github/ROCm - ROCm - AtHeartEngineering

mirror of https://github.com/ROCm/ROCm.git synced 2026-01-09 06:38:00 -05:00

Author	SHA1	Message	Date
Peter Park	4f53183696	docs: Add JAX MaxText benchmark v25.7 (#5182 ) * Update previous versions * Add data file * fix filename and anchors * add templates * update .wordlist.txt * Update template and data add missing step fix fmt * update template * fix data * add jax 0.6.0 * update history * update quantized training note	2025-09-08 21:42:56 -04:00
Peter Park	4bc1bf00c6	Update PyTorch training benchmark docker doc to 25.7 (#5255 ) * Update PyTorch training benchmark docker doc to 25.7 * update .wordlist.txt * update conf.py * update data sheet * fix sphinx warnings	2025-09-05 12:07:51 -04:00
Matt Williams	1d42f7cc62	Deep learning frameworks edits for scale (#5189 ) * Deep learning frameworks edits for scale Based on https://ontrack-internal.amd.com/browse/ROCDOC-1809 * update table table * leo comments * formatting * format * update table based on feedback * header * Update machine learning page * headers * Apply suggestions from code review Co-authored-by: anisha-amd <anisha.sankar@amd.com> * Update .wordlist.txt * formatting * Update docs/how-to/deep-learning-rocm.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> --------- Co-authored-by: Matt Williams <Matt.Williams+amdeng@amd.com> Co-authored-by: anisha-amd <anisha.sankar@amd.com> Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-08-22 11:46:07 -04:00
Peter Park	98029db4ee	docs: Add Primus (Megatron) training Docker documentation (#5218 )	2025-08-21 23:50:55 -04:00
Peter Park	7ee22790ce	docs: Update vLLM benchmark doc for 20250812 Docker release (#5196 )	2025-08-14 15:43:36 -04:00
anisha-amd	266387d816	Docs: Adding frameworks compatibility for Megablocks and Taichi (#5133 )	2025-07-31 13:00:31 -04:00
Istvan Kiss	fb30dafa29	Update precision support page part I. (#5127 )	2025-07-31 15:22:19 +02:00
yugang-amd	cc5bc5a882	Add SGLang inference benchmark doc w/ initial support for DeepSeek-R1-Distill-Qwen-32B (#4870 )	2025-07-25 12:42:40 -04:00
Alex Xu	aa6f40e2e0	Merge remote-tracking branch 'external/develop' into sync-develop-from-external	2025-07-21 14:55:59 -04:00
Peter Park	5bcf3b0847	Update Megatron-LM training benchmark doc for v25.6 release (#5064 )	2025-07-18 15:57:25 -04:00
Jeffrey Novotny	b431415ade	Merge Verl, DGL, Megatron changes. (#5047 ) * Verl compatibility * verl compatibility * add Supported features Signed-off-by: Vicky Tsang <vtsang@amd.com> * updated and edited verl compat doc * added links to verl * add future release for sglang and megatron inference eng. Signed-off-by: Vicky Tsang <vtsang@amd.com> * fix lint Signed-off-by: Vicky Tsang <vtsang@amd.com> * fixed a typo and a table * Spolifroni amd/add to compat matrix (#430) * added verl to compatibility matrix * small change * fixed an error in csv * edited the verl compat based on leo's recommendations * updated compat matrix (#435) * Added a hardcoded link to the verl install This is a link to an RTD build and MUST be removed before publishing. * Update verl-compatibility.rst * Added a hardcoded link to the verl install This link is to an RTD build and it WILL break at publishing. It MUST be changed before publishing. * Added version support note (#448) * small fixes * Update verl-compatibility.rst * Update verl-compatibility.rst --------- Signed-off-by: Vicky Tsang <vtsang@amd.com> Co-authored-by: spolifroni-amd <sandra.polifroni@amd.com> Co-authored-by: anisha-amd <anisha.sankar@amd.com> (cherry picked from commit `f9bd22626b`) * Stanford Megatron-LM Compatibility * Create stanford-megatron-lm-compatibility.rst * toc and wordlist * Update deep-learning-rocm.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * fixes and adding to main compat matrix * formatting fix * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update docs/compatibility/ml-compatibility/stanford-megatron-lm-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/compatibility/ml-compatibility/stanford-megatron-lm-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/compatibility/ml-compatibility/stanford-megatron-lm-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> (cherry picked from commit `f4f096b44e`) * Framework: DGL Compatability * Introducing new file for DGL Compatability * Update dgl-compatibility.rst * Update .wordlist.txt * Update .wordlist.txt * Update deep-learning-rocm.rst * compatibility fixes * Update docs/compatibility/ml-compatibility/dgl-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/compatibility/ml-compatibility/dgl-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/compatibility/ml-compatibility/dgl-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/compatibility/ml-compatibility/dgl-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update dgl-compatibility.rst * Update dgl-compatibility.rst * Update dgl-compatibility.rst * Update dgl-compatibility.rst * additions to use-cases and system support * wording and fixes * Update dgl-compatibility.rst * Update dgl-compatibility.rst * remove table heading * Update compatibility-matrix-historical-6.0.csv --------- Co-authored-by: anisha-amd <anisha.sankar@amd.com> Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> (cherry picked from commit `2a7554c0b9`) * Manually resolve merge conflict * Further merge conflict adjustments --------- Signed-off-by: Vicky Tsang <vtsang@amd.com> Co-authored-by: vickytsang <vtsang@amd.com> Co-authored-by: spolifroni-amd <sandra.polifroni@amd.com> Co-authored-by: anisha-amd <anisha.sankar@amd.com> Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> Co-authored-by: Mukhil M S <167260682+mukh1l@users.noreply.github.com>	2025-07-15 18:57:31 -04:00
vickytsang	f9bd22626b	Verl compatibility * verl compatibility * add Supported features Signed-off-by: Vicky Tsang <vtsang@amd.com> * updated and edited verl compat doc * added links to verl * add future release for sglang and megatron inference eng. Signed-off-by: Vicky Tsang <vtsang@amd.com> * fix lint Signed-off-by: Vicky Tsang <vtsang@amd.com> * fixed a typo and a table * Spolifroni amd/add to compat matrix (#430) * added verl to compatibility matrix * small change * fixed an error in csv * edited the verl compat based on leo's recommendations * updated compat matrix (#435) * Added a hardcoded link to the verl install This is a link to an RTD build and MUST be removed before publishing. * Update verl-compatibility.rst * Added a hardcoded link to the verl install This link is to an RTD build and it WILL break at publishing. It MUST be changed before publishing. * Added version support note (#448) * small fixes * Update verl-compatibility.rst * Update verl-compatibility.rst --------- Signed-off-by: Vicky Tsang <vtsang@amd.com> Co-authored-by: spolifroni-amd <sandra.polifroni@amd.com> Co-authored-by: anisha-amd <anisha.sankar@amd.com>	2025-07-15 16:39:31 -04:00
anisha-amd	f4f096b44e	Stanford Megatron-LM Compatibility * Create stanford-megatron-lm-compatibility.rst * toc and wordlist * Update deep-learning-rocm.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * fixes and adding to main compat matrix * formatting fix * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update docs/compatibility/ml-compatibility/stanford-megatron-lm-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/compatibility/ml-compatibility/stanford-megatron-lm-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/compatibility/ml-compatibility/stanford-megatron-lm-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-07-15 16:23:50 -04:00
Mukhil M S	2a7554c0b9	Framework: DGL Compatability * Introducing new file for DGL Compatability * Update dgl-compatibility.rst * Update .wordlist.txt * Update .wordlist.txt * Update deep-learning-rocm.rst * compatibility fixes * Update docs/compatibility/ml-compatibility/dgl-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/compatibility/ml-compatibility/dgl-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/compatibility/ml-compatibility/dgl-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/compatibility/ml-compatibility/dgl-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update dgl-compatibility.rst * Update dgl-compatibility.rst * Update dgl-compatibility.rst * Update dgl-compatibility.rst * additions to use-cases and system support * wording and fixes * Update dgl-compatibility.rst * Update dgl-compatibility.rst * remove table heading * Update compatibility-matrix-historical-6.0.csv --------- Co-authored-by: anisha-amd <anisha.sankar@amd.com> Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-07-15 16:17:58 -04:00
Adel Johar	51cb6461b5	Docs: Pytorch compatibility page update	2025-06-18 11:12:47 +02:00
Adel Johar	c699aaf915	Docs: Overhaul JAX compatibility page	2025-06-12 14:35:30 +02:00
Peter Park	9ed65a81c4	Add Megatron-LM benchmark doc 5/2 (#4778 ) * reorg files * add tabs * update template * update template * update wordlist and toc * add previous version to doc * add selector paragraph * update wordlist.txt	2025-05-22 14:28:18 -04:00
Peter Park	0a77e7b3a5	docs: Add system health check doc under ROCm for AI (#4736 ) * add initial draft * add to toc and install page * update wording * improve documentation structure * resturcture and expand content * add to training section * add to conf.py article_pages * Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * update wordlist.txt * Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * inference --> AI workloads * udpate toc * update article_pages in conf.py * Update system validation notes in training docs * fix links in prerequisite-system-validation * wording * add note * consistency * remove extra files * fix links * add links to training index page --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-05-13 15:54:48 -04:00
Peter Park	d44ea40a0d	Add MPT-30B + LLM Foundry doc (#4704 ) * add mpt-30b doc * add tunableop note * update MPT doc * add section * update wordlist * fix flash attention version * update "applies to" * address review feedback * Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * update docker details to pytorch-training-v25.5 * update --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-05-02 12:13:20 -04:00
Peter Park	c3faa9670b	Add PyTorch inference benchmark Docker guide (+ CLIP and Chai-1) (#4654 ) * update vLLM links in deploy-your-model.rst * add pytorch inference benchmark doc * update toc and vLLM title * remove previous versions * update * wording * fix link and "applies to" * add pytorch to wordlist * add tunableop note to clip * make tunableop note appear to all models * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * fix incorrect links * wording * fix wrong docker pull tag --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-04-23 17:35:52 -04:00
Peter Park	9ff3c2c885	Update PyTorch training Docker doc for 25.5 (#4638 ) * update pytorch-training to 25.5 * remove llama 2 * Revert "remove llama 2" This reverts commit dab672fa7bcbd8bff730382c14177df4301a537d. * add previous version * fix run cmd * add link to docker hub * fix linting issue * add Llama 3.3 70B * update	2025-04-15 18:16:22 -04:00
Parag Bhandari	db3c46fccf	Merge branch 'develop-internal' into develop	2025-04-11 14:32:09 -04:00
Peter Park	424e6148bd	Add MaxText training Docker doc Add MaxText training Docker doc	2025-03-28 11:25:06 -04:00
Pratik Basyal	e980ea5e57	Pre ga 640 update (#333 ) * ROCProfiler deprecation notice udpated * Link error * Compatibility updated * New changelog and OS support updated * Upcoming changes removed from rocWWMA, added to hipTensor * Glibc added to wordlist * Instict docs content added * RHEL 9.5 to OS * Compatibility OS update * Leo's feedback incorporated and TOC updated for linux requirement	2025-03-21 16:09:53 -04:00
Istvan Kiss	635838e7ef	Add atomics operation support page	2025-03-20 17:11:02 +01:00
Peter Park	2fca094531	PyTorch training Docker update 25.4 (#4482 ) * remove orphan tag * add hugging face PEFT * update "previous versions" * data == ultrachat 200k * fix "llama 2" * add ultrachat to wordlist * fix previous versions table * add performance measurements * add mi325x * fix prev version * change 'validation' to 'testing * fix dir name * fix backtick	2025-03-13 13:40:00 -04:00
Peter Park	1fb42c2591	Update LLM inference performance validation on AMD Instinct MI300X guide to filter by desired model (#4424 ) * WIP (cherry picked from commit a06a5b5b959a9425e7384fb58b88c3716f380e48) rm unneeded files (cherry picked from commit f1d0c00056a83299bdea74a43cd17454999cf2d8) * add sphinxcontrib.datatemplates (cherry picked from commit d056b93a325d87b81f54f70c6eb4ae78f4fb0bc1) * add template (cherry picked from commit 0691d59f0a1efbda7908762b7a906e30a65c0ee1) fix template (cherry picked from commit 01e4bea5522aa5deeaade58c105ff850f449df8b) WIPO (cherry picked from commit 4d8daf7445e7be92cd9ee1d39dff564bd8de41f4) WIP (cherry picked from commit 9eefd1f5833bc4dc8de9d777ff65a5fe5f826dbd) update models yaml schema (cherry picked from commit a5f0fc1e6cc51104dc2d42029bfcf3eea276d270) add model groups functionality (cherry picked from commit 13f49f96dd3e5a160d37c52e48a4fbcccdcf4f9e) add selector headings and fix template (cherry picked from commit 35f7f2314bcf74b4fd0a8ca10aaabf0de7063bb0) update template (cherry picked from commit 9e2dcfe0c7f6e7c2c685866ea83375fbacbc5032) fix (cherry picked from commit be51e32791550ddc21785effccb889228394b242) use classes instead of data tags (cherry picked from commit cd52d68c504f7e7435d156ae70cf4bde1dfe703e) update template (cherry picked from commit 9ed89fee6874b39ee3535fbde54a0a59f346ea2b) clean up extra wip files (cherry picked from commit a9f965a104baa966c184054638e935b011526278) update wordlist (cherry picked from commit f783656814e896aedd21acd1c8c87b4700c14469) remove unused template (cherry picked from commit cac894bd9c2b1262c9c006e5fddbcb742dc6d882) improve script (cherry picked from commit ca20ffd4922916616e0924d625652a815f27c35f) fix template (cherry picked from commit 752c61fda856fd5b244734636c036c8877e823b9) fix standalone benchmark output path in template (cherry picked from commit d8c04203b5ec0f6c2e2307f7890304a3dc5687be) fix toc (cherry picked from commit 8df42faf53488ef29f5a263d25032f3d35cd58ed) update script to prevent flash of unstyled content import a11y (cherry picked from commit 46c852717f223a1d8744fab035807cebab4c5404) add tabindex to wordlist (cherry picked from commit 11492593f9692f5453045e7ec52c8f8ae9624ae9) text update script * remove unused config option * reorganize assets * fix linting warning * move js from data/ to extension/	2025-02-28 12:39:02 -05:00
Adel Johar	4be8096109	Merge pull request #4393 from ROCm/docs_fix_arch Docs: Fix gpu-arch-spec.rst	2025-02-26 14:19:38 +01:00
Peter Park	389fa7071b	Update docs on Megatron-LM and PyTorch training Dockers (#4407 ) * Update Megatron-LM and PyTorch Training Docker docs Also restructure TOC * Apply suggestions from code review Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> update "start training" text Apply suggestions from code review Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> update conf.py fix spacing fix branding issue add disable numa reorg remove extra text	2025-02-21 13:07:18 -05:00
Pratik Basyal	1b36ab4850	Final GA day prep for 633 (#313 ) * ROCProfiler deprecation notice udpated * Final GA day changes added * github issue no. added * ROCTx added * rocprofv added to wordlist * Minor fix	2025-02-19 15:19:44 -05:00
Adel Johar	0c6f660d59	Docs: Fix gpu-arch-spec.rst	2025-02-19 17:05:01 +01:00
Peter Park	2751a17cf0	Update vLLM benchmarking guide (#4347 ) * update vllm-benchmark fix hlist overflow update standalone benchmarking options update list of models fix typo and model name unnecessary duplicate info update formatting update vllm benchmark guide - remove Llama 2 FP8 - add Jais 13B - update commands update docker pull tag update MAD available models remove extra mad models not relevant to vllm update PyTorch version add changelog add model names to .wordlist.txt * Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst Co-authored-by: Pratik Basyal <pratik.basyal@amd.com> * Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst Co-authored-by: Pratik Basyal <pratik.basyal@amd.com> * Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst Co-authored-by: Pratik Basyal <pratik.basyal@amd.com> * fix typo * update link * fix link text * change changelog to previous versions * fix typo * remove "for" --------- Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>	2025-02-05 17:18:35 -05:00
Pratik Basyal	3af84601f8	Final changes moved to autotag template (#295 ) * Final changes moved to autotag template * VCN added	2025-01-28 13:25:10 -05:00
Pratik Basyal	3738297667	2nd POC for How to Use ROCm for AI (#282 ) * Initial draft for How-to POC * Zone.identifier file removed * Broken links in index.md fixed * Zone.identifier file removed * Review feedback incorporated * Title updated * New format for ROCm for AI TOC created * Folder structure changed * ROCm for AI index updated * Link to Llama recipe updated * Review feedback added * Feedback from Cindy added * Intro text from Cindy added * New flow suggested by Hongxia incorporated * Overview content from Cindy added, TOC updated, Meta data updated * Reference to HPC removed * Listing alignment updated * Overview page updated * Folder structure and link change resulted from TOC change updated * Content sequence updated * Meta data updated * Review feedback incorporated * Index file renamed * Conf file updated for OS compatibility info * update metadata (#4) update metadata fix spelling * Wordlist updated --------- Co-authored-by: Peter Park <peter.park@amd.com>	2025-01-24 17:42:20 -05:00
Peter Park	d534f755e4	Add metadata to docs (#3688 ) * add missing metadata add metadata to mi300 arch doc add metadata to contributing guide add metadata to mi300x tuning guides * update meta to yaml frontmatter * update to md metadata to myst frontmatter * remove extra file * fix spelling	2025-01-14 08:55:45 -05:00
Peter Park	26553d725b	Add TensorFlow compatibility docs (#4247 ) * Add Tensorflow * WIP * WIP * minor fmt * PR feedbacks * fix missed inconsistent formatting * WIP WIP WIP WIP * minor formatting update tensorflow-rocm docker images to rocm6.3.1 fix urls * WIP * fix typo and update wordlist * fix tables not rendering * fix table headings * add period * update tf dockers * fix link * fix link * wording * update historical compat * fix tensile link --------- Co-authored-by: Mátyás Aradi <matyas@streamhpc.com> Co-authored-by: Istvan Kiss <neon60@gmail.com>	2025-01-09 14:24:58 -05:00
Peter Park	ff1393142b	Add JAX compatibility doc (#4234 ) * Add JAX compatibility (cherry picked from commit 99215ab6b4cf6a1209d6c5fc781b5855251dcba5) * WIP (cherry picked from commit 54564a85d340b4149ed80a33377cf54c1eb48713) * Fix docker table (cherry picked from commit 8115a905764c869b390de2561e5f1356ec7e9743) * WIP (cherry picked from commit 45076e1fd20fd2c43f7a0ab6d8d5d246c498d801) * add minor formatting (cherry picked from commit c75706841092006c26766611b0407b79a13c7345) * PR feedbacks (cherry picked from commit 236b5daae4251c26cd697c6e20d5982771b05754) * fix inconsistent formatting (cherry picked from commit 0c6a2e3627f9e6159e3f400ab18769904c18097e) * Rename file (cherry picked from commit f17239aa8a9fa1ecdf8dab08c0348dc9216c5311) * jax_triton supported (cherry picked from commit fa56d697fbaa44c0c480df71dc236be8584291c0) * WIP (cherry picked from commit e8f0c5741fe96bb1e3272365906334d911a9a849) * WIP (cherry picked from commit 8ee4f3c62da8e11eea591340dc7c9fc1be8b7035) * WIP (cherry picked from commit 58c6bf441054fe3a21ba2d86808279e90de847b7) * WIP (cherry picked from commit 368ddf6925215a9bfd75a43c7c33def12238f81d) * update .wordlist.txt (cherry picked from commit 78ac332c8d6eba93e2b3e57440da3f60054bbadb) * update .wordlist.txt (cherry picked from commit 8d9492399f4b73b0c3c5359684d5b7faa328ba0f) * Fix typos (cherry picked from commit 394dede13b6de087237832fe3c693c11da7d733b) * update jax note (cherry picked from commit ceacc713c4295f8bbd20fc622579de9053b73337) * Update docs/compatibility/ml-compatibility/jax-compatibility.rst (cherry picked from commit b0613e914a2ba639fddea62eb495f97beaa8ba49) * Update docs/compatibility/ml-compatibility/jax-compatibility.rst (cherry picked from commit 8aac4344b6fd4120a3b8a31878f5316df99f3f99) * Add back hipGraph support (cherry picked from commit 028ddb3535073e0cd668c24614a0a73a491b5948) * WIP (cherry picked from commit 2e0ff9c5e3f88ceea6b0ca770bb4edb52ce08a47) * WIP (cherry picked from commit 186802585de5b7d58f9ac2a7947a83c037df1617) * add blurb about docker icon (cherry picked from commit aef650d4072578f75e7549151613f390f6545ce1) * update pytorch-compatibility path in conf.py * words --------- Co-authored-by: Mátyás Aradi <matyas@streamhpc.com> Co-authored-by: Istvan Kiss <neon60@gmail.com>	2025-01-07 09:57:19 -05:00
Peter Park	76d6e892bb	Add PyTorch compatibility doc (#4193 ) * Add compatibility framework pages * update formatting * WIP * satisfy spellcheck linter * PR feedbacks * caps * remove jax and tensorflow pages * comment out "?"s * update wordlist * fix toc and table * update toc and deep-learning-rocm.rst --------- Co-authored-by: Istvan Kiss <neon60@gmail.com>	2024-12-23 18:06:22 -05:00
Alex Xu	d275733631	Merge remote-tracking branch 'internal/develop' into sync-develop-from-internal	2024-12-20 14:00:20 -05:00
Pratik Basyal	14a6fd5837	Release.md and autotag template updated for 6.3.1 release prep (#268 ) * Release ready final changes and template update * wordlist updated --------- Co-authored-by: prbasyal <prbasyal@amd.com>	2024-12-20 12:56:13 -05:00
Istvan Kiss	21d26e52d0	Add graph safe support	2024-12-19 14:48:58 +01:00
alexxu-amd	c2be7ee900	Merge branch 'develop' into sync-develop-from-external	2024-12-18 16:19:12 -05:00
Pratik Basyal	279a241c11	Quickupdates release631 develop (#255 ) * Transferbench added * Minor fix * Table alignment fixed * Review feedback * Leo's feedback incorporated * Fixed issue added * Compatibility Matrix table fixed * JAX version updated * Debian support added * Transferbench added * Debian footnote updated * Debian added to wordlist * Debian footnote updated * wordlist updated --------- Co-authored-by: prbasyal <prbasyal@amd.com>	2024-12-18 16:08:13 -05:00
Alex Xu	0356ffd148	Merge remote-tracking branch 'external/develop' into sync-develop-from-external	2024-12-18 15:57:08 -05:00
Pratik Basyal	6a7d8654ad	Revamped PCIe into new format and incorporated style guide (#4051 ) * Revamped PCIe into new format and incorporated style guide * Title case fixed * Quick fix and changes * Added RMW to wordlist and updated titles * Grammatical fixes incorporated * Sandra's review feedback incorporated * Removed PCIe3 feature reference * Leo's feedback incorporated * Sandra's feedback incorporated * Replaced execute with run * Replaced executing with running * SME review feedback incorporated * Minor feedback updated * Sandra's feedback incorporated * Filename renamed * File rename changes updated * Document title updated --------- Co-authored-by: prbasyal <prbasyal@amd.com>	2024-12-17 12:00:00 -05:00
Peter Park	f9dbc1f21f	add megatron training doc (#4159 ) * add megatron training doc update toc add images update formatting and wording formatting update formatting update conf.py update formatting update docker img tweak formatting Fix stuff fix mock-data/data-path add specific commit hash to checkout update docker pull tag fix docker run cmd and examples path fix docker cmd * wording words words * improve title	2024-12-16 13:37:35 -05:00
Jeffrey Novotny	04fdc08328	Change reference to kernel-mode GPU compute driver in ROCm (#4147 ) * Change reference to kernel-mode GPU compute driver in ROCm * More changes for kernel-mode terminology * Fix linting	2024-12-13 11:46:02 -05:00
Peter Park	b0722b3228	Add @hongxiayang updates to MI300X workload tuning guide (#4123 ) minor fixes to formatting fix spelling errors more spelling fixes quantization update fix format simplify wording in tunableops and format fix Apply suggestions from code review review feedback by Peter Co-authored-by: Peter Park <peter.park@amd.com> Apply suggestions from code review addressing feedback Co-authored-by: Peter Park <peter.park@amd.com> Apply suggestions from code review feedback again Co-authored-by: Peter Park <peter.park@amd.com> add hipblaslt yaml file figure feedback and minor formatting formatting update wordlist.txt remove outdated sentence regarding fsdp and rccl (cherry picked from commit 87fa9fd83a2e623f6cab4e69d65f49e3db0a45f6) update wordlist Co-authored-by: hongxyan <hongxyan@amd.com>	2024-12-06 12:10:57 -05:00
Peter Park	34dd7ce288	Add minor stylistic updates in release notes (#4097 )	2024-12-04 16:02:38 -05:00
Sam Wu	f77e2dd7a7	Sync develop branch (#4078 )	2024-12-03 15:18:51 -07:00

1 2 3

101 Commits