github/ROCm - ROCm - AtHeartEngineering

mirror of https://github.com/ROCm/ROCm.git synced 2026-02-12 15:34:58 -05:00

Author	SHA1	Message	Date
yugang-amd	830f2d5edf	Update for vllm -05/27 (#4886 ) * Update vLLM inference benchmark Docker page for rocm/vllm 5/27 * update repo for Pytorch	2025-06-05 13:30:20 -04:00
yugang-amd	53d3e092d3	Fix broken link (#4854 )	2025-05-31 13:01:34 -04:00
Peter Park	2eb8bf4963	Fix typo in Megatron-LM Docker pull tags (#4829 )	2025-05-28 15:18:00 -04:00
Peter Park	cebf0f5975	Add latest rocm/vllm Docker details in vLLM inference benchmark guide (#4824 ) * update rocm/vllm Docker details to latest release * Add previous vLLM version * fix 'further reading' xrefs * improve model grouping names * fix links * update model picker text	2025-05-28 14:20:18 -04:00
Peter Park	505041d90a	Document specs for Radeon RX 9070 + small fix in megatron-lm doc (#4780 ) * Document specs for Radeon RX 9070 * fix wrong version in megatron-lm.rst	2025-05-22 16:28:17 -04:00
Peter Park	9ed65a81c4	Add Megatron-LM benchmark doc 5/2 (#4778 ) * reorg files * add tabs * update template * update template * update wordlist and toc * add previous version to doc * add selector paragraph * update wordlist.txt	2025-05-22 14:28:18 -04:00
Peter Park	0a77e7b3a5	docs: Add system health check doc under ROCm for AI (#4736 ) * add initial draft * add to toc and install page * update wording * improve documentation structure * resturcture and expand content * add to training section * add to conf.py article_pages * Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * update wordlist.txt * Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * inference --> AI workloads * udpate toc * update article_pages in conf.py * Update system validation notes in training docs * fix links in prerequisite-system-validation * wording * add note * consistency * remove extra files * fix links * add links to training index page --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-05-13 15:54:48 -04:00
Peter Park	90a651d2b6	Merge pull request #4725 from peterjunpark/docs/quark-model-quantization Add quark in model-quantization.rst	2025-05-08 10:34:39 -04:00
Peter Park	bb7af3351a	Fix incorrect throughput benchmark command in inference/vllm-benchmark.rst (#4723 ) * update inference index to include pyt inference * fix incorrect command in throughput benchmark * wording	2025-05-08 09:24:51 -04:00
Peter Park	186c281aba	fix links in pytorch-inference-benchmark.rst (#4713 )	2025-05-06 13:34:55 -04:00
Peter Park	d44ea40a0d	Add MPT-30B + LLM Foundry doc (#4704 ) * add mpt-30b doc * add tunableop note * update MPT doc * add section * update wordlist * fix flash attention version * update "applies to" * address review feedback * Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * update docker details to pytorch-training-v25.5 * update --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-05-02 12:13:20 -04:00
Peter Park	7458fcb7ab	Update JAX MaxText benchmark doc to v25.5 (#4695 ) * fix shell cmd formatting * add previous versions section * update docker details and add llama 3.3 * update missed docker image tags to 25.5	2025-04-28 17:52:53 -04:00
Peter Park	16d6e59003	fix link to pytorch-training v25.4 doc (#4696 )	2025-04-28 17:52:33 -04:00
Peter Park	a66bc1d85e	fix link to previous version in vllm-benchmark.rst (#4689 )	2025-04-24 17:54:04 -04:00
Peter Park	36b6ffaf7c	Add QwQ 32B to vllm-benchmark.rst (#4685 ) * Add Qwen2 MoE 2.7B to vllm-benchmark-models.yaml * Add QwQ-32B-Preview to vllm-benchmark-models.yaml * add links to performance results words * change "performance validation" to "performance testing" * remove "-Preview" from QwQ-32B * move qwen2 MoE after qwen2 * add TunableOp section * fix formatting * add link to TunableOp doc * add tunableop note * fix vllm-benchmark template * remove cmdline option for --tunableop on * update docker details * remove "training" * remove qwen2	2025-04-24 16:44:34 -04:00
Peter Park	40e4ba3ecc	Update vLLM inference benchmark Docker guide (#4653 ) * Remove JAIS 13B and 30B * update Docker details - vLLM 0.8.3 * add previous version * Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst * fix link to previous version	2025-04-24 15:59:13 -04:00
Peter Park	1f41ce26be	Add note for chai-1 benchmark Docker in pytorch-inference-benchmark.rst (#4684 )	2025-04-24 15:48:53 -04:00
Peter Park	c3faa9670b	Add PyTorch inference benchmark Docker guide (+ CLIP and Chai-1) (#4654 ) * update vLLM links in deploy-your-model.rst * add pytorch inference benchmark doc * update toc and vLLM title * remove previous versions * update * wording * fix link and "applies to" * add pytorch to wordlist * add tunableop note to clip * make tunableop note appear to all models * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * fix incorrect links * wording * fix wrong docker pull tag --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-04-23 17:35:52 -04:00
Peter Park	9ff3c2c885	Update PyTorch training Docker doc for 25.5 (#4638 ) * update pytorch-training to 25.5 * remove llama 2 * Revert "remove llama 2" This reverts commit dab672fa7bcbd8bff730382c14177df4301a537d. * add previous version * fix run cmd * add link to docker hub * fix linting issue * add Llama 3.3 70B * update	2025-04-15 18:16:22 -04:00
Peter Park	d057d49af1	Fix vllm Dockerfile.rocm path (#4628 )	2025-04-15 11:26:54 -04:00
Peter Park	310864e653	fix link to Dockerfile.rocm (#4573 )	2025-04-14 10:10:03 -04:00
Parag Bhandari	493585dfbb	Merge branch 'develop' of github.com:ROCm/ROCm into develop	2025-04-11 15:15:43 -04:00
Parag Bhandari	e756d99f65	Merge branch 'develop-internal' into develop	2025-04-11 15:15:19 -04:00
Pratik Basyal	686fcece1d	PRE GA Day 640 update for resetting link and HPC application list (#367 ) * Links reset to point to latest from stg, internal, RTD, and develop * ROCm for HPC updated * GA prep changes	2025-04-11 14:12:57 -05:00
pbhandar-amd	131e34f582	Update w6000-v620.md	2025-04-11 15:11:34 -04:00
Parag Bhandari	db3c46fccf	Merge branch 'develop-internal' into develop	2025-04-11 14:32:09 -04:00
Dominic Widdows	715cce53de	Update workload.rst with small export fix (#4425 ) Tiny fix that removes the "export" directive. ` export HIP_FORCE_DEV_KERNARG=1 hipblaslt-bench ...` leads to bash: export: `hipblaslt-bench': not a valid identifier whereas just starting with HIP_FORCE_DEV_KERNARG=1 passes this env var to the hipblaslt-bench process, which I think is the intention here.	2025-04-03 13:01:26 -04:00
Peter Park	ea66bf386a	Fix more links in documentation (#4551 ) * fix vllm engine args link * remove RDNA subtree in under system optimization in toc * fix RDNA 2 architecture PDF link * fix CLR LICENSE.txt link * fix rocPyDecode license link	2025-04-01 15:56:34 -04:00
Peter Park	ac2c5e72d4	Fix links in documentation	2025-04-01 15:39:20 -04:00
Peter Park	424e6148bd	Add MaxText training Docker doc Add MaxText training Docker doc	2025-03-28 11:25:06 -04:00
Pratik Basyal	a0faccba37	AMD GPU Docs System optimization migration changes in ROCm Docs Develop (#4538 ) * AMD GPU Docs System optimization migration changes in ROCm Docs (#296) * System optimization migration changes in ROCm * Linting issue fixed * Linking corrected * Minor change * Link updated to Instinct.docs.amd.com * ROCm docs grid updated by removing IOMMU.rst, pcie-atomics, and oversubscription pages * Files removed and reference fixed * Reference text updated * GPU atomics from 6.4.0 removed	2025-03-27 16:38:10 -04:00
Pratik Basyal	544149631a	AMD GPU Docs System optimization migration changes in ROCm Docs (#296 ) * System optimization migration changes in ROCm * Linting issue fixed * Linking corrected * Minor change * Link updated to Instinct.docs.amd.com * ROCm docs grid updated by removing IOMMU.rst, pcie-atomics, and oversubscription pages * Files removed and reference fixed * Reference text updated	2025-03-26 10:01:33 -04:00
Peter Park	58d42ec50b	Improve "tuning guides" landing page (#4504 ) * Improve "tuning guides" landing page * Update docs/how-to/gpu-performance/mi300x.rst Co-authored-by: Pratik Basyal <pratik.basyal@amd.com> * Update docs/how-to/gpu-performance/mi300x.rst Co-authored-by: Pratik Basyal <pratik.basyal@amd.com> * change tuning to optimization --------- Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>	2025-03-25 13:54:27 -04:00
Peter Park	8f359da39e	Update Megatron-LM doc for 25.4 (#4520 ) * update megatron-lm doc * update 'previous versions' * add missing space * update docker pull tag * Update options and docker pull tag * Add performance measurements link to megatron-lm doc * fix previous versions table * words * Simplify system validation section * minor fixes * fix perv versions tbl	2025-03-21 16:49:55 -04:00
Peter Park	2fca094531	PyTorch training Docker update 25.4 (#4482 ) * remove orphan tag * add hugging face PEFT * update "previous versions" * data == ultrachat 200k * fix "llama 2" * add ultrachat to wordlist * fix previous versions table * add performance measurements * add mi325x * fix prev version * change 'validation' to 'testing * fix dir name * fix backtick	2025-03-13 13:40:00 -04:00
Peter Park	9b2ce2b634	Update vLLM performance Docker docs (#4491 ) * add links to performance results words * change "performance validation" to "performance testing" * update vLLM docker 3/11 * add previous versions add previous versions * fix llama 3.1 8b model repo name * words	2025-03-13 10:04:21 -04:00
Peter Park	29ba151b48	Fix "VGPR" typo in workload tuning guide (#4484 ) * Fix "VGPR" typo in workload tuning guide * fix wording	2025-03-12 10:28:35 -04:00
Pratik Basyal	9aad9ce7ef	Content for modprobe added to MI300X system optimization (#4434 ) Added content for modprobe	2025-03-07 14:52:20 -05:00
Peter Park	1fb42c2591	Update LLM inference performance validation on AMD Instinct MI300X guide to filter by desired model (#4424 ) * WIP (cherry picked from commit a06a5b5b959a9425e7384fb58b88c3716f380e48) rm unneeded files (cherry picked from commit f1d0c00056a83299bdea74a43cd17454999cf2d8) * add sphinxcontrib.datatemplates (cherry picked from commit d056b93a325d87b81f54f70c6eb4ae78f4fb0bc1) * add template (cherry picked from commit 0691d59f0a1efbda7908762b7a906e30a65c0ee1) fix template (cherry picked from commit 01e4bea5522aa5deeaade58c105ff850f449df8b) WIPO (cherry picked from commit 4d8daf7445e7be92cd9ee1d39dff564bd8de41f4) WIP (cherry picked from commit 9eefd1f5833bc4dc8de9d777ff65a5fe5f826dbd) update models yaml schema (cherry picked from commit a5f0fc1e6cc51104dc2d42029bfcf3eea276d270) add model groups functionality (cherry picked from commit 13f49f96dd3e5a160d37c52e48a4fbcccdcf4f9e) add selector headings and fix template (cherry picked from commit 35f7f2314bcf74b4fd0a8ca10aaabf0de7063bb0) update template (cherry picked from commit 9e2dcfe0c7f6e7c2c685866ea83375fbacbc5032) fix (cherry picked from commit be51e32791550ddc21785effccb889228394b242) use classes instead of data tags (cherry picked from commit cd52d68c504f7e7435d156ae70cf4bde1dfe703e) update template (cherry picked from commit 9ed89fee6874b39ee3535fbde54a0a59f346ea2b) clean up extra wip files (cherry picked from commit a9f965a104baa966c184054638e935b011526278) update wordlist (cherry picked from commit f783656814e896aedd21acd1c8c87b4700c14469) remove unused template (cherry picked from commit cac894bd9c2b1262c9c006e5fddbcb742dc6d882) improve script (cherry picked from commit ca20ffd4922916616e0924d625652a815f27c35f) fix template (cherry picked from commit 752c61fda856fd5b244734636c036c8877e823b9) fix standalone benchmark output path in template (cherry picked from commit d8c04203b5ec0f6c2e2307f7890304a3dc5687be) fix toc (cherry picked from commit 8df42faf53488ef29f5a263d25032f3d35cd58ed) update script to prevent flash of unstyled content import a11y (cherry picked from commit 46c852717f223a1d8744fab035807cebab4c5404) add tabindex to wordlist (cherry picked from commit 11492593f9692f5453045e7ec52c8f8ae9624ae9) text update script * remove unused config option * reorganize assets * fix linting warning * move js from data/ to extension/	2025-02-28 12:39:02 -05:00
Peter Park	1ea1c5c6e0	fix tab sync and nested tab Megatron-LM doc (#4409 )	2025-02-21 17:19:48 -05:00
Peter Park	389fa7071b	Update docs on Megatron-LM and PyTorch training Dockers (#4407 ) * Update Megatron-LM and PyTorch Training Docker docs Also restructure TOC * Apply suggestions from code review Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> update "start training" text Apply suggestions from code review Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> update conf.py fix spacing fix branding issue add disable numa reorg remove extra text	2025-02-21 13:07:18 -05:00
Peter Park	618b44ed23	add vllm docker to release highlights (#306 )	2025-02-13 12:01:08 -05:00
Peter Park	2751a17cf0	Update vLLM benchmarking guide (#4347 ) * update vllm-benchmark fix hlist overflow update standalone benchmarking options update list of models fix typo and model name unnecessary duplicate info update formatting update vllm benchmark guide - remove Llama 2 FP8 - add Jais 13B - update commands update docker pull tag update MAD available models remove extra mad models not relevant to vllm update PyTorch version add changelog add model names to .wordlist.txt * Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst Co-authored-by: Pratik Basyal <pratik.basyal@amd.com> * Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst Co-authored-by: Pratik Basyal <pratik.basyal@amd.com> * Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst Co-authored-by: Pratik Basyal <pratik.basyal@amd.com> * fix typo * update link * fix link text * change changelog to previous versions * fix typo * remove "for" --------- Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>	2025-02-05 17:18:35 -05:00
Pratik Basyal	f885b5df6e	Updated ROCm install on Linux installation method link (#4313 )	2025-01-31 16:48:33 -05:00
Jeffrey Novotny	d401b5f152	Add ToC and index links to the AI Developer Tutorials (#4312 ) * Add ToC and index links to the AI Developer Tutorials * Change link positioning * Change wording	2025-01-29 14:43:32 -05:00
Pratik Basyal	353d2fe1c1	2nd POC for How to Use ROCm for AI (#282 ) (#4299 ) * New TOC for ROCm for AI developed Co-authored-by: Peter Park <peter.park@amd.com>	2025-01-27 15:49:21 -05:00
Peter Park	8dd99fe3a4	fix link to llama cookbook (#4269 )	2025-01-17 14:53:36 -05:00
Adel Johar	7754fc4b9d	Docs: resolve warnings from sphinx build output	2025-01-16 14:36:47 +01:00
Peter Park	d534f755e4	Add metadata to docs (#3688 ) * add missing metadata add metadata to mi300 arch doc add metadata to contributing guide add metadata to mi300x tuning guides * update meta to yaml frontmatter * update to md metadata to myst frontmatter * remove extra file * fix spelling	2025-01-14 08:55:45 -05:00
Peter Park	26553d725b	Add TensorFlow compatibility docs (#4247 ) * Add Tensorflow * WIP * WIP * minor fmt * PR feedbacks * fix missed inconsistent formatting * WIP WIP WIP WIP * minor formatting update tensorflow-rocm docker images to rocm6.3.1 fix urls * WIP * fix typo and update wordlist * fix tables not rendering * fix table headings * add period * update tf dockers * fix link * fix link * wording * update historical compat * fix tensile link --------- Co-authored-by: Mátyás Aradi <matyas@streamhpc.com> Co-authored-by: Istvan Kiss <neon60@gmail.com>	2025-01-09 14:24:58 -05:00

1 2 3

141 Commits