Peter Park
7380c89985
docs: Add system health check doc under ROCm for AI ( #4736 )
...
* add initial draft
* add to toc and install page
* update wording
* improve documentation structure
* resturcture and expand content
* add to training section
* add to conf.py article_pages
* Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* update wordlist.txt
* Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* inference --> AI workloads
* udpate toc
* update article_pages in conf.py
* Update system validation notes in training docs
* fix links in prerequisite-system-validation
* wording
* add note
* consistency
* remove extra files
* fix links
* add links to training index page
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
(cherry picked from commit 0a77e7b3a5 )
2025-05-13 15:55:36 -04:00
Istvan Kiss
165ea54e12
Jax and PyTorch compatibility page update 6.4 ( #4732 )
...
* JAX compatibility page upate (#4727 )
* Fix compatibility list (#4731 )
* Pytorch compatibility page update
* Fix unsupported section structure on JAX (#4733 )
2025-05-13 18:24:19 +02:00
Peter Park
065d1cdc95
Merge pull request #4725 from peterjunpark/docs/quark-model-quantization
...
Add quark in model-quantization.rst
(cherry picked from commit 90a651d2b6 )
2025-05-08 10:35:33 -04:00
Peter Park
5b859352b2
Merge pull request #4724 from peterjunpark/docs/6.4.0
...
[docs/6.4.0] Fix incorrect throughput benchmark command in inference/vllm-benchmar…
2025-05-08 09:31:38 -04:00
Peter Park
f15a1e830e
Fix incorrect throughput benchmark command in inference/vllm-benchmark.rst ( #4723 )
...
* update inference index to include pyt inference
* fix incorrect command in throughput benchmark
* wording
(cherry picked from commit bb7af3351a )
2025-05-08 09:27:44 -04:00
Pratik Basyal
a2628dce5d
rocSHMEM component added to ROCm 6.4.0 documentation ( #4719 ) ( #4720 )
...
* rocSHMEM added to ROCm 640
* Space removed
* link fixed
2025-05-07 15:42:38 -04:00
Peter Park
e0098d0668
fix links in pytorch-inference-benchmark.rst ( #4713 )
...
(cherry picked from commit 186c281aba )
2025-05-06 15:27:17 -04:00
Peter Park
71cffa9681
fix dynamic urls in toc.yml.in
2025-05-06 15:27:17 -04:00
Peter Park
94337a9887
Add MPT-30B + LLM Foundry doc ( #4704 )
...
* add mpt-30b doc
* add tunableop note
* update MPT doc
* add section
* update wordlist
* fix flash attention version
* update "applies to"
* address review feedback
* Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* update docker details to pytorch-training-v25.5
* update
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
(cherry picked from commit d44ea40a0d )
2025-05-02 12:13:56 -04:00
Peter Park
18d98ca692
Update vLLM docker pull tag 20250415 in vllm-benchmark.rst ( #4702 )
...
(cherry picked from commit 85778177a1 )
2025-04-30 16:10:27 -04:00
Peter Park
c8144c4a60
Update JAX MaxText benchmark doc to v25.5 ( #4695 )
...
* fix shell cmd formatting
* add previous versions section
* update docker details and add llama 3.3
* update missed docker image tags to 25.5
(cherry picked from commit 7458fcb7ab )
2025-04-28 17:53:37 -04:00
Peter Park
ed45d6add9
fix link to pytorch-training v25.4 doc ( #4696 )
...
(cherry picked from commit 16d6e59003 )
2025-04-28 17:53:37 -04:00
Peter Park
4f86b2801a
Update vLLM inference benchmark Docker guide ( #4653 )
...
* Remove JAIS 13B and 30B
* update Docker details - vLLM 0.8.3
* add previous version
* Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst
* fix link to previous version
(cherry picked from commit 40e4ba3ecc )
2025-04-24 17:57:05 -04:00
Peter Park
9c07ed1726
fix link to previous version in vllm-benchmark.rst ( #4689 )
...
(cherry picked from commit a66bc1d85e )
2025-04-24 17:54:30 -04:00
Peter Park
34ca259220
Add QwQ 32B to vllm-benchmark.rst ( #4685 )
...
* Add Qwen2 MoE 2.7B to vllm-benchmark-models.yaml
* Add QwQ-32B-Preview to vllm-benchmark-models.yaml
* add links to performance results
words
* change "performance validation" to "performance testing"
* remove "-Preview" from QwQ-32B
* move qwen2 MoE after qwen2
* add TunableOp section
* fix formatting
* add link to TunableOp doc
* add tunableop note
* fix vllm-benchmark template
* remove cmdline option for --tunableop on
* update docker details
* remove "training"
* remove qwen2
(cherry picked from commit 36b6ffaf7c )
2025-04-24 16:46:48 -04:00
Peter Park
d04443ac13
Add note for chai-1 benchmark Docker in pytorch-inference-benchmark.rst ( #4684 )
...
(cherry picked from commit 1f41ce26be )
2025-04-24 16:45:33 -04:00
Peter Park
311b4cd62b
Add PyTorch inference benchmark Docker guide (+ CLIP and Chai-1) ( #4654 )
...
* update vLLM links in deploy-your-model.rst
* add pytorch inference benchmark doc
* update toc and vLLM title
* remove previous versions
* update
* wording
* fix link and "applies to"
* add pytorch to wordlist
* add tunableop note to clip
* make tunableop note appear to all models
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* fix incorrect links
* wording
* fix wrong docker pull tag
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
(cherry picked from commit c3faa9670b )
2025-04-23 17:36:25 -04:00
Peter Park
d2ccd706a5
Update ML framework Docker compatibility docs for 6.4.0 ( #4667 )
...
* update pytorch-compatibility.rst
* update tensorflow compat
fix
* update jax and jax-community docker versions
(cherry picked from commit b29b3592bd )
2025-04-22 16:17:24 -04:00
Peter Park
699f668a2b
fix link to Dockerfile.rocm ( #4573 )
...
(cherry picked from commit 310864e653 )
2025-04-22 14:09:35 -04:00
Pratik Basyal
3bc09b6faa
615 column added to historical compatibility matrix in ROCm 640 ( #4655 )
...
* 6.1.5 column added and broken link fixed
2025-04-17 11:50:32 -04:00
Peter Park
824d760646
Update PyTorch training Docker doc for 25.5 ( #4638 )
...
* update pytorch-training to 25.5
* remove llama 2
* Revert "remove llama 2"
This reverts commit dab672fa7bcbd8bff730382c14177df4301a537d.
* add previous version
* fix run cmd
* add link to docker hub
* fix linting issue
* add Llama 3.3 70B
* update
(cherry picked from commit 9ff3c2c885 )
2025-04-15 18:17:06 -04:00
Peter Park
cb412a7a7f
Fix vllm Dockerfile.rocm path ( #4628 )
...
(cherry picked from commit d057d49af1 )
2025-04-15 11:28:09 -04:00
Peter Park
d1b426f2d0
Update KMD versions in compat matrix ( #4594 )
...
* update KMD versions in compat matrix
* update historical compat matrix
(cherry picked from commit 656db2bc84 )
2025-04-11 16:49:12 -04:00
Pratik Basyal
639e2dc232
Release notes Link update 640 branch ( #4593 )
...
* Link update (#4591 )
* Date updated
2025-04-11 16:26:26 -04:00
Parag Bhandari
5104389ab3
Merge branch 'develop' into docs/6.4.0
2025-04-11 15:15:54 -04:00
Parag Bhandari
493585dfbb
Merge branch 'develop' of github.com:ROCm/ROCm into develop
2025-04-11 15:15:43 -04:00
Parag Bhandari
e756d99f65
Merge branch 'develop-internal' into develop
2025-04-11 15:15:19 -04:00
Pratik Basyal
686fcece1d
PRE GA Day 640 update for resetting link and HPC application list ( #367 )
...
* Links reset to point to latest from stg, internal, RTD, and develop
* ROCm for HPC updated
* GA prep changes
2025-04-11 14:12:57 -05:00
pbhandar-amd
131e34f582
Update w6000-v620.md
2025-04-11 15:11:34 -04:00
Parag Bhandari
6b71afe8a2
Merge branch 'develop' into docs/6.4.0
2025-04-11 14:36:57 -04:00
Parag Bhandari
db3c46fccf
Merge branch 'develop-internal' into develop
2025-04-11 14:32:09 -04:00
pbhandar-amd
7d5ea2f2f9
Update versions.md
2025-04-11 13:16:06 -04:00
pbhandar-amd
18abbbda11
Update versions.md
2025-04-11 13:15:53 -04:00
pbhandar-amd
d2c914d477
Update documentation requirements
2025-04-11 10:28:37 -04:00
Peter Park
03137e1146
Remove "preview support" for PyT 2.6 ( #368 )
...
* remove pytorch 2.6 preview support note
* update pytorch support release note
2025-04-11 09:12:41 -04:00
Peter Park
8a24176528
Update Thrust and CUB versions for 6.4 + fix compatibility table not displaying ( #364 )
...
* Update Thrust and CUB versions
* fix whitespace issue causing build error
* fix onnx runtime ver
2025-04-10 13:38:48 -04:00
Pratik Basyal
1e231b4b28
640 RN known issues batch 4 ( #365 )
...
* ROCProfiler deprecation notice udpated
* RHEL 9.6 support removed and 9.5 EOS rejected
* Feedback to KV cache highlight added
* Wrong entry of ROCprofiler-SDK removed
* Additional known issues added
* GA Release date updated
* Consolidated changelog sync
2025-04-10 09:05:34 -04:00
Pratik Basyal
c26f470c8a
6.4.0 Known issues update to RN batch 3 ( #362 )
...
* ROCProfiler deprecation notice udpated
* RHEL 9.6 support removed and 9.5 EOS rejected
* Feedback to KV cache highlight added
* Wrong entry of ROCprofiler-SDK removed
* ROCm debugger known issues added
* JAX known issues added
* Ordering fixed
* Compute partition known issues added
* TP sizes known issues added
* Highlight and compatibility matrix updated
* ONNX auto-update corrected
* ROCm systems profiler known issues removed
* Title update
2025-04-09 10:14:14 -04:00
Istvan Kiss
13bd184ec3
Add RDNA4 ISA guide
2025-04-08 13:57:32 +02:00
Istvan Kiss
6c7f167650
Fix broken torchserve link
2025-04-07 16:07:31 +02:00
dependabot[bot]
defb276d93
Build(deps): Bump rocm-docs-core from 1.18.1 to 1.18.2 in /docs/sphinx ( #4556 )
...
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core ) from 1.18.1 to 1.18.2.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases )
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md )
- [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.18.1...v1.18.2 )
---
updated-dependencies:
- dependency-name: rocm-docs-core
dependency-version: 1.18.2
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-04-03 17:02:06 -06:00
Peter Park
fdf24a9c40
fix link to CLR license ( #4560 )
2025-04-03 13:09:59 -04:00
Dominic Widdows
715cce53de
Update workload.rst with small export fix ( #4425 )
...
Tiny fix that removes the "export" directive.
` export HIP_FORCE_DEV_KERNARG=1 hipblaslt-bench ...`
leads to
bash: export: `hipblaslt-bench': not a valid identifier
whereas just starting with HIP_FORCE_DEV_KERNARG=1 passes this env var to the hipblaslt-bench process, which I think is the intention here.
2025-04-03 13:01:26 -04:00
Jeffrey Novotny
c71201b801
Add Radeon PRO W7800 48GB to GPU hardware specs ( #356 )
...
* Add Radeon PRO W7800 48GB to GPU hardware specs
* Adjust row order
2025-04-01 16:44:56 -04:00
Peter Park
ea66bf386a
Fix more links in documentation ( #4551 )
...
* fix vllm engine args link
* remove RDNA subtree in under system optimization in toc
* fix RDNA 2 architecture PDF link
* fix CLR LICENSE.txt link
* fix rocPyDecode license link
2025-04-01 15:56:34 -04:00
Peter Park
ac2c5e72d4
Fix links in documentation
2025-04-01 15:39:20 -04:00
Peter Park
53eb4f6edb
Change AMD SMI ver to 25.3.0 from 25.2.0 ( #345 )
2025-04-01 13:02:27 -04:00
amitkumar-amd
b178a7ca78
Update the TOC ( #355 )
...
* remove 1200
* update link on TOC
* Update docs/sphinx/_toc.yml.in
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com >
---------
Co-authored-by: Pratik Basyal <prbasyal@amd.com >
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com >
2025-03-28 15:59:27 -05:00
Peter Park
424e6148bd
Add MaxText training Docker doc
...
Add MaxText training Docker doc
2025-03-28 11:25:06 -04:00
Peter Park
15aca4be9d
Fix ML framework compatible versions for 6.4 ( #347 )
...
* Fix ML framework compatible versions for 6.4
* add footnote to historical compat matrix
2025-03-28 10:55:36 -04:00