Peter Park
98fde2bff1
Add RDNA4 OS support note in RELEASE.md and compat matrix ( #4764 )
...
* fix vllm link in release.md
* add RDNA4 note in compat matrix
* update hipcc github url to specific path in llvm-project repo
* remove non-existant HIP upcoming changes reference
* remove non-existant resolved issues internal link
* fix hip upcoming changes url
* duplicate amd smi known issue
2025-05-21 14:23:48 -04:00
Peter Park
0e8b745266
Fix toc ( #4762 )
2025-05-21 12:26:30 -04:00
Alex Xu
58a62bc00e
Merge remote-tracking branch 'external/develop' into sync-develop-from-external
2025-05-21 11:16:31 -04:00
Peter Park
8dc7016405
Add Radeon AI PRO R9700, Radeon RX 9070 XT, RX 9060 XT to gpu-arch-specs ( #411 )
...
* add Radeon AI PRO R7900, Radeon RX 9070 XT, Radeon RX 9060 XT to gpu-arch-specs.rst
* update compat matrices
* fix spacing in historical compat csv file
2025-05-21 11:04:46 -04:00
alexxu-amd
ddcad120a2
Update versions.md
2025-05-21 09:52:05 -04:00
Peter Park
ca5d0d0000
[6.4.1] update llvm-project version and add RCCL known issue ( #401 )
...
* update llvm-project version
* add RCCL known issue
2025-05-15 16:20:59 -04:00
Peter Park
0a77e7b3a5
docs: Add system health check doc under ROCm for AI ( #4736 )
...
* add initial draft
* add to toc and install page
* update wording
* improve documentation structure
* resturcture and expand content
* add to training section
* add to conf.py article_pages
* Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* update wordlist.txt
* Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* inference --> AI workloads
* udpate toc
* update article_pages in conf.py
* Update system validation notes in training docs
* fix links in prerequisite-system-validation
* wording
* add note
* consistency
* remove extra files
* fix links
* add links to training index page
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2025-05-13 15:54:48 -04:00
Istvan Kiss
d1772b9ca3
Fix unsupported section structure on JAX ( #4733 )
2025-05-13 17:39:25 +02:00
Istvan Kiss
f65e1412df
Fix compatibility list ( #4731 )
2025-05-13 16:26:36 +02:00
Istvan Kiss
ea1072b11d
JAX compatibility page upate ( #4727 )
2025-05-08 19:31:13 +02:00
Peter Park
90a651d2b6
Merge pull request #4725 from peterjunpark/docs/quark-model-quantization
...
Add quark in model-quantization.rst
2025-05-08 10:34:39 -04:00
Peter Park
bb7af3351a
Fix incorrect throughput benchmark command in inference/vllm-benchmark.rst ( #4723 )
...
* update inference index to include pyt inference
* fix incorrect command in throughput benchmark
* wording
2025-05-08 09:24:51 -04:00
Wei Luo
d1debc7e45
[doc]: Add quark in model-quantization.rst ( #374 )
...
* Add quark in model-quantization.rst
---------
Co-authored-by: Peter Park <peter.park@amd.com >
Co-authored-by: Peter Park <git@peterjunpark.com >
2025-05-08 14:28:51 +08:00
Pratik Basyal
8ef1bb0139
rocSHMEM component added to ROCm 6.4.0 documentation ( #4719 )
...
* rocSHMEM added to ROCm 640
* Space removed
* link fixed
2025-05-07 15:31:38 -04:00
Pratik Basyal
169f3bbe5e
641 Release notes update post RC2 batch1 ( #387 )
...
* Release highlight updated
* TOC updated for internal
* RC3 manifest added
* clarify docker image highlight
* update doc highlights
* RC3 changes added
* RC3 manifest added
* ROCm SMI version update
---------
Co-authored-by: Peter Park <peter.park@amd.com >
2025-05-06 15:07:54 -04:00
Peter Park
186c281aba
fix links in pytorch-inference-benchmark.rst ( #4713 )
2025-05-06 13:34:55 -04:00
Pratik Basyal
e28eac2fe1
License typo fixed ( #384 )
2025-05-02 12:37:08 -04:00
Peter Park
d44ea40a0d
Add MPT-30B + LLM Foundry doc ( #4704 )
...
* add mpt-30b doc
* add tunableop note
* update MPT doc
* add section
* update wordlist
* fix flash attention version
* update "applies to"
* address review feedback
* Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* update docker details to pytorch-training-v25.5
* update
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2025-05-02 12:13:20 -04:00
Pratik Basyal
217fb452f8
Initial changes to 6.4.1 RN ( #379 )
...
* Initial changes added
* Changelogs for RCCL, hipblaslt, compute profiler, and systems added
* 6.4.0 GA manifest
* 6.4.1 RC1 manifest
* RC2 Manifest added
* Update RELEASE.md
Add CLR Changelog entry for HIP 6.4.1
* Release highlight added
* AMD SMI changelog added
* ROCr runtime changelog added
* RCCL resolved issue added
* Minor change
* Minor fixes
* Quick changes to version
* Offline installer update
* Istallation udpated
* added rocalution to release notes
* Updated changelogs for components
* Changes to changelog
* Update RELEASE.md
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com >
* Update RELEASE.md
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com >
* rocSHMEM related changes added
* Changelog updated with new changes
* Heading level fixed
* AMD SMI version bumped to 25.4.0
* Reordered
* Table zebra pattern updated
* Consolidated updated
* Zebra patter aligned
* Add ROCm SMI changes to 6.4.1
* Update CHANGELOG.md
Co-authored-by: Pratik Basyal <prbasyal@amd.com >
* update doc highlights
* Link to rocSHMEM
* update
* Minor changes
* Changelog feedback updated
---------
Co-authored-by: randyh62 <42045079+randyh62@users.noreply.github.com >
Co-authored-by: spolifroni-amd <sandra.polifroni@amd.com >
Co-authored-by: Peter Park <peter.park@amd.com >
2025-05-01 13:54:31 -04:00
Peter Park
85778177a1
Update vLLM docker pull tag 20250415 in vllm-benchmark.rst ( #4702 )
2025-04-30 16:09:30 -04:00
Istvan Kiss
84177354de
Pytorch compatibility page update
2025-04-29 14:43:40 +02:00
Peter Park
7458fcb7ab
Update JAX MaxText benchmark doc to v25.5 ( #4695 )
...
* fix shell cmd formatting
* add previous versions section
* update docker details and add llama 3.3
* update missed docker image tags to 25.5
2025-04-28 17:52:53 -04:00
Peter Park
16d6e59003
fix link to pytorch-training v25.4 doc ( #4696 )
2025-04-28 17:52:33 -04:00
Peter Park
a66bc1d85e
fix link to previous version in vllm-benchmark.rst ( #4689 )
2025-04-24 17:54:04 -04:00
Peter Park
36b6ffaf7c
Add QwQ 32B to vllm-benchmark.rst ( #4685 )
...
* Add Qwen2 MoE 2.7B to vllm-benchmark-models.yaml
* Add QwQ-32B-Preview to vllm-benchmark-models.yaml
* add links to performance results
words
* change "performance validation" to "performance testing"
* remove "-Preview" from QwQ-32B
* move qwen2 MoE after qwen2
* add TunableOp section
* fix formatting
* add link to TunableOp doc
* add tunableop note
* fix vllm-benchmark template
* remove cmdline option for --tunableop on
* update docker details
* remove "training"
* remove qwen2
2025-04-24 16:44:34 -04:00
Peter Park
40e4ba3ecc
Update vLLM inference benchmark Docker guide ( #4653 )
...
* Remove JAIS 13B and 30B
* update Docker details - vLLM 0.8.3
* add previous version
* Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst
* fix link to previous version
2025-04-24 15:59:13 -04:00
Peter Park
1f41ce26be
Add note for chai-1 benchmark Docker in pytorch-inference-benchmark.rst ( #4684 )
2025-04-24 15:48:53 -04:00
Peter Park
c3faa9670b
Add PyTorch inference benchmark Docker guide (+ CLIP and Chai-1) ( #4654 )
...
* update vLLM links in deploy-your-model.rst
* add pytorch inference benchmark doc
* update toc and vLLM title
* remove previous versions
* update
* wording
* fix link and "applies to"
* add pytorch to wordlist
* add tunableop note to clip
* make tunableop note appear to all models
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* fix incorrect links
* wording
* fix wrong docker pull tag
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2025-04-23 17:35:52 -04:00
Peter Park
b29b3592bd
Update ML framework Docker compatibility docs for 6.4.0 ( #4667 )
...
* update pytorch-compatibility.rst
* update tensorflow compat
fix
* update jax and jax-community docker versions
2025-04-22 16:16:16 -04:00
Pratik Basyal
fc162d11e0
6.1.5 column added to historical compatibility develop branch ( #4648 )
...
* 6.1.5 column added to historical compatibility
2025-04-17 11:55:32 -04:00
Peter Park
9ff3c2c885
Update PyTorch training Docker doc for 25.5 ( #4638 )
...
* update pytorch-training to 25.5
* remove llama 2
* Revert "remove llama 2"
This reverts commit dab672fa7bcbd8bff730382c14177df4301a537d.
* add previous version
* fix run cmd
* add link to docker hub
* fix linting issue
* add Llama 3.3 70B
* update
2025-04-15 18:16:22 -04:00
Peter Park
d057d49af1
Fix vllm Dockerfile.rocm path ( #4628 )
2025-04-15 11:26:54 -04:00
Peter Park
310864e653
fix link to Dockerfile.rocm ( #4573 )
2025-04-14 10:10:03 -04:00
Pratik Basyal
af18a170bc
Blog link update to 6.4.0 release notes #4596
...
Blog link update to 6.4.0 release notes
2025-04-11 17:48:42 -04:00
Peter Park
656db2bc84
Update KMD versions in compat matrix ( #4594 )
...
* update KMD versions in compat matrix
* update historical compat matrix
2025-04-11 16:48:21 -04:00
Parag Bhandari
493585dfbb
Merge branch 'develop' of github.com:ROCm/ROCm into develop
2025-04-11 15:15:43 -04:00
Parag Bhandari
e756d99f65
Merge branch 'develop-internal' into develop
2025-04-11 15:15:19 -04:00
Pratik Basyal
686fcece1d
PRE GA Day 640 update for resetting link and HPC application list ( #367 )
...
* Links reset to point to latest from stg, internal, RTD, and develop
* ROCm for HPC updated
* GA prep changes
2025-04-11 14:12:57 -05:00
pbhandar-amd
131e34f582
Update w6000-v620.md
2025-04-11 15:11:34 -04:00
Parag Bhandari
db3c46fccf
Merge branch 'develop-internal' into develop
2025-04-11 14:32:09 -04:00
pbhandar-amd
7d5ea2f2f9
Update versions.md
2025-04-11 13:16:06 -04:00
pbhandar-amd
18abbbda11
Update versions.md
2025-04-11 13:15:53 -04:00
Peter Park
03137e1146
Remove "preview support" for PyT 2.6 ( #368 )
...
* remove pytorch 2.6 preview support note
* update pytorch support release note
2025-04-11 09:12:41 -04:00
Peter Park
8a24176528
Update Thrust and CUB versions for 6.4 + fix compatibility table not displaying ( #364 )
...
* Update Thrust and CUB versions
* fix whitespace issue causing build error
* fix onnx runtime ver
2025-04-10 13:38:48 -04:00
Pratik Basyal
1e231b4b28
640 RN known issues batch 4 ( #365 )
...
* ROCProfiler deprecation notice udpated
* RHEL 9.6 support removed and 9.5 EOS rejected
* Feedback to KV cache highlight added
* Wrong entry of ROCprofiler-SDK removed
* Additional known issues added
* GA Release date updated
* Consolidated changelog sync
2025-04-10 09:05:34 -04:00
Pratik Basyal
c26f470c8a
6.4.0 Known issues update to RN batch 3 ( #362 )
...
* ROCProfiler deprecation notice udpated
* RHEL 9.6 support removed and 9.5 EOS rejected
* Feedback to KV cache highlight added
* Wrong entry of ROCprofiler-SDK removed
* ROCm debugger known issues added
* JAX known issues added
* Ordering fixed
* Compute partition known issues added
* TP sizes known issues added
* Highlight and compatibility matrix updated
* ONNX auto-update corrected
* ROCm systems profiler known issues removed
* Title update
2025-04-09 10:14:14 -04:00
Istvan Kiss
13bd184ec3
Add RDNA4 ISA guide
2025-04-08 13:57:32 +02:00
Istvan Kiss
6c7f167650
Fix broken torchserve link
2025-04-07 16:07:31 +02:00
dependabot[bot]
defb276d93
Build(deps): Bump rocm-docs-core from 1.18.1 to 1.18.2 in /docs/sphinx ( #4556 )
...
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core ) from 1.18.1 to 1.18.2.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases )
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md )
- [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.18.1...v1.18.2 )
---
updated-dependencies:
- dependency-name: rocm-docs-core
dependency-version: 1.18.2
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-04-03 17:02:06 -06:00
Peter Park
fdf24a9c40
fix link to CLR license ( #4560 )
2025-04-03 13:09:59 -04:00