yugang-amd
404e91f2d9
Update compatibility-matrix.rst ( #4860 )
2025-05-30 17:50:33 -04:00
alexxu-amd
50cfc538ff
Change viewer link from latest to mainline in what-is-rocm page ( #4856 )
...
* change viewer link from latest to mainline
* correct format
(cherry picked from commit c1919faccd )
2025-05-30 17:18:40 -04:00
Swati Rawat
a9c323e596
Docs: Add rocprof-compute-viewer ( #4850 )
...
* Docs: Add rocprof-compute-viewer
* update requirements.txt
---------
Co-authored-by: Alex Xu <alex.xu@amd.com >
(cherry picked from commit 6142df329b )
2025-05-30 15:22:51 -04:00
Peter Park
7a81d10c1d
Add RHEL 9.6 to compat matrix ( #4839 )
...
* add RHEL 9.6 to compat matrix
* add os support note
(cherry picked from commit 2addcb0bca )
2025-05-30 14:57:24 -04:00
yugang-amd
00f74d2d8e
Add microsoft/phi-4 vllm-benchmark-models ( #4801 ) ( #4847 )
...
* add Phi-4 to vllm-benchmark-models.yaml
fix model_repo
* update model group names
Co-authored-by: Peter Park <peter.park@amd.com >
2025-05-30 09:20:17 -04:00
Peter Park
4963eeab00
Update ML framework Docker inventories for 6.4.1 ( #4841 )
...
* Update tensorflow Docker compatibility table
* update jax Docker compatibility table
* fix py versions
* update pytorch Docker compatibility table
(cherry picked from commit 93fd0ef1d4 )
2025-05-29 18:34:47 -04:00
Peter Park
7c25ce240b
Add Falcon-180B to vLLM benchmark Docker doc ( #4836 )
...
* add Falcon to vllm-benchmark-models.yaml
* update group name
(cherry picked from commit daf2e980d9 )
2025-05-29 18:34:47 -04:00
Peter Park
fdeaacd3cc
fix megatron-lm pull tags
2025-05-28 15:12:50 -04:00
Peter Park
8e61ba4f90
Fix rocm/vllm pull tag
...
fix
2025-05-28 14:42:35 -04:00
Peter Park
94ee445a8a
Add latest rocm/vllm Docker details in vLLM inference benchmark guide ( #4824 )
...
* update rocm/vllm Docker details to latest release
* Add previous vLLM version
* fix 'further reading' xrefs
* improve model grouping names
* fix links
* update model picker text
(cherry picked from commit cebf0f5975 )
2025-05-28 14:23:05 -04:00
Peter Park
2e5fe544a0
Add RDNA4 RX 9070 GRE to gpu-arch-specs.rst and RELEASE.md ( #4820 )
...
(cherry picked from commit 0acb457389 )
2025-05-28 10:21:50 -04:00
yugang-amd
4dae0ba84d
Update SGPR for RDNA3 and RDNA2 series ( #4815 )
2025-05-27 15:13:22 -04:00
yugang-amd
5ddab465c3
Bump up requirement version ( #4805 )
...
* bump up requirement version
* update requirements.txt
* Use Python 3.10
2025-05-27 11:08:55 -04:00
yugang-amd
151e563dcb
Merge pull request #4792 from yugang-amd/wavefront-size-6-4-1
...
Update wavefront size
2025-05-26 14:56:38 -04:00
yugang-amd
ae1a330fd7
fix links
2025-05-26 14:35:36 -04:00
yugang-amd
cab805674a
update wavefront size
...
(cherry picked from commit 230b01565f )
2025-05-26 13:56:14 -04:00
yugang-amd
387cfab91f
fix typo
2025-05-26 12:53:18 -04:00
yugang-amd
525703a5ab
update wavefront size
2025-05-22 17:41:36 -04:00
Peter Park
6d2b1595b3
Document specs for Radeon RX 9070 + small fix in megatron-lm doc ( #4780 )
...
* Document specs for Radeon RX 9070
* fix wrong version in megatron-lm.rst
(cherry picked from commit 505041d90a )
2025-05-22 16:30:56 -04:00
yugang-amd
31e9013bdc
update rocSHMEM xrefs
...
(cherry picked from commit 7697298f5d )
2025-05-22 15:19:09 -04:00
Peter Park
9b69755b99
Add Megatron-LM benchmark doc 5/2 ( #4778 )
...
* reorg files
* add tabs
* update template
* update template
* update wordlist and toc
* add previous version to doc
* add selector paragraph
* update wordlist.txt
(cherry picked from commit 9ed65a81c4 )
2025-05-22 14:29:40 -04:00
Peter Park
4f80043312
fix 9070 XT gfx target in gpu-arch-specs table ( #4775 )
...
(cherry picked from commit 6d9f430c70 )
2025-05-22 12:12:14 -04:00
Peter Park
98fde2bff1
Add RDNA4 OS support note in RELEASE.md and compat matrix ( #4764 )
...
* fix vllm link in release.md
* add RDNA4 note in compat matrix
* update hipcc github url to specific path in llvm-project repo
* remove non-existant HIP upcoming changes reference
* remove non-existant resolved issues internal link
* fix hip upcoming changes url
* duplicate amd smi known issue
2025-05-21 14:23:48 -04:00
Peter Park
0e8b745266
Fix toc ( #4762 )
2025-05-21 12:26:30 -04:00
Alex Xu
58a62bc00e
Merge remote-tracking branch 'external/develop' into sync-develop-from-external
2025-05-21 11:16:31 -04:00
Peter Park
8dc7016405
Add Radeon AI PRO R9700, Radeon RX 9070 XT, RX 9060 XT to gpu-arch-specs ( #411 )
...
* add Radeon AI PRO R7900, Radeon RX 9070 XT, Radeon RX 9060 XT to gpu-arch-specs.rst
* update compat matrices
* fix spacing in historical compat csv file
2025-05-21 11:04:46 -04:00
alexxu-amd
ddcad120a2
Update versions.md
2025-05-21 09:52:05 -04:00
Peter Park
ca5d0d0000
[6.4.1] update llvm-project version and add RCCL known issue ( #401 )
...
* update llvm-project version
* add RCCL known issue
2025-05-15 16:20:59 -04:00
Peter Park
0a77e7b3a5
docs: Add system health check doc under ROCm for AI ( #4736 )
...
* add initial draft
* add to toc and install page
* update wording
* improve documentation structure
* resturcture and expand content
* add to training section
* add to conf.py article_pages
* Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* update wordlist.txt
* Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* inference --> AI workloads
* udpate toc
* update article_pages in conf.py
* Update system validation notes in training docs
* fix links in prerequisite-system-validation
* wording
* add note
* consistency
* remove extra files
* fix links
* add links to training index page
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2025-05-13 15:54:48 -04:00
Istvan Kiss
d1772b9ca3
Fix unsupported section structure on JAX ( #4733 )
2025-05-13 17:39:25 +02:00
Istvan Kiss
f65e1412df
Fix compatibility list ( #4731 )
2025-05-13 16:26:36 +02:00
Istvan Kiss
ea1072b11d
JAX compatibility page upate ( #4727 )
2025-05-08 19:31:13 +02:00
Peter Park
90a651d2b6
Merge pull request #4725 from peterjunpark/docs/quark-model-quantization
...
Add quark in model-quantization.rst
2025-05-08 10:34:39 -04:00
Peter Park
bb7af3351a
Fix incorrect throughput benchmark command in inference/vllm-benchmark.rst ( #4723 )
...
* update inference index to include pyt inference
* fix incorrect command in throughput benchmark
* wording
2025-05-08 09:24:51 -04:00
Wei Luo
d1debc7e45
[doc]: Add quark in model-quantization.rst ( #374 )
...
* Add quark in model-quantization.rst
---------
Co-authored-by: Peter Park <peter.park@amd.com >
Co-authored-by: Peter Park <git@peterjunpark.com >
2025-05-08 14:28:51 +08:00
Pratik Basyal
8ef1bb0139
rocSHMEM component added to ROCm 6.4.0 documentation ( #4719 )
...
* rocSHMEM added to ROCm 640
* Space removed
* link fixed
2025-05-07 15:31:38 -04:00
Pratik Basyal
169f3bbe5e
641 Release notes update post RC2 batch1 ( #387 )
...
* Release highlight updated
* TOC updated for internal
* RC3 manifest added
* clarify docker image highlight
* update doc highlights
* RC3 changes added
* RC3 manifest added
* ROCm SMI version update
---------
Co-authored-by: Peter Park <peter.park@amd.com >
2025-05-06 15:07:54 -04:00
Peter Park
186c281aba
fix links in pytorch-inference-benchmark.rst ( #4713 )
2025-05-06 13:34:55 -04:00
Pratik Basyal
e28eac2fe1
License typo fixed ( #384 )
2025-05-02 12:37:08 -04:00
Peter Park
d44ea40a0d
Add MPT-30B + LLM Foundry doc ( #4704 )
...
* add mpt-30b doc
* add tunableop note
* update MPT doc
* add section
* update wordlist
* fix flash attention version
* update "applies to"
* address review feedback
* Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* update docker details to pytorch-training-v25.5
* update
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2025-05-02 12:13:20 -04:00
Pratik Basyal
217fb452f8
Initial changes to 6.4.1 RN ( #379 )
...
* Initial changes added
* Changelogs for RCCL, hipblaslt, compute profiler, and systems added
* 6.4.0 GA manifest
* 6.4.1 RC1 manifest
* RC2 Manifest added
* Update RELEASE.md
Add CLR Changelog entry for HIP 6.4.1
* Release highlight added
* AMD SMI changelog added
* ROCr runtime changelog added
* RCCL resolved issue added
* Minor change
* Minor fixes
* Quick changes to version
* Offline installer update
* Istallation udpated
* added rocalution to release notes
* Updated changelogs for components
* Changes to changelog
* Update RELEASE.md
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com >
* Update RELEASE.md
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com >
* rocSHMEM related changes added
* Changelog updated with new changes
* Heading level fixed
* AMD SMI version bumped to 25.4.0
* Reordered
* Table zebra pattern updated
* Consolidated updated
* Zebra patter aligned
* Add ROCm SMI changes to 6.4.1
* Update CHANGELOG.md
Co-authored-by: Pratik Basyal <prbasyal@amd.com >
* update doc highlights
* Link to rocSHMEM
* update
* Minor changes
* Changelog feedback updated
---------
Co-authored-by: randyh62 <42045079+randyh62@users.noreply.github.com >
Co-authored-by: spolifroni-amd <sandra.polifroni@amd.com >
Co-authored-by: Peter Park <peter.park@amd.com >
2025-05-01 13:54:31 -04:00
Peter Park
85778177a1
Update vLLM docker pull tag 20250415 in vllm-benchmark.rst ( #4702 )
2025-04-30 16:09:30 -04:00
Istvan Kiss
84177354de
Pytorch compatibility page update
2025-04-29 14:43:40 +02:00
Peter Park
7458fcb7ab
Update JAX MaxText benchmark doc to v25.5 ( #4695 )
...
* fix shell cmd formatting
* add previous versions section
* update docker details and add llama 3.3
* update missed docker image tags to 25.5
2025-04-28 17:52:53 -04:00
Peter Park
16d6e59003
fix link to pytorch-training v25.4 doc ( #4696 )
2025-04-28 17:52:33 -04:00
Peter Park
a66bc1d85e
fix link to previous version in vllm-benchmark.rst ( #4689 )
2025-04-24 17:54:04 -04:00
Peter Park
36b6ffaf7c
Add QwQ 32B to vllm-benchmark.rst ( #4685 )
...
* Add Qwen2 MoE 2.7B to vllm-benchmark-models.yaml
* Add QwQ-32B-Preview to vllm-benchmark-models.yaml
* add links to performance results
words
* change "performance validation" to "performance testing"
* remove "-Preview" from QwQ-32B
* move qwen2 MoE after qwen2
* add TunableOp section
* fix formatting
* add link to TunableOp doc
* add tunableop note
* fix vllm-benchmark template
* remove cmdline option for --tunableop on
* update docker details
* remove "training"
* remove qwen2
2025-04-24 16:44:34 -04:00
Peter Park
40e4ba3ecc
Update vLLM inference benchmark Docker guide ( #4653 )
...
* Remove JAIS 13B and 30B
* update Docker details - vLLM 0.8.3
* add previous version
* Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst
* fix link to previous version
2025-04-24 15:59:13 -04:00
Peter Park
1f41ce26be
Add note for chai-1 benchmark Docker in pytorch-inference-benchmark.rst ( #4684 )
2025-04-24 15:48:53 -04:00
Peter Park
c3faa9670b
Add PyTorch inference benchmark Docker guide (+ CLIP and Chai-1) ( #4654 )
...
* update vLLM links in deploy-your-model.rst
* add pytorch inference benchmark doc
* update toc and vLLM title
* remove previous versions
* update
* wording
* fix link and "applies to"
* add pytorch to wordlist
* add tunableop note to clip
* make tunableop note appear to all models
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* fix incorrect links
* wording
* fix wrong docker pull tag
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2025-04-23 17:35:52 -04:00