1157 Commits

Author SHA1 Message Date
randyh62
2b83a962a0 Use intersphinx links for deep learning (#5859)
* Use intersphinx links for deep learning

* Update deep-learning-rocm.rst

remove Taichi

* Update deep-learning-rocm.rst

Change Install link to "link"

* Apply suggestion from @randyh62

OK
2026-01-20 09:17:37 -08:00
Jeffrey Novotny
54bf4c0319 Add missing APU entries to GPU hardware specifications (#646) (#5862) (#5863)
* Add missing APU entries to GPU hardware specifications

* Move Ryzen APUs to new tab

* Add new column to Ryzen table and rename column elsewhere

---------

(cherry picked from commit 7ab402a3b3)


(cherry picked from commit 33fbde69db)

Co-authored-by: alexxu-amd <159800977+alexxu-amd@users.noreply.github.com>
2026-01-16 13:02:06 -05:00
peterjunpark
4347a11bc4 Doc update for vLLM refactor #5855 (#5856)
(cherry picked from commit a745e45dcb)
2026-01-15 11:34:02 -05:00
ROCm Docs Automation
2b7fde505f Update rocm-docs-core to 1.31.2 2026-01-14 11:26:11 -05:00
anisha-amd
a98d6a5777 Docs: Ray release 25.12 and compatibility version format standardization (#5845) (#5846) 2026-01-08 12:29:00 -05:00
Swati Rawat
4184d1ee1f Update docs/how-to/rocm-for-ai/training/benchmark-docker/previous-versions/megatron-lm-v24.12-dev.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-08 16:46:22 +05:30
Swati Rawat
0786c328c1 Update docs/how-to/rocm-for-ai/system-setup/prerequisite-system-validation.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-08 16:46:22 +05:30
Swati Rawat
88ea6072f5 Update docs/how-to/rocm-for-ai/system-setup/prerequisite-system-validation.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-08 16:46:22 +05:30
Swati Rawat
b32dcc8570 Update docs/how-to/rocm-for-ai/system-setup/prerequisite-system-validation.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-08 16:46:22 +05:30
Swati Rawat
0faa92e922 Update docs/how-to/rocm-for-ai/training/benchmark-docker/previous-versions/megatron-lm-v24.12-dev.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-08 16:46:21 +05:30
Swati Rawat
26ae989602 Update docs/how-to/rocm-for-ai/training/benchmark-docker/previous-versions/megatron-lm-v24.12-dev.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-08 16:46:21 +05:30
srawat
4402dc4147 Update single-gpu-fine-tuning-and-inference.rst 2026-01-08 16:46:21 +05:30
srawat
5eda438e0a Update multi-gpu-fine-tuning-and-inference.rst 2026-01-08 16:46:20 +05:30
srawat
049784e1a7 Update prerequisite-system-validation.rst 2026-01-08 16:42:18 +05:30
srawat
f12169c5b7 replace rocm-smi reference with amd-smi 2026-01-08 16:42:18 +05:30
peterjunpark
b35d1a0627 fix(primus-pytorch.rst): FP8 config instead of BF16 (#5839)
(cherry picked from commit 2dc22ca890)
2026-01-07 13:51:50 -05:00
Pratik Basyal
912618cb08 ROCM-core version fixed (#5827) (#5828) 2026-01-02 16:10:16 -05:00
peterjunpark
7d2feaa8b1 Fix inconsistency in xDiT doc (#5823)
Fix inconsistency in xDiT doc

(cherry picked from commit 172b0f7c08)
2025-12-29 10:29:59 -05:00
peterjunpark
2a65394e32 Update docs for xDiT diffusion inference 25.13 Docker release (#5820)
* archive previous version

* add xdit 25.13

* update history index

* add perf results section

(cherry picked from commit c67fac78bd)
2025-12-29 08:45:29 -05:00
peterjunpark
268c1332c9 Update training docs for Primus/25.11 (#5819)
* update conf and toc.yml.in

* archive previous versions

archive data files

update anchors

* primus pytorch: remove training batch size args

* update primus megatron run cmds

multi-node

* update primus pytorch

update

* update

update

* update docker tag

(cherry picked from commit e0b8ec4dfb)
2025-12-29 08:45:17 -05:00
Pratik Basyal
374e0944dc OS table removed from compatibility table [develop] (#5810) (#5811)
* OS table removed from compatibility table

* Feedback added

* Azure Linux 3.0 and compatibility version update

* Version fix

* Review feedback added

* Minor change
2025-12-23 16:38:03 -05:00
peterjunpark
512e311041 Update xdit diffusion inference history (#5808) (#5809)
* Update xdit diffusion inference history

* fix

(cherry picked from commit 3a43bacdda)
2025-12-22 11:14:57 -05:00
peterjunpark
ad4f486635 fix link to ROCm PyT docker image (#5803) (#5804)
(cherry picked from commit 48d8fe139b)
2025-12-19 15:51:20 -05:00
peterjunpark
485886712b clean up formatting in FA2 page (#5795) (#5796)
(cherry picked from commit 7455fe57b8)
2025-12-19 09:38:20 -05:00
peterjunpark
1cd6a14a22 Update Flash Attention guidance in "Model acceleration libraries" (#5793)
* flash attention update

Signed-off-by: seungrok.jung <seungrok.jung@amd.com>

flash attention update

Signed-off-by: seungrok.jung <seungrok.jung@amd.com>

flash attention update

Signed-off-by: seungrok.jung <seungrok.jung@amd.com>

sentence-case heading

* Update docs/how-to/rocm-for-ai/inference-optimization/model-acceleration-libraries.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

---------

Co-authored-by: seungrok.jung <seungrok.jung@amd.com>
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>
(cherry picked from commit 52c0a47e84)
2025-12-19 09:00:40 -05:00
peterjunpark
a17f04a3b5 Update documentation for JAX training MaxText 25.11 release (#5789) (#5790)
(cherry picked from commit cbab9a465d)
2025-12-18 11:26:42 -05:00
peterjunpark
94de66ef3f [docs/7.1.1] Publish vLLM and xDiT doc updates (#5787)
* vLLM inference benchmark 1210 (#5776)

* Archive previous ver

fix anchors

* Update vllm.rst and data yaml for 20251210

(cherry picked from commit 1b4f25733d)

* xDiT diffusion inference v25.12 documentation update (#5786)

* Add xdit-diffusion ROCm docs page.

* Update template formatting and fix sphinx warnings

* Add System Validation section.

* Add sw component versions/commits.

* Update to use latest v25.10 image instead of v25.9

* Update commands and add FLUX instructions.

* Update Flux instructions. Change image tag. Describe as diffusion inference instead of specifically video.

* git rm xdit-video-diffusion.rst

* Docs for v25.12

* Add hyperlinks to components

* Command fixes

* -Diffusers suffix

* Simplify yaml file and cleanup main rst page.

* Spelling, added 'js'

* fix merge conflict

fix

---------

Co-authored-by: Kristoffer <kristoffer.torp@amd.com>
(cherry picked from commit 459283da3c)

---------

Co-authored-by: Kristoffer <kristoffer.torp@amd.com>
2025-12-17 10:28:30 -05:00
Pratik Basyal
e5cebe7b4e Taichi removed from ROCm docs [Develop] (#5779) (#5781)
* Taichi removed from ROCm docs

* Warnings fixed
2025-12-16 13:24:12 -05:00
Pratik Basyal
7047cfa19c Onnx and rocshmem version updated (#5760) (#5764) 2025-12-11 17:11:05 -05:00
Matt Williams
0d17c96f7f Fixing link redirects (#5758)
* Update multi-gpu-fine-tuning-and-inference.rst

* Update pytorch-training-v25.6.rst

* Update pytorch-compatibility.rst
2025-12-10 11:31:26 -05:00
anisha-amd
2f8c99f7f0 Docs: update verl compatibility - fix (#5755) 2025-12-09 19:52:12 -05:00
anisha-amd
982927e866 Docs: verl framework - compatibility - 25.11 release (#5752) (#5753) 2025-12-09 12:02:20 -05:00
peterjunpark
8f45b791fe Fix Primus PyTorch doc: training.batch_size -> training.local_batch_size (#5748) (#5749)
(cherry picked from commit bf74351e5a)
2025-12-08 13:59:00 -05:00
yugang-amd
f7c7587b10 xdit-diffusion v25.11 docs (#5743) 2025-12-05 17:08:21 -05:00
Pratik Basyal
96b3c0d4f3 PyTorch 2.7 support added (#5740) (#5741) 2025-12-04 17:00:34 -05:00
peterjunpark
d6d4d2ef92 fix docker hub links for primus:v25.10 (#5738)
(cherry picked from commit 453751a86f)
2025-12-04 09:21:53 -05:00
peterjunpark
8647ebcf76 Update training Docker docs for Primus 25.10 (#5737)
(cherry picked from commit fb644412d5)
2025-12-04 09:21:53 -05:00
Istvan Kiss
acbd671e99 JAX key features and enhancements (#5708)
Co-authored-by: Pratik Basyal <prbasyal@amd.com>
2025-12-01 19:52:07 +01:00
ROCm Docs Automation
5d7fdace0e Update rocm-docs-core to 1.30.0 2025-11-26 17:09:50 -05:00
Pratik Basyal
9ea8a48b3a Link and PyTorch version updated (#5700) (#5701) 2025-11-26 12:01:12 -05:00
Alex Xu
9956d72614 fix dependency 2025-11-26 11:42:22 -05:00
Alex Xu
305d24f486 Merge branch 'roc-7.1.x' into docs/7.1.1 2025-11-26 11:37:06 -05:00
Alex Xu
42cad29c04 re-compile requirements.txt 2025-11-26 11:35:00 -05:00
Alex Xu
26f6b6b3e1 Merge branch 'roc-7.1.x' into docs/7.1.1 2025-11-26 11:29:02 -05:00
Alex Xu
4490c57c6a resolve merge conflict 2025-11-26 10:33:02 -05:00
Alex Xu
007f24fe7b Merge remote-tracking branch 'external/develop' into sync-develop-from-external 2025-11-26 10:09:04 -05:00
Pratik Basyal
1b5a3e54c2 711 compatibility note update and review feedback added (#636)
* Leo's review feedback added

* rocshmem version bumped from 3.0.0 to 3.1.0

* Footnote cleaned

* Footnote updated

* Ram's feedback

* Link updated

* Footnote updated

* Link fixed
2025-11-26 09:46:57 -05:00
alexxu-amd
2c6eb9cf2a Update versions.md (#637)
* Update versions.md

* remove empty line
2025-11-26 09:03:54 -05:00
Alex Xu
d4cdbd79a3 Merge branch 'develop' into docs/7.1.1 2025-11-26 08:47:19 -05:00
Pratik Basyal
b93fdb811c 7.1.1 pre-GA public link reset (#627)
* 7.1.1 pre-GA public link reset

* Update CHANGELOG.md
2025-11-26 08:38:13 -05:00