Commit Graph

1172 Commits

Author SHA1 Message Date
Alex Xu
8c28f9ca9f Merge remote-tracking branch 'external/develop' into sync-devlop-from-external 2026-01-21 14:34:02 -05:00
alexxu-amd
6261b2c421 Add 7.2.0 to version list (#680) 2026-01-21 13:51:40 -05:00
Pratik Basyal
decd7e712c PyTorch 3 digits and 711 known issue added (#679) 2026-01-21 12:37:27 -05:00
Pratik Basyal
b7dd7e24ed 7.2.0 PLDM and Release date updated (#675)
* Release date updated

* vllm1 and GPU resiliency highlight removed

* Minor change

* Changelog synced
2026-01-21 09:54:47 -05:00
randyh62
45bd726f55 Use intersphinx links for deep learning (#5859)
* Use intersphinx links for deep learning

* Update deep-learning-rocm.rst

remove Taichi

* Update deep-learning-rocm.rst

Change Install link to "link"

* Apply suggestion from @randyh62

OK
2026-01-16 13:17:47 -08:00
Jeffrey Novotny
33fbde69db Add missing APU entries to GPU hardware specifications (#646) (#5862)
* Add missing APU entries to GPU hardware specifications

* Move Ryzen APUs to new tab

* Add new column to Ryzen table and rename column elsewhere

---------


(cherry picked from commit 7ab402a3b3)

Co-authored-by: alexxu-amd <159800977+alexxu-amd@users.noreply.github.com>
2026-01-16 12:55:55 -05:00
Jeffrey Novotny
7ab402a3b3 Add missing APU entries to GPU hardware specifications (#646)
* Add missing APU entries to GPU hardware specifications

* Move Ryzen APUs to new tab

* Add new column to Ryzen table and rename column elsewhere

---------

Co-authored-by: alexxu-amd <159800977+alexxu-amd@users.noreply.github.com>
2026-01-16 11:31:13 -06:00
Alex Xu
2851f89992 update rocm-docs-core version to 1.31.3 2026-01-16 10:25:40 -05:00
Pratik Basyal
7068119ae3 7.2.0 Build version updated (#668)
* Build version updated

* Changelog synced

* PLDM udpate
2026-01-15 11:35:54 -05:00
peterjunpark
a745e45dcb Doc update for vLLM refactor #5855 2026-01-15 11:21:38 -05:00
alexxu-amd
8beac1891f update requirements.txt (#5851) 2026-01-14 16:55:26 -05:00
Pratik Basyal
0bb5a15def hipblasLT and Profiler-SDK changelog added 7.2.0 (#667)
* hipblasLT and Profiler-SDK changelog added

* Minor changes

* Resolved issues added

* Minor rewording

* Feedback incoporated

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Changelog synced

* verl and Ray change included

---------

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>
2026-01-14 12:50:31 -05:00
anisha-amd
773f5de407 Docs: Ray release 25.12 and compatibility version format standardization (#5845) 2026-01-08 12:09:11 -05:00
dependabot[bot]
b297ced032 Bump urllib3 from 2.5.0 to 2.6.3 in /docs/sphinx (#5842)
Bumps [urllib3](https://github.com/urllib3/urllib3) from 2.5.0 to 2.6.3.
- [Release notes](https://github.com/urllib3/urllib3/releases)
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst)
- [Commits](https://github.com/urllib3/urllib3/compare/2.5.0...2.6.3)

---
updated-dependencies:
- dependency-name: urllib3
  dependency-version: 2.6.3
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-08 08:22:01 -05:00
peterjunpark
2dc22ca890 fix(primus-pytorch.rst): FP8 config instead of BF16 (#5839) 2026-01-07 13:49:31 -05:00
Pratik Basyal
8d076740b8 720 RC2 update (#660)
* New GPUs listed

* GPU highlights updated

* OS table removed

* JAX 0.8.0 support added

* Apply suggestions from code review

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Azure Linux 3.0 removed

* Review feedback added

* Release and changelog synced

* Minor corrections and date change

---------

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>
2026-01-07 11:20:08 -05:00
dependabot[bot]
ba95e0e689 Bump pynacl from 1.6.1 to 1.6.2 in /docs/sphinx (#5836)
Bumps [pynacl](https://github.com/pyca/pynacl) from 1.6.1 to 1.6.2.
- [Changelog](https://github.com/pyca/pynacl/blob/main/CHANGELOG.rst)
- [Commits](https://github.com/pyca/pynacl/compare/1.6.1...1.6.2)

---
updated-dependencies:
- dependency-name: pynacl
  dependency-version: 1.6.2
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-06 14:10:42 -05:00
Swati Rawat
5b12c9a80e Merge branch 'develop' into swraw/amd-smi-doc 2026-01-05 18:51:32 +05:30
Swati Rawat
61d2424ab7 Update docs/how-to/rocm-for-ai/training/benchmark-docker/previous-versions/megatron-lm-v24.12-dev.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-05 18:18:35 +05:30
Swati Rawat
2e3500a111 Update docs/how-to/rocm-for-ai/system-setup/prerequisite-system-validation.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-05 18:18:25 +05:30
Swati Rawat
fa4bf5e9ba Update docs/how-to/rocm-for-ai/system-setup/prerequisite-system-validation.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-05 18:18:17 +05:30
Swati Rawat
2e506f1ae7 Update docs/how-to/rocm-for-ai/system-setup/prerequisite-system-validation.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-05 18:18:00 +05:30
Swati Rawat
56b684fcae Update docs/how-to/rocm-for-ai/training/benchmark-docker/previous-versions/megatron-lm-v24.12-dev.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-05 18:17:40 +05:30
Swati Rawat
b3e78704f5 Update docs/how-to/rocm-for-ai/training/benchmark-docker/previous-versions/megatron-lm-v24.12-dev.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-05 18:17:11 +05:30
Pratik Basyal
1691d369e9 ROCM-core version fixed (#5827) 2026-01-02 16:06:27 -05:00
peterjunpark
172b0f7c08 Fix inconsistency in xDiT doc
Fix inconsistency in xDiT doc
2025-12-29 10:26:25 -05:00
peterjunpark
c67fac78bd Update docs for xDiT diffusion inference 25.13 Docker release (#5820)
* archive previous version

* add xdit 25.13

* update history index

* add perf results section
2025-12-29 08:44:45 -05:00
peterjunpark
e0b8ec4dfb Update training docs for Primus/25.11 (#5819)
* update conf and toc.yml.in

* archive previous versions

archive data files

update anchors

* primus pytorch: remove training batch size args

* update primus megatron run cmds

multi-node

* update primus pytorch

update

* update

update

* update docker tag
2025-12-29 08:05:47 -05:00
Pratik Basyal
38f2d043dc OS table removed from compatibility table [develop] (#5810)
* OS table removed from compatibility table

* Feedback added

* Azure Linux 3.0 and compatibility version update

* Version fix

* Review feedback added

* Minor change
2025-12-23 16:28:19 -05:00
srawat
756fad8435 Update single-gpu-fine-tuning-and-inference.rst 2025-12-23 16:05:01 +05:30
peterjunpark
3a43bacdda Update xdit diffusion inference history (#5808)
* Update xdit diffusion inference history

* fix
2025-12-22 11:05:32 -05:00
srawat
f84d9574a8 Update multi-gpu-fine-tuning-and-inference.rst 2025-12-22 17:30:39 +05:30
peterjunpark
48d8fe139b fix link to ROCm PyT docker image (#5803) 2025-12-19 15:47:55 -05:00
peterjunpark
7455fe57b8 clean up formatting in FA2 page (#5795) 2025-12-19 09:21:41 -05:00
peterjunpark
52c0a47e84 Update Flash Attention guidance in "Model acceleration libraries" (#5793)
* flash attention update

Signed-off-by: seungrok.jung <seungrok.jung@amd.com>

flash attention update

Signed-off-by: seungrok.jung <seungrok.jung@amd.com>

flash attention update

Signed-off-by: seungrok.jung <seungrok.jung@amd.com>

sentence-case heading

* Update docs/how-to/rocm-for-ai/inference-optimization/model-acceleration-libraries.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

---------

Co-authored-by: seungrok.jung <seungrok.jung@amd.com>
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>
2025-12-19 08:48:52 -05:00
peterjunpark
cbab9a465d Update documentation for JAX training MaxText 25.11 release (#5789) 2025-12-18 11:23:58 -05:00
Pratik Basyal
377d2631e3 Initial changes to ROCm 7.2.0 (#648)
* Changes to 7.2.0

* Changelogs updated

* Highlights added

* Highlights added

* Apply suggestions from code review

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>
Co-authored-by: Jeffrey Novotny <jnovotny@amd.com>

* ROCProfiler-SDK changelog added

* rocsparse commit added

* Changelog synced

* Hightlights updated

* TOC updated

* ONNX updated

* Highlights added

* ROCm documentatino updates added

* Highlight updated

* ROCShmem version updated

* Review and changelog synced

* Update RELEASE.md

* Update CHANGELOG.md

add llvm-project

* Update RELEASE.md

Add HIP highlights

* Inconsistencies fixed

* Update RELEASE.md

Changed bullet list to subheads

* Update RELEASE.md

add code format to HIP process

* Update CHANGELOG.md

Update format of HIP process

* llvm-update

* Minor change

* Minor changes

* Runfile and Offline installer added

* Changelog synced

* Changelog synced

* Changelog updated

* Changelogs updated

* Compatibility updated

* Minor correction

* Break addded

* Fixed sync

* Breaking added

* Apply suggestions from code review

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>
Co-authored-by: yugang-amd <yugang.wang@amd.com>

* Editorial update

* Changelog synced

* Virtualization update

* ROCm resolved issue removed

---------

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>
Co-authored-by: Jeffrey Novotny <jnovotny@amd.com>
Co-authored-by: randyh62 <42045079+randyh62@users.noreply.github.com>
Co-authored-by: yugang-amd <yugang.wang@amd.com>
2025-12-17 13:50:53 -05:00
peterjunpark
459283da3c xDiT diffusion inference v25.12 documentation update (#5786)
* Add xdit-diffusion ROCm docs page.

* Update template formatting and fix sphinx warnings

* Add System Validation section.

* Add sw component versions/commits.

* Update to use latest v25.10 image instead of v25.9

* Update commands and add FLUX instructions.

* Update Flux instructions. Change image tag. Describe as diffusion inference instead of specifically video.

* git rm xdit-video-diffusion.rst

* Docs for v25.12

* Add hyperlinks to components

* Command fixes

* -Diffusers suffix

* Simplify yaml file and cleanup main rst page.

* Spelling, added 'js'

* fix merge conflict

fix

---------

Co-authored-by: Kristoffer <kristoffer.torp@amd.com>
2025-12-17 10:20:10 -05:00
srawat
00683dc244 Update prerequisite-system-validation.rst 2025-12-17 19:59:10 +05:30
peterjunpark
1b4f25733d vLLM inference benchmark 1210 (#5776)
* Archive previous ver

fix anchors

* Update vllm.rst and data yaml for 20251210
2025-12-17 09:21:57 -05:00
srawat
535b051b8d replace rocm-smi reference with amd-smi 2025-12-17 19:42:50 +05:30
Pratik Basyal
78e8baf147 Taichi removed from ROCm docs [Develop] (#5779)
* Taichi removed from ROCm docs

* Warnings fixed
2025-12-16 13:12:40 -05:00
Matt Williams
c3f0b99cc0 Reverting Optiq note 2025-12-12 17:47:33 -05:00
dependabot[bot]
c9d1679486 Bump rocm-docs-core from 1.31.0 to 1.31.1 in /docs/sphinx
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.31.0 to 1.31.1.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.31.0...v1.31.1)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-version: 1.31.1
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2025-12-12 16:15:26 -05:00
Pratik Basyal
fdbef17d7b Onnx and rocshmem version updated (#5760) 2025-12-11 17:05:25 -05:00
Matt Williams
6592a41a7f Adding ROCm-Optiq note to What is ROCm page (#5709)
* Adding ROCm-Optiq note to What is ROCm page

Adding a note for a link to the Optiq docs

* Apply suggestion from @mattwill-amd

* Apply suggestion from @mattwill-amd

* Apply suggestion from @mattwill-amd

* Update what-is-rocm.rst

* Update what-is-rocm.rst

* Apply suggestion from @mattwill-amd

* Apply suggestion from @mattwill-amd

* Apply suggestion from @mattwill-amd

* Apply suggestion from @mattwill-amd
2025-12-10 12:56:33 -08:00
Matt Williams
65a936023b Fixing link redirects (#5758)
* Update multi-gpu-fine-tuning-and-inference.rst

* Update pytorch-training-v25.6.rst

* Update pytorch-compatibility.rst
2025-12-10 11:17:59 -05:00
anisha-amd
2a64949081 Docs: update verl compatibility - fix (#5756) 2025-12-09 19:51:37 -05:00
anisha-amd
0a17434517 Docs: update verl compatibility - fix (#5754) 2025-12-09 18:36:16 -05:00
anisha-amd
2be7e5ac1e Docs: verl framework - compatibility - 25.11 release (#5752) 2025-12-09 11:41:43 -05:00