Commit Graph

1188 Commits

Author SHA1 Message Date
peterjunpark
d5e8a6f7be [docs/7.2.0] Update docs for xDiT diffusion inference 26.1 (#5956)
* docs(jax-maxtext training): remove single-node for llama 3.1 405b

(cherry picked from commit a3a4440909)

* Update docs for xDiT diffusion inference 26.1 (#5955)

* archive previous version

* xDiT diffusion inference docker 26.1

(cherry picked from commit fe8dff691d)
2026-02-11 13:30:15 -05:00
peterjunpark
208443edec docs(jax-maxtext training): remove single-node for llama 3.1 405b (#5939)
(cherry picked from commit a3a4440909)
2026-02-06 13:50:03 -05:00
peterjunpark
b62e0546fd Add docs for Maxtext 26.1 Docker release (#5936)
* archive previous version

* update doc

* add multi node for llama3 405b

fix

(cherry picked from commit 1d5baf2c73)
2026-02-06 13:30:47 -05:00
anisha-amd
de99ee0fe2 Docs: FlashInfer compatibility - frameworks release 26.01 (#5929) (#5930) 2026-02-04 13:48:04 -05:00
peterjunpark
811188dc13 Update Primus docs for 26.1 release (#5911) (#5918)
* archive previous versions

update conf

fix

fix docker hub url

fix

* update history pages

* update docker info

* update configs

* update primus commit

(cherry picked from commit d8b6ee47e3)
2026-01-30 12:54:26 -05:00
peterjunpark
ec36bc9971 Publish vLLM / SGLang + MoRI distributed inference cookbooks (#5912) (#5913)
* add recipes

* clean up

update

clean up

fix

* update sglang docker instructions

docker image tag
add user to docker group

fix

* update pldm/bkc

* update pldm/bkc

* add bkc note

* update bkc notes

* update article info

* update wordlist

* fix linting issues

* fix linting issues

* fix linting

* fix ref

(cherry picked from commit d1165b7359)
2026-01-29 11:42:03 -05:00
Pratik Basyal
af8ea73581 720 reference link update and note fixes [Develop] (#5883) (#5884)
* Links updated to 7.2.0

* COmpatibility note fixed
2026-01-22 12:21:46 -05:00
Alex Xu
370816001e Merge branch 'roc-7.2.x' into docs/7.2.0 2026-01-21 15:29:08 -05:00
Alex Xu
8c28f9ca9f Merge remote-tracking branch 'external/develop' into sync-devlop-from-external 2026-01-21 14:34:02 -05:00
alexxu-amd
6261b2c421 Add 7.2.0 to version list (#680) 2026-01-21 13:51:40 -05:00
Pratik Basyal
decd7e712c PyTorch 3 digits and 711 known issue added (#679) 2026-01-21 12:37:27 -05:00
Pratik Basyal
b7dd7e24ed 7.2.0 PLDM and Release date updated (#675)
* Release date updated

* vllm1 and GPU resiliency highlight removed

* Minor change

* Changelog synced
2026-01-21 09:54:47 -05:00
Swati Rawat
1980239b81 Update docs/how-to/rocm-for-ai/training/benchmark-docker/previous-versions/megatron-lm-v24.12-dev.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-21 17:31:41 +05:30
Swati Rawat
c75fd6f532 Update docs/how-to/rocm-for-ai/system-setup/prerequisite-system-validation.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-21 17:31:05 +05:30
Swati Rawat
72cb598190 Update docs/how-to/rocm-for-ai/system-setup/prerequisite-system-validation.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-21 17:30:33 +05:30
Swati Rawat
9b55b77aaa Update docs/how-to/rocm-for-ai/system-setup/prerequisite-system-validation.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-21 17:29:45 +05:30
Swati Rawat
8267303e1d Update docs/how-to/rocm-for-ai/training/benchmark-docker/previous-versions/megatron-lm-v24.12-dev.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-21 17:29:04 +05:30
Swati Rawat
86d2c4e891 Update docs/how-to/rocm-for-ai/training/benchmark-docker/previous-versions/megatron-lm-v24.12-dev.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-21 17:28:23 +05:30
srawat
2977e35330 Update single-gpu-fine-tuning-and-inference.rst 2026-01-21 17:27:13 +05:30
srawat
e95955f572 Update multi-gpu-fine-tuning-and-inference.rst 2026-01-21 17:27:13 +05:30
randyh62
45bd726f55 Use intersphinx links for deep learning (#5859)
* Use intersphinx links for deep learning

* Update deep-learning-rocm.rst

remove Taichi

* Update deep-learning-rocm.rst

Change Install link to "link"

* Apply suggestion from @randyh62

OK
2026-01-16 13:17:47 -08:00
Jeffrey Novotny
33fbde69db Add missing APU entries to GPU hardware specifications (#646) (#5862)
* Add missing APU entries to GPU hardware specifications

* Move Ryzen APUs to new tab

* Add new column to Ryzen table and rename column elsewhere

---------


(cherry picked from commit 7ab402a3b3)

Co-authored-by: alexxu-amd <159800977+alexxu-amd@users.noreply.github.com>
2026-01-16 12:55:55 -05:00
Jeffrey Novotny
7ab402a3b3 Add missing APU entries to GPU hardware specifications (#646)
* Add missing APU entries to GPU hardware specifications

* Move Ryzen APUs to new tab

* Add new column to Ryzen table and rename column elsewhere

---------

Co-authored-by: alexxu-amd <159800977+alexxu-amd@users.noreply.github.com>
2026-01-16 11:31:13 -06:00
Alex Xu
2851f89992 update rocm-docs-core version to 1.31.3 2026-01-16 10:25:40 -05:00
Pratik Basyal
7068119ae3 7.2.0 Build version updated (#668)
* Build version updated

* Changelog synced

* PLDM udpate
2026-01-15 11:35:54 -05:00
peterjunpark
a745e45dcb Doc update for vLLM refactor #5855 2026-01-15 11:21:38 -05:00
alexxu-amd
8beac1891f update requirements.txt (#5851) 2026-01-14 16:55:26 -05:00
Pratik Basyal
0bb5a15def hipblasLT and Profiler-SDK changelog added 7.2.0 (#667)
* hipblasLT and Profiler-SDK changelog added

* Minor changes

* Resolved issues added

* Minor rewording

* Feedback incoporated

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Changelog synced

* verl and Ray change included

---------

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>
2026-01-14 12:50:31 -05:00
anisha-amd
773f5de407 Docs: Ray release 25.12 and compatibility version format standardization (#5845) 2026-01-08 12:09:11 -05:00
dependabot[bot]
b297ced032 Bump urllib3 from 2.5.0 to 2.6.3 in /docs/sphinx (#5842)
Bumps [urllib3](https://github.com/urllib3/urllib3) from 2.5.0 to 2.6.3.
- [Release notes](https://github.com/urllib3/urllib3/releases)
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst)
- [Commits](https://github.com/urllib3/urllib3/compare/2.5.0...2.6.3)

---
updated-dependencies:
- dependency-name: urllib3
  dependency-version: 2.6.3
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-08 08:22:01 -05:00
peterjunpark
2dc22ca890 fix(primus-pytorch.rst): FP8 config instead of BF16 (#5839) 2026-01-07 13:49:31 -05:00
Pratik Basyal
8d076740b8 720 RC2 update (#660)
* New GPUs listed

* GPU highlights updated

* OS table removed

* JAX 0.8.0 support added

* Apply suggestions from code review

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Azure Linux 3.0 removed

* Review feedback added

* Release and changelog synced

* Minor corrections and date change

---------

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>
2026-01-07 11:20:08 -05:00
dependabot[bot]
ba95e0e689 Bump pynacl from 1.6.1 to 1.6.2 in /docs/sphinx (#5836)
Bumps [pynacl](https://github.com/pyca/pynacl) from 1.6.1 to 1.6.2.
- [Changelog](https://github.com/pyca/pynacl/blob/main/CHANGELOG.rst)
- [Commits](https://github.com/pyca/pynacl/compare/1.6.1...1.6.2)

---
updated-dependencies:
- dependency-name: pynacl
  dependency-version: 1.6.2
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-06 14:10:42 -05:00
Swati Rawat
5b12c9a80e Merge branch 'develop' into swraw/amd-smi-doc 2026-01-05 18:51:32 +05:30
Swati Rawat
61d2424ab7 Update docs/how-to/rocm-for-ai/training/benchmark-docker/previous-versions/megatron-lm-v24.12-dev.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-05 18:18:35 +05:30
Swati Rawat
2e3500a111 Update docs/how-to/rocm-for-ai/system-setup/prerequisite-system-validation.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-05 18:18:25 +05:30
Swati Rawat
fa4bf5e9ba Update docs/how-to/rocm-for-ai/system-setup/prerequisite-system-validation.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-05 18:18:17 +05:30
Swati Rawat
2e506f1ae7 Update docs/how-to/rocm-for-ai/system-setup/prerequisite-system-validation.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-05 18:18:00 +05:30
Swati Rawat
56b684fcae Update docs/how-to/rocm-for-ai/training/benchmark-docker/previous-versions/megatron-lm-v24.12-dev.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-05 18:17:40 +05:30
Swati Rawat
b3e78704f5 Update docs/how-to/rocm-for-ai/training/benchmark-docker/previous-versions/megatron-lm-v24.12-dev.rst
Co-authored-by: peterjunpark <git@peterjunpark.com>
2026-01-05 18:17:11 +05:30
Pratik Basyal
1691d369e9 ROCM-core version fixed (#5827) 2026-01-02 16:06:27 -05:00
peterjunpark
172b0f7c08 Fix inconsistency in xDiT doc
Fix inconsistency in xDiT doc
2025-12-29 10:26:25 -05:00
peterjunpark
c67fac78bd Update docs for xDiT diffusion inference 25.13 Docker release (#5820)
* archive previous version

* add xdit 25.13

* update history index

* add perf results section
2025-12-29 08:44:45 -05:00
peterjunpark
e0b8ec4dfb Update training docs for Primus/25.11 (#5819)
* update conf and toc.yml.in

* archive previous versions

archive data files

update anchors

* primus pytorch: remove training batch size args

* update primus megatron run cmds

multi-node

* update primus pytorch

update

* update

update

* update docker tag
2025-12-29 08:05:47 -05:00
Pratik Basyal
38f2d043dc OS table removed from compatibility table [develop] (#5810)
* OS table removed from compatibility table

* Feedback added

* Azure Linux 3.0 and compatibility version update

* Version fix

* Review feedback added

* Minor change
2025-12-23 16:28:19 -05:00
srawat
756fad8435 Update single-gpu-fine-tuning-and-inference.rst 2025-12-23 16:05:01 +05:30
peterjunpark
3a43bacdda Update xdit diffusion inference history (#5808)
* Update xdit diffusion inference history

* fix
2025-12-22 11:05:32 -05:00
srawat
f84d9574a8 Update multi-gpu-fine-tuning-and-inference.rst 2025-12-22 17:30:39 +05:30
peterjunpark
48d8fe139b fix link to ROCm PyT docker image (#5803) 2025-12-19 15:47:55 -05:00
peterjunpark
7455fe57b8 clean up formatting in FA2 page (#5795) 2025-12-19 09:21:41 -05:00