149 Commits

Author SHA1 Message Date
peterjunpark
12f71b15d2 Update xDiT docs for 26.4 release (#6122)
* archive previous version

* Squashed commit of the following:

commit 45ec725e624e719272641ffd3e4f1d47a29c0b0f
Author: Mikko Lauri <mikko.lauri@amd.com>
Date:   Wed Mar 4 07:49:56 2026 -0600

    add mxfp4 note

commit 2f33052d0b9527efd7ded82579ed0a350c244361
Author: Mikko Lauri <mikko.lauri@amd.com>
Date:   Wed Mar 4 07:14:22 2026 -0600

    add qwen models

commit f67b47ba559edd1189d238d267568137e450a88d
Author: Mikko Lauri <mikko.lauri@amd.com>
Date:   Wed Mar 4 07:04:02 2026 -0600

    update news

commit ce4e497210215d41e9c3fdfa679b2e2ae33bcb1d
Author: Mikko Lauri <mikko.lauri@amd.com>
Date:   Wed Mar 4 07:02:15 2026 -0600

    update versions

commit 80032b7c585e95c67932cb3f7de859f7df73b379
Author: Mikko Lauri <mikko.lauri@amd.com>
Date:   Wed Mar 4 04:01:39 2026 -0600

    squashed changes from v26.2

* update confs

* add link to 7.12.0 preview

---------

Co-authored-by: nsakkine <niko.sakkinen@amd.com>
2026-04-08 09:14:35 -04:00
Jeffrey Novotny
60c55eeac7 Adding the draft of the landing page (#657) (#6106)
* Adding the draft of the landing page

* Fixing lint errors

* fix missed lint error

* mimic the selector tool

* try list approach

* Use torch as quick start

* Feedback from docs and installer teams

* Remove extra newline to fix linting error

* Remove quick start section

* Incorporate more feedback for quick start section

* Change 7.11 preview link to 7.12

---------


(cherry picked from commit df20cc3da9)

Co-authored-by: Andrei Kochin <andrei.kochin@amd.com>
2026-04-01 11:38:34 -04:00
peterjunpark
3647212e6e docs: Primus 26.2 fixes (#6063)
* Primus 26.2 (Megatron): fix extra model option

* remove known issue doc
2026-03-25 19:21:24 -04:00
peterjunpark
a30c96c7e3 Primus 26.2 documentation update (#6061)
* archive previous version

* update configs

* update megatron page

* update legacy configs

* update

* fix links
2026-03-25 18:13:34 -04:00
Alex Xu
2cb8b8c04e Merge remote-tracking branch 'external/develop' into sync-develop-from-external 2026-03-25 09:48:03 -04:00
peterjunpark
36b5a0354b docs: fix dbrx-instruct broken link (#6039) 2026-03-17 09:57:19 -04:00
peterjunpark
e6af1a41c0 docs: Update xDiT diffusion inference page for v26.3 image (#6037)
* archive previous version

* update versions

* update news

* add qwen models

* add mxfp4 note

---------

Co-authored-by: Mikko Lauri <mikko.lauri@amd.com>
2026-03-17 09:47:36 -04:00
Istvan Kiss
2994886568 Remove mention of ROCR and CLR from what is ROCm page (#633)
* Remove ROCr and CLR mention on what is ROCm page.
---------

Co-authored-by: Adel Johar <adel.johar@amd.com>
2026-03-10 09:04:04 +01:00
Pratik Basyal
ba27476656 721 ROCProfiler and ROCTracer removal reverted (#705)
* Revert "ROCProfiler and ROCTracer removal from 7.2.1 docs (#697)"

This reverts commit 81ab9413bf.

* ROCProfiler ROCTracer deprecation warning reintroduced

* Review feedback on Offline installer discontinuation added
2026-03-03 17:49:59 -05:00
peterjunpark
1aeb3c0df1 xDiT diffusion inference v26.2 (#6010)
* Rebase ontop of v26.1

* Update pull tag

* Update components

* Update 'whats new' section

* Change to xdit runner

* Update benchmark_commands

* Add Flux 2 klein model

* Add LTX-2 model

* Remove deprecated folders

* Rephrase output sentence.

* update history page

* archive previous version

* fix

fix

---------

Co-authored-by: Kristoffer <kristoffer.torp@amd.com>
2026-03-02 11:49:13 -05:00
Pratik Basyal
81ab9413bf ROCProfiler and ROCTracer removal from 7.2.1 docs (#697)
* ROCprofiler and ROCTracer removed from 7.2.1 release

* Review feedback added

* Minor change"

* Update RELEASE.md

Co-authored-by: Swati Rawat <120587655+SwRaw@users.noreply.github.com>

---------

Co-authored-by: Swati Rawat <120587655+SwRaw@users.noreply.github.com>
2026-03-02 10:42:16 -05:00
peterjunpark
474c6e4a70 Update JAX Maxtext training doc for 26.2 release (#5993) 2026-02-24 10:48:51 -05:00
peterjunpark
fe8dff691d Update docs for xDiT diffusion inference 26.1 (#5955)
* archive previous version

* xDiT diffusion inference docker 26.1
2026-02-11 13:27:36 -05:00
peterjunpark
a3a4440909 docs(jax-maxtext training): remove single-node for llama 3.1 405b 2026-02-06 13:47:55 -05:00
peterjunpark
1d5baf2c73 Add docs for Maxtext 26.1 Docker release (#5936)
* archive previous version

* update doc

* add multi node for llama3 405b

fix
2026-02-06 13:29:05 -05:00
peterjunpark
d8b6ee47e3 Update Primus docs for 26.1 release (#5911)
* archive previous versions

update conf

fix

fix docker hub url

fix

* update history pages

* update docker info

* update configs

* update primus commit
2026-01-30 12:51:13 -05:00
peterjunpark
a745e45dcb Doc update for vLLM refactor #5855 2026-01-15 11:21:38 -05:00
peterjunpark
c67fac78bd Update docs for xDiT diffusion inference 25.13 Docker release (#5820)
* archive previous version

* add xdit 25.13

* update history index

* add perf results section
2025-12-29 08:44:45 -05:00
peterjunpark
e0b8ec4dfb Update training docs for Primus/25.11 (#5819)
* update conf and toc.yml.in

* archive previous versions

archive data files

update anchors

* primus pytorch: remove training batch size args

* update primus megatron run cmds

multi-node

* update primus pytorch

update

* update

update

* update docker tag
2025-12-29 08:05:47 -05:00
peterjunpark
3a43bacdda Update xdit diffusion inference history (#5808)
* Update xdit diffusion inference history

* fix
2025-12-22 11:05:32 -05:00
peterjunpark
cbab9a465d Update documentation for JAX training MaxText 25.11 release (#5789) 2025-12-18 11:23:58 -05:00
peterjunpark
459283da3c xDiT diffusion inference v25.12 documentation update (#5786)
* Add xdit-diffusion ROCm docs page.

* Update template formatting and fix sphinx warnings

* Add System Validation section.

* Add sw component versions/commits.

* Update to use latest v25.10 image instead of v25.9

* Update commands and add FLUX instructions.

* Update Flux instructions. Change image tag. Describe as diffusion inference instead of specifically video.

* git rm xdit-video-diffusion.rst

* Docs for v25.12

* Add hyperlinks to components

* Command fixes

* -Diffusers suffix

* Simplify yaml file and cleanup main rst page.

* Spelling, added 'js'

* fix merge conflict

fix

---------

Co-authored-by: Kristoffer <kristoffer.torp@amd.com>
2025-12-17 10:20:10 -05:00
peterjunpark
1b4f25733d vLLM inference benchmark 1210 (#5776)
* Archive previous ver

fix anchors

* Update vllm.rst and data yaml for 20251210
2025-12-17 09:21:57 -05:00
Pratik Basyal
78e8baf147 Taichi removed from ROCm docs [Develop] (#5779)
* Taichi removed from ROCm docs

* Warnings fixed
2025-12-16 13:12:40 -05:00
yugang-amd
f2067767e0 xdit-diffusion v25.11 docs (#5744) 2025-12-05 17:09:48 -05:00
peterjunpark
453751a86f fix docker hub links for primus:v25.10 (#5738) 2025-12-04 09:17:33 -05:00
peterjunpark
fb644412d5 Update training Docker docs for Primus 25.10 (#5737) 2025-12-04 09:08:00 -05:00
yugang-amd
674dc355e4 vLLM 10/24 release (#5626)
* vLLM 10/24 release

* updates per SME inputs

* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/vllm.rst

Co-authored-by: Jeffrey Novotny <jnovotny@amd.com>

---------

Co-authored-by: Jeffrey Novotny <jnovotny@amd.com>
2025-11-05 11:13:50 -05:00
peterjunpark
1515fb3779 Revert "Add xdit diffusion docs (#5576)" (#5580)
This reverts commit 4132a2609c.
2025-10-27 16:22:28 -04:00
Kristoffer
4132a2609c Add xdit diffusion docs (#5576)
* Add xdit video diffusion base page.

* Update supported accelerators.

* Remove dependency on mad-tags.

* Update docker pull section.

* Update container launch instructions.

* Improve launch instruction options and layout.

* Add benchmark result outputs.

* Fix wrong HunyuanVideo path

* Finalize instructions.

* Consistent title.

* Make page and side-bar titles the same.

* Updated wordlist. Removed note container reg HF.

* Remove fp8_gemms in command and add release notes.

* Update accelerators naming.

* Add note regarding OOB performance.

* Fix admonition box.

* Overall fixes.
2025-10-27 14:56:55 +01:00
peterjunpark
a613bd6824 JAX Maxtext v25.9 doc update (#5532)
* archive previous version (25.7)

* update docker components list for 25.9

* update template

* update docker pull tag

* update

* fix intro
2025-10-17 11:31:06 -04:00
peterjunpark
14bb59fca9 Update Megatron/PyTorch Primus 25.9 docs (#5528)
* add previous versions

* Fix heading levels in pages using embedded templates (#5468)

* update primus-megatron doc

update megatron-lm doc

update templates

fix tab

update primus-megatron model configs

Update primus-pytorch model configs

fix css class

add posttrain to pytorch-training template

update data sheets

update

update

update

update docker tags

* Add known issue and update Primus/Turbo versions

* add primus ver to histories

* update primus ver to 0.1.1

* fix leftovers from merge conflict
2025-10-16 12:51:30 -04:00
anisha-amd
a98236a4e3 Main Docs: references of accelerator removal and change to GPU (#5495)
* Docs: references of accelerator removal and change to GPU

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>
2025-10-16 11:22:10 -04:00
peterjunpark
68e8453ca5 Update vLLM doc for 10/6 release and bump rocm-docs-core to 1.26.0 (#5481)
* archive previous doc version

* update model/docker data and doc templates

* Update "Reproducing the Docker image"

* fix: truncated commit hash doesn't work for some reason

* bump rocm-docs-core to 1.26.0

* fix numbering

fix

* update docker tag

* update .wordlist.txt
2025-10-08 16:23:40 -04:00
Peter Park
d92e5b6c12 Update Primus Megatron doc v25.8 (#5396)
* megatron: update previous versions list

update

wording

* megatron: update rst and yaml

update primus repo link

update mig guide

* update headings and anchors

* megatron: update doc

* update docker hub urls
2025-09-19 08:09:21 -04:00
Peter Park
9827ba7ff2 docs: MaxText v25.7 patch update (#5372)
* remove jax 0.6.0 nanoo fp8 caveat note

* reorder maxtext docker images in data sheet
2025-09-17 16:25:46 -04:00
Peter Park
26f708da87 Add Stable Diffusion XL to PyT training benchmark doc and fix paths in SGLang Disagg Inference doc (#5282)
* add sdxl to pytorch-training

* fix sphinx warnings

fix links

* fix paths in cmds and links in sglang disagg

* fix col width

* update release highlights

* fix

quickfix
2025-09-16 16:49:33 -04:00
Peter Park
bab853a0d3 Add NCF to pytorch training benchmark doc (#5352)
* add previous version (25.6)

* fix template

* Formatting and wording fixes

* add caveats

* update yaml

* add note to pytorch-training

* fix template

* make model name shorter
2025-09-16 13:29:28 -04:00
Peter Park
d5101532f7 docs: Add SGLang disaggregated P/D inference w/ Mooncake guide (#5335)
* add main content

* Update content and format

add clarification

update

update data

* fix

fix

fix

* fix: deepseek v3

* add ki

* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

---------

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>
2025-09-16 10:33:58 -05:00
Peter Park
ef4e7ca1fe docs(PyTorch training v25.8): Add Primus and update PyTorch training benchmark docs (#5331)
* pyt: update previous versions list

update conf.py

* pyt: update yaml and rst

update

update toc

* update headings and anchors

* pyt: update doc

* update docker hub urls
2025-09-16 10:33:53 -05:00
Parag Bhandari
60e3a8107c Merge branch 'develop' into develop-internal 2025-09-16 05:12:42 -04:00
Peter Park
7098bdc03b Update vLLM inference benchmark doc for 0909 release (and Sphinx fixes) (#5289) 2025-09-11 15:01:17 -04:00
Peter Park
05a66f75fe add qwen3 30b a3b to vllm-benchmark-models (#5280) 2025-09-09 17:41:11 -04:00
Peter Park
4f53183696 docs: Add JAX MaxText benchmark v25.7 (#5182)
* Update previous versions

* Add data file

* fix filename and anchors

* add templates

* update .wordlist.txt

* Update template and data

add missing step

fix fmt

* update template

* fix data

* add jax 0.6.0

* update history

* update quantized training note
2025-09-08 21:42:56 -04:00
Peter Park
4bc1bf00c6 Update PyTorch training benchmark docker doc to 25.7 (#5255)
* Update PyTorch training benchmark docker doc to 25.7

* update .wordlist.txt

* update conf.py

* update data sheet

* fix sphinx warnings
2025-09-05 12:07:51 -04:00
Istvan Kiss
d476d09aff Update precision support page with missing libraries and RDNA2 and CDNA4 support 2025-08-28 17:09:34 +02:00
Pratik Basyal
ea8ff1b17d UCC and UCX version and release notes update for 7.0.0 (#521)
* Indentation and formatting updated

* UCC and UCX version udpated

* ROCm bandwidth test update

* MI350 series info added

* Changelog update

* ROCm systems Profiler highlight updated

* Redundant removed, pulled out from HIP changelog

* Known issues to Compute profiler added

* ONNX compatibility updtaed

* ROCm COmpute Profiler highlight added

* RN update

* ROCm 700 stack image updated

* ROCM Compute and System highlight updated

* Deep learning frameworks added

* removed BF16 support for MIGraphX -- already in 6.4 release notes; removed FP4 MIGraphX support

* ROCm Compute profiler highlight updated

* Formatting update

* AI framework update

* ROCm Systems Profiler udpate

* removed mention of CentOS of CentOS

* ROCm Compute Profiler update

* Feedback changes

* leo's feedback incorporated

* ampersand

* Changelog synced

* Changelog synced

* RHEL 10 removed

* Rocky Linux updated

---------

Co-authored-by: spolifroni-amd <sandra.polifroni@amd.com>
2025-08-26 16:34:27 -04:00
Peter Park
98029db4ee docs: Add Primus (Megatron) training Docker documentation (#5218) 2025-08-21 23:50:55 -04:00
Istvan Kiss
ae734e7846 Add MI350X and MI355X to atomics operation page (#497)
Add MI350X and MI355X to atomics operation page
2025-08-18 15:37:19 +02:00
Peter Park
55d0a88ec5 vLLM inference benchmark doc: add missing data field (#5199) 2025-08-15 13:20:39 -04:00