peterjunpark
12f71b15d2
Update xDiT docs for 26.4 release ( #6122 )
...
* archive previous version
* Squashed commit of the following:
commit 45ec725e624e719272641ffd3e4f1d47a29c0b0f
Author: Mikko Lauri <mikko.lauri@amd.com >
Date: Wed Mar 4 07:49:56 2026 -0600
add mxfp4 note
commit 2f33052d0b9527efd7ded82579ed0a350c244361
Author: Mikko Lauri <mikko.lauri@amd.com >
Date: Wed Mar 4 07:14:22 2026 -0600
add qwen models
commit f67b47ba559edd1189d238d267568137e450a88d
Author: Mikko Lauri <mikko.lauri@amd.com >
Date: Wed Mar 4 07:04:02 2026 -0600
update news
commit ce4e497210215d41e9c3fdfa679b2e2ae33bcb1d
Author: Mikko Lauri <mikko.lauri@amd.com >
Date: Wed Mar 4 07:02:15 2026 -0600
update versions
commit 80032b7c585e95c67932cb3f7de859f7df73b379
Author: Mikko Lauri <mikko.lauri@amd.com >
Date: Wed Mar 4 04:01:39 2026 -0600
squashed changes from v26.2
* update confs
* add link to 7.12.0 preview
---------
Co-authored-by: nsakkine <niko.sakkinen@amd.com >
2026-04-08 09:14:35 -04:00
Jeffrey Novotny
60c55eeac7
Adding the draft of the landing page ( #657 ) ( #6106 )
...
* Adding the draft of the landing page
* Fixing lint errors
* fix missed lint error
* mimic the selector tool
* try list approach
* Use torch as quick start
* Feedback from docs and installer teams
* Remove extra newline to fix linting error
* Remove quick start section
* Incorporate more feedback for quick start section
* Change 7.11 preview link to 7.12
---------
(cherry picked from commit df20cc3da9 )
Co-authored-by: Andrei Kochin <andrei.kochin@amd.com >
2026-04-01 11:38:34 -04:00
peterjunpark
3647212e6e
docs: Primus 26.2 fixes ( #6063 )
...
* Primus 26.2 (Megatron): fix extra model option
* remove known issue doc
2026-03-25 19:21:24 -04:00
peterjunpark
a30c96c7e3
Primus 26.2 documentation update ( #6061 )
...
* archive previous version
* update configs
* update megatron page
* update legacy configs
* update
* fix links
2026-03-25 18:13:34 -04:00
Alex Xu
2cb8b8c04e
Merge remote-tracking branch 'external/develop' into sync-develop-from-external
2026-03-25 09:48:03 -04:00
peterjunpark
36b5a0354b
docs: fix dbrx-instruct broken link ( #6039 )
2026-03-17 09:57:19 -04:00
peterjunpark
e6af1a41c0
docs: Update xDiT diffusion inference page for v26.3 image ( #6037 )
...
* archive previous version
* update versions
* update news
* add qwen models
* add mxfp4 note
---------
Co-authored-by: Mikko Lauri <mikko.lauri@amd.com >
2026-03-17 09:47:36 -04:00
Istvan Kiss
2994886568
Remove mention of ROCR and CLR from what is ROCm page ( #633 )
...
* Remove ROCr and CLR mention on what is ROCm page.
---------
Co-authored-by: Adel Johar <adel.johar@amd.com >
2026-03-10 09:04:04 +01:00
Pratik Basyal
ba27476656
721 ROCProfiler and ROCTracer removal reverted ( #705 )
...
* Revert "ROCProfiler and ROCTracer removal from 7.2.1 docs (#697 )"
This reverts commit 81ab9413bf .
* ROCProfiler ROCTracer deprecation warning reintroduced
* Review feedback on Offline installer discontinuation added
2026-03-03 17:49:59 -05:00
peterjunpark
1aeb3c0df1
xDiT diffusion inference v26.2 ( #6010 )
...
* Rebase ontop of v26.1
* Update pull tag
* Update components
* Update 'whats new' section
* Change to xdit runner
* Update benchmark_commands
* Add Flux 2 klein model
* Add LTX-2 model
* Remove deprecated folders
* Rephrase output sentence.
* update history page
* archive previous version
* fix
fix
---------
Co-authored-by: Kristoffer <kristoffer.torp@amd.com >
2026-03-02 11:49:13 -05:00
Pratik Basyal
81ab9413bf
ROCProfiler and ROCTracer removal from 7.2.1 docs ( #697 )
...
* ROCprofiler and ROCTracer removed from 7.2.1 release
* Review feedback added
* Minor change"
* Update RELEASE.md
Co-authored-by: Swati Rawat <120587655+SwRaw@users.noreply.github.com >
---------
Co-authored-by: Swati Rawat <120587655+SwRaw@users.noreply.github.com >
2026-03-02 10:42:16 -05:00
peterjunpark
474c6e4a70
Update JAX Maxtext training doc for 26.2 release ( #5993 )
2026-02-24 10:48:51 -05:00
peterjunpark
fe8dff691d
Update docs for xDiT diffusion inference 26.1 ( #5955 )
...
* archive previous version
* xDiT diffusion inference docker 26.1
2026-02-11 13:27:36 -05:00
peterjunpark
a3a4440909
docs(jax-maxtext training): remove single-node for llama 3.1 405b
2026-02-06 13:47:55 -05:00
peterjunpark
1d5baf2c73
Add docs for Maxtext 26.1 Docker release ( #5936 )
...
* archive previous version
* update doc
* add multi node for llama3 405b
fix
2026-02-06 13:29:05 -05:00
peterjunpark
d8b6ee47e3
Update Primus docs for 26.1 release ( #5911 )
...
* archive previous versions
update conf
fix
fix docker hub url
fix
* update history pages
* update docker info
* update configs
* update primus commit
2026-01-30 12:51:13 -05:00
peterjunpark
a745e45dcb
Doc update for vLLM refactor #5855
2026-01-15 11:21:38 -05:00
peterjunpark
c67fac78bd
Update docs for xDiT diffusion inference 25.13 Docker release ( #5820 )
...
* archive previous version
* add xdit 25.13
* update history index
* add perf results section
2025-12-29 08:44:45 -05:00
peterjunpark
e0b8ec4dfb
Update training docs for Primus/25.11 ( #5819 )
...
* update conf and toc.yml.in
* archive previous versions
archive data files
update anchors
* primus pytorch: remove training batch size args
* update primus megatron run cmds
multi-node
* update primus pytorch
update
* update
update
* update docker tag
2025-12-29 08:05:47 -05:00
peterjunpark
3a43bacdda
Update xdit diffusion inference history ( #5808 )
...
* Update xdit diffusion inference history
* fix
2025-12-22 11:05:32 -05:00
peterjunpark
cbab9a465d
Update documentation for JAX training MaxText 25.11 release ( #5789 )
2025-12-18 11:23:58 -05:00
peterjunpark
459283da3c
xDiT diffusion inference v25.12 documentation update ( #5786 )
...
* Add xdit-diffusion ROCm docs page.
* Update template formatting and fix sphinx warnings
* Add System Validation section.
* Add sw component versions/commits.
* Update to use latest v25.10 image instead of v25.9
* Update commands and add FLUX instructions.
* Update Flux instructions. Change image tag. Describe as diffusion inference instead of specifically video.
* git rm xdit-video-diffusion.rst
* Docs for v25.12
* Add hyperlinks to components
* Command fixes
* -Diffusers suffix
* Simplify yaml file and cleanup main rst page.
* Spelling, added 'js'
* fix merge conflict
fix
---------
Co-authored-by: Kristoffer <kristoffer.torp@amd.com >
2025-12-17 10:20:10 -05:00
peterjunpark
1b4f25733d
vLLM inference benchmark 1210 ( #5776 )
...
* Archive previous ver
fix anchors
* Update vllm.rst and data yaml for 20251210
2025-12-17 09:21:57 -05:00
Pratik Basyal
78e8baf147
Taichi removed from ROCm docs [Develop] ( #5779 )
...
* Taichi removed from ROCm docs
* Warnings fixed
2025-12-16 13:12:40 -05:00
yugang-amd
f2067767e0
xdit-diffusion v25.11 docs ( #5744 )
2025-12-05 17:09:48 -05:00
peterjunpark
453751a86f
fix docker hub links for primus:v25.10 ( #5738 )
2025-12-04 09:17:33 -05:00
peterjunpark
fb644412d5
Update training Docker docs for Primus 25.10 ( #5737 )
2025-12-04 09:08:00 -05:00
yugang-amd
674dc355e4
vLLM 10/24 release ( #5626 )
...
* vLLM 10/24 release
* updates per SME inputs
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/vllm.rst
Co-authored-by: Jeffrey Novotny <jnovotny@amd.com >
---------
Co-authored-by: Jeffrey Novotny <jnovotny@amd.com >
2025-11-05 11:13:50 -05:00
peterjunpark
1515fb3779
Revert "Add xdit diffusion docs ( #5576 )" ( #5580 )
...
This reverts commit 4132a2609c .
2025-10-27 16:22:28 -04:00
Kristoffer
4132a2609c
Add xdit diffusion docs ( #5576 )
...
* Add xdit video diffusion base page.
* Update supported accelerators.
* Remove dependency on mad-tags.
* Update docker pull section.
* Update container launch instructions.
* Improve launch instruction options and layout.
* Add benchmark result outputs.
* Fix wrong HunyuanVideo path
* Finalize instructions.
* Consistent title.
* Make page and side-bar titles the same.
* Updated wordlist. Removed note container reg HF.
* Remove fp8_gemms in command and add release notes.
* Update accelerators naming.
* Add note regarding OOB performance.
* Fix admonition box.
* Overall fixes.
2025-10-27 14:56:55 +01:00
peterjunpark
a613bd6824
JAX Maxtext v25.9 doc update ( #5532 )
...
* archive previous version (25.7)
* update docker components list for 25.9
* update template
* update docker pull tag
* update
* fix intro
2025-10-17 11:31:06 -04:00
peterjunpark
14bb59fca9
Update Megatron/PyTorch Primus 25.9 docs ( #5528 )
...
* add previous versions
* Fix heading levels in pages using embedded templates (#5468 )
* update primus-megatron doc
update megatron-lm doc
update templates
fix tab
update primus-megatron model configs
Update primus-pytorch model configs
fix css class
add posttrain to pytorch-training template
update data sheets
update
update
update
update docker tags
* Add known issue and update Primus/Turbo versions
* add primus ver to histories
* update primus ver to 0.1.1
* fix leftovers from merge conflict
2025-10-16 12:51:30 -04:00
anisha-amd
a98236a4e3
Main Docs: references of accelerator removal and change to GPU ( #5495 )
...
* Docs: references of accelerator removal and change to GPU
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com >
2025-10-16 11:22:10 -04:00
peterjunpark
68e8453ca5
Update vLLM doc for 10/6 release and bump rocm-docs-core to 1.26.0 ( #5481 )
...
* archive previous doc version
* update model/docker data and doc templates
* Update "Reproducing the Docker image"
* fix: truncated commit hash doesn't work for some reason
* bump rocm-docs-core to 1.26.0
* fix numbering
fix
* update docker tag
* update .wordlist.txt
2025-10-08 16:23:40 -04:00
Peter Park
d92e5b6c12
Update Primus Megatron doc v25.8 ( #5396 )
...
* megatron: update previous versions list
update
wording
* megatron: update rst and yaml
update primus repo link
update mig guide
* update headings and anchors
* megatron: update doc
* update docker hub urls
2025-09-19 08:09:21 -04:00
Peter Park
9827ba7ff2
docs: MaxText v25.7 patch update ( #5372 )
...
* remove jax 0.6.0 nanoo fp8 caveat note
* reorder maxtext docker images in data sheet
2025-09-17 16:25:46 -04:00
Peter Park
26f708da87
Add Stable Diffusion XL to PyT training benchmark doc and fix paths in SGLang Disagg Inference doc ( #5282 )
...
* add sdxl to pytorch-training
* fix sphinx warnings
fix links
* fix paths in cmds and links in sglang disagg
* fix col width
* update release highlights
* fix
quickfix
2025-09-16 16:49:33 -04:00
Peter Park
bab853a0d3
Add NCF to pytorch training benchmark doc ( #5352 )
...
* add previous version (25.6)
* fix template
* Formatting and wording fixes
* add caveats
* update yaml
* add note to pytorch-training
* fix template
* make model name shorter
2025-09-16 13:29:28 -04:00
Peter Park
d5101532f7
docs: Add SGLang disaggregated P/D inference w/ Mooncake guide ( #5335 )
...
* add main content
* Update content and format
add clarification
update
update data
* fix
fix
fix
* fix: deepseek v3
* add ki
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2025-09-16 10:33:58 -05:00
Peter Park
ef4e7ca1fe
docs(PyTorch training v25.8): Add Primus and update PyTorch training benchmark docs ( #5331 )
...
* pyt: update previous versions list
update conf.py
* pyt: update yaml and rst
update
update toc
* update headings and anchors
* pyt: update doc
* update docker hub urls
2025-09-16 10:33:53 -05:00
Parag Bhandari
60e3a8107c
Merge branch 'develop' into develop-internal
2025-09-16 05:12:42 -04:00
Peter Park
7098bdc03b
Update vLLM inference benchmark doc for 0909 release (and Sphinx fixes) ( #5289 )
2025-09-11 15:01:17 -04:00
Peter Park
05a66f75fe
add qwen3 30b a3b to vllm-benchmark-models ( #5280 )
2025-09-09 17:41:11 -04:00
Peter Park
4f53183696
docs: Add JAX MaxText benchmark v25.7 ( #5182 )
...
* Update previous versions
* Add data file
* fix filename and anchors
* add templates
* update .wordlist.txt
* Update template and data
add missing step
fix fmt
* update template
* fix data
* add jax 0.6.0
* update history
* update quantized training note
2025-09-08 21:42:56 -04:00
Peter Park
4bc1bf00c6
Update PyTorch training benchmark docker doc to 25.7 ( #5255 )
...
* Update PyTorch training benchmark docker doc to 25.7
* update .wordlist.txt
* update conf.py
* update data sheet
* fix sphinx warnings
2025-09-05 12:07:51 -04:00
Istvan Kiss
d476d09aff
Update precision support page with missing libraries and RDNA2 and CDNA4 support
2025-08-28 17:09:34 +02:00
Pratik Basyal
ea8ff1b17d
UCC and UCX version and release notes update for 7.0.0 ( #521 )
...
* Indentation and formatting updated
* UCC and UCX version udpated
* ROCm bandwidth test update
* MI350 series info added
* Changelog update
* ROCm systems Profiler highlight updated
* Redundant removed, pulled out from HIP changelog
* Known issues to Compute profiler added
* ONNX compatibility updtaed
* ROCm COmpute Profiler highlight added
* RN update
* ROCm 700 stack image updated
* ROCM Compute and System highlight updated
* Deep learning frameworks added
* removed BF16 support for MIGraphX -- already in 6.4 release notes; removed FP4 MIGraphX support
* ROCm Compute profiler highlight updated
* Formatting update
* AI framework update
* ROCm Systems Profiler udpate
* removed mention of CentOS of CentOS
* ROCm Compute Profiler update
* Feedback changes
* leo's feedback incorporated
* ampersand
* Changelog synced
* Changelog synced
* RHEL 10 removed
* Rocky Linux updated
---------
Co-authored-by: spolifroni-amd <sandra.polifroni@amd.com >
2025-08-26 16:34:27 -04:00
Peter Park
98029db4ee
docs: Add Primus (Megatron) training Docker documentation ( #5218 )
2025-08-21 23:50:55 -04:00
Istvan Kiss
ae734e7846
Add MI350X and MI355X to atomics operation page ( #497 )
...
Add MI350X and MI355X to atomics operation page
2025-08-18 15:37:19 +02:00
Peter Park
55d0a88ec5
vLLM inference benchmark doc: add missing data field ( #5199 )
2025-08-15 13:20:39 -04:00