peterjunpark
e3704ad70e
Revert "Add xdit diffusion docs ( #5576 ) ( #5578 )" ( #5579 )
...
This reverts commit a38b2865f0 .
2025-10-27 16:21:10 -04:00
peterjunpark
a38b2865f0
Add xdit diffusion docs ( #5576 ) ( #5578 )
...
(cherry picked from commit 4132a2609c )
Co-authored-by: Kristoffer <kristoffer.torp@amd.com >
2025-10-27 15:41:29 -04:00
peterjunpark
dfdff755ef
Fix broken links under rocm-for-ai/ ( #5564 ) ( #5565 )
...
(cherry picked from commit 35ca027aa4 )
2025-10-23 15:18:08 -04:00
peterjunpark
8d2d5abdae
add xref to vllm v1 optimization guide in workload.rst ( #5560 ) ( #5561 )
...
(cherry picked from commit 90c1d9068f )
2025-10-23 11:51:55 -04:00
peterjunpark
b30b8b43e0
Updates to the vLLM optimization guide for MI300X/MI355X ( #5554 )
...
* Expand vLLM optimization guide for MI300X/MI355X with comprehensive AITER coverage. attention backend selection, environment variables (HIP/RCCL/Quick Reduce), parallelism strategies, quantization (FP8/FP4), engine tuning, CUDA graph modes, and multi-node scaling.
Co-authored-by: PinSiang <pinsiang.tan@embeddedllm.com >
Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com >
Co-authored-by: pinsiangamd <pinsiang.tan@amd.com >
Co-authored-by: Jeffrey Novotny <jnovotny@amd.com >
(cherry picked from commit cb8d21a0df )
2025-10-22 13:01:57 -04:00
peterjunpark
79acda6775
JAX Maxtext v25.9 doc update ( #5532 ) ( #5533 )
...
* archive previous version (25.7)
* update docker components list for 25.9
* update template
* update docker pull tag
* update
* fix intro
(cherry picked from commit a613bd6824 )
2025-10-17 11:54:39 -04:00
peterjunpark
811fa5c87a
Update Megatron/PyTorch Primus 25.9 docs ( #5528 ) ( #5529 )
...
* add previous versions
* Fix heading levels in pages using embedded templates (#5468 )
* update primus-megatron doc
update megatron-lm doc
update templates
fix tab
update primus-megatron model configs
Update primus-pytorch model configs
fix css class
add posttrain to pytorch-training template
update data sheets
update
update
update
update docker tags
* Add known issue and update Primus/Turbo versions
* add primus ver to histories
* update primus ver to 0.1.1
* fix leftovers from merge conflict
(cherry picked from commit 14bb59fca9 )
2025-10-16 13:27:40 -04:00
Pratik Basyal
0ada3a8fef
ROCm for HPC topic updated Develop ( #5504 ) ( #5505 )
...
* ROCm for HPC topic updated
* ROCm for HPC topic udpated
* Minor editorial
2025-10-10 22:39:31 -04:00
peterjunpark
68e8453ca5
Update vLLM doc for 10/6 release and bump rocm-docs-core to 1.26.0 ( #5481 )
...
* archive previous doc version
* update model/docker data and doc templates
* Update "Reproducing the Docker image"
* fix: truncated commit hash doesn't work for some reason
* bump rocm-docs-core to 1.26.0
* fix numbering
fix
* update docker tag
* update .wordlist.txt
2025-10-08 16:23:40 -04:00
peterjunpark
eeea0d2180
Fix heading levels in pages using embedded templates ( #5468 )
2025-10-03 13:33:14 -04:00
anisha-amd
93c6d17922
Docs: frameworks 25.09 - compatibility - FlashInfer and llama.cpp ( #5462 )
2025-10-02 13:51:36 -04:00
peterjunpark
2e1b4dd5ee
Add multi-node setup instructions for training perf Dockers ( #5449 )
...
---------
Co-authored-by: Jeffrey Novotny <jnovotny@amd.com >
2025-09-30 14:53:38 -04:00
Peter Park
fd59b5fbac
fix links in docs ( #5446 )
2025-09-29 15:27:32 -04:00
Pratik Basyal
d92d9268dc
Use of Radeon and Ryzen reference updated [Develop] ( #5432 )
...
* Use of Radeon and Ryzen reference updated
* Pytorch link update
2025-09-24 19:07:41 -05:00
Peter Park
442d7e4750
Add env var note to vllm.rst for MoE models and fix links in docs ( #5415 )
...
* docs(vllm.rst): add performance note for MoE models
* docs: fix links
update vllm readme link 20250521
fix links
2025-09-22 15:58:43 -04:00
Peter Park
d92e5b6c12
Update Primus Megatron doc v25.8 ( #5396 )
...
* megatron: update previous versions list
update
wording
* megatron: update rst and yaml
update primus repo link
update mig guide
* update headings and anchors
* megatron: update doc
* update docker hub urls
2025-09-19 08:09:21 -04:00
Peter Park
9827ba7ff2
docs: MaxText v25.7 patch update ( #5372 )
...
* remove jax 0.6.0 nanoo fp8 caveat note
* reorder maxtext docker images in data sheet
2025-09-17 16:25:46 -04:00
Peter Park
e8d104124f
Fix PyTorch training benchmark doc template ( #5357 )
...
* fix template
* update wordlist
2025-09-16 17:21:57 -04:00
Peter Park
26f708da87
Add Stable Diffusion XL to PyT training benchmark doc and fix paths in SGLang Disagg Inference doc ( #5282 )
...
* add sdxl to pytorch-training
* fix sphinx warnings
fix links
* fix paths in cmds and links in sglang disagg
* fix col width
* update release highlights
* fix
quickfix
2025-09-16 16:49:33 -04:00
Peter Park
bab853a0d3
Add NCF to pytorch training benchmark doc ( #5352 )
...
* add previous version (25.6)
* fix template
* Formatting and wording fixes
* add caveats
* update yaml
* add note to pytorch-training
* fix template
* make model name shorter
2025-09-16 13:29:28 -04:00
Peter Park
d5101532f7
docs: Add SGLang disaggregated P/D inference w/ Mooncake guide ( #5335 )
...
* add main content
* Update content and format
add clarification
update
update data
* fix
fix
fix
* fix: deepseek v3
* add ki
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2025-09-16 10:33:58 -05:00
Peter Park
ef4e7ca1fe
docs(PyTorch training v25.8): Add Primus and update PyTorch training benchmark docs ( #5331 )
...
* pyt: update previous versions list
update conf.py
* pyt: update yaml and rst
update
update toc
* update headings and anchors
* pyt: update doc
* update docker hub urls
2025-09-16 10:33:53 -05:00
Peter Park
76cb264f34
Update vllm-history.rst with missing 0909 entry ( #5308 )
2025-09-16 06:54:34 -04:00
Peter Park
7098bdc03b
Update vLLM inference benchmark doc for 0909 release (and Sphinx fixes) ( #5289 )
2025-09-11 15:01:17 -04:00
anisha-amd
db43d18c37
Docs: frameworks compatibility- ray and llama.cpp ( #5273 )
2025-09-09 11:02:30 -04:00
Peter Park
4f53183696
docs: Add JAX MaxText benchmark v25.7 ( #5182 )
...
* Update previous versions
* Add data file
* fix filename and anchors
* add templates
* update .wordlist.txt
* Update template and data
add missing step
fix fmt
* update template
* fix data
* add jax 0.6.0
* update history
* update quantized training note
2025-09-08 21:42:56 -04:00
Peter Park
4bc1bf00c6
Update PyTorch training benchmark docker doc to 25.7 ( #5255 )
...
* Update PyTorch training benchmark docker doc to 25.7
* update .wordlist.txt
* update conf.py
* update data sheet
* fix sphinx warnings
2025-09-05 12:07:51 -04:00
Matt Williams
76fd6b2290
Updating broken link ( #5258 )
2025-09-05 11:45:06 -04:00
Matt Williams
1d42f7cc62
Deep learning frameworks edits for scale ( #5189 )
...
* Deep learning frameworks edits for scale
Based on https://ontrack-internal.amd.com/browse/ROCDOC-1809
* update table
table
* leo comments
* formatting
* format
* update table based on feedback
* header
* Update machine learning page
* headers
* Apply suggestions from code review
Co-authored-by: anisha-amd <anisha.sankar@amd.com >
* Update .wordlist.txt
* formatting
* Update docs/how-to/deep-learning-rocm.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
---------
Co-authored-by: Matt Williams <Matt.Williams+amdeng@amd.com >
Co-authored-by: anisha-amd <anisha.sankar@amd.com >
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2025-08-22 11:46:07 -04:00
Peter Park
98029db4ee
docs: Add Primus (Megatron) training Docker documentation ( #5218 )
2025-08-21 23:50:55 -04:00
Peter Park
55d0a88ec5
vLLM inference benchmark doc: add missing data field ( #5199 )
2025-08-15 13:20:39 -04:00
Peter Park
7ee22790ce
docs: Update vLLM benchmark doc for 20250812 Docker release ( #5196 )
2025-08-14 15:43:36 -04:00
Peter Park
80f7dc79b9
Add Hunyuan Video to PyTorch inference benchmark models doc ( #5094 )
2025-08-12 11:54:59 -04:00
Dominic Widdows
9e055d92ce
Fix hyperlink syntax
2025-08-08 10:28:09 -07:00
Dominic Widdows
698d7f1d58
Updating old link that has been changed ( #5149 )
2025-08-05 15:23:55 -04:00
anisha-amd
266387d816
Docs: Adding frameworks compatibility for Megablocks and Taichi ( #5133 )
2025-07-31 13:00:31 -04:00
yugang-amd
cc5bc5a882
Add SGLang inference benchmark doc w/ initial support for DeepSeek-R1-Distill-Qwen-32B ( #4870 )
2025-07-25 12:42:40 -04:00
Peter Park
14249f24d8
Use madengine instead of tools/run_models.py in docs ( #5095 )
2025-07-24 15:38:12 -04:00
Peter Park
984a91f008
Add DeepSeek Janus Pro 7B to PyTorch inference benchmark doc ( #5071 )
...
---------
Co-authored-by: yugang-amd <yugang.wang@amd.com >
2025-07-22 16:26:06 -04:00
Peter Park
2269e9d25d
Remove broken link to deprecated AMDGPU installer documentation ( #5078 )
...
* remove link to deprecated AMDGPU installation method
* add deep learning frameworks
2025-07-21 19:36:20 -04:00
Peter Park
5bcf3b0847
Update Megatron-LM training benchmark doc for v25.6 release ( #5064 )
2025-07-18 15:57:25 -04:00
Peter Park
7e7e15a201
Fix path to data file in vllm-0.9.1-20250702.rst ( #5066 )
2025-07-18 14:16:05 -04:00
Peter Park
b437a625b3
Update vLLM inference benchmark doc for 0715 release ( #5058 )
2025-07-17 15:00:02 -04:00
Jan Stephan
16f707d6c4
Merge pull request #5001 from j-stephan/fix-doc-warnings
...
Fix doc warnings
2025-07-16 07:10:54 -04:00
Jeffrey Novotny
b431415ade
Merge Verl, DGL, Megatron changes. ( #5047 )
...
* Verl compatibility
* verl compatibility
* add Supported features
Signed-off-by: Vicky Tsang <vtsang@amd.com >
* updated and edited verl compat doc
* added links to verl
* add future release for sglang and megatron inference eng.
Signed-off-by: Vicky Tsang <vtsang@amd.com >
* fix lint
Signed-off-by: Vicky Tsang <vtsang@amd.com >
* fixed a typo and a table
* Spolifroni amd/add to compat matrix (#430 )
* added verl to compatibility matrix
* small change
* fixed an error in csv
* edited the verl compat based on leo's recommendations
* updated compat matrix (#435 )
* Added a hardcoded link to the verl install
This is a link to an RTD build and MUST be removed before publishing.
* Update verl-compatibility.rst
* Added a hardcoded link to the verl install
This link is to an RTD build and it WILL break at publishing. It MUST be changed before publishing.
* Added version support note (#448 )
* small fixes
* Update verl-compatibility.rst
* Update verl-compatibility.rst
---------
Signed-off-by: Vicky Tsang <vtsang@amd.com >
Co-authored-by: spolifroni-amd <sandra.polifroni@amd.com >
Co-authored-by: anisha-amd <anisha.sankar@amd.com >
(cherry picked from commit f9bd22626b )
* Stanford Megatron-LM Compatibility
* Create stanford-megatron-lm-compatibility.rst
* toc and wordlist
* Update deep-learning-rocm.rst
* Update stanford-megatron-lm-compatibility.rst
* Update stanford-megatron-lm-compatibility.rst
* Update stanford-megatron-lm-compatibility.rst
* Update stanford-megatron-lm-compatibility.rst
* Update stanford-megatron-lm-compatibility.rst
* Update stanford-megatron-lm-compatibility.rst
* fixes and adding to main compat matrix
* formatting fix
* Update stanford-megatron-lm-compatibility.rst
* Update stanford-megatron-lm-compatibility.rst
* Update stanford-megatron-lm-compatibility.rst
* Update docs/compatibility/ml-compatibility/stanford-megatron-lm-compatibility.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/compatibility/ml-compatibility/stanford-megatron-lm-compatibility.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/compatibility/ml-compatibility/stanford-megatron-lm-compatibility.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update stanford-megatron-lm-compatibility.rst
* Update stanford-megatron-lm-compatibility.rst
* Update stanford-megatron-lm-compatibility.rst
* Update stanford-megatron-lm-compatibility.rst
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
(cherry picked from commit f4f096b44e )
* Framework: DGL Compatability
* Introducing new file for DGL Compatability
* Update dgl-compatibility.rst
* Update .wordlist.txt
* Update .wordlist.txt
* Update deep-learning-rocm.rst
* compatibility fixes
* Update docs/compatibility/ml-compatibility/dgl-compatibility.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/compatibility/ml-compatibility/dgl-compatibility.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/compatibility/ml-compatibility/dgl-compatibility.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/compatibility/ml-compatibility/dgl-compatibility.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update dgl-compatibility.rst
* Update dgl-compatibility.rst
* Update dgl-compatibility.rst
* Update dgl-compatibility.rst
* additions to use-cases and system support
* wording and fixes
* Update dgl-compatibility.rst
* Update dgl-compatibility.rst
* remove table heading
* Update compatibility-matrix-historical-6.0.csv
---------
Co-authored-by: anisha-amd <anisha.sankar@amd.com >
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
(cherry picked from commit 2a7554c0b9 )
* Manually resolve merge conflict
* Further merge conflict adjustments
---------
Signed-off-by: Vicky Tsang <vtsang@amd.com >
Co-authored-by: vickytsang <vtsang@amd.com >
Co-authored-by: spolifroni-amd <sandra.polifroni@amd.com >
Co-authored-by: anisha-amd <anisha.sankar@amd.com >
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
Co-authored-by: Mukhil M S <167260682+mukh1l@users.noreply.github.com >
2025-07-15 18:57:31 -04:00
Peter Park
548d31f990
fix broken image in megatron-lm-v24.12-dev.rst ( #5043 )
2025-07-15 10:57:12 -04:00
Pratik Basyal
544186aef8
ROCm for HPC table update for Develop ( #5015 ) ( #5016 ) ( #5019 )
...
* ROCm for HPC table update for 6.4.0 (#5015 ) (#5016 )
* 6.4.0 updates synced
* Minor change
* Link update
2025-07-09 14:57:53 -04:00
Peter Park
22524eeaa5
fix xrefs in vllm-0.9.0.1-20250605.rst ( #5017 )
2025-07-09 14:38:24 -04:00
Peter Park
d471b04cd5
Update vLLM Docker doc for 07/02
2025-07-09 11:38:27 -04:00
Peter Park
3b3fc4894b
Fix xrefs and Sphinx warnings in documentation
...
Fix xrefs and Sphinx warnings in documentation
2025-07-08 13:22:53 -04:00