yugang-amd
f2067767e0
xdit-diffusion v25.11 docs ( #5744 )
2025-12-05 17:09:48 -05:00
peterjunpark
453751a86f
fix docker hub links for primus:v25.10 ( #5738 )
2025-12-04 09:17:33 -05:00
peterjunpark
fb644412d5
Update training Docker docs for Primus 25.10 ( #5737 )
2025-12-04 09:08:00 -05:00
yugang-amd
674dc355e4
vLLM 10/24 release ( #5626 )
...
* vLLM 10/24 release
* updates per SME inputs
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/vllm.rst
Co-authored-by: Jeffrey Novotny <jnovotny@amd.com >
---------
Co-authored-by: Jeffrey Novotny <jnovotny@amd.com >
2025-11-05 11:13:50 -05:00
peterjunpark
1515fb3779
Revert "Add xdit diffusion docs ( #5576 )" ( #5580 )
...
This reverts commit 4132a2609c .
2025-10-27 16:22:28 -04:00
Kristoffer
4132a2609c
Add xdit diffusion docs ( #5576 )
...
* Add xdit video diffusion base page.
* Update supported accelerators.
* Remove dependency on mad-tags.
* Update docker pull section.
* Update container launch instructions.
* Improve launch instruction options and layout.
* Add benchmark result outputs.
* Fix wrong HunyuanVideo path
* Finalize instructions.
* Consistent title.
* Make page and side-bar titles the same.
* Updated wordlist. Removed note container reg HF.
* Remove fp8_gemms in command and add release notes.
* Update accelerators naming.
* Add note regarding OOB performance.
* Fix admonition box.
* Overall fixes.
2025-10-27 14:56:55 +01:00
peterjunpark
a613bd6824
JAX Maxtext v25.9 doc update ( #5532 )
...
* archive previous version (25.7)
* update docker components list for 25.9
* update template
* update docker pull tag
* update
* fix intro
2025-10-17 11:31:06 -04:00
peterjunpark
14bb59fca9
Update Megatron/PyTorch Primus 25.9 docs ( #5528 )
...
* add previous versions
* Fix heading levels in pages using embedded templates (#5468 )
* update primus-megatron doc
update megatron-lm doc
update templates
fix tab
update primus-megatron model configs
Update primus-pytorch model configs
fix css class
add posttrain to pytorch-training template
update data sheets
update
update
update
update docker tags
* Add known issue and update Primus/Turbo versions
* add primus ver to histories
* update primus ver to 0.1.1
* fix leftovers from merge conflict
2025-10-16 12:51:30 -04:00
anisha-amd
a98236a4e3
Main Docs: references of accelerator removal and change to GPU ( #5495 )
...
* Docs: references of accelerator removal and change to GPU
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com >
2025-10-16 11:22:10 -04:00
peterjunpark
68e8453ca5
Update vLLM doc for 10/6 release and bump rocm-docs-core to 1.26.0 ( #5481 )
...
* archive previous doc version
* update model/docker data and doc templates
* Update "Reproducing the Docker image"
* fix: truncated commit hash doesn't work for some reason
* bump rocm-docs-core to 1.26.0
* fix numbering
fix
* update docker tag
* update .wordlist.txt
2025-10-08 16:23:40 -04:00
Peter Park
d92e5b6c12
Update Primus Megatron doc v25.8 ( #5396 )
...
* megatron: update previous versions list
update
wording
* megatron: update rst and yaml
update primus repo link
update mig guide
* update headings and anchors
* megatron: update doc
* update docker hub urls
2025-09-19 08:09:21 -04:00
Peter Park
9827ba7ff2
docs: MaxText v25.7 patch update ( #5372 )
...
* remove jax 0.6.0 nanoo fp8 caveat note
* reorder maxtext docker images in data sheet
2025-09-17 16:25:46 -04:00
Peter Park
26f708da87
Add Stable Diffusion XL to PyT training benchmark doc and fix paths in SGLang Disagg Inference doc ( #5282 )
...
* add sdxl to pytorch-training
* fix sphinx warnings
fix links
* fix paths in cmds and links in sglang disagg
* fix col width
* update release highlights
* fix
quickfix
2025-09-16 16:49:33 -04:00
Peter Park
bab853a0d3
Add NCF to pytorch training benchmark doc ( #5352 )
...
* add previous version (25.6)
* fix template
* Formatting and wording fixes
* add caveats
* update yaml
* add note to pytorch-training
* fix template
* make model name shorter
2025-09-16 13:29:28 -04:00
Peter Park
d5101532f7
docs: Add SGLang disaggregated P/D inference w/ Mooncake guide ( #5335 )
...
* add main content
* Update content and format
add clarification
update
update data
* fix
fix
fix
* fix: deepseek v3
* add ki
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2025-09-16 10:33:58 -05:00
Peter Park
ef4e7ca1fe
docs(PyTorch training v25.8): Add Primus and update PyTorch training benchmark docs ( #5331 )
...
* pyt: update previous versions list
update conf.py
* pyt: update yaml and rst
update
update toc
* update headings and anchors
* pyt: update doc
* update docker hub urls
2025-09-16 10:33:53 -05:00
Parag Bhandari
60e3a8107c
Merge branch 'develop' into develop-internal
2025-09-16 05:12:42 -04:00
Peter Park
7098bdc03b
Update vLLM inference benchmark doc for 0909 release (and Sphinx fixes) ( #5289 )
2025-09-11 15:01:17 -04:00
Peter Park
05a66f75fe
add qwen3 30b a3b to vllm-benchmark-models ( #5280 )
2025-09-09 17:41:11 -04:00
Peter Park
4f53183696
docs: Add JAX MaxText benchmark v25.7 ( #5182 )
...
* Update previous versions
* Add data file
* fix filename and anchors
* add templates
* update .wordlist.txt
* Update template and data
add missing step
fix fmt
* update template
* fix data
* add jax 0.6.0
* update history
* update quantized training note
2025-09-08 21:42:56 -04:00
Peter Park
4bc1bf00c6
Update PyTorch training benchmark docker doc to 25.7 ( #5255 )
...
* Update PyTorch training benchmark docker doc to 25.7
* update .wordlist.txt
* update conf.py
* update data sheet
* fix sphinx warnings
2025-09-05 12:07:51 -04:00
Istvan Kiss
d476d09aff
Update precision support page with missing libraries and RDNA2 and CDNA4 support
2025-08-28 17:09:34 +02:00
Pratik Basyal
ea8ff1b17d
UCC and UCX version and release notes update for 7.0.0 ( #521 )
...
* Indentation and formatting updated
* UCC and UCX version udpated
* ROCm bandwidth test update
* MI350 series info added
* Changelog update
* ROCm systems Profiler highlight updated
* Redundant removed, pulled out from HIP changelog
* Known issues to Compute profiler added
* ONNX compatibility updtaed
* ROCm COmpute Profiler highlight added
* RN update
* ROCm 700 stack image updated
* ROCM Compute and System highlight updated
* Deep learning frameworks added
* removed BF16 support for MIGraphX -- already in 6.4 release notes; removed FP4 MIGraphX support
* ROCm Compute profiler highlight updated
* Formatting update
* AI framework update
* ROCm Systems Profiler udpate
* removed mention of CentOS of CentOS
* ROCm Compute Profiler update
* Feedback changes
* leo's feedback incorporated
* ampersand
* Changelog synced
* Changelog synced
* RHEL 10 removed
* Rocky Linux updated
---------
Co-authored-by: spolifroni-amd <sandra.polifroni@amd.com >
2025-08-26 16:34:27 -04:00
Peter Park
98029db4ee
docs: Add Primus (Megatron) training Docker documentation ( #5218 )
2025-08-21 23:50:55 -04:00
Istvan Kiss
ae734e7846
Add MI350X and MI355X to atomics operation page ( #497 )
...
Add MI350X and MI355X to atomics operation page
2025-08-18 15:37:19 +02:00
Peter Park
55d0a88ec5
vLLM inference benchmark doc: add missing data field ( #5199 )
2025-08-15 13:20:39 -04:00
Peter Park
7ee22790ce
docs: Update vLLM benchmark doc for 20250812 Docker release ( #5196 )
2025-08-14 15:43:36 -04:00
Peter Park
80f7dc79b9
Add Hunyuan Video to PyTorch inference benchmark models doc ( #5094 )
2025-08-12 11:54:59 -04:00
Pratik Basyal
f632f2879f
ROCm Software Stack image for 6.4.0 updated ( #5112 )
2025-07-28 14:51:19 -04:00
yugang-amd
cc5bc5a882
Add SGLang inference benchmark doc w/ initial support for DeepSeek-R1-Distill-Qwen-32B ( #4870 )
2025-07-25 12:42:40 -04:00
Peter Park
984a91f008
Add DeepSeek Janus Pro 7B to PyTorch inference benchmark doc ( #5071 )
...
---------
Co-authored-by: yugang-amd <yugang.wang@amd.com >
2025-07-22 16:26:06 -04:00
Peter Park
5bcf3b0847
Update Megatron-LM training benchmark doc for v25.6 release ( #5064 )
2025-07-18 15:57:25 -04:00
Peter Park
b437a625b3
Update vLLM inference benchmark doc for 0715 release ( #5058 )
2025-07-17 15:00:02 -04:00
Peter Park
d471b04cd5
Update vLLM Docker doc for 07/02
2025-07-09 11:38:27 -04:00
Peter Park
d0c8ba0805
Add Wan2.1 to PyTorch inference Docker documentation ( #4984 )
...
* add wan2.1 to pyt inference models
* update group name
* fix container tag
* fix group name
* change documented data type to bfloat16
* fix col width
2025-07-02 09:58:37 -04:00
Peter Park
91a541f8b9
Update PyTorch training benchmark doc for v25.6 ( #4950 )
...
* update pytorch-training docker details
* add previous version
* add models data
* update models data id
* add models picker
* update data
* update fmt
fmt
* update data yaml
* update template
* update data
* fix
* fix vllm-0.6.4 broken link
* fix vllm history
2025-06-23 09:26:15 -04:00
Peter Park
34f8d57ece
Organize version histories in ROCm for AI benchmark Docker docs ( #4948 )
...
* add vllm 0.8.3 20250415
update prev versions table
* add vllm previous versions page
* move index to vllm-history
* add standalone megatron-lm version history
* add pytorch training version history
* fix
* add vllm-0.4.3
* add vllm-0.6.4
* update vllm-history
* add vllm-0.7.3
* add vllm-0.6.6
* add notes
* fix vllm readme links
fix main page link
* add latest version to previous versions list
* add jax-maxtext history
* fix jax-maxtext history
* add pytorch-training history
* add link in jax-maxtext 25.4
* add megatron-lm history
* fix datatemplate path for vllm 0.8.3
* fix jax-maxtext history link
* update note about performance measurements
* add vllm 0.8.5_20250521 previous version
* consistency fixes
2025-06-20 15:01:38 -04:00
yugang-amd
55f95adc7c
Update for vllm -06/10 ( #4943 )
2025-06-20 08:41:37 -04:00
Peter Park
cfb3504d77
Add Mochi Video to pytorch-inference-benchmark-models.yaml
...
Add Mochi Video to pytorch-inference-benchmark-models.yaml
2025-06-10 13:18:41 -04:00
yugang-amd
830f2d5edf
Update for vllm -05/27 ( #4886 )
...
* Update vLLM inference benchmark Docker page for rocm/vllm 5/27
* update repo for Pytorch
2025-06-05 13:30:20 -04:00
Peter Park
6999c24402
Add microsoft/phi-4 vllm-benchmark-models ( #4801 )
...
* add Phi-4 to vllm-benchmark-models.yaml
fix model_repo
* update model group names
2025-05-30 06:37:13 -04:00
Peter Park
daf2e980d9
Add Falcon-180B to vLLM benchmark Docker doc ( #4836 )
...
* add Falcon to vllm-benchmark-models.yaml
* update group name
2025-05-29 18:26:21 -04:00
Peter Park
9dbc10b4c5
Fix rocm/vllm pull tag
...
Fix rocm/vllm pull tag
2025-05-28 14:42:21 -04:00
Peter Park
cebf0f5975
Add latest rocm/vllm Docker details in vLLM inference benchmark guide ( #4824 )
...
* update rocm/vllm Docker details to latest release
* Add previous vLLM version
* fix 'further reading' xrefs
* improve model grouping names
* fix links
* update model picker text
2025-05-28 14:20:18 -04:00
Peter Park
9ed65a81c4
Add Megatron-LM benchmark doc 5/2 ( #4778 )
...
* reorg files
* add tabs
* update template
* update template
* update wordlist and toc
* add previous version to doc
* add selector paragraph
* update wordlist.txt
2025-05-22 14:28:18 -04:00
Pratik Basyal
8ef1bb0139
rocSHMEM component added to ROCm 6.4.0 documentation ( #4719 )
...
* rocSHMEM added to ROCm 640
* Space removed
* link fixed
2025-05-07 15:31:38 -04:00
Peter Park
85778177a1
Update vLLM docker pull tag 20250415 in vllm-benchmark.rst ( #4702 )
2025-04-30 16:09:30 -04:00
Peter Park
36b6ffaf7c
Add QwQ 32B to vllm-benchmark.rst ( #4685 )
...
* Add Qwen2 MoE 2.7B to vllm-benchmark-models.yaml
* Add QwQ-32B-Preview to vllm-benchmark-models.yaml
* add links to performance results
words
* change "performance validation" to "performance testing"
* remove "-Preview" from QwQ-32B
* move qwen2 MoE after qwen2
* add TunableOp section
* fix formatting
* add link to TunableOp doc
* add tunableop note
* fix vllm-benchmark template
* remove cmdline option for --tunableop on
* update docker details
* remove "training"
* remove qwen2
2025-04-24 16:44:34 -04:00
Peter Park
40e4ba3ecc
Update vLLM inference benchmark Docker guide ( #4653 )
...
* Remove JAIS 13B and 30B
* update Docker details - vLLM 0.8.3
* add previous version
* Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst
* fix link to previous version
2025-04-24 15:59:13 -04:00
Peter Park
c3faa9670b
Add PyTorch inference benchmark Docker guide (+ CLIP and Chai-1) ( #4654 )
...
* update vLLM links in deploy-your-model.rst
* add pytorch inference benchmark doc
* update toc and vLLM title
* remove previous versions
* update
* wording
* fix link and "applies to"
* add pytorch to wordlist
* add tunableop note to clip
* make tunableop note appear to all models
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* fix incorrect links
* wording
* fix wrong docker pull tag
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2025-04-23 17:35:52 -04:00