peterjunpark
1515fb3779
Revert "Add xdit diffusion docs ( #5576 )" ( #5580 )
...
This reverts commit 4132a2609c .
2025-10-27 16:22:28 -04:00
Kristoffer
4132a2609c
Add xdit diffusion docs ( #5576 )
...
* Add xdit video diffusion base page.
* Update supported accelerators.
* Remove dependency on mad-tags.
* Update docker pull section.
* Update container launch instructions.
* Improve launch instruction options and layout.
* Add benchmark result outputs.
* Fix wrong HunyuanVideo path
* Finalize instructions.
* Consistent title.
* Make page and side-bar titles the same.
* Updated wordlist. Removed note container reg HF.
* Remove fp8_gemms in command and add release notes.
* Update accelerators naming.
* Add note regarding OOB performance.
* Fix admonition box.
* Overall fixes.
2025-10-27 14:56:55 +01:00
peterjunpark
a613bd6824
JAX Maxtext v25.9 doc update ( #5532 )
...
* archive previous version (25.7)
* update docker components list for 25.9
* update template
* update docker pull tag
* update
* fix intro
2025-10-17 11:31:06 -04:00
peterjunpark
14bb59fca9
Update Megatron/PyTorch Primus 25.9 docs ( #5528 )
...
* add previous versions
* Fix heading levels in pages using embedded templates (#5468 )
* update primus-megatron doc
update megatron-lm doc
update templates
fix tab
update primus-megatron model configs
Update primus-pytorch model configs
fix css class
add posttrain to pytorch-training template
update data sheets
update
update
update
update docker tags
* Add known issue and update Primus/Turbo versions
* add primus ver to histories
* update primus ver to 0.1.1
* fix leftovers from merge conflict
2025-10-16 12:51:30 -04:00
anisha-amd
a98236a4e3
Main Docs: references of accelerator removal and change to GPU ( #5495 )
...
* Docs: references of accelerator removal and change to GPU
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com >
2025-10-16 11:22:10 -04:00
peterjunpark
68e8453ca5
Update vLLM doc for 10/6 release and bump rocm-docs-core to 1.26.0 ( #5481 )
...
* archive previous doc version
* update model/docker data and doc templates
* Update "Reproducing the Docker image"
* fix: truncated commit hash doesn't work for some reason
* bump rocm-docs-core to 1.26.0
* fix numbering
fix
* update docker tag
* update .wordlist.txt
2025-10-08 16:23:40 -04:00
Peter Park
d92e5b6c12
Update Primus Megatron doc v25.8 ( #5396 )
...
* megatron: update previous versions list
update
wording
* megatron: update rst and yaml
update primus repo link
update mig guide
* update headings and anchors
* megatron: update doc
* update docker hub urls
2025-09-19 08:09:21 -04:00
Peter Park
9827ba7ff2
docs: MaxText v25.7 patch update ( #5372 )
...
* remove jax 0.6.0 nanoo fp8 caveat note
* reorder maxtext docker images in data sheet
2025-09-17 16:25:46 -04:00
Peter Park
26f708da87
Add Stable Diffusion XL to PyT training benchmark doc and fix paths in SGLang Disagg Inference doc ( #5282 )
...
* add sdxl to pytorch-training
* fix sphinx warnings
fix links
* fix paths in cmds and links in sglang disagg
* fix col width
* update release highlights
* fix
quickfix
2025-09-16 16:49:33 -04:00
Peter Park
bab853a0d3
Add NCF to pytorch training benchmark doc ( #5352 )
...
* add previous version (25.6)
* fix template
* Formatting and wording fixes
* add caveats
* update yaml
* add note to pytorch-training
* fix template
* make model name shorter
2025-09-16 13:29:28 -04:00
Peter Park
d5101532f7
docs: Add SGLang disaggregated P/D inference w/ Mooncake guide ( #5335 )
...
* add main content
* Update content and format
add clarification
update
update data
* fix
fix
fix
* fix: deepseek v3
* add ki
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2025-09-16 10:33:58 -05:00
Peter Park
ef4e7ca1fe
docs(PyTorch training v25.8): Add Primus and update PyTorch training benchmark docs ( #5331 )
...
* pyt: update previous versions list
update conf.py
* pyt: update yaml and rst
update
update toc
* update headings and anchors
* pyt: update doc
* update docker hub urls
2025-09-16 10:33:53 -05:00
Parag Bhandari
60e3a8107c
Merge branch 'develop' into develop-internal
2025-09-16 05:12:42 -04:00
Peter Park
7098bdc03b
Update vLLM inference benchmark doc for 0909 release (and Sphinx fixes) ( #5289 )
2025-09-11 15:01:17 -04:00
Peter Park
05a66f75fe
add qwen3 30b a3b to vllm-benchmark-models ( #5280 )
2025-09-09 17:41:11 -04:00
Peter Park
4f53183696
docs: Add JAX MaxText benchmark v25.7 ( #5182 )
...
* Update previous versions
* Add data file
* fix filename and anchors
* add templates
* update .wordlist.txt
* Update template and data
add missing step
fix fmt
* update template
* fix data
* add jax 0.6.0
* update history
* update quantized training note
2025-09-08 21:42:56 -04:00
Peter Park
4bc1bf00c6
Update PyTorch training benchmark docker doc to 25.7 ( #5255 )
...
* Update PyTorch training benchmark docker doc to 25.7
* update .wordlist.txt
* update conf.py
* update data sheet
* fix sphinx warnings
2025-09-05 12:07:51 -04:00
Istvan Kiss
d476d09aff
Update precision support page with missing libraries and RDNA2 and CDNA4 support
2025-08-28 17:09:34 +02:00
Pratik Basyal
ea8ff1b17d
UCC and UCX version and release notes update for 7.0.0 ( #521 )
...
* Indentation and formatting updated
* UCC and UCX version udpated
* ROCm bandwidth test update
* MI350 series info added
* Changelog update
* ROCm systems Profiler highlight updated
* Redundant removed, pulled out from HIP changelog
* Known issues to Compute profiler added
* ONNX compatibility updtaed
* ROCm COmpute Profiler highlight added
* RN update
* ROCm 700 stack image updated
* ROCM Compute and System highlight updated
* Deep learning frameworks added
* removed BF16 support for MIGraphX -- already in 6.4 release notes; removed FP4 MIGraphX support
* ROCm Compute profiler highlight updated
* Formatting update
* AI framework update
* ROCm Systems Profiler udpate
* removed mention of CentOS of CentOS
* ROCm Compute Profiler update
* Feedback changes
* leo's feedback incorporated
* ampersand
* Changelog synced
* Changelog synced
* RHEL 10 removed
* Rocky Linux updated
---------
Co-authored-by: spolifroni-amd <sandra.polifroni@amd.com >
2025-08-26 16:34:27 -04:00
Peter Park
98029db4ee
docs: Add Primus (Megatron) training Docker documentation ( #5218 )
2025-08-21 23:50:55 -04:00
Istvan Kiss
ae734e7846
Add MI350X and MI355X to atomics operation page ( #497 )
...
Add MI350X and MI355X to atomics operation page
2025-08-18 15:37:19 +02:00
Peter Park
55d0a88ec5
vLLM inference benchmark doc: add missing data field ( #5199 )
2025-08-15 13:20:39 -04:00
Peter Park
7ee22790ce
docs: Update vLLM benchmark doc for 20250812 Docker release ( #5196 )
2025-08-14 15:43:36 -04:00
Peter Park
80f7dc79b9
Add Hunyuan Video to PyTorch inference benchmark models doc ( #5094 )
2025-08-12 11:54:59 -04:00
Pratik Basyal
f632f2879f
ROCm Software Stack image for 6.4.0 updated ( #5112 )
2025-07-28 14:51:19 -04:00
yugang-amd
cc5bc5a882
Add SGLang inference benchmark doc w/ initial support for DeepSeek-R1-Distill-Qwen-32B ( #4870 )
2025-07-25 12:42:40 -04:00
Peter Park
984a91f008
Add DeepSeek Janus Pro 7B to PyTorch inference benchmark doc ( #5071 )
...
---------
Co-authored-by: yugang-amd <yugang.wang@amd.com >
2025-07-22 16:26:06 -04:00
Peter Park
5bcf3b0847
Update Megatron-LM training benchmark doc for v25.6 release ( #5064 )
2025-07-18 15:57:25 -04:00
Peter Park
b437a625b3
Update vLLM inference benchmark doc for 0715 release ( #5058 )
2025-07-17 15:00:02 -04:00
Peter Park
d471b04cd5
Update vLLM Docker doc for 07/02
2025-07-09 11:38:27 -04:00
Peter Park
d0c8ba0805
Add Wan2.1 to PyTorch inference Docker documentation ( #4984 )
...
* add wan2.1 to pyt inference models
* update group name
* fix container tag
* fix group name
* change documented data type to bfloat16
* fix col width
2025-07-02 09:58:37 -04:00
Peter Park
91a541f8b9
Update PyTorch training benchmark doc for v25.6 ( #4950 )
...
* update pytorch-training docker details
* add previous version
* add models data
* update models data id
* add models picker
* update data
* update fmt
fmt
* update data yaml
* update template
* update data
* fix
* fix vllm-0.6.4 broken link
* fix vllm history
2025-06-23 09:26:15 -04:00
Peter Park
34f8d57ece
Organize version histories in ROCm for AI benchmark Docker docs ( #4948 )
...
* add vllm 0.8.3 20250415
update prev versions table
* add vllm previous versions page
* move index to vllm-history
* add standalone megatron-lm version history
* add pytorch training version history
* fix
* add vllm-0.4.3
* add vllm-0.6.4
* update vllm-history
* add vllm-0.7.3
* add vllm-0.6.6
* add notes
* fix vllm readme links
fix main page link
* add latest version to previous versions list
* add jax-maxtext history
* fix jax-maxtext history
* add pytorch-training history
* add link in jax-maxtext 25.4
* add megatron-lm history
* fix datatemplate path for vllm 0.8.3
* fix jax-maxtext history link
* update note about performance measurements
* add vllm 0.8.5_20250521 previous version
* consistency fixes
2025-06-20 15:01:38 -04:00
yugang-amd
55f95adc7c
Update for vllm -06/10 ( #4943 )
2025-06-20 08:41:37 -04:00
Peter Park
cfb3504d77
Add Mochi Video to pytorch-inference-benchmark-models.yaml
...
Add Mochi Video to pytorch-inference-benchmark-models.yaml
2025-06-10 13:18:41 -04:00
yugang-amd
830f2d5edf
Update for vllm -05/27 ( #4886 )
...
* Update vLLM inference benchmark Docker page for rocm/vllm 5/27
* update repo for Pytorch
2025-06-05 13:30:20 -04:00
Peter Park
6999c24402
Add microsoft/phi-4 vllm-benchmark-models ( #4801 )
...
* add Phi-4 to vllm-benchmark-models.yaml
fix model_repo
* update model group names
2025-05-30 06:37:13 -04:00
Peter Park
daf2e980d9
Add Falcon-180B to vLLM benchmark Docker doc ( #4836 )
...
* add Falcon to vllm-benchmark-models.yaml
* update group name
2025-05-29 18:26:21 -04:00
Peter Park
9dbc10b4c5
Fix rocm/vllm pull tag
...
Fix rocm/vllm pull tag
2025-05-28 14:42:21 -04:00
Peter Park
cebf0f5975
Add latest rocm/vllm Docker details in vLLM inference benchmark guide ( #4824 )
...
* update rocm/vllm Docker details to latest release
* Add previous vLLM version
* fix 'further reading' xrefs
* improve model grouping names
* fix links
* update model picker text
2025-05-28 14:20:18 -04:00
Peter Park
9ed65a81c4
Add Megatron-LM benchmark doc 5/2 ( #4778 )
...
* reorg files
* add tabs
* update template
* update template
* update wordlist and toc
* add previous version to doc
* add selector paragraph
* update wordlist.txt
2025-05-22 14:28:18 -04:00
Pratik Basyal
8ef1bb0139
rocSHMEM component added to ROCm 6.4.0 documentation ( #4719 )
...
* rocSHMEM added to ROCm 640
* Space removed
* link fixed
2025-05-07 15:31:38 -04:00
Peter Park
85778177a1
Update vLLM docker pull tag 20250415 in vllm-benchmark.rst ( #4702 )
2025-04-30 16:09:30 -04:00
Peter Park
36b6ffaf7c
Add QwQ 32B to vllm-benchmark.rst ( #4685 )
...
* Add Qwen2 MoE 2.7B to vllm-benchmark-models.yaml
* Add QwQ-32B-Preview to vllm-benchmark-models.yaml
* add links to performance results
words
* change "performance validation" to "performance testing"
* remove "-Preview" from QwQ-32B
* move qwen2 MoE after qwen2
* add TunableOp section
* fix formatting
* add link to TunableOp doc
* add tunableop note
* fix vllm-benchmark template
* remove cmdline option for --tunableop on
* update docker details
* remove "training"
* remove qwen2
2025-04-24 16:44:34 -04:00
Peter Park
40e4ba3ecc
Update vLLM inference benchmark Docker guide ( #4653 )
...
* Remove JAIS 13B and 30B
* update Docker details - vLLM 0.8.3
* add previous version
* Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst
* fix link to previous version
2025-04-24 15:59:13 -04:00
Peter Park
c3faa9670b
Add PyTorch inference benchmark Docker guide (+ CLIP and Chai-1) ( #4654 )
...
* update vLLM links in deploy-your-model.rst
* add pytorch inference benchmark doc
* update toc and vLLM title
* remove previous versions
* update
* wording
* fix link and "applies to"
* add pytorch to wordlist
* add tunableop note to clip
* make tunableop note appear to all models
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* fix incorrect links
* wording
* fix wrong docker pull tag
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2025-04-23 17:35:52 -04:00
Istvan Kiss
635838e7ef
Add atomics operation support page
2025-03-20 17:11:02 +01:00
Peter Park
9b2ce2b634
Update vLLM performance Docker docs ( #4491 )
...
* add links to performance results
words
* change "performance validation" to "performance testing"
* update vLLM docker 3/11
* add previous versions
add previous versions
* fix llama 3.1 8b model repo name
* words
2025-03-13 10:04:21 -04:00
Peter Park
1fb42c2591
Update LLM inference performance validation on AMD Instinct MI300X guide to filter by desired model ( #4424 )
...
* WIP
(cherry picked from commit a06a5b5b959a9425e7384fb58b88c3716f380e48)
rm unneeded files
(cherry picked from commit f1d0c00056a83299bdea74a43cd17454999cf2d8)
* add sphinxcontrib.datatemplates
(cherry picked from commit d056b93a325d87b81f54f70c6eb4ae78f4fb0bc1)
* add template
(cherry picked from commit 0691d59f0a1efbda7908762b7a906e30a65c0ee1)
fix template
(cherry picked from commit 01e4bea5522aa5deeaade58c105ff850f449df8b)
WIPO
(cherry picked from commit 4d8daf7445e7be92cd9ee1d39dff564bd8de41f4)
WIP
(cherry picked from commit 9eefd1f5833bc4dc8de9d777ff65a5fe5f826dbd)
update models yaml schema
(cherry picked from commit a5f0fc1e6cc51104dc2d42029bfcf3eea276d270)
add model groups functionality
(cherry picked from commit 13f49f96dd3e5a160d37c52e48a4fbcccdcf4f9e)
add selector headings and fix template
(cherry picked from commit 35f7f2314bcf74b4fd0a8ca10aaabf0de7063bb0)
update template
(cherry picked from commit 9e2dcfe0c7f6e7c2c685866ea83375fbacbc5032)
fix
(cherry picked from commit be51e32791550ddc21785effccb889228394b242)
use classes instead of data tags
(cherry picked from commit cd52d68c504f7e7435d156ae70cf4bde1dfe703e)
update template
(cherry picked from commit 9ed89fee6874b39ee3535fbde54a0a59f346ea2b)
clean up extra wip files
(cherry picked from commit a9f965a104baa966c184054638e935b011526278)
update wordlist
(cherry picked from commit f783656814e896aedd21acd1c8c87b4700c14469)
remove unused template
(cherry picked from commit cac894bd9c2b1262c9c006e5fddbcb742dc6d882)
improve script
(cherry picked from commit ca20ffd4922916616e0924d625652a815f27c35f)
fix template
(cherry picked from commit 752c61fda856fd5b244734636c036c8877e823b9)
fix standalone benchmark output path in template
(cherry picked from commit d8c04203b5ec0f6c2e2307f7890304a3dc5687be)
fix toc
(cherry picked from commit 8df42faf53488ef29f5a263d25032f3d35cd58ed)
update script to prevent flash of unstyled content
import a11y
(cherry picked from commit 46c852717f223a1d8744fab035807cebab4c5404)
add tabindex to wordlist
(cherry picked from commit 11492593f9692f5453045e7ec52c8f8ae9624ae9)
text
update script
* remove unused config option
* reorganize assets
* fix linting warning
* move js from data/ to extension/
2025-02-28 12:39:02 -05:00
randyh62
32feb96819
Rocm azure linux ( #280 )
...
* Ad Software stack for 6.3.2
includes Azure Linux
* Update what-is-rocm.rst
add Azure Linux
2025-01-14 15:50:13 -08:00