peterjunpark
3a43bacdda
Update xdit diffusion inference history ( #5808 )
...
* Update xdit diffusion inference history
* fix
2025-12-22 11:05:32 -05:00
peterjunpark
459283da3c
xDiT diffusion inference v25.12 documentation update ( #5786 )
...
* Add xdit-diffusion ROCm docs page.
* Update template formatting and fix sphinx warnings
* Add System Validation section.
* Add sw component versions/commits.
* Update to use latest v25.10 image instead of v25.9
* Update commands and add FLUX instructions.
* Update Flux instructions. Change image tag. Describe as diffusion inference instead of specifically video.
* git rm xdit-video-diffusion.rst
* Docs for v25.12
* Add hyperlinks to components
* Command fixes
* -Diffusers suffix
* Simplify yaml file and cleanup main rst page.
* Spelling, added 'js'
* fix merge conflict
fix
---------
Co-authored-by: Kristoffer <kristoffer.torp@amd.com >
2025-12-17 10:20:10 -05:00
peterjunpark
1b4f25733d
vLLM inference benchmark 1210 ( #5776 )
...
* Archive previous ver
fix anchors
* Update vllm.rst and data yaml for 20251210
2025-12-17 09:21:57 -05:00
Pratik Basyal
78e8baf147
Taichi removed from ROCm docs [Develop] ( #5779 )
...
* Taichi removed from ROCm docs
* Warnings fixed
2025-12-16 13:12:40 -05:00
yugang-amd
f2067767e0
xdit-diffusion v25.11 docs ( #5744 )
2025-12-05 17:09:48 -05:00
yugang-amd
674dc355e4
vLLM 10/24 release ( #5626 )
...
* vLLM 10/24 release
* updates per SME inputs
* Update docs/how-to/rocm-for-ai/inference/benchmark-docker/vllm.rst
Co-authored-by: Jeffrey Novotny <jnovotny@amd.com >
---------
Co-authored-by: Jeffrey Novotny <jnovotny@amd.com >
2025-11-05 11:13:50 -05:00
anisha-amd
a98236a4e3
Main Docs: references of accelerator removal and change to GPU ( #5495 )
...
* Docs: references of accelerator removal and change to GPU
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com >
2025-10-16 11:22:10 -04:00
peterjunpark
68e8453ca5
Update vLLM doc for 10/6 release and bump rocm-docs-core to 1.26.0 ( #5481 )
...
* archive previous doc version
* update model/docker data and doc templates
* Update "Reproducing the Docker image"
* fix: truncated commit hash doesn't work for some reason
* bump rocm-docs-core to 1.26.0
* fix numbering
fix
* update docker tag
* update .wordlist.txt
2025-10-08 16:23:40 -04:00
Peter Park
442d7e4750
Add env var note to vllm.rst for MoE models and fix links in docs ( #5415 )
...
* docs(vllm.rst): add performance note for MoE models
* docs: fix links
update vllm readme link 20250521
fix links
2025-09-22 15:58:43 -04:00
Peter Park
76cb264f34
Update vllm-history.rst with missing 0909 entry ( #5308 )
2025-09-16 06:54:34 -04:00
Peter Park
7098bdc03b
Update vLLM inference benchmark doc for 0909 release (and Sphinx fixes) ( #5289 )
2025-09-11 15:01:17 -04:00
Peter Park
4bc1bf00c6
Update PyTorch training benchmark docker doc to 25.7 ( #5255 )
...
* Update PyTorch training benchmark docker doc to 25.7
* update .wordlist.txt
* update conf.py
* update data sheet
* fix sphinx warnings
2025-09-05 12:07:51 -04:00
Peter Park
55d0a88ec5
vLLM inference benchmark doc: add missing data field ( #5199 )
2025-08-15 13:20:39 -04:00
Peter Park
7ee22790ce
docs: Update vLLM benchmark doc for 20250812 Docker release ( #5196 )
2025-08-14 15:43:36 -04:00
yugang-amd
cc5bc5a882
Add SGLang inference benchmark doc w/ initial support for DeepSeek-R1-Distill-Qwen-32B ( #4870 )
2025-07-25 12:42:40 -04:00
Peter Park
5bcf3b0847
Update Megatron-LM training benchmark doc for v25.6 release ( #5064 )
2025-07-18 15:57:25 -04:00
Peter Park
7e7e15a201
Fix path to data file in vllm-0.9.1-20250702.rst ( #5066 )
2025-07-18 14:16:05 -04:00
Peter Park
b437a625b3
Update vLLM inference benchmark doc for 0715 release ( #5058 )
2025-07-17 15:00:02 -04:00
Jan Stephan
16f707d6c4
Merge pull request #5001 from j-stephan/fix-doc-warnings
...
Fix doc warnings
2025-07-16 07:10:54 -04:00
Peter Park
22524eeaa5
fix xrefs in vllm-0.9.0.1-20250605.rst ( #5017 )
2025-07-09 14:38:24 -04:00
Peter Park
d471b04cd5
Update vLLM Docker doc for 07/02
2025-07-09 11:38:27 -04:00
Peter Park
3b3fc4894b
Fix xrefs and Sphinx warnings in documentation
...
Fix xrefs and Sphinx warnings in documentation
2025-07-08 13:22:53 -04:00
Peter Park
91a541f8b9
Update PyTorch training benchmark doc for v25.6 ( #4950 )
...
* update pytorch-training docker details
* add previous version
* add models data
* update models data id
* add models picker
* update data
* update fmt
fmt
* update data yaml
* update template
* update data
* fix
* fix vllm-0.6.4 broken link
* fix vllm history
2025-06-23 09:26:15 -04:00
Peter Park
34f8d57ece
Organize version histories in ROCm for AI benchmark Docker docs ( #4948 )
...
* add vllm 0.8.3 20250415
update prev versions table
* add vllm previous versions page
* move index to vllm-history
* add standalone megatron-lm version history
* add pytorch training version history
* fix
* add vllm-0.4.3
* add vllm-0.6.4
* update vllm-history
* add vllm-0.7.3
* add vllm-0.6.6
* add notes
* fix vllm readme links
fix main page link
* add latest version to previous versions list
* add jax-maxtext history
* fix jax-maxtext history
* add pytorch-training history
* add link in jax-maxtext 25.4
* add megatron-lm history
* fix datatemplate path for vllm 0.8.3
* fix jax-maxtext history link
* update note about performance measurements
* add vllm 0.8.5_20250521 previous version
* consistency fixes
2025-06-20 15:01:38 -04:00
yugang-amd
55f95adc7c
Update for vllm -06/10 ( #4943 )
2025-06-20 08:41:37 -04:00
Peter Park
d69037bfcc
Fix Sphinx issue in vllm-benchmark 0.8.5-20250513 previous version ( #4924 )
...
* fix sphinx issue in vllm-benchmark 0.8.5-20250513 previous version
* update article_info in conf.py
* update rocm/vllm
2025-06-13 15:03:51 -04:00
yugang-amd
830f2d5edf
Update for vllm -05/27 ( #4886 )
...
* Update vLLM inference benchmark Docker page for rocm/vllm 5/27
* update repo for Pytorch
2025-06-05 13:30:20 -04:00