github/ROCm - ROCm - AtHeartEngineering

mirror of https://github.com/ROCm/ROCm.git synced 2026-02-13 16:05:07 -05:00

Author	SHA1	Message	Date
peterjunpark	e3704ad70e	Revert "Add xdit diffusion docs (#5576 ) (#5578 )" (#5579 ) This reverts commit `a38b2865f0`.	2025-10-27 16:21:10 -04:00
peterjunpark	a38b2865f0	Add xdit diffusion docs (#5576 ) (#5578 ) (cherry picked from commit `4132a2609c`) Co-authored-by: Kristoffer <kristoffer.torp@amd.com>	2025-10-27 15:41:29 -04:00
peterjunpark	dfdff755ef	Fix broken links under rocm-for-ai/ (#5564 ) (#5565 ) (cherry picked from commit `35ca027aa4`)	2025-10-23 15:18:08 -04:00
peterjunpark	8d2d5abdae	add xref to vllm v1 optimization guide in workload.rst (#5560 ) (#5561 ) (cherry picked from commit `90c1d9068f`)	2025-10-23 11:51:55 -04:00
peterjunpark	b30b8b43e0	Updates to the vLLM optimization guide for MI300X/MI355X (#5554 ) * Expand vLLM optimization guide for MI300X/MI355X with comprehensive AITER coverage. attention backend selection, environment variables (HIP/RCCL/Quick Reduce), parallelism strategies, quantization (FP8/FP4), engine tuning, CUDA graph modes, and multi-node scaling. Co-authored-by: PinSiang <pinsiang.tan@embeddedllm.com> Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by: pinsiangamd <pinsiang.tan@amd.com> Co-authored-by: Jeffrey Novotny <jnovotny@amd.com> (cherry picked from commit `cb8d21a0df`)	2025-10-22 13:01:57 -04:00
peterjunpark	79acda6775	JAX Maxtext v25.9 doc update (#5532 ) (#5533 ) * archive previous version (25.7) * update docker components list for 25.9 * update template * update docker pull tag * update * fix intro (cherry picked from commit `a613bd6824`)	2025-10-17 11:54:39 -04:00
peterjunpark	811fa5c87a	Update Megatron/PyTorch Primus 25.9 docs (#5528 ) (#5529 ) * add previous versions * Fix heading levels in pages using embedded templates (#5468) * update primus-megatron doc update megatron-lm doc update templates fix tab update primus-megatron model configs Update primus-pytorch model configs fix css class add posttrain to pytorch-training template update data sheets update update update update docker tags * Add known issue and update Primus/Turbo versions * add primus ver to histories * update primus ver to 0.1.1 * fix leftovers from merge conflict (cherry picked from commit `14bb59fca9`)	2025-10-16 13:27:40 -04:00
Pratik Basyal	0ada3a8fef	ROCm for HPC topic updated Develop (#5504 ) (#5505 ) * ROCm for HPC topic updated * ROCm for HPC topic udpated * Minor editorial	2025-10-10 22:39:31 -04:00
peterjunpark	68e8453ca5	Update vLLM doc for 10/6 release and bump rocm-docs-core to 1.26.0 (#5481 ) * archive previous doc version * update model/docker data and doc templates * Update "Reproducing the Docker image" * fix: truncated commit hash doesn't work for some reason * bump rocm-docs-core to 1.26.0 * fix numbering fix * update docker tag * update .wordlist.txt	2025-10-08 16:23:40 -04:00
peterjunpark	eeea0d2180	Fix heading levels in pages using embedded templates (#5468 )	2025-10-03 13:33:14 -04:00
anisha-amd	93c6d17922	Docs: frameworks 25.09 - compatibility - FlashInfer and llama.cpp (#5462 )	2025-10-02 13:51:36 -04:00
peterjunpark	2e1b4dd5ee	Add multi-node setup instructions for training perf Dockers (#5449 ) --------- Co-authored-by: Jeffrey Novotny <jnovotny@amd.com>	2025-09-30 14:53:38 -04:00
Peter Park	fd59b5fbac	fix links in docs (#5446 )	2025-09-29 15:27:32 -04:00
Pratik Basyal	d92d9268dc	Use of Radeon and Ryzen reference updated [Develop] (#5432 ) * Use of Radeon and Ryzen reference updated * Pytorch link update	2025-09-24 19:07:41 -05:00
Peter Park	442d7e4750	Add env var note to vllm.rst for MoE models and fix links in docs (#5415 ) * docs(vllm.rst): add performance note for MoE models * docs: fix links update vllm readme link 20250521 fix links	2025-09-22 15:58:43 -04:00
Peter Park	d92e5b6c12	Update Primus Megatron doc v25.8 (#5396 ) * megatron: update previous versions list update wording * megatron: update rst and yaml update primus repo link update mig guide * update headings and anchors * megatron: update doc * update docker hub urls	2025-09-19 08:09:21 -04:00
Peter Park	9827ba7ff2	docs: MaxText v25.7 patch update (#5372 ) * remove jax 0.6.0 nanoo fp8 caveat note * reorder maxtext docker images in data sheet	2025-09-17 16:25:46 -04:00
Peter Park	e8d104124f	Fix PyTorch training benchmark doc template (#5357 ) * fix template * update wordlist	2025-09-16 17:21:57 -04:00
Peter Park	26f708da87	Add Stable Diffusion XL to PyT training benchmark doc and fix paths in SGLang Disagg Inference doc (#5282 ) * add sdxl to pytorch-training * fix sphinx warnings fix links * fix paths in cmds and links in sglang disagg * fix col width * update release highlights * fix quickfix	2025-09-16 16:49:33 -04:00
Peter Park	bab853a0d3	Add NCF to pytorch training benchmark doc (#5352 ) * add previous version (25.6) * fix template * Formatting and wording fixes * add caveats * update yaml * add note to pytorch-training * fix template * make model name shorter	2025-09-16 13:29:28 -04:00
Peter Park	d5101532f7	docs: Add SGLang disaggregated P/D inference w/ Mooncake guide (#5335 ) * add main content * Update content and format add clarification update update data * fix fix fix * fix: deepseek v3 * add ki * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-09-16 10:33:58 -05:00
Peter Park	ef4e7ca1fe	docs(PyTorch training v25.8): Add Primus and update PyTorch training benchmark docs (#5331 ) * pyt: update previous versions list update conf.py * pyt: update yaml and rst update update toc * update headings and anchors * pyt: update doc * update docker hub urls	2025-09-16 10:33:53 -05:00
Peter Park	76cb264f34	Update vllm-history.rst with missing 0909 entry (#5308 )	2025-09-16 06:54:34 -04:00
Peter Park	7098bdc03b	Update vLLM inference benchmark doc for 0909 release (and Sphinx fixes) (#5289 )	2025-09-11 15:01:17 -04:00
anisha-amd	db43d18c37	Docs: frameworks compatibility- ray and llama.cpp (#5273 )	2025-09-09 11:02:30 -04:00
Peter Park	4f53183696	docs: Add JAX MaxText benchmark v25.7 (#5182 ) * Update previous versions * Add data file * fix filename and anchors * add templates * update .wordlist.txt * Update template and data add missing step fix fmt * update template * fix data * add jax 0.6.0 * update history * update quantized training note	2025-09-08 21:42:56 -04:00
Peter Park	4bc1bf00c6	Update PyTorch training benchmark docker doc to 25.7 (#5255 ) * Update PyTorch training benchmark docker doc to 25.7 * update .wordlist.txt * update conf.py * update data sheet * fix sphinx warnings	2025-09-05 12:07:51 -04:00
Matt Williams	76fd6b2290	Updating broken link (#5258 )	2025-09-05 11:45:06 -04:00
Matt Williams	1d42f7cc62	Deep learning frameworks edits for scale (#5189 ) * Deep learning frameworks edits for scale Based on https://ontrack-internal.amd.com/browse/ROCDOC-1809 * update table table * leo comments * formatting * format * update table based on feedback * header * Update machine learning page * headers * Apply suggestions from code review Co-authored-by: anisha-amd <anisha.sankar@amd.com> * Update .wordlist.txt * formatting * Update docs/how-to/deep-learning-rocm.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> --------- Co-authored-by: Matt Williams <Matt.Williams+amdeng@amd.com> Co-authored-by: anisha-amd <anisha.sankar@amd.com> Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-08-22 11:46:07 -04:00
Peter Park	98029db4ee	docs: Add Primus (Megatron) training Docker documentation (#5218 )	2025-08-21 23:50:55 -04:00
Peter Park	55d0a88ec5	vLLM inference benchmark doc: add missing data field (#5199 )	2025-08-15 13:20:39 -04:00
Peter Park	7ee22790ce	docs: Update vLLM benchmark doc for 20250812 Docker release (#5196 )	2025-08-14 15:43:36 -04:00
Peter Park	80f7dc79b9	Add Hunyuan Video to PyTorch inference benchmark models doc (#5094 )	2025-08-12 11:54:59 -04:00
Dominic Widdows	9e055d92ce	Fix hyperlink syntax	2025-08-08 10:28:09 -07:00
Dominic Widdows	698d7f1d58	Updating old link that has been changed (#5149 )	2025-08-05 15:23:55 -04:00
anisha-amd	266387d816	Docs: Adding frameworks compatibility for Megablocks and Taichi (#5133 )	2025-07-31 13:00:31 -04:00
yugang-amd	cc5bc5a882	Add SGLang inference benchmark doc w/ initial support for DeepSeek-R1-Distill-Qwen-32B (#4870 )	2025-07-25 12:42:40 -04:00
Peter Park	14249f24d8	Use `madengine` instead of tools/run_models.py in docs (#5095 )	2025-07-24 15:38:12 -04:00
Peter Park	984a91f008	Add DeepSeek Janus Pro 7B to PyTorch inference benchmark doc (#5071 ) --------- Co-authored-by: yugang-amd <yugang.wang@amd.com>	2025-07-22 16:26:06 -04:00
Peter Park	2269e9d25d	Remove broken link to deprecated AMDGPU installer documentation (#5078 ) * remove link to deprecated AMDGPU installation method * add deep learning frameworks	2025-07-21 19:36:20 -04:00
Peter Park	5bcf3b0847	Update Megatron-LM training benchmark doc for v25.6 release (#5064 )	2025-07-18 15:57:25 -04:00
Peter Park	7e7e15a201	Fix path to data file in vllm-0.9.1-20250702.rst (#5066 )	2025-07-18 14:16:05 -04:00
Peter Park	b437a625b3	Update vLLM inference benchmark doc for 0715 release (#5058 )	2025-07-17 15:00:02 -04:00
Jan Stephan	16f707d6c4	Merge pull request #5001 from j-stephan/fix-doc-warnings Fix doc warnings	2025-07-16 07:10:54 -04:00
Jeffrey Novotny	b431415ade	Merge Verl, DGL, Megatron changes. (#5047 ) * Verl compatibility * verl compatibility * add Supported features Signed-off-by: Vicky Tsang <vtsang@amd.com> * updated and edited verl compat doc * added links to verl * add future release for sglang and megatron inference eng. Signed-off-by: Vicky Tsang <vtsang@amd.com> * fix lint Signed-off-by: Vicky Tsang <vtsang@amd.com> * fixed a typo and a table * Spolifroni amd/add to compat matrix (#430) * added verl to compatibility matrix * small change * fixed an error in csv * edited the verl compat based on leo's recommendations * updated compat matrix (#435) * Added a hardcoded link to the verl install This is a link to an RTD build and MUST be removed before publishing. * Update verl-compatibility.rst * Added a hardcoded link to the verl install This link is to an RTD build and it WILL break at publishing. It MUST be changed before publishing. * Added version support note (#448) * small fixes * Update verl-compatibility.rst * Update verl-compatibility.rst --------- Signed-off-by: Vicky Tsang <vtsang@amd.com> Co-authored-by: spolifroni-amd <sandra.polifroni@amd.com> Co-authored-by: anisha-amd <anisha.sankar@amd.com> (cherry picked from commit `f9bd22626b`) * Stanford Megatron-LM Compatibility * Create stanford-megatron-lm-compatibility.rst * toc and wordlist * Update deep-learning-rocm.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * fixes and adding to main compat matrix * formatting fix * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update docs/compatibility/ml-compatibility/stanford-megatron-lm-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/compatibility/ml-compatibility/stanford-megatron-lm-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/compatibility/ml-compatibility/stanford-megatron-lm-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst * Update stanford-megatron-lm-compatibility.rst --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> (cherry picked from commit `f4f096b44e`) * Framework: DGL Compatability * Introducing new file for DGL Compatability * Update dgl-compatibility.rst * Update .wordlist.txt * Update .wordlist.txt * Update deep-learning-rocm.rst * compatibility fixes * Update docs/compatibility/ml-compatibility/dgl-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/compatibility/ml-compatibility/dgl-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/compatibility/ml-compatibility/dgl-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/compatibility/ml-compatibility/dgl-compatibility.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update dgl-compatibility.rst * Update dgl-compatibility.rst * Update dgl-compatibility.rst * Update dgl-compatibility.rst * additions to use-cases and system support * wording and fixes * Update dgl-compatibility.rst * Update dgl-compatibility.rst * remove table heading * Update compatibility-matrix-historical-6.0.csv --------- Co-authored-by: anisha-amd <anisha.sankar@amd.com> Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> (cherry picked from commit `2a7554c0b9`) * Manually resolve merge conflict * Further merge conflict adjustments --------- Signed-off-by: Vicky Tsang <vtsang@amd.com> Co-authored-by: vickytsang <vtsang@amd.com> Co-authored-by: spolifroni-amd <sandra.polifroni@amd.com> Co-authored-by: anisha-amd <anisha.sankar@amd.com> Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> Co-authored-by: Mukhil M S <167260682+mukh1l@users.noreply.github.com>	2025-07-15 18:57:31 -04:00
Peter Park	548d31f990	fix broken image in megatron-lm-v24.12-dev.rst (#5043 )	2025-07-15 10:57:12 -04:00
Pratik Basyal	544186aef8	ROCm for HPC table update for Develop (#5015 ) (#5016 ) (#5019 ) * ROCm for HPC table update for 6.4.0 (#5015) (#5016) * 6.4.0 updates synced * Minor change * Link update	2025-07-09 14:57:53 -04:00
Peter Park	22524eeaa5	fix xrefs in vllm-0.9.0.1-20250605.rst (#5017 )	2025-07-09 14:38:24 -04:00
Peter Park	d471b04cd5	Update vLLM Docker doc for 07/02	2025-07-09 11:38:27 -04:00
Peter Park	3b3fc4894b	Fix xrefs and Sphinx warnings in documentation Fix xrefs and Sphinx warnings in documentation	2025-07-08 13:22:53 -04:00

1 2 3 4

200 Commits