github/ROCm - ROCm - AtHeartEngineering

mirror of https://github.com/ROCm/ROCm.git synced 2026-01-09 14:48:06 -05:00

Author	SHA1	Message	Date
yugang-amd	f2067767e0	xdit-diffusion v25.11 docs (#5744 )	2025-12-05 17:09:48 -05:00
peterjunpark	453751a86f	fix docker hub links for primus:v25.10 (#5738 )	2025-12-04 09:17:33 -05:00
peterjunpark	fb644412d5	Update training Docker docs for Primus 25.10 (#5737 )	2025-12-04 09:08:00 -05:00
yugang-amd	674dc355e4	vLLM 10/24 release (#5626 ) * vLLM 10/24 release * updates per SME inputs * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/vllm.rst Co-authored-by: Jeffrey Novotny <jnovotny@amd.com> --------- Co-authored-by: Jeffrey Novotny <jnovotny@amd.com>	2025-11-05 11:13:50 -05:00
peterjunpark	1515fb3779	Revert "Add xdit diffusion docs (#5576 )" (#5580 ) This reverts commit `4132a2609c`.	2025-10-27 16:22:28 -04:00
Kristoffer	4132a2609c	Add xdit diffusion docs (#5576 ) * Add xdit video diffusion base page. * Update supported accelerators. * Remove dependency on mad-tags. * Update docker pull section. * Update container launch instructions. * Improve launch instruction options and layout. * Add benchmark result outputs. * Fix wrong HunyuanVideo path * Finalize instructions. * Consistent title. * Make page and side-bar titles the same. * Updated wordlist. Removed note container reg HF. * Remove fp8_gemms in command and add release notes. * Update accelerators naming. * Add note regarding OOB performance. * Fix admonition box. * Overall fixes.	2025-10-27 14:56:55 +01:00
peterjunpark	a613bd6824	JAX Maxtext v25.9 doc update (#5532 ) * archive previous version (25.7) * update docker components list for 25.9 * update template * update docker pull tag * update * fix intro	2025-10-17 11:31:06 -04:00
peterjunpark	14bb59fca9	Update Megatron/PyTorch Primus 25.9 docs (#5528 ) * add previous versions * Fix heading levels in pages using embedded templates (#5468) * update primus-megatron doc update megatron-lm doc update templates fix tab update primus-megatron model configs Update primus-pytorch model configs fix css class add posttrain to pytorch-training template update data sheets update update update update docker tags * Add known issue and update Primus/Turbo versions * add primus ver to histories * update primus ver to 0.1.1 * fix leftovers from merge conflict	2025-10-16 12:51:30 -04:00
anisha-amd	a98236a4e3	Main Docs: references of accelerator removal and change to GPU (#5495 ) * Docs: references of accelerator removal and change to GPU Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>	2025-10-16 11:22:10 -04:00
peterjunpark	68e8453ca5	Update vLLM doc for 10/6 release and bump rocm-docs-core to 1.26.0 (#5481 ) * archive previous doc version * update model/docker data and doc templates * Update "Reproducing the Docker image" * fix: truncated commit hash doesn't work for some reason * bump rocm-docs-core to 1.26.0 * fix numbering fix * update docker tag * update .wordlist.txt	2025-10-08 16:23:40 -04:00
Peter Park	d92e5b6c12	Update Primus Megatron doc v25.8 (#5396 ) * megatron: update previous versions list update wording * megatron: update rst and yaml update primus repo link update mig guide * update headings and anchors * megatron: update doc * update docker hub urls	2025-09-19 08:09:21 -04:00
Peter Park	9827ba7ff2	docs: MaxText v25.7 patch update (#5372 ) * remove jax 0.6.0 nanoo fp8 caveat note * reorder maxtext docker images in data sheet	2025-09-17 16:25:46 -04:00
Peter Park	26f708da87	Add Stable Diffusion XL to PyT training benchmark doc and fix paths in SGLang Disagg Inference doc (#5282 ) * add sdxl to pytorch-training * fix sphinx warnings fix links * fix paths in cmds and links in sglang disagg * fix col width * update release highlights * fix quickfix	2025-09-16 16:49:33 -04:00
Peter Park	bab853a0d3	Add NCF to pytorch training benchmark doc (#5352 ) * add previous version (25.6) * fix template * Formatting and wording fixes * add caveats * update yaml * add note to pytorch-training * fix template * make model name shorter	2025-09-16 13:29:28 -04:00
Peter Park	d5101532f7	docs: Add SGLang disaggregated P/D inference w/ Mooncake guide (#5335 ) * add main content * Update content and format add clarification update update data * fix fix fix * fix: deepseek v3 * add ki * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/benchmark-docker/sglang-distributed.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-09-16 10:33:58 -05:00
Peter Park	ef4e7ca1fe	docs(PyTorch training v25.8): Add Primus and update PyTorch training benchmark docs (#5331 ) * pyt: update previous versions list update conf.py * pyt: update yaml and rst update update toc * update headings and anchors * pyt: update doc * update docker hub urls	2025-09-16 10:33:53 -05:00
Parag Bhandari	60e3a8107c	Merge branch 'develop' into develop-internal	2025-09-16 05:12:42 -04:00
Peter Park	7098bdc03b	Update vLLM inference benchmark doc for 0909 release (and Sphinx fixes) (#5289 )	2025-09-11 15:01:17 -04:00
Peter Park	05a66f75fe	add qwen3 30b a3b to vllm-benchmark-models (#5280 )	2025-09-09 17:41:11 -04:00
Peter Park	4f53183696	docs: Add JAX MaxText benchmark v25.7 (#5182 ) * Update previous versions * Add data file * fix filename and anchors * add templates * update .wordlist.txt * Update template and data add missing step fix fmt * update template * fix data * add jax 0.6.0 * update history * update quantized training note	2025-09-08 21:42:56 -04:00
Peter Park	4bc1bf00c6	Update PyTorch training benchmark docker doc to 25.7 (#5255 ) * Update PyTorch training benchmark docker doc to 25.7 * update .wordlist.txt * update conf.py * update data sheet * fix sphinx warnings	2025-09-05 12:07:51 -04:00
Istvan Kiss	d476d09aff	Update precision support page with missing libraries and RDNA2 and CDNA4 support	2025-08-28 17:09:34 +02:00
Pratik Basyal	ea8ff1b17d	UCC and UCX version and release notes update for 7.0.0 (#521 ) * Indentation and formatting updated * UCC and UCX version udpated * ROCm bandwidth test update * MI350 series info added * Changelog update * ROCm systems Profiler highlight updated * Redundant removed, pulled out from HIP changelog * Known issues to Compute profiler added * ONNX compatibility updtaed * ROCm COmpute Profiler highlight added * RN update * ROCm 700 stack image updated * ROCM Compute and System highlight updated * Deep learning frameworks added * removed BF16 support for MIGraphX -- already in 6.4 release notes; removed FP4 MIGraphX support * ROCm Compute profiler highlight updated * Formatting update * AI framework update * ROCm Systems Profiler udpate * removed mention of CentOS of CentOS * ROCm Compute Profiler update * Feedback changes * leo's feedback incorporated * ampersand * Changelog synced * Changelog synced * RHEL 10 removed * Rocky Linux updated --------- Co-authored-by: spolifroni-amd <sandra.polifroni@amd.com>	2025-08-26 16:34:27 -04:00
Peter Park	98029db4ee	docs: Add Primus (Megatron) training Docker documentation (#5218 )	2025-08-21 23:50:55 -04:00
Istvan Kiss	ae734e7846	Add MI350X and MI355X to atomics operation page (#497 ) Add MI350X and MI355X to atomics operation page	2025-08-18 15:37:19 +02:00
Peter Park	55d0a88ec5	vLLM inference benchmark doc: add missing data field (#5199 )	2025-08-15 13:20:39 -04:00
Peter Park	7ee22790ce	docs: Update vLLM benchmark doc for 20250812 Docker release (#5196 )	2025-08-14 15:43:36 -04:00
Peter Park	80f7dc79b9	Add Hunyuan Video to PyTorch inference benchmark models doc (#5094 )	2025-08-12 11:54:59 -04:00
Pratik Basyal	f632f2879f	ROCm Software Stack image for 6.4.0 updated (#5112 )	2025-07-28 14:51:19 -04:00
yugang-amd	cc5bc5a882	Add SGLang inference benchmark doc w/ initial support for DeepSeek-R1-Distill-Qwen-32B (#4870 )	2025-07-25 12:42:40 -04:00
Peter Park	984a91f008	Add DeepSeek Janus Pro 7B to PyTorch inference benchmark doc (#5071 ) --------- Co-authored-by: yugang-amd <yugang.wang@amd.com>	2025-07-22 16:26:06 -04:00
Peter Park	5bcf3b0847	Update Megatron-LM training benchmark doc for v25.6 release (#5064 )	2025-07-18 15:57:25 -04:00
Peter Park	b437a625b3	Update vLLM inference benchmark doc for 0715 release (#5058 )	2025-07-17 15:00:02 -04:00
Peter Park	d471b04cd5	Update vLLM Docker doc for 07/02	2025-07-09 11:38:27 -04:00
Peter Park	d0c8ba0805	Add Wan2.1 to PyTorch inference Docker documentation (#4984 ) * add wan2.1 to pyt inference models * update group name * fix container tag * fix group name * change documented data type to bfloat16 * fix col width	2025-07-02 09:58:37 -04:00
Peter Park	91a541f8b9	Update PyTorch training benchmark doc for v25.6 (#4950 ) * update pytorch-training docker details * add previous version * add models data * update models data id * add models picker * update data * update fmt fmt * update data yaml * update template * update data * fix * fix vllm-0.6.4 broken link * fix vllm history	2025-06-23 09:26:15 -04:00
Peter Park	34f8d57ece	Organize version histories in ROCm for AI benchmark Docker docs (#4948 ) * add vllm 0.8.3 20250415 update prev versions table * add vllm previous versions page * move index to vllm-history * add standalone megatron-lm version history * add pytorch training version history * fix * add vllm-0.4.3 * add vllm-0.6.4 * update vllm-history * add vllm-0.7.3 * add vllm-0.6.6 * add notes * fix vllm readme links fix main page link * add latest version to previous versions list * add jax-maxtext history * fix jax-maxtext history * add pytorch-training history * add link in jax-maxtext 25.4 * add megatron-lm history * fix datatemplate path for vllm 0.8.3 * fix jax-maxtext history link * update note about performance measurements * add vllm 0.8.5_20250521 previous version * consistency fixes	2025-06-20 15:01:38 -04:00
yugang-amd	55f95adc7c	Update for vllm -06/10 (#4943 )	2025-06-20 08:41:37 -04:00
Peter Park	cfb3504d77	Add Mochi Video to pytorch-inference-benchmark-models.yaml Add Mochi Video to pytorch-inference-benchmark-models.yaml	2025-06-10 13:18:41 -04:00
yugang-amd	830f2d5edf	Update for vllm -05/27 (#4886 ) * Update vLLM inference benchmark Docker page for rocm/vllm 5/27 * update repo for Pytorch	2025-06-05 13:30:20 -04:00
Peter Park	6999c24402	Add microsoft/phi-4 vllm-benchmark-models (#4801 ) * add Phi-4 to vllm-benchmark-models.yaml fix model_repo * update model group names	2025-05-30 06:37:13 -04:00
Peter Park	daf2e980d9	Add Falcon-180B to vLLM benchmark Docker doc (#4836 ) * add Falcon to vllm-benchmark-models.yaml * update group name	2025-05-29 18:26:21 -04:00
Peter Park	9dbc10b4c5	Fix rocm/vllm pull tag Fix rocm/vllm pull tag	2025-05-28 14:42:21 -04:00
Peter Park	cebf0f5975	Add latest rocm/vllm Docker details in vLLM inference benchmark guide (#4824 ) * update rocm/vllm Docker details to latest release * Add previous vLLM version * fix 'further reading' xrefs * improve model grouping names * fix links * update model picker text	2025-05-28 14:20:18 -04:00
Peter Park	9ed65a81c4	Add Megatron-LM benchmark doc 5/2 (#4778 ) * reorg files * add tabs * update template * update template * update wordlist and toc * add previous version to doc * add selector paragraph * update wordlist.txt	2025-05-22 14:28:18 -04:00
Pratik Basyal	8ef1bb0139	rocSHMEM component added to ROCm 6.4.0 documentation (#4719 ) * rocSHMEM added to ROCm 640 * Space removed * link fixed	2025-05-07 15:31:38 -04:00
Peter Park	85778177a1	Update vLLM docker pull tag 20250415 in vllm-benchmark.rst (#4702 )	2025-04-30 16:09:30 -04:00
Peter Park	36b6ffaf7c	Add QwQ 32B to vllm-benchmark.rst (#4685 ) * Add Qwen2 MoE 2.7B to vllm-benchmark-models.yaml * Add QwQ-32B-Preview to vllm-benchmark-models.yaml * add links to performance results words * change "performance validation" to "performance testing" * remove "-Preview" from QwQ-32B * move qwen2 MoE after qwen2 * add TunableOp section * fix formatting * add link to TunableOp doc * add tunableop note * fix vllm-benchmark template * remove cmdline option for --tunableop on * update docker details * remove "training" * remove qwen2	2025-04-24 16:44:34 -04:00
Peter Park	40e4ba3ecc	Update vLLM inference benchmark Docker guide (#4653 ) * Remove JAIS 13B and 30B * update Docker details - vLLM 0.8.3 * add previous version * Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst * fix link to previous version	2025-04-24 15:59:13 -04:00
Peter Park	c3faa9670b	Add PyTorch inference benchmark Docker guide (+ CLIP and Chai-1) (#4654 ) * update vLLM links in deploy-your-model.rst * add pytorch inference benchmark doc * update toc and vLLM title * remove previous versions * update * wording * fix link and "applies to" * add pytorch to wordlist * add tunableop note to clip * make tunableop note appear to all models * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * fix incorrect links * wording * fix wrong docker pull tag --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-04-23 17:35:52 -04:00

1 2 3

125 Commits