github/ROCm - ROCm - AtHeartEngineering

mirror of https://github.com/ROCm/ROCm.git synced 2026-02-12 07:25:22 -05:00

Author	SHA1	Message	Date
Peter Park	7098bdc03b	Update vLLM inference benchmark doc for 0909 release (and Sphinx fixes) (#5289 )	2025-09-11 15:01:17 -04:00
Peter Park	05a66f75fe	add qwen3 30b a3b to vllm-benchmark-models (#5280 )	2025-09-09 17:41:11 -04:00
Peter Park	4f53183696	docs: Add JAX MaxText benchmark v25.7 (#5182 ) * Update previous versions * Add data file * fix filename and anchors * add templates * update .wordlist.txt * Update template and data add missing step fix fmt * update template * fix data * add jax 0.6.0 * update history * update quantized training note	2025-09-08 21:42:56 -04:00
Peter Park	4bc1bf00c6	Update PyTorch training benchmark docker doc to 25.7 (#5255 ) * Update PyTorch training benchmark docker doc to 25.7 * update .wordlist.txt * update conf.py * update data sheet * fix sphinx warnings	2025-09-05 12:07:51 -04:00
Peter Park	98029db4ee	docs: Add Primus (Megatron) training Docker documentation (#5218 )	2025-08-21 23:50:55 -04:00
Peter Park	55d0a88ec5	vLLM inference benchmark doc: add missing data field (#5199 )	2025-08-15 13:20:39 -04:00
Peter Park	7ee22790ce	docs: Update vLLM benchmark doc for 20250812 Docker release (#5196 )	2025-08-14 15:43:36 -04:00
Peter Park	80f7dc79b9	Add Hunyuan Video to PyTorch inference benchmark models doc (#5094 )	2025-08-12 11:54:59 -04:00
Pratik Basyal	f632f2879f	ROCm Software Stack image for 6.4.0 updated (#5112 )	2025-07-28 14:51:19 -04:00
yugang-amd	cc5bc5a882	Add SGLang inference benchmark doc w/ initial support for DeepSeek-R1-Distill-Qwen-32B (#4870 )	2025-07-25 12:42:40 -04:00
Peter Park	984a91f008	Add DeepSeek Janus Pro 7B to PyTorch inference benchmark doc (#5071 ) --------- Co-authored-by: yugang-amd <yugang.wang@amd.com>	2025-07-22 16:26:06 -04:00
Peter Park	5bcf3b0847	Update Megatron-LM training benchmark doc for v25.6 release (#5064 )	2025-07-18 15:57:25 -04:00
Peter Park	b437a625b3	Update vLLM inference benchmark doc for 0715 release (#5058 )	2025-07-17 15:00:02 -04:00
Peter Park	d471b04cd5	Update vLLM Docker doc for 07/02	2025-07-09 11:38:27 -04:00
Peter Park	d0c8ba0805	Add Wan2.1 to PyTorch inference Docker documentation (#4984 ) * add wan2.1 to pyt inference models * update group name * fix container tag * fix group name * change documented data type to bfloat16 * fix col width	2025-07-02 09:58:37 -04:00
Peter Park	91a541f8b9	Update PyTorch training benchmark doc for v25.6 (#4950 ) * update pytorch-training docker details * add previous version * add models data * update models data id * add models picker * update data * update fmt fmt * update data yaml * update template * update data * fix * fix vllm-0.6.4 broken link * fix vllm history	2025-06-23 09:26:15 -04:00
Peter Park	34f8d57ece	Organize version histories in ROCm for AI benchmark Docker docs (#4948 ) * add vllm 0.8.3 20250415 update prev versions table * add vllm previous versions page * move index to vllm-history * add standalone megatron-lm version history * add pytorch training version history * fix * add vllm-0.4.3 * add vllm-0.6.4 * update vllm-history * add vllm-0.7.3 * add vllm-0.6.6 * add notes * fix vllm readme links fix main page link * add latest version to previous versions list * add jax-maxtext history * fix jax-maxtext history * add pytorch-training history * add link in jax-maxtext 25.4 * add megatron-lm history * fix datatemplate path for vllm 0.8.3 * fix jax-maxtext history link * update note about performance measurements * add vllm 0.8.5_20250521 previous version * consistency fixes	2025-06-20 15:01:38 -04:00
yugang-amd	55f95adc7c	Update for vllm -06/10 (#4943 )	2025-06-20 08:41:37 -04:00
Peter Park	cfb3504d77	Add Mochi Video to pytorch-inference-benchmark-models.yaml Add Mochi Video to pytorch-inference-benchmark-models.yaml	2025-06-10 13:18:41 -04:00
yugang-amd	830f2d5edf	Update for vllm -05/27 (#4886 ) * Update vLLM inference benchmark Docker page for rocm/vllm 5/27 * update repo for Pytorch	2025-06-05 13:30:20 -04:00
Peter Park	6999c24402	Add microsoft/phi-4 vllm-benchmark-models (#4801 ) * add Phi-4 to vllm-benchmark-models.yaml fix model_repo * update model group names	2025-05-30 06:37:13 -04:00
Peter Park	daf2e980d9	Add Falcon-180B to vLLM benchmark Docker doc (#4836 ) * add Falcon to vllm-benchmark-models.yaml * update group name	2025-05-29 18:26:21 -04:00
Peter Park	9dbc10b4c5	Fix rocm/vllm pull tag Fix rocm/vllm pull tag	2025-05-28 14:42:21 -04:00
Peter Park	cebf0f5975	Add latest rocm/vllm Docker details in vLLM inference benchmark guide (#4824 ) * update rocm/vllm Docker details to latest release * Add previous vLLM version * fix 'further reading' xrefs * improve model grouping names * fix links * update model picker text	2025-05-28 14:20:18 -04:00
Peter Park	9ed65a81c4	Add Megatron-LM benchmark doc 5/2 (#4778 ) * reorg files * add tabs * update template * update template * update wordlist and toc * add previous version to doc * add selector paragraph * update wordlist.txt	2025-05-22 14:28:18 -04:00
Pratik Basyal	8ef1bb0139	rocSHMEM component added to ROCm 6.4.0 documentation (#4719 ) * rocSHMEM added to ROCm 640 * Space removed * link fixed	2025-05-07 15:31:38 -04:00
Peter Park	85778177a1	Update vLLM docker pull tag 20250415 in vllm-benchmark.rst (#4702 )	2025-04-30 16:09:30 -04:00
Peter Park	36b6ffaf7c	Add QwQ 32B to vllm-benchmark.rst (#4685 ) * Add Qwen2 MoE 2.7B to vllm-benchmark-models.yaml * Add QwQ-32B-Preview to vllm-benchmark-models.yaml * add links to performance results words * change "performance validation" to "performance testing" * remove "-Preview" from QwQ-32B * move qwen2 MoE after qwen2 * add TunableOp section * fix formatting * add link to TunableOp doc * add tunableop note * fix vllm-benchmark template * remove cmdline option for --tunableop on * update docker details * remove "training" * remove qwen2	2025-04-24 16:44:34 -04:00
Peter Park	40e4ba3ecc	Update vLLM inference benchmark Docker guide (#4653 ) * Remove JAIS 13B and 30B * update Docker details - vLLM 0.8.3 * add previous version * Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst * fix link to previous version	2025-04-24 15:59:13 -04:00
Peter Park	c3faa9670b	Add PyTorch inference benchmark Docker guide (+ CLIP and Chai-1) (#4654 ) * update vLLM links in deploy-your-model.rst * add pytorch inference benchmark doc * update toc and vLLM title * remove previous versions * update * wording * fix link and "applies to" * add pytorch to wordlist * add tunableop note to clip * make tunableop note appear to all models * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * fix incorrect links * wording * fix wrong docker pull tag --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-04-23 17:35:52 -04:00
Istvan Kiss	635838e7ef	Add atomics operation support page	2025-03-20 17:11:02 +01:00
Peter Park	9b2ce2b634	Update vLLM performance Docker docs (#4491 ) * add links to performance results words * change "performance validation" to "performance testing" * update vLLM docker 3/11 * add previous versions add previous versions * fix llama 3.1 8b model repo name * words	2025-03-13 10:04:21 -04:00
Peter Park	1fb42c2591	Update LLM inference performance validation on AMD Instinct MI300X guide to filter by desired model (#4424 ) * WIP (cherry picked from commit a06a5b5b959a9425e7384fb58b88c3716f380e48) rm unneeded files (cherry picked from commit f1d0c00056a83299bdea74a43cd17454999cf2d8) * add sphinxcontrib.datatemplates (cherry picked from commit d056b93a325d87b81f54f70c6eb4ae78f4fb0bc1) * add template (cherry picked from commit 0691d59f0a1efbda7908762b7a906e30a65c0ee1) fix template (cherry picked from commit 01e4bea5522aa5deeaade58c105ff850f449df8b) WIPO (cherry picked from commit 4d8daf7445e7be92cd9ee1d39dff564bd8de41f4) WIP (cherry picked from commit 9eefd1f5833bc4dc8de9d777ff65a5fe5f826dbd) update models yaml schema (cherry picked from commit a5f0fc1e6cc51104dc2d42029bfcf3eea276d270) add model groups functionality (cherry picked from commit 13f49f96dd3e5a160d37c52e48a4fbcccdcf4f9e) add selector headings and fix template (cherry picked from commit 35f7f2314bcf74b4fd0a8ca10aaabf0de7063bb0) update template (cherry picked from commit 9e2dcfe0c7f6e7c2c685866ea83375fbacbc5032) fix (cherry picked from commit be51e32791550ddc21785effccb889228394b242) use classes instead of data tags (cherry picked from commit cd52d68c504f7e7435d156ae70cf4bde1dfe703e) update template (cherry picked from commit 9ed89fee6874b39ee3535fbde54a0a59f346ea2b) clean up extra wip files (cherry picked from commit a9f965a104baa966c184054638e935b011526278) update wordlist (cherry picked from commit f783656814e896aedd21acd1c8c87b4700c14469) remove unused template (cherry picked from commit cac894bd9c2b1262c9c006e5fddbcb742dc6d882) improve script (cherry picked from commit ca20ffd4922916616e0924d625652a815f27c35f) fix template (cherry picked from commit 752c61fda856fd5b244734636c036c8877e823b9) fix standalone benchmark output path in template (cherry picked from commit d8c04203b5ec0f6c2e2307f7890304a3dc5687be) fix toc (cherry picked from commit 8df42faf53488ef29f5a263d25032f3d35cd58ed) update script to prevent flash of unstyled content import a11y (cherry picked from commit 46c852717f223a1d8744fab035807cebab4c5404) add tabindex to wordlist (cherry picked from commit 11492593f9692f5453045e7ec52c8f8ae9624ae9) text update script * remove unused config option * reorganize assets * fix linting warning * move js from data/ to extension/	2025-02-28 12:39:02 -05:00
randyh62	32feb96819	Rocm azure linux (#280 ) * Ad Software stack for 6.3.2 includes Azure Linux * Update what-is-rocm.rst add Azure Linux	2025-01-14 15:50:13 -08:00
alexxu-amd	85bd6e98f5	Remove gpu-cluster-networking and 'Using MPI' page due to migration to Instinct Docs (#4201 ) * remove 'Using MPI' and 'gpu-cluster-networking' sections due to migration to dcgpu * remove gpu-cluster-networking from index page --------- Co-authored-by: Alex Xu <alex.xu@amd.com>	2024-12-30 09:39:46 -05:00
alexxu-amd	758e8a33db	Merge branch 'develop' into sync-develop-from-external	2024-12-19 09:48:30 -05:00
randyh62	4f10f22920	Rocm image 631 (#257 ) * Add files via upload Add Debian OS without TransferBench * Delete docs/data/rocm-software-stack-6_3_0.jpg remove 6-3-0 image * Update what-is-rocm.rst Remove link and description for TransferBench * Update rocm-tools.md remove TransferBench * Update what-is-rocm.rst update ROCm Software Stack image name * Add files via upload Add correct image	2024-12-18 15:12:33 -08:00
Alex Xu	0356ffd148	Merge remote-tracking branch 'external/develop' into sync-develop-from-external	2024-12-18 15:57:08 -05:00
randyh62	5c5b5cce73	Add files via upload (#250 ) Add ROCm image with TransferBench	2024-12-17 15:37:09 -08:00
randyh62	08d9286cd5	Update ROCm software stack image for 6.3.1 (#245 )	2024-12-17 11:55:51 -08:00
Peter Park	f9dbc1f21f	add megatron training doc (#4159 ) * add megatron training doc update toc add images update formatting and wording formatting update formatting update conf.py update formatting update docker img tweak formatting Fix stuff fix mock-data/data-path add specific commit hash to checkout update docker pull tag fix docker run cmd and examples path fix docker cmd * wording words words * improve title	2024-12-16 13:37:35 -05:00
randyh62	a591218531	ROCm software stack update for 6.3.1 (#242 )	2024-12-11 15:29:41 -08:00
Peter Park	b0722b3228	Add @hongxiayang updates to MI300X workload tuning guide (#4123 ) minor fixes to formatting fix spelling errors more spelling fixes quantization update fix format simplify wording in tunableops and format fix Apply suggestions from code review review feedback by Peter Co-authored-by: Peter Park <peter.park@amd.com> Apply suggestions from code review addressing feedback Co-authored-by: Peter Park <peter.park@amd.com> Apply suggestions from code review feedback again Co-authored-by: Peter Park <peter.park@amd.com> add hipblaslt yaml file figure feedback and minor formatting formatting update wordlist.txt remove outdated sentence regarding fsdp and rccl (cherry picked from commit 87fa9fd83a2e623f6cab4e69d65f49e3db0a45f6) update wordlist Co-authored-by: hongxyan <hongxyan@amd.com>	2024-12-06 12:10:57 -05:00
Peter Park	3b1d1fa5b7	fix stack image (#4112 )	2024-12-04 21:55:17 -05:00
Peter Park	8ea3ad51c4	Add GitHub issue links in known issues + update stack diagram (#4091 ) * add GitHub issue links in known issues * Update stack diagram * remove extra img	2024-12-03 15:49:45 -07:00
Sam Wu	f77e2dd7a7	Sync develop branch (#4078 )	2024-12-03 15:18:51 -07:00
randyh62	e9e9fa4ba5	Change Common Language Runtime to Compute Language Runtime (#200 ) * Change Common Language Runtime to Compute Language Runtime * change rocJPEG description * update ROCm Software Stack image	2024-11-22 15:45:26 -08:00
randyh62	c18694b0fe	add azure image (#214 )	2024-11-22 08:04:01 -08:00
spolifroni-amd	cf8fc95451	adding missing images (#4036 )	2024-11-21 14:43:24 -05:00
Peter Park	b7ecf6d552	Rename Omnitools to ROCm Compute/Systems Profiler (#183 ) * rename Omniperf and Omnitrace * rename labels rename more labels * update licenses and rocm-tools.md * fix rocprof-sys ref	2024-11-07 18:01:26 -05:00

1 2 3

105 Commits