github/ROCm - ROCm - AtHeartEngineering

mirror of https://github.com/ROCm/ROCm.git synced 2026-02-21 03:00:39 -05:00

Author	SHA1	Message	Date
yugang-amd	404e91f2d9	Update compatibility-matrix.rst (#4860 )	2025-05-30 17:50:33 -04:00
alexxu-amd	50cfc538ff	Change viewer link from latest to mainline in what-is-rocm page (#4856 ) * change viewer link from latest to mainline * correct format (cherry picked from commit `c1919faccd`)	2025-05-30 17:18:40 -04:00
Swati Rawat	a9c323e596	Docs: Add rocprof-compute-viewer (#4850 ) * Docs: Add rocprof-compute-viewer * update requirements.txt --------- Co-authored-by: Alex Xu <alex.xu@amd.com> (cherry picked from commit `6142df329b`)	2025-05-30 15:22:51 -04:00
Peter Park	7a81d10c1d	Add RHEL 9.6 to compat matrix (#4839 ) * add RHEL 9.6 to compat matrix * add os support note (cherry picked from commit `2addcb0bca`)	2025-05-30 14:57:24 -04:00
yugang-amd	00f74d2d8e	Add microsoft/phi-4 vllm-benchmark-models (#4801 ) (#4847 ) * add Phi-4 to vllm-benchmark-models.yaml fix model_repo * update model group names Co-authored-by: Peter Park <peter.park@amd.com>	2025-05-30 09:20:17 -04:00
Peter Park	4963eeab00	Update ML framework Docker inventories for 6.4.1 (#4841 ) * Update tensorflow Docker compatibility table * update jax Docker compatibility table * fix py versions * update pytorch Docker compatibility table (cherry picked from commit `93fd0ef1d4`)	2025-05-29 18:34:47 -04:00
Peter Park	7c25ce240b	Add Falcon-180B to vLLM benchmark Docker doc (#4836 ) * add Falcon to vllm-benchmark-models.yaml * update group name (cherry picked from commit `daf2e980d9`)	2025-05-29 18:34:47 -04:00
Peter Park	fdeaacd3cc	fix megatron-lm pull tags	2025-05-28 15:12:50 -04:00
Peter Park	8e61ba4f90	Fix rocm/vllm pull tag fix	2025-05-28 14:42:35 -04:00
Peter Park	94ee445a8a	Add latest rocm/vllm Docker details in vLLM inference benchmark guide (#4824 ) * update rocm/vllm Docker details to latest release * Add previous vLLM version * fix 'further reading' xrefs * improve model grouping names * fix links * update model picker text (cherry picked from commit `cebf0f5975`)	2025-05-28 14:23:05 -04:00
Peter Park	2e5fe544a0	Add RDNA4 RX 9070 GRE to gpu-arch-specs.rst and RELEASE.md (#4820 ) (cherry picked from commit `0acb457389`)	2025-05-28 10:21:50 -04:00
yugang-amd	4dae0ba84d	Update SGPR for RDNA3 and RDNA2 series (#4815 )	2025-05-27 15:13:22 -04:00
yugang-amd	5ddab465c3	Bump up requirement version (#4805 ) * bump up requirement version * update requirements.txt * Use Python 3.10	2025-05-27 11:08:55 -04:00
yugang-amd	151e563dcb	Merge pull request #4792 from yugang-amd/wavefront-size-6-4-1 Update wavefront size	2025-05-26 14:56:38 -04:00
yugang-amd	ae1a330fd7	fix links	2025-05-26 14:35:36 -04:00
yugang-amd	cab805674a	update wavefront size (cherry picked from commit `230b01565f`)	2025-05-26 13:56:14 -04:00
yugang-amd	387cfab91f	fix typo	2025-05-26 12:53:18 -04:00
yugang-amd	525703a5ab	update wavefront size	2025-05-22 17:41:36 -04:00
Peter Park	6d2b1595b3	Document specs for Radeon RX 9070 + small fix in megatron-lm doc (#4780 ) * Document specs for Radeon RX 9070 * fix wrong version in megatron-lm.rst (cherry picked from commit `505041d90a`)	2025-05-22 16:30:56 -04:00
yugang-amd	31e9013bdc	update rocSHMEM xrefs (cherry picked from commit `7697298f5d`)	2025-05-22 15:19:09 -04:00
Peter Park	9b69755b99	Add Megatron-LM benchmark doc 5/2 (#4778 ) * reorg files * add tabs * update template * update template * update wordlist and toc * add previous version to doc * add selector paragraph * update wordlist.txt (cherry picked from commit `9ed65a81c4`)	2025-05-22 14:29:40 -04:00
Peter Park	4f80043312	fix 9070 XT gfx target in gpu-arch-specs table (#4775 ) (cherry picked from commit `6d9f430c70`)	2025-05-22 12:12:14 -04:00
Peter Park	98fde2bff1	Add RDNA4 OS support note in RELEASE.md and compat matrix (#4764 ) * fix vllm link in release.md * add RDNA4 note in compat matrix * update hipcc github url to specific path in llvm-project repo * remove non-existant HIP upcoming changes reference * remove non-existant resolved issues internal link * fix hip upcoming changes url * duplicate amd smi known issue	2025-05-21 14:23:48 -04:00
Peter Park	0e8b745266	Fix toc (#4762 )	2025-05-21 12:26:30 -04:00
Alex Xu	58a62bc00e	Merge remote-tracking branch 'external/develop' into sync-develop-from-external	2025-05-21 11:16:31 -04:00
Peter Park	8dc7016405	Add Radeon AI PRO R9700, Radeon RX 9070 XT, RX 9060 XT to gpu-arch-specs (#411 ) * add Radeon AI PRO R7900, Radeon RX 9070 XT, Radeon RX 9060 XT to gpu-arch-specs.rst * update compat matrices * fix spacing in historical compat csv file	2025-05-21 11:04:46 -04:00
alexxu-amd	ddcad120a2	Update versions.md	2025-05-21 09:52:05 -04:00
Peter Park	ca5d0d0000	[6.4.1] update llvm-project version and add RCCL known issue (#401 ) * update llvm-project version * add RCCL known issue	2025-05-15 16:20:59 -04:00
Peter Park	0a77e7b3a5	docs: Add system health check doc under ROCm for AI (#4736 ) * add initial draft * add to toc and install page * update wording * improve documentation structure * resturcture and expand content * add to training section * add to conf.py article_pages * Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * update wordlist.txt * Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * inference --> AI workloads * udpate toc * update article_pages in conf.py * Update system validation notes in training docs * fix links in prerequisite-system-validation * wording * add note * consistency * remove extra files * fix links * add links to training index page --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-05-13 15:54:48 -04:00
Istvan Kiss	d1772b9ca3	Fix unsupported section structure on JAX (#4733 )	2025-05-13 17:39:25 +02:00
Istvan Kiss	f65e1412df	Fix compatibility list (#4731 )	2025-05-13 16:26:36 +02:00
Istvan Kiss	ea1072b11d	JAX compatibility page upate (#4727 )	2025-05-08 19:31:13 +02:00
Peter Park	90a651d2b6	Merge pull request #4725 from peterjunpark/docs/quark-model-quantization Add quark in model-quantization.rst	2025-05-08 10:34:39 -04:00
Peter Park	bb7af3351a	Fix incorrect throughput benchmark command in inference/vllm-benchmark.rst (#4723 ) * update inference index to include pyt inference * fix incorrect command in throughput benchmark * wording	2025-05-08 09:24:51 -04:00
Wei Luo	d1debc7e45	[doc]: Add quark in model-quantization.rst (#374 ) * Add quark in model-quantization.rst --------- Co-authored-by: Peter Park <peter.park@amd.com> Co-authored-by: Peter Park <git@peterjunpark.com>	2025-05-08 14:28:51 +08:00
Pratik Basyal	8ef1bb0139	rocSHMEM component added to ROCm 6.4.0 documentation (#4719 ) * rocSHMEM added to ROCm 640 * Space removed * link fixed	2025-05-07 15:31:38 -04:00
Pratik Basyal	169f3bbe5e	641 Release notes update post RC2 batch1 (#387 ) * Release highlight updated * TOC updated for internal * RC3 manifest added * clarify docker image highlight * update doc highlights * RC3 changes added * RC3 manifest added * ROCm SMI version update --------- Co-authored-by: Peter Park <peter.park@amd.com>	2025-05-06 15:07:54 -04:00
Peter Park	186c281aba	fix links in pytorch-inference-benchmark.rst (#4713 )	2025-05-06 13:34:55 -04:00
Pratik Basyal	e28eac2fe1	License typo fixed (#384 )	2025-05-02 12:37:08 -04:00
Peter Park	d44ea40a0d	Add MPT-30B + LLM Foundry doc (#4704 ) * add mpt-30b doc * add tunableop note * update MPT doc * add section * update wordlist * fix flash attention version * update "applies to" * address review feedback * Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * update docker details to pytorch-training-v25.5 * update --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-05-02 12:13:20 -04:00
Pratik Basyal	217fb452f8	Initial changes to 6.4.1 RN (#379 ) * Initial changes added * Changelogs for RCCL, hipblaslt, compute profiler, and systems added * 6.4.0 GA manifest * 6.4.1 RC1 manifest * RC2 Manifest added * Update RELEASE.md Add CLR Changelog entry for HIP 6.4.1 * Release highlight added * AMD SMI changelog added * ROCr runtime changelog added * RCCL resolved issue added * Minor change * Minor fixes * Quick changes to version * Offline installer update * Istallation udpated * added rocalution to release notes * Updated changelogs for components * Changes to changelog * Update RELEASE.md Co-authored-by: Pratik Basyal <pratik.basyal@amd.com> * Update RELEASE.md Co-authored-by: Pratik Basyal <pratik.basyal@amd.com> * rocSHMEM related changes added * Changelog updated with new changes * Heading level fixed * AMD SMI version bumped to 25.4.0 * Reordered * Table zebra pattern updated * Consolidated updated * Zebra patter aligned * Add ROCm SMI changes to 6.4.1 * Update CHANGELOG.md Co-authored-by: Pratik Basyal <prbasyal@amd.com> * update doc highlights * Link to rocSHMEM * update * Minor changes * Changelog feedback updated --------- Co-authored-by: randyh62 <42045079+randyh62@users.noreply.github.com> Co-authored-by: spolifroni-amd <sandra.polifroni@amd.com> Co-authored-by: Peter Park <peter.park@amd.com>	2025-05-01 13:54:31 -04:00
Peter Park	85778177a1	Update vLLM docker pull tag 20250415 in vllm-benchmark.rst (#4702 )	2025-04-30 16:09:30 -04:00
Istvan Kiss	84177354de	Pytorch compatibility page update	2025-04-29 14:43:40 +02:00
Peter Park	7458fcb7ab	Update JAX MaxText benchmark doc to v25.5 (#4695 ) * fix shell cmd formatting * add previous versions section * update docker details and add llama 3.3 * update missed docker image tags to 25.5	2025-04-28 17:52:53 -04:00
Peter Park	16d6e59003	fix link to pytorch-training v25.4 doc (#4696 )	2025-04-28 17:52:33 -04:00
Peter Park	a66bc1d85e	fix link to previous version in vllm-benchmark.rst (#4689 )	2025-04-24 17:54:04 -04:00
Peter Park	36b6ffaf7c	Add QwQ 32B to vllm-benchmark.rst (#4685 ) * Add Qwen2 MoE 2.7B to vllm-benchmark-models.yaml * Add QwQ-32B-Preview to vllm-benchmark-models.yaml * add links to performance results words * change "performance validation" to "performance testing" * remove "-Preview" from QwQ-32B * move qwen2 MoE after qwen2 * add TunableOp section * fix formatting * add link to TunableOp doc * add tunableop note * fix vllm-benchmark template * remove cmdline option for --tunableop on * update docker details * remove "training" * remove qwen2	2025-04-24 16:44:34 -04:00
Peter Park	40e4ba3ecc	Update vLLM inference benchmark Docker guide (#4653 ) * Remove JAIS 13B and 30B * update Docker details - vLLM 0.8.3 * add previous version * Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst * fix link to previous version	2025-04-24 15:59:13 -04:00
Peter Park	1f41ce26be	Add note for chai-1 benchmark Docker in pytorch-inference-benchmark.rst (#4684 )	2025-04-24 15:48:53 -04:00
Peter Park	c3faa9670b	Add PyTorch inference benchmark Docker guide (+ CLIP and Chai-1) (#4654 ) * update vLLM links in deploy-your-model.rst * add pytorch inference benchmark doc * update toc and vLLM title * remove previous versions * update * wording * fix link and "applies to" * add pytorch to wordlist * add tunableop note to clip * make tunableop note appear to all models * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * fix incorrect links * wording * fix wrong docker pull tag --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-04-23 17:35:52 -04:00

1 2 3 4 5 ...

873 Commits