github/ROCm - ROCm - AtHeartEngineering

mirror of https://github.com/ROCm/ROCm.git synced 2026-02-21 03:00:39 -05:00

Author	SHA1	Message	Date
Peter Park	7380c89985	docs: Add system health check doc under ROCm for AI (#4736 ) * add initial draft * add to toc and install page * update wording * improve documentation structure * resturcture and expand content * add to training section * add to conf.py article_pages * Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * update wordlist.txt * Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * inference --> AI workloads * udpate toc * update article_pages in conf.py * Update system validation notes in training docs * fix links in prerequisite-system-validation * wording * add note * consistency * remove extra files * fix links * add links to training index page --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> (cherry picked from commit `0a77e7b3a5`)	2025-05-13 15:55:36 -04:00
Istvan Kiss	165ea54e12	Jax and PyTorch compatibility page update 6.4 (#4732 ) * JAX compatibility page upate (#4727) * Fix compatibility list (#4731) * Pytorch compatibility page update * Fix unsupported section structure on JAX (#4733)	2025-05-13 18:24:19 +02:00
Peter Park	065d1cdc95	Merge pull request #4725 from peterjunpark/docs/quark-model-quantization Add quark in model-quantization.rst (cherry picked from commit `90a651d2b6`)	2025-05-08 10:35:33 -04:00
Peter Park	5b859352b2	Merge pull request #4724 from peterjunpark/docs/6.4.0 [docs/6.4.0] Fix incorrect throughput benchmark command in inference/vllm-benchmar…	2025-05-08 09:31:38 -04:00
Peter Park	f15a1e830e	Fix incorrect throughput benchmark command in inference/vllm-benchmark.rst (#4723 ) * update inference index to include pyt inference * fix incorrect command in throughput benchmark * wording (cherry picked from commit `bb7af3351a`)	2025-05-08 09:27:44 -04:00
Pratik Basyal	a2628dce5d	rocSHMEM component added to ROCm 6.4.0 documentation (#4719 ) (#4720 ) * rocSHMEM added to ROCm 640 * Space removed * link fixed	2025-05-07 15:42:38 -04:00
Peter Park	e0098d0668	fix links in pytorch-inference-benchmark.rst (#4713 ) (cherry picked from commit `186c281aba`)	2025-05-06 15:27:17 -04:00
Peter Park	71cffa9681	fix dynamic urls in toc.yml.in	2025-05-06 15:27:17 -04:00
Peter Park	94337a9887	Add MPT-30B + LLM Foundry doc (#4704 ) * add mpt-30b doc * add tunableop note * update MPT doc * add section * update wordlist * fix flash attention version * update "applies to" * address review feedback * Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * update docker details to pytorch-training-v25.5 * update --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> (cherry picked from commit `d44ea40a0d`)	2025-05-02 12:13:56 -04:00
Peter Park	18d98ca692	Update vLLM docker pull tag 20250415 in vllm-benchmark.rst (#4702 ) (cherry picked from commit `85778177a1`)	2025-04-30 16:10:27 -04:00
Peter Park	c8144c4a60	Update JAX MaxText benchmark doc to v25.5 (#4695 ) * fix shell cmd formatting * add previous versions section * update docker details and add llama 3.3 * update missed docker image tags to 25.5 (cherry picked from commit `7458fcb7ab`)	2025-04-28 17:53:37 -04:00
Peter Park	ed45d6add9	fix link to pytorch-training v25.4 doc (#4696 ) (cherry picked from commit `16d6e59003`)	2025-04-28 17:53:37 -04:00
Peter Park	4f86b2801a	Update vLLM inference benchmark Docker guide (#4653 ) * Remove JAIS 13B and 30B * update Docker details - vLLM 0.8.3 * add previous version * Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst * fix link to previous version (cherry picked from commit `40e4ba3ecc`)	2025-04-24 17:57:05 -04:00
Peter Park	9c07ed1726	fix link to previous version in vllm-benchmark.rst (#4689 ) (cherry picked from commit `a66bc1d85e`)	2025-04-24 17:54:30 -04:00
Peter Park	34ca259220	Add QwQ 32B to vllm-benchmark.rst (#4685 ) * Add Qwen2 MoE 2.7B to vllm-benchmark-models.yaml * Add QwQ-32B-Preview to vllm-benchmark-models.yaml * add links to performance results words * change "performance validation" to "performance testing" * remove "-Preview" from QwQ-32B * move qwen2 MoE after qwen2 * add TunableOp section * fix formatting * add link to TunableOp doc * add tunableop note * fix vllm-benchmark template * remove cmdline option for --tunableop on * update docker details * remove "training" * remove qwen2 (cherry picked from commit `36b6ffaf7c`)	2025-04-24 16:46:48 -04:00
Peter Park	d04443ac13	Add note for chai-1 benchmark Docker in pytorch-inference-benchmark.rst (#4684 ) (cherry picked from commit `1f41ce26be`)	2025-04-24 16:45:33 -04:00
Peter Park	311b4cd62b	Add PyTorch inference benchmark Docker guide (+ CLIP and Chai-1) (#4654 ) * update vLLM links in deploy-your-model.rst * add pytorch inference benchmark doc * update toc and vLLM title * remove previous versions * update * wording * fix link and "applies to" * add pytorch to wordlist * add tunableop note to clip * make tunableop note appear to all models * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * fix incorrect links * wording * fix wrong docker pull tag --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> (cherry picked from commit `c3faa9670b`)	2025-04-23 17:36:25 -04:00
Peter Park	d2ccd706a5	Update ML framework Docker compatibility docs for 6.4.0 (#4667 ) * update pytorch-compatibility.rst * update tensorflow compat fix * update jax and jax-community docker versions (cherry picked from commit `b29b3592bd`)	2025-04-22 16:17:24 -04:00
Peter Park	699f668a2b	fix link to Dockerfile.rocm (#4573 ) (cherry picked from commit `310864e653`)	2025-04-22 14:09:35 -04:00
Pratik Basyal	3bc09b6faa	615 column added to historical compatibility matrix in ROCm 640 (#4655 ) * 6.1.5 column added and broken link fixed	2025-04-17 11:50:32 -04:00
Peter Park	824d760646	Update PyTorch training Docker doc for 25.5 (#4638 ) * update pytorch-training to 25.5 * remove llama 2 * Revert "remove llama 2" This reverts commit dab672fa7bcbd8bff730382c14177df4301a537d. * add previous version * fix run cmd * add link to docker hub * fix linting issue * add Llama 3.3 70B * update (cherry picked from commit `9ff3c2c885`)	2025-04-15 18:17:06 -04:00
Peter Park	cb412a7a7f	Fix vllm Dockerfile.rocm path (#4628 ) (cherry picked from commit `d057d49af1`)	2025-04-15 11:28:09 -04:00
Peter Park	d1b426f2d0	Update KMD versions in compat matrix (#4594 ) * update KMD versions in compat matrix * update historical compat matrix (cherry picked from commit `656db2bc84`)	2025-04-11 16:49:12 -04:00
Pratik Basyal	639e2dc232	Release notes Link update 640 branch (#4593 ) * Link update (#4591) * Date updated	2025-04-11 16:26:26 -04:00
Parag Bhandari	5104389ab3	Merge branch 'develop' into docs/6.4.0	2025-04-11 15:15:54 -04:00
Parag Bhandari	493585dfbb	Merge branch 'develop' of github.com:ROCm/ROCm into develop	2025-04-11 15:15:43 -04:00
Parag Bhandari	e756d99f65	Merge branch 'develop-internal' into develop	2025-04-11 15:15:19 -04:00
Pratik Basyal	686fcece1d	PRE GA Day 640 update for resetting link and HPC application list (#367 ) * Links reset to point to latest from stg, internal, RTD, and develop * ROCm for HPC updated * GA prep changes	2025-04-11 14:12:57 -05:00
pbhandar-amd	131e34f582	Update w6000-v620.md	2025-04-11 15:11:34 -04:00
Parag Bhandari	6b71afe8a2	Merge branch 'develop' into docs/6.4.0	2025-04-11 14:36:57 -04:00
Parag Bhandari	db3c46fccf	Merge branch 'develop-internal' into develop	2025-04-11 14:32:09 -04:00
pbhandar-amd	7d5ea2f2f9	Update versions.md	2025-04-11 13:16:06 -04:00
pbhandar-amd	18abbbda11	Update versions.md	2025-04-11 13:15:53 -04:00
pbhandar-amd	d2c914d477	Update documentation requirements	2025-04-11 10:28:37 -04:00
Peter Park	03137e1146	Remove "preview support" for PyT 2.6 (#368 ) * remove pytorch 2.6 preview support note * update pytorch support release note	2025-04-11 09:12:41 -04:00
Peter Park	8a24176528	Update Thrust and CUB versions for 6.4 + fix compatibility table not displaying (#364 ) * Update Thrust and CUB versions * fix whitespace issue causing build error * fix onnx runtime ver	2025-04-10 13:38:48 -04:00
Pratik Basyal	1e231b4b28	640 RN known issues batch 4 (#365 ) * ROCProfiler deprecation notice udpated * RHEL 9.6 support removed and 9.5 EOS rejected * Feedback to KV cache highlight added * Wrong entry of ROCprofiler-SDK removed * Additional known issues added * GA Release date updated * Consolidated changelog sync	2025-04-10 09:05:34 -04:00
Pratik Basyal	c26f470c8a	6.4.0 Known issues update to RN batch 3 (#362 ) * ROCProfiler deprecation notice udpated * RHEL 9.6 support removed and 9.5 EOS rejected * Feedback to KV cache highlight added * Wrong entry of ROCprofiler-SDK removed * ROCm debugger known issues added * JAX known issues added * Ordering fixed * Compute partition known issues added * TP sizes known issues added * Highlight and compatibility matrix updated * ONNX auto-update corrected * ROCm systems profiler known issues removed * Title update	2025-04-09 10:14:14 -04:00
Istvan Kiss	13bd184ec3	Add RDNA4 ISA guide	2025-04-08 13:57:32 +02:00
Istvan Kiss	6c7f167650	Fix broken torchserve link	2025-04-07 16:07:31 +02:00
dependabot[bot]	defb276d93	Build(deps): Bump rocm-docs-core from 1.18.1 to 1.18.2 in /docs/sphinx (#4556 ) Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.18.1 to 1.18.2. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md) - [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.18.1...v1.18.2) --- updated-dependencies: - dependency-name: rocm-docs-core dependency-version: 1.18.2 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-04-03 17:02:06 -06:00
Peter Park	fdf24a9c40	fix link to CLR license (#4560 )	2025-04-03 13:09:59 -04:00
Dominic Widdows	715cce53de	Update workload.rst with small export fix (#4425 ) Tiny fix that removes the "export" directive. ` export HIP_FORCE_DEV_KERNARG=1 hipblaslt-bench ...` leads to bash: export: `hipblaslt-bench': not a valid identifier whereas just starting with HIP_FORCE_DEV_KERNARG=1 passes this env var to the hipblaslt-bench process, which I think is the intention here.	2025-04-03 13:01:26 -04:00
Jeffrey Novotny	c71201b801	Add Radeon PRO W7800 48GB to GPU hardware specs (#356 ) * Add Radeon PRO W7800 48GB to GPU hardware specs * Adjust row order	2025-04-01 16:44:56 -04:00
Peter Park	ea66bf386a	Fix more links in documentation (#4551 ) * fix vllm engine args link * remove RDNA subtree in under system optimization in toc * fix RDNA 2 architecture PDF link * fix CLR LICENSE.txt link * fix rocPyDecode license link	2025-04-01 15:56:34 -04:00
Peter Park	ac2c5e72d4	Fix links in documentation	2025-04-01 15:39:20 -04:00
Peter Park	53eb4f6edb	Change AMD SMI ver to 25.3.0 from 25.2.0 (#345 )	2025-04-01 13:02:27 -04:00
amitkumar-amd	b178a7ca78	Update the TOC (#355 ) * remove 1200 * update link on TOC * Update docs/sphinx/_toc.yml.in Co-authored-by: Pratik Basyal <pratik.basyal@amd.com> --------- Co-authored-by: Pratik Basyal <prbasyal@amd.com> Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>	2025-03-28 15:59:27 -05:00
Peter Park	424e6148bd	Add MaxText training Docker doc Add MaxText training Docker doc	2025-03-28 11:25:06 -04:00
Peter Park	15aca4be9d	Fix ML framework compatible versions for 6.4 (#347 ) * Fix ML framework compatible versions for 6.4 * add footnote to historical compat matrix	2025-03-28 10:55:36 -04:00

1 2 3 4 5 ...

843 Commits