github/ROCm - ROCm - AtHeartEngineering

mirror of https://github.com/ROCm/ROCm.git synced 2026-04-05 03:01:17 -04:00

Author	SHA1	Message	Date
Peter Park	23e78c8d55	fix tab sync and nested tab Megatron-LM doc (#4409 ) (cherry picked from commit `1ea1c5c6e0`)	2025-02-21 17:20:15 -05:00
Peter Park	4af488e27d	Update docs on Megatron-LM and PyTorch training Dockers (#4407 ) * Update Megatron-LM and PyTorch Training Docker docs Also restructure TOC * Apply suggestions from code review Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> update "start training" text Apply suggestions from code review Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> update conf.py fix spacing fix branding issue add disable numa reorg remove extra text (cherry picked from commit `389fa7071b`)	2025-02-21 13:10:42 -05:00
pbhandar-amd	cd79403931	Update vllm-benchmark.rst	2025-02-19 17:21:29 -05:00
Parag Bhandari	065fe8b138	Merge branch 'develop' into roc-6.3.x	2025-02-19 16:30:33 -05:00
Peter Park	618b44ed23	add vllm docker to release highlights (#306 )	2025-02-13 12:01:08 -05:00
Peter Park	2751a17cf0	Update vLLM benchmarking guide (#4347 ) * update vllm-benchmark fix hlist overflow update standalone benchmarking options update list of models fix typo and model name unnecessary duplicate info update formatting update vllm benchmark guide - remove Llama 2 FP8 - add Jais 13B - update commands update docker pull tag update MAD available models remove extra mad models not relevant to vllm update PyTorch version add changelog add model names to .wordlist.txt * Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst Co-authored-by: Pratik Basyal <pratik.basyal@amd.com> * Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst Co-authored-by: Pratik Basyal <pratik.basyal@amd.com> * Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst Co-authored-by: Pratik Basyal <pratik.basyal@amd.com> * fix typo * update link * fix link text * change changelog to previous versions * fix typo * remove "for" --------- Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>	2025-02-05 17:18:35 -05:00
Pratik Basyal	f885b5df6e	Updated ROCm install on Linux installation method link (#4313 )	2025-01-31 16:48:33 -05:00
Jeffrey Novotny	ddfb5bda12	Add ToC and index links to the AI Developer Tutorials (#4312 ) * Add ToC and index links to the AI Developer Tutorials * Change link positioning * Change wording (cherry picked from commit `d401b5f152`)	2025-01-29 14:45:32 -05:00
Jeffrey Novotny	d401b5f152	Add ToC and index links to the AI Developer Tutorials (#4312 ) * Add ToC and index links to the AI Developer Tutorials * Change link positioning * Change wording	2025-01-29 14:43:32 -05:00
Alex Xu	e55b9f2a33	Merge branch 'develop' into roc-6.3.x	2025-01-28 14:18:28 -05:00
Pratik Basyal	353d2fe1c1	2nd POC for How to Use ROCm for AI (#282 ) (#4299 ) * New TOC for ROCm for AI developed Co-authored-by: Peter Park <peter.park@amd.com>	2025-01-27 15:49:21 -05:00
Peter Park	8dd99fe3a4	fix link to llama cookbook (#4269 )	2025-01-17 14:53:36 -05:00
Adel Johar	7754fc4b9d	Docs: resolve warnings from sphinx build output	2025-01-16 14:36:47 +01:00
Peter Park	d534f755e4	Add metadata to docs (#3688 ) * add missing metadata add metadata to mi300 arch doc add metadata to contributing guide add metadata to mi300x tuning guides * update meta to yaml frontmatter * update to md metadata to myst frontmatter * remove extra file * fix spelling	2025-01-14 08:55:45 -05:00
Peter Park	26553d725b	Add TensorFlow compatibility docs (#4247 ) * Add Tensorflow * WIP * WIP * minor fmt * PR feedbacks * fix missed inconsistent formatting * WIP WIP WIP WIP * minor formatting update tensorflow-rocm docker images to rocm6.3.1 fix urls * WIP * fix typo and update wordlist * fix tables not rendering * fix table headings * add period * update tf dockers * fix link * fix link * wording * update historical compat * fix tensile link --------- Co-authored-by: Mátyás Aradi <matyas@streamhpc.com> Co-authored-by: Istvan Kiss <neon60@gmail.com>	2025-01-09 14:24:58 -05:00
Pratik Basyal	8caa54caca	HPC application list updated (#4066 ) * PETSc added * List of HPC applications updated for 6.2.4 * Leo's feedback incorporated Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Review feedback incorporated * vllm removed --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-01-08 08:35:42 -05:00
Peter Park	ff1393142b	Add JAX compatibility doc (#4234 ) * Add JAX compatibility (cherry picked from commit 99215ab6b4cf6a1209d6c5fc781b5855251dcba5) * WIP (cherry picked from commit 54564a85d340b4149ed80a33377cf54c1eb48713) * Fix docker table (cherry picked from commit 8115a905764c869b390de2561e5f1356ec7e9743) * WIP (cherry picked from commit 45076e1fd20fd2c43f7a0ab6d8d5d246c498d801) * add minor formatting (cherry picked from commit c75706841092006c26766611b0407b79a13c7345) * PR feedbacks (cherry picked from commit 236b5daae4251c26cd697c6e20d5982771b05754) * fix inconsistent formatting (cherry picked from commit 0c6a2e3627f9e6159e3f400ab18769904c18097e) * Rename file (cherry picked from commit f17239aa8a9fa1ecdf8dab08c0348dc9216c5311) * jax_triton supported (cherry picked from commit fa56d697fbaa44c0c480df71dc236be8584291c0) * WIP (cherry picked from commit e8f0c5741fe96bb1e3272365906334d911a9a849) * WIP (cherry picked from commit 8ee4f3c62da8e11eea591340dc7c9fc1be8b7035) * WIP (cherry picked from commit 58c6bf441054fe3a21ba2d86808279e90de847b7) * WIP (cherry picked from commit 368ddf6925215a9bfd75a43c7c33def12238f81d) * update .wordlist.txt (cherry picked from commit 78ac332c8d6eba93e2b3e57440da3f60054bbadb) * update .wordlist.txt (cherry picked from commit 8d9492399f4b73b0c3c5359684d5b7faa328ba0f) * Fix typos (cherry picked from commit 394dede13b6de087237832fe3c693c11da7d733b) * update jax note (cherry picked from commit ceacc713c4295f8bbd20fc622579de9053b73337) * Update docs/compatibility/ml-compatibility/jax-compatibility.rst (cherry picked from commit b0613e914a2ba639fddea62eb495f97beaa8ba49) * Update docs/compatibility/ml-compatibility/jax-compatibility.rst (cherry picked from commit 8aac4344b6fd4120a3b8a31878f5316df99f3f99) * Add back hipGraph support (cherry picked from commit 028ddb3535073e0cd668c24614a0a73a491b5948) * WIP (cherry picked from commit 2e0ff9c5e3f88ceea6b0ca770bb4edb52ce08a47) * WIP (cherry picked from commit 186802585de5b7d58f9ac2a7947a83c037df1617) * add blurb about docker icon (cherry picked from commit aef650d4072578f75e7549151613f390f6545ce1) * update pytorch-compatibility path in conf.py * words --------- Co-authored-by: Mátyás Aradi <matyas@streamhpc.com> Co-authored-by: Istvan Kiss <neon60@gmail.com>	2025-01-07 09:57:19 -05:00
alexxu-amd	85bd6e98f5	Remove gpu-cluster-networking and 'Using MPI' page due to migration to Instinct Docs (#4201 ) * remove 'Using MPI' and 'gpu-cluster-networking' sections due to migration to dcgpu * remove gpu-cluster-networking from index page --------- Co-authored-by: Alex Xu <alex.xu@amd.com>	2024-12-30 09:39:46 -05:00
Peter Park	9250e1ba28	Fix PyTorch Compatibility link and remove incomplete rows (#4195 ) * fix pytorch-compatibility filename fix links * remove incomplete rows in pytorch-compatibility * fix broken refs	2024-12-24 13:51:33 -05:00
Peter Park	44aaf1b57c	Add PyTorch compatibility doc (#4193 ) * Add compatibility framework pages * update formatting * WIP * satisfy spellcheck linter * PR feedbacks * caps * remove jax and tensorflow pages * comment out "?"s * update wordlist * fix toc and table * update toc and deep-learning-rocm.rst --------- Co-authored-by: Istvan Kiss <neon60@gmail.com>	2024-12-24 13:51:33 -05:00
Peter Park	f76145c2ad	Fix PyTorch Compatibility link and remove incomplete rows (#4195 ) * fix pytorch-compatibility filename fix links * remove incomplete rows in pytorch-compatibility * fix broken refs	2024-12-24 11:13:54 -05:00
Peter Park	76d6e892bb	Add PyTorch compatibility doc (#4193 ) * Add compatibility framework pages * update formatting * WIP * satisfy spellcheck linter * PR feedbacks * caps * remove jax and tensorflow pages * comment out "?"s * update wordlist * fix toc and table * update toc and deep-learning-rocm.rst --------- Co-authored-by: Istvan Kiss <neon60@gmail.com>	2024-12-23 18:06:22 -05:00
Pratik Basyal	33891d6fd0	Bar memory update develop (#4168 ) * Bar Memory page added * Leo's feedback incorporated * Spell check fixed * SME review feedback incorporated * Feedback updated * Indentation fixed --------- Co-authored-by: prbasyal <prbasyal@amd.com>	2024-12-20 16:05:49 -05:00
Peter Park	f9dbc1f21f	add megatron training doc (#4159 ) * add megatron training doc update toc add images update formatting and wording formatting update formatting update conf.py update formatting update docker img tweak formatting Fix stuff fix mock-data/data-path add specific commit hash to checkout update docker pull tag fix docker run cmd and examples path fix docker cmd * wording words words * improve title	2024-12-16 13:37:35 -05:00
Peter Park	78f9adc6ec	fix rccl hip streams section in workload tuning guide (#4140 )	2024-12-09 11:06:12 -05:00
Peter Park	b0722b3228	Add @hongxiayang updates to MI300X workload tuning guide (#4123 ) minor fixes to formatting fix spelling errors more spelling fixes quantization update fix format simplify wording in tunableops and format fix Apply suggestions from code review review feedback by Peter Co-authored-by: Peter Park <peter.park@amd.com> Apply suggestions from code review addressing feedback Co-authored-by: Peter Park <peter.park@amd.com> Apply suggestions from code review feedback again Co-authored-by: Peter Park <peter.park@amd.com> add hipblaslt yaml file figure feedback and minor formatting formatting update wordlist.txt remove outdated sentence regarding fsdp and rccl (cherry picked from commit 87fa9fd83a2e623f6cab4e69d65f49e3db0a45f6) update wordlist Co-authored-by: hongxyan <hongxyan@amd.com>	2024-12-06 12:10:57 -05:00
Sam Wu	f77e2dd7a7	Sync develop branch (#4078 )	2024-12-03 15:18:51 -07:00
Istvan Kiss	0f6f4b7d57	Update docs/how-to/programming_guide.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2024-11-07 17:36:38 +01:00
Istvan Kiss	b0629fdd29	Apply suggestions from code review Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2024-11-07 17:36:38 +01:00
randyh62	493cd06641	Update docs/how-to/programming_guide.rst	2024-11-07 17:36:38 +01:00
Istvan Kiss	385dbc30d3	Update docs/how-to/hip_programming_guide.rst Co-authored-by: MKKnorr <MKKnorr@web.de>	2024-11-07 17:36:38 +01:00
Istvan Kiss	6fe63192a9	WIP	2024-11-07 17:36:38 +01:00
randyh62	fb5c08138e	Update docs/how-to/hip_programming_guide.rst Co-authored-by: MKKnorr <MKKnorr@web.de>	2024-11-07 17:36:38 +01:00
Istvan Kiss	d41f1362de	Apply suggestions from code review Co-authored-by: randyh62 <42045079+randyh62@users.noreply.github.com>	2024-11-07 17:36:38 +01:00
Istvan Kiss	ba3e128246	Increase visibility of programming guide	2024-11-07 17:36:38 +01:00
Peter Park	0fe08d93d7	Update links to vllm perf validation doc (#3971 ) * update links to vllm perf validation doc * add PagedAttention to wordlist	2024-10-30 18:24:18 -04:00
Peter Park	f1fb476f6f	add suggestions to vllm perf validation doc (#3968 )	2024-10-30 14:25:58 -04:00
Peter Park	35b16c1293	add vllm performance validation doc (#3964 )	2024-10-30 13:24:11 -04:00
Kent Russell	74333b667d	docs: Remove invalid amd_iommu=on parameter Per kernel-parameters.txt, there is no "on" option for amd_iommu. While intel_iommu has it, amd_iommu is automatically on unless specified otherwise. For more info, see these 2 links: https://www.kernel.org/doc/Documentation/admin-guide/kernel-parameters.txt `75aa74d52f/drivers/iommu/amd/init.c (L3481)` Signed-off-by: Kent Russell <kent.russell@amd.com>	2024-10-18 10:44:07 -04:00
Peter Park	b541be7bcb	Update bitsandbytes branch in docs (#3898 )	2024-10-15 10:47:56 -04:00
Jeffrey Novotny	4efcaad13d	Fix Radeon link and point at R6.1.3 as absolute link (#3757 )	2024-09-19 15:57:43 -04:00
Peter Park	1a26868cce	add missing make command to bitsandbytes install commands (#3722 )	2024-09-13 13:21:44 -04:00
Jeffrey Novotny	4992db3e6c	Add FBGEMM/FBGEMM_GPU to the Model acceleration libraries page (#3659 ) * Add FBGEMM/FBGEMM_GPU to the Model acceleration libraries page * Add words to wordlist and fix a typo * Add new sections for Docker and testing * Incorporate comments from the external review * Some minor edits and clarifications * Incorporate further review coments and fix test section * Add comment to test section * Change git clone command for FBGEMM repo * Change Docker command * Changes from internal review * Fix linting issue	2024-09-09 11:20:50 -04:00
Jeffrey Novotny	23a67a3abf	Add introduction and links to the new guide to the vLLM optimized Doc… (#3637 ) * Add introduction and links to the new guide to the vLLM optimized Docker image on AMD Infinity Hub * Update target link for the Docker vLLM guide * Change target URL * Change link target URL again	2024-09-04 17:07:46 -04:00
Peter Park	bc64c7b425	Fix intersphinx links (#3668 ) * fix links in install.rst * fix links in sys opt guides	2024-09-03 12:28:24 -04:00
ozziemoreno	b91522afbb	Update model-quantization.rst to import `BitsAndBytesConfig` from transformers library (#3638 )	2024-09-03 10:35:35 -04:00
Jeffrey Novotny	66211e27b6	Expand the section on changing thread affinity (#3653 ) * Expand the section on changing thread affinity * Clarify the methods for configuring allocatable memory settings * Small correction	2024-08-29 09:45:50 -04:00
Chris Kime	a19fe8bb31	Correct ttm to amdttm (#3648 )	2024-08-27 14:23:04 -04:00
Jeffrey Novotny	91d4a7e0c9	Add a section on increasing memory allocation to the MI300A system op… (#3587 ) * Add a section on increasing memory allocation to the MI300A system optimization guide * Addition to wordlist * Change GB to GiB for consistency * Standardize GiB/KiB spacing * Minor wording changes	2024-08-16 08:35:08 -04:00
Peter Park	27f5d9ad7d	Fix intersphinx links (#3546 ) * update fw install links * fix more intersphinx links * fix more links	2024-08-08 15:20:57 -04:00

1 2 3

108 Commits