github/ROCm - ROCm - AtHeartEngineering

mirror of https://github.com/ROCm/ROCm.git synced 2026-02-05 03:45:12 -05:00

Author	SHA1	Message	Date
Peter Park	c3faa9670b	Add PyTorch inference benchmark Docker guide (+ CLIP and Chai-1) (#4654 ) * update vLLM links in deploy-your-model.rst * add pytorch inference benchmark doc * update toc and vLLM title * remove previous versions * update * wording * fix link and "applies to" * add pytorch to wordlist * add tunableop note to clip * make tunableop note appear to all models * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * Update docs/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> * fix incorrect links * wording * fix wrong docker pull tag --------- Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>	2025-04-23 17:35:52 -04:00
Peter Park	9b2ce2b634	Update vLLM performance Docker docs (#4491 ) * add links to performance results words * change "performance validation" to "performance testing" * update vLLM docker 3/11 * add previous versions add previous versions * fix llama 3.1 8b model repo name * words	2025-03-13 10:04:21 -04:00
Peter Park	1fb42c2591	Update LLM inference performance validation on AMD Instinct MI300X guide to filter by desired model (#4424 ) * WIP (cherry picked from commit a06a5b5b959a9425e7384fb58b88c3716f380e48) rm unneeded files (cherry picked from commit f1d0c00056a83299bdea74a43cd17454999cf2d8) * add sphinxcontrib.datatemplates (cherry picked from commit d056b93a325d87b81f54f70c6eb4ae78f4fb0bc1) * add template (cherry picked from commit 0691d59f0a1efbda7908762b7a906e30a65c0ee1) fix template (cherry picked from commit 01e4bea5522aa5deeaade58c105ff850f449df8b) WIPO (cherry picked from commit 4d8daf7445e7be92cd9ee1d39dff564bd8de41f4) WIP (cherry picked from commit 9eefd1f5833bc4dc8de9d777ff65a5fe5f826dbd) update models yaml schema (cherry picked from commit a5f0fc1e6cc51104dc2d42029bfcf3eea276d270) add model groups functionality (cherry picked from commit 13f49f96dd3e5a160d37c52e48a4fbcccdcf4f9e) add selector headings and fix template (cherry picked from commit 35f7f2314bcf74b4fd0a8ca10aaabf0de7063bb0) update template (cherry picked from commit 9e2dcfe0c7f6e7c2c685866ea83375fbacbc5032) fix (cherry picked from commit be51e32791550ddc21785effccb889228394b242) use classes instead of data tags (cherry picked from commit cd52d68c504f7e7435d156ae70cf4bde1dfe703e) update template (cherry picked from commit 9ed89fee6874b39ee3535fbde54a0a59f346ea2b) clean up extra wip files (cherry picked from commit a9f965a104baa966c184054638e935b011526278) update wordlist (cherry picked from commit f783656814e896aedd21acd1c8c87b4700c14469) remove unused template (cherry picked from commit cac894bd9c2b1262c9c006e5fddbcb742dc6d882) improve script (cherry picked from commit ca20ffd4922916616e0924d625652a815f27c35f) fix template (cherry picked from commit 752c61fda856fd5b244734636c036c8877e823b9) fix standalone benchmark output path in template (cherry picked from commit d8c04203b5ec0f6c2e2307f7890304a3dc5687be) fix toc (cherry picked from commit 8df42faf53488ef29f5a263d25032f3d35cd58ed) update script to prevent flash of unstyled content import a11y (cherry picked from commit 46c852717f223a1d8744fab035807cebab4c5404) add tabindex to wordlist (cherry picked from commit 11492593f9692f5453045e7ec52c8f8ae9624ae9) text update script * remove unused config option * reorganize assets * fix linting warning * move js from data/ to extension/	2025-02-28 12:39:02 -05:00
alexxu-amd	85bd6e98f5	Remove gpu-cluster-networking and 'Using MPI' page due to migration to Instinct Docs (#4201 ) * remove 'Using MPI' and 'gpu-cluster-networking' sections due to migration to dcgpu * remove gpu-cluster-networking from index page --------- Co-authored-by: Alex Xu <alex.xu@amd.com>	2024-12-30 09:39:46 -05:00
Peter Park	f9dbc1f21f	add megatron training doc (#4159 ) * add megatron training doc update toc add images update formatting and wording formatting update formatting update conf.py update formatting update docker img tweak formatting Fix stuff fix mock-data/data-path add specific commit hash to checkout update docker pull tag fix docker run cmd and examples path fix docker cmd * wording words words * improve title	2024-12-16 13:37:35 -05:00
Peter Park	b0722b3228	Add @hongxiayang updates to MI300X workload tuning guide (#4123 ) minor fixes to formatting fix spelling errors more spelling fixes quantization update fix format simplify wording in tunableops and format fix Apply suggestions from code review review feedback by Peter Co-authored-by: Peter Park <peter.park@amd.com> Apply suggestions from code review addressing feedback Co-authored-by: Peter Park <peter.park@amd.com> Apply suggestions from code review feedback again Co-authored-by: Peter Park <peter.park@amd.com> add hipblaslt yaml file figure feedback and minor formatting formatting update wordlist.txt remove outdated sentence regarding fsdp and rccl (cherry picked from commit 87fa9fd83a2e623f6cab4e69d65f49e3db0a45f6) update wordlist Co-authored-by: hongxyan <hongxyan@amd.com>	2024-12-06 12:10:57 -05:00
Sam Wu	f77e2dd7a7	Sync develop branch (#4078 )	2024-12-03 15:18:51 -07:00
Jeffrey Novotny	bdcb82372b	MI300A system optimization guide internal draft (#117 ) * MI300A system optimization guide internal draft * Small changes to System BIOS paragraph * Some minor edits * Changes after external review feedback * Add CPU Affinity debug setting * Edit CPU Affinity debug setting * Changes from external discussion * Add glossary and other small fixes * Additional changes from the review * Update the IOMMU guidance * Change description of CPU affinity setting * Slight rewording * Change Debian to Red Hat-based * A few changes from the second internal review	2024-07-31 13:29:49 -04:00
Peter Park	7b883f3af4	Add MI300X tuning guides (#3448 ) * Add MI300X tuning guides Add mi300x doc (pandoc conversion) fix headings add metadata move images to shared/ move images to shared/ convert tuning-guides.md to rst using pandoc add mi300x to tuning-guides.rst landing page update h1s, toc, and landing page fix spelling fix fmt format code blocks add tensilelite imgs fix formatting fix formatting some more fix formatting more formatting spelling remove --enforce-eager note satisfy spellcheck linter more spelling add fixes from hongxia fix env var in D5 add fixes to PyTorch inductor section fix fix Update docs/how-to/tuning-guides/mi300x.rst Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Update docs/how-to/tuning-guides/mi300x.rst Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Update docs/how-to/tuning-guides/mi300x.rst Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Update docs/how-to/tuning-guides/mi300x.rst Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Update docs/how-to/tuning-guides/mi300x.rst Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Update docs/how-to/tuning-guides/mi300x.rst Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Update docs/how-to/tuning-guides/mi300x.rst Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Update docs/how-to/tuning-guides/mi300x.rst Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Update docs/how-to/tuning-guides/mi300x.rst Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Update docs/how-to/tuning-guides/mi300x.rst Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Update docs/how-to/tuning-guides/mi300x.rst Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Update docs/how-to/tuning-guides/mi300x.rst Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Update 'torch_compile_debug' suggestion based on Hongxia's feedback fix PyTorch inductor env vars minor formatting fixes Apply suggestions from code review Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Update vllm path Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> disable numfig in Sphinx configuration fix formatting and capitalization add words to wordlist update index update wordlist update optimizing-triton-kernel convert cards to table fix link in index.md add @lpaoletti's feedback Add system tuning guide add images add system section add os settings and sys management remove pcie=noats recommendation reorg add blurb to developer section impr formatting remove windows os from tuning guides pages in conf.py add suggestions from review fix typo and link remove os windows from relevant pages in conf mi300x add suggestions from review fix toc fix index links reorg update vLLM vars Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> update vLLM vars Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> reorganize add warnings add text to system tuning add filler text on index pages reorg tuning pages fix links fix vars * rm old pages fix toc * add suggestions from review small change add more suggestions rewrite intro * add 'workload tuning philosophy' * refactor * fix broken links * black format conf.py * simplify cmd and update doc structure * add higher-level heading for consistency (mi300x.rst) * add fixes from review fix url add fixes fix formatting fix fmt fix hipBLASLt section change words fix tensilelite section fix fix fix fmt * style guide * fix some formatting * satisfy spellcheck linter * update wordlist * fix bad conflict resolution	2024-07-22 17:24:14 -04:00
randyh62	091fa3ef8e	update AI framework image (#3406 ) * update AI framework image * remove old image	2024-07-16 11:02:07 -07:00
randyh62	356ad4ab47	remove Magma (#3361 ) * remove Magma * missed one	2024-06-26 10:00:39 -07:00
Peter Park	22e9f6f373	Add "Using ROCm for HPC" guide (#3302 ) * Add ROCm for HPC * Update index and toc * Add TMs in other tutorials * Add hpc apps table Spellcheck add stack image and fix links Add descriptions update copy Update copy add ref Finish adding app descriptions tweak descs fix line lengths * Revert "Add TMs in other tutorials" This reverts commit `08a1a80e57`. * Add links to install and compat matrix * Update HPC stack graphic and add some links Add hpc and td to wordlist fix links * Apply suggestions from Leo's review Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> Update docs/how-to/rocm-for-hpc/index.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> Update docs/how-to/rocm-for-hpc/index.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> Update docs/how-to/rocm-for-hpc/index.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> Update docs/how-to/rocm-for-hpc/index.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> Update docs/how-to/rocm-for-hpc/index.rst Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> fix formatting Update words * update wordlist * Update hpc app descriptions with content from InfinityHub catalog	2024-06-21 16:15:18 -04:00
Peter Park	6494885359	Rename fine-tuning and optimization guide directory and fix index.md (#3242 ) * Mv fine-tuning and optimization files * Reorder index.md * Rename images directory * Fix internal links	2024-06-05 11:11:00 -04:00
Peter Park	fed33835a0	Add "Fine Tuning LLMs" how to guide (#3124 ) * Add Fine Tuning LLMs how to guide * Reorg and refactor Fine-tuning LLMs with ROCm Update index and headings Fix formatting and update toc Split out content from index to overview.rst Add metadata Clean up overview Add inference sections, fix rst errors, clean up single-gpu-fine-tuning Combine fine-tuning and inference guides Fix some links and formatting Update toc and add formatting fixes Add ck kernel fusion content Update toc Clean up model quantization and acceleration Add CK images Clean up profiling Update triton kernel performance optimization Update llm inference frameworks guide Disable automatic number of figures and tables in Sphinx conf Change tabs to spaces Change heading to end with -ing Add link fixes and heading updates Add rocprof/Omniperf/Omnitrace section Update profiling and debugging guide Add formatting fixes Satisfy spellcheck Fix words Delete unused file Finish overview Clean up first 4 sections Multi-gpu fine-tuning guide: slight fixes Update toc Remove tabs Formatting fixes * Minor wording updates * Add some clean-up * Update profiling and debugging gudie * Fix Omnitrace link * Update ck kernel fusion with latest * Update CK formatting * Fix perfetto link syntax * Fix typos and add blurbs * Add fixes to Triton optimization doc * Tabify saving adapters / models section * Fix linting errors - spellcheck Fix spelling and grammar Satisfy linter Update wording in profiling guide Add fixes to satisfy linter More fixes for linting in Triton guide More linting fixes Spellcheck in CK guide * Improve triton guide Fix linting errors and optics * Add occupancy / vgpr table Change some wording * Re-add tunableop * Add missing indent in _toc.yml * Remove ckProfiler references * Add links to resources * Add refs in CK optimization guide * Rename files and fix internal links * Organize tuning guides Reorg triton * Add compute unit diagram * Remove AutoAWQ * Add higher res image for Perfetto trace example * Update link text * Update fig nums * Update some formatting * Update "Inductor" * Change "Inductor" to TorchInductor * Add link to official TorchInductor docs	2024-06-03 14:04:33 -04:00
Peter Park	61d18252ab	Remove unused images and add link to usage in Deep Learning install guide (#3196 )	2024-05-30 19:28:13 -04:00
Peter Park	6a5defb825	Add "How to use ROCm for AI" (#3117 ) * Add Using ROCm for AI:wq Add PyTorch Docker installation images Split doc into subtopics Add metadata Clean up index Clean up hugging face guide Clean up installation guide Fix rST formatting Clean up install and train-a-model Clean up MAD Delete unused file Add ref anchors and clean up MAD doc Add formatting fixes Update toc and section index Format some code blocks Remove install guide and update toc Chop installation guide Clean up deployment and hugging face sections Change headings to end in -ing Fix spelling in Training a model Delete MAD and split out install content Fix formatting Change words to satisfy spellcheck linter * Add review suggestions and add helpful links Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com> Add helpful links and add review suggestions Remove fine-tuning link and links to D5 and MAGMA Update docs/how-to/rocm-for-ai/deploy-your-model.rst Co-authored-by: Young Hui - AMD <145490163+yhuiYH@users.noreply.github.com> Update DeepSpeed link Add subheading to ML framework installation and closing blurb to hugging face models guide * Reorder topics	2024-05-30 16:17:44 -04:00
Peter Park	3a68f43df7	Reorg 'Deep learning' and 'Tuning guides' docs (#3153 ) * Rename 'Tuning guides' to 'Hardware optimization' * Move deep learning to Install section * Change 'Hardware' to 'System' to align with index.md * Satisfy spellcheck linter * adding new framework install graphic with JAX * Fix link to ROCm libraries list * crop framework_install graphic * Reset .wordlist.txt update * Prettify deep learning framework installation page * Change spacing in list of frameworks --------- Co-authored-by: Young Hui <young.hui@amd.com>	2024-05-29 14:12:43 -04:00
Lisa	a44f6d1efc	link updates (#2861 )	2024-02-08 17:24:12 -07:00
Lisa	d0d4eed1a6	Update titles to sentence case (#2455 )	2023-09-18 12:26:31 -06:00
Lisa	7c5976004f	ROCm A-Z page & link cleanup (#2450 )	2023-09-13 13:00:50 -06:00

20 Commits