* remove 'Using MPI' and 'gpu-cluster-networking' sections due to migration to dcgpu
* remove gpu-cluster-networking from index page
---------
Co-authored-by: Alex Xu <alex.xu@amd.com>
add mitigiation steps
add to toc
move page for build
move doc
fix spelling
update doc
update oversubscription
update order
fix spelling
add oversubscription to wordlist
move oversubscription topic to bottom of toc and index
* Add links to gpu cluster network guides
* Add newline character to eof
* Make link absolute
* add dynamic branch in toc
* remove unnecessary page
clean up
* clean up index/toc
* make multi-node topics adjacent
---------
Co-authored-by: Peter Park <peter.park@amd.com>
* adding preliminary compatibility matrix data for 6.2.1
* bump up some version numbers from 6.2.0 to 6.2.1
* adding kernel versions to compatibility matrix. I hate it
* add kernel version lookup table, in dropdown list
* add KFD and User space support. Also adjust some meta data keywords
* update 6.2.1 RC2 versions
* make spelling linter happy
* remove kernel versions from table, just reference LUT below
* Leave kenerel Lookup table expanded
* update kernel version table
* remove kernels from historical matrix, update footnotes
* move historical matrix into compatibility folder
* update historical matrix paths
* version bumps for RC3
* RC4 has no other version bumps. Reorder RPP alphabetically
* change How-To card hue to purple
* move precision_support to reference
* add rocPyDecode to AI
* Use CSS style sheets for Card titles
* remove temp folder and files
* add card hues
* shuffle hues
* update requirements
* add hues test
* add hues test2
* select hues
* remove hues test
* use hues and add gutters
* sync TOC and index titles
* once more through the TOC
* MI300A system optimization guide internal draft
* Small changes to System BIOS paragraph
* Some minor edits
* Changes after external review feedback
* Add CPU Affinity debug setting
* Edit CPU Affinity debug setting
* Changes from external discussion
* Add glossary and other small fixes
* Additional changes from the review
* Update the IOMMU guidance
* Change description of CPU affinity setting
* Slight rewording
* Change Debian to Red Hat-based
* A few changes from the second internal review
* Add MI300X tuning guides
Add mi300x doc (pandoc conversion)
fix headings
add metadata
move images to shared/
move images to shared/
convert tuning-guides.md to rst using pandoc
add mi300x to tuning-guides.rst landing page
update h1s, toc, and landing page
fix spelling
fix fmt
format code blocks
add tensilelite imgs
fix formatting
fix formatting some more
fix formatting
more formatting
spelling
remove --enforce-eager note
satisfy spellcheck linter
more spelling
add fixes from hongxia
fix env var in D5
add fixes to PyTorch inductor section
fix
fix
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
Update 'torch_compile_debug' suggestion based on Hongxia's feedback
fix PyTorch inductor env vars
minor formatting fixes
Apply suggestions from code review
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
Update vllm path
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
disable numfig in Sphinx configuration
fix formatting and capitalization
add words to wordlist
update index
update wordlist
update optimizing-triton-kernel
convert cards to table
fix link in index.md
add @lpaoletti's feedback
Add system tuning guide
add images
add system section
add os settings and sys management
remove pcie=noats recommendation
reorg
add blurb to developer section
impr formatting
remove windows os from tuning guides pages in conf.py
add suggestions from review
fix typo and link
remove os windows from relevant pages in conf
mi300x
add suggestions from review
fix toc
fix index links
reorg
update vLLM vars
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
update vLLM vars
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>
reorganize
add warnings
add text to system tuning
add filler text on index pages
reorg tuning pages
fix links
fix vars
* rm old pages
fix toc
* add suggestions from review
small change
add more suggestions
rewrite intro
* add 'workload tuning philosophy'
* refactor
* fix broken links
* black format conf.py
* simplify cmd and update doc structure
* add higher-level heading for consistency (mi300x.rst)
* add fixes from review
fix url
add fixes
fix formatting
fix fmt
fix hipBLASLt section
change words
fix tensilelite section
fix
fix
fix fmt
* style guide
* fix some formatting
* satisfy spellcheck linter
* update wordlist
* fix bad conflict resolution
* Add Fine Tuning LLMs how to guide
* Reorg and refactor Fine-tuning LLMs with ROCm
Update index and headings
Fix formatting and update toc
Split out content from index to overview.rst
Add metadata
Clean up overview
Add inference sections, fix rst errors, clean up single-gpu-fine-tuning
Combine fine-tuning and inference guides
Fix some links and formatting
Update toc and add formatting fixes
Add ck kernel fusion content
Update toc
Clean up model quantization and acceleration
Add CK images
Clean up profiling
Update triton kernel performance optimization
Update llm inference frameworks guide
Disable automatic number of figures and tables in Sphinx conf
Change tabs to spaces
Change heading to end with -ing
Add link fixes and heading updates
Add rocprof/Omniperf/Omnitrace section
Update profiling and debugging guide
Add formatting fixes
Satisfy spellcheck
Fix words
Delete unused file
Finish overview
Clean up first 4 sections
Multi-gpu fine-tuning guide: slight fixes
Update toc
Remove tabs
Formatting fixes
* Minor wording updates
* Add some clean-up
* Update profiling and debugging gudie
* Fix Omnitrace link
* Update ck kernel fusion with latest
* Update CK formatting
* Fix perfetto link syntax
* Fix typos and add blurbs
* Add fixes to Triton optimization doc
* Tabify saving adapters / models section
* Fix linting errors - spellcheck
Fix spelling and grammar
Satisfy linter
Update wording in profiling guide
Add fixes to satisfy linter
More fixes for linting in Triton guide
More linting fixes
Spellcheck in CK guide
* Improve triton guide
Fix linting errors and optics
* Add occupancy / vgpr table
Change some wording
* Re-add tunableop
* Add missing indent in _toc.yml
* Remove ckProfiler references
* Add links to resources
* Add refs in CK optimization guide
* Rename files and fix internal links
* Organize tuning guides
Reorg triton
* Add compute unit diagram
* Remove AutoAWQ
* Add higher res image for Perfetto trace example
* Update link text
* Update fig nums
* Update some formatting
* Update "Inductor"
* Change "Inductor" to TorchInductor
* Add link to official TorchInductor docs
* Add Using ROCm for AI:wq
Add PyTorch Docker installation images
Split doc into subtopics
Add metadata
Clean up index
Clean up hugging face guide
Clean up installation guide
Fix rST formatting
Clean up install and train-a-model
Clean up MAD
Delete unused file
Add ref anchors and clean up MAD doc
Add formatting fixes
Update toc and section index
Format some code blocks
Remove install guide and update toc
Chop installation guide
Clean up deployment and hugging face sections
Change headings to end in -ing
Fix spelling in Training a model
Delete MAD and split out install content
Fix formatting
Change words to satisfy spellcheck linter
* Add review suggestions and add helpful links
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>
Add helpful links and add review suggestions
Remove fine-tuning link and links to D5 and MAGMA
Update docs/how-to/rocm-for-ai/deploy-your-model.rst
Co-authored-by: Young Hui - AMD <145490163+yhuiYH@users.noreply.github.com>
Update DeepSpeed link
Add subheading to ML framework installation and closing blurb to hugging face models guide
* Reorder topics
* Rename 'Tuning guides' to 'Hardware optimization'
* Move deep learning to Install section
* Change 'Hardware' to 'System' to align with index.md
* Satisfy spellcheck linter
* adding new framework install graphic with JAX
* Fix link to ROCm libraries list
* crop framework_install graphic
* Reset .wordlist.txt update
* Prettify deep learning framework installation page
* Change spacing in list of frameworks
---------
Co-authored-by: Young Hui <young.hui@amd.com>