Sam Wu
a1518ffa94
Merge develop into roc-6.1.x ( #3440 )
...
* Bump rocm-docs-core from 1.4.1 to 1.5.0 in /docs/sphinx (#3396 )
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core ) from 1.4.1 to 1.5.0.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases )
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md )
- [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.4.1...v1.5.0 )
---
updated-dependencies:
- dependency-name: rocm-docs-core
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Bump certifi from 2024.2.2 to 2024.7.4 in /docs/sphinx (#3399 )
Bumps [certifi](https://github.com/certifi/python-certifi ) from 2024.2.2 to 2024.7.4.
- [Commits](https://github.com/certifi/python-certifi/compare/2024.02.02...2024.07.04 )
---
updated-dependencies:
- dependency-name: certifi
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* External CI: build hipBLASLt external dependencies (#3405 )
* External CI: Increase composable_kernel pipeline time limit (#3407 )
* [Changelog/release notes] Fix and add custom templates for autotag script (#3408 )
* Update custom templates
* Add custom templates
* Fix custom template for hipfort
* Fix custom template for hipify
* Fix custom template for rvs
* External CI: Change composable_kernel pipeline to build for specific GPUs with tests and examples (#3412 )
* increase task time limit
* test building CK for multiple architectures
* Update composable_kernel.yml
* Update composable_kernel.yml
* gfx90a build
* gfx941;gfx1100;gfx1030 build
* hipTensor gfx941 build
* hipTensor gfx941 build
* reduce CK timeout to 100 minutes
* change all gfx90a targets to gfx942
* Bump sphinx-reredirects from 0.1.4 to 0.1.5 in /docs/sphinx (#3419 )
Bumps [sphinx-reredirects](https://github.com/documatt/sphinx-reredirects ) from 0.1.4 to 0.1.5.
- [Commits](https://github.com/documatt/sphinx-reredirects/compare/v0.1.4...v0.1.5 )
---
updated-dependencies:
- dependency-name: sphinx-reredirects
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Removed TransferBench from the tools list (#3421 )
* update AI framework image (#3406 )
* update AI framework image
* remove old image
* Update system optimization guides headings (#3422 )
* update headings to system optimization
* update index
* conv tuning-guides.md to rst
* shorten system optimization landing page
* update conf.py
update toc order
add space
* Update docs/how-to/tuning-guides.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* update keywords
* update intro
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* External CI: move hipBLASLt build directory to ephemeral storage (#3433 )
* build hipblaslt in /mnt instead
* rm checkoutref
* remove debug step
* Update using-gpu-sanitizer.md with new known issues (#3423 )
* External CI: move hipBLASLt to new large disk pool
* Remove unused custom template for ck (#3438 )
* External CI: ROCm nightly builds (#3435 )
* ROCm nightly builds
* remove branch trigger, enable develop
* Remove unused configurations in conf.py (#3444 )
* External CI: Switch all pipeline GPU_TARGETS to gfx942 (#3443 )
* Switch all pipeline gpu targets to gfx942
* Change more pipelines target to gfx942
* set variables for manual testing
* Switch all pipeline gpu targets to gfx942
* Change more pipelines target to gfx942
* set variables for manual testing
* add test pipeline id
* revert test changes
* correct gpu target name
* remove unused flags; change hipSPARSELt target to be gfx942
* Add MI300X tuning guides (#3448 )
* Add MI300X tuning guides
Add mi300x doc (pandoc conversion)
fix headings
add metadata
move images to shared/
move images to shared/
convert tuning-guides.md to rst using pandoc
add mi300x to tuning-guides.rst landing page
update h1s, toc, and landing page
fix spelling
fix fmt
format code blocks
add tensilelite imgs
fix formatting
fix formatting some more
fix formatting
more formatting
spelling
remove --enforce-eager note
satisfy spellcheck linter
more spelling
add fixes from hongxia
fix env var in D5
add fixes to PyTorch inductor section
fix
fix
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update 'torch_compile_debug' suggestion based on Hongxia's feedback
fix PyTorch inductor env vars
minor formatting fixes
Apply suggestions from code review
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update vllm path
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
disable numfig in Sphinx configuration
fix formatting and capitalization
add words to wordlist
update index
update wordlist
update optimizing-triton-kernel
convert cards to table
fix link in index.md
add @lpaoletti's feedback
Add system tuning guide
add images
add system section
add os settings and sys management
remove pcie=noats recommendation
reorg
add blurb to developer section
impr formatting
remove windows os from tuning guides pages in conf.py
add suggestions from review
fix typo and link
remove os windows from relevant pages in conf
mi300x
add suggestions from review
fix toc
fix index links
reorg
update vLLM vars
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
update vLLM vars
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
reorganize
add warnings
add text to system tuning
add filler text on index pages
reorg tuning pages
fix links
fix vars
* rm old pages
fix toc
* add suggestions from review
small change
add more suggestions
rewrite intro
* add 'workload tuning philosophy'
* refactor
* fix broken links
* black format conf.py
* simplify cmd and update doc structure
* add higher-level heading for consistency (mi300x.rst)
* add fixes from review
fix url
add fixes
fix formatting
fix fmt
fix hipBLASLt section
change words
fix tensilelite section
fix
fix
fix fmt
* style guide
* fix some formatting
* satisfy spellcheck linter
* update wordlist
* fix bad conflict resolution
---------
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: danielsu-amd <danielsu@amd.com >
Co-authored-by: alexxu-amd <159800977+alexxu-amd@users.noreply.github.com >
Co-authored-by: spolifroni-amd <Sandra.Polifroni@amd.com >
Co-authored-by: randyh62 <42045079+randyh62@users.noreply.github.com >
Co-authored-by: Peter Park <peter.park@amd.com >
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
Co-authored-by: b-sumner <brian.sumner@amd.com >
2024-07-22 15:39:48 -06:00
randyh62
356ad4ab47
remove Magma ( #3361 )
...
* remove Magma
* missed one
2024-06-26 10:00:39 -07:00
Peter Park
22e9f6f373
Add "Using ROCm for HPC" guide ( #3302 )
...
* Add ROCm for HPC
* Update index and toc
* Add TMs in other tutorials
* Add hpc apps table
Spellcheck
add stack image and fix links
Add descriptions
update copy
Update copy
add ref
Finish adding app descriptions
tweak descs
fix line lengths
* Revert "Add TMs in other tutorials"
This reverts commit 08a1a80e57 .
* Add links to install and compat matrix
* Update HPC stack graphic and add some links
Add hpc and td to wordlist
fix links
* Apply suggestions from Leo's review
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
Update docs/how-to/rocm-for-hpc/index.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
Update docs/how-to/rocm-for-hpc/index.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
Update docs/how-to/rocm-for-hpc/index.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
Update docs/how-to/rocm-for-hpc/index.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
Update docs/how-to/rocm-for-hpc/index.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
fix formatting
Update words
* update wordlist
* Update hpc app descriptions with content from InfinityHub catalog
2024-06-21 16:15:18 -04:00
Peter Park
6494885359
Rename fine-tuning and optimization guide directory and fix index.md ( #3242 )
...
* Mv fine-tuning and optimization files
* Reorder index.md
* Rename images directory
* Fix internal links
2024-06-05 11:11:00 -04:00
Peter Park
fed33835a0
Add "Fine Tuning LLMs" how to guide ( #3124 )
...
* Add Fine Tuning LLMs how to guide
* Reorg and refactor Fine-tuning LLMs with ROCm
Update index and headings
Fix formatting and update toc
Split out content from index to overview.rst
Add metadata
Clean up overview
Add inference sections, fix rst errors, clean up single-gpu-fine-tuning
Combine fine-tuning and inference guides
Fix some links and formatting
Update toc and add formatting fixes
Add ck kernel fusion content
Update toc
Clean up model quantization and acceleration
Add CK images
Clean up profiling
Update triton kernel performance optimization
Update llm inference frameworks guide
Disable automatic number of figures and tables in Sphinx conf
Change tabs to spaces
Change heading to end with -ing
Add link fixes and heading updates
Add rocprof/Omniperf/Omnitrace section
Update profiling and debugging guide
Add formatting fixes
Satisfy spellcheck
Fix words
Delete unused file
Finish overview
Clean up first 4 sections
Multi-gpu fine-tuning guide: slight fixes
Update toc
Remove tabs
Formatting fixes
* Minor wording updates
* Add some clean-up
* Update profiling and debugging gudie
* Fix Omnitrace link
* Update ck kernel fusion with latest
* Update CK formatting
* Fix perfetto link syntax
* Fix typos and add blurbs
* Add fixes to Triton optimization doc
* Tabify saving adapters / models section
* Fix linting errors - spellcheck
Fix spelling and grammar
Satisfy linter
Update wording in profiling guide
Add fixes to satisfy linter
More fixes for linting in Triton guide
More linting fixes
Spellcheck in CK guide
* Improve triton guide
Fix linting errors and optics
* Add occupancy / vgpr table
Change some wording
* Re-add tunableop
* Add missing indent in _toc.yml
* Remove ckProfiler references
* Add links to resources
* Add refs in CK optimization guide
* Rename files and fix internal links
* Organize tuning guides
Reorg triton
* Add compute unit diagram
* Remove AutoAWQ
* Add higher res image for Perfetto trace example
* Update link text
* Update fig nums
* Update some formatting
* Update "Inductor"
* Change "Inductor" to TorchInductor
* Add link to official TorchInductor docs
2024-06-03 14:04:33 -04:00
Peter Park
61d18252ab
Remove unused images and add link to usage in Deep Learning install guide ( #3196 )
2024-05-30 19:28:13 -04:00
Peter Park
6a5defb825
Add "How to use ROCm for AI" ( #3117 )
...
* Add Using ROCm for AI:wq
Add PyTorch Docker installation images
Split doc into subtopics
Add metadata
Clean up index
Clean up hugging face guide
Clean up installation guide
Fix rST formatting
Clean up install and train-a-model
Clean up MAD
Delete unused file
Add ref anchors and clean up MAD doc
Add formatting fixes
Update toc and section index
Format some code blocks
Remove install guide and update toc
Chop installation guide
Clean up deployment and hugging face sections
Change headings to end in -ing
Fix spelling in Training a model
Delete MAD and split out install content
Fix formatting
Change words to satisfy spellcheck linter
* Add review suggestions and add helpful links
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
Add helpful links and add review suggestions
Remove fine-tuning link and links to D5 and MAGMA
Update docs/how-to/rocm-for-ai/deploy-your-model.rst
Co-authored-by: Young Hui - AMD <145490163+yhuiYH@users.noreply.github.com >
Update DeepSpeed link
Add subheading to ML framework installation and closing blurb to hugging face models guide
* Reorder topics
2024-05-30 16:17:44 -04:00
Peter Park
3a68f43df7
Reorg 'Deep learning' and 'Tuning guides' docs ( #3153 )
...
* Rename 'Tuning guides' to 'Hardware optimization'
* Move deep learning to Install section
* Change 'Hardware' to 'System' to align with index.md
* Satisfy spellcheck linter
* adding new framework install graphic with JAX
* Fix link to ROCm libraries list
* crop framework_install graphic
* Reset .wordlist.txt update
* Prettify deep learning framework installation page
* Change spacing in list of frameworks
---------
Co-authored-by: Young Hui <young.hui@amd.com >
2024-05-29 14:12:43 -04:00
peter
da18980f63
Reorganize "What is ROCm?" page ( #3006 )
...
* add rocm software stack diagram to What is ROCm landing page
* restructure ROCm project list table
* clean up unnecessary hyphenation
* update What is ROCm stack diagram filename
* reorder rocm project list to reflect diagram
* update "What is ROCm?" image metadata
* change 'project list' to 'components'
* change 'project' to 'component'
2024-04-12 14:01:41 -04:00
Istvan Kiss
47d06cb492
Precision support ( #2815 )
...
* Precision support page initial commit
Move to rst file
Fix details of Mi100
Update docs/about/compatibility/precission-support.md Co-authored-by: MKKnorr <MKKnorr@web.de >
* Update precission-support page
Co-authored-by: MKKnorr <MKKnorr@web.de >
* PR fix based on feedbackcs
* Rename precision-support.rst to data-type_support.rts
* Update rocThrust library data type support
* PR findings fixes
* Update data-type-support page
Co-authored-by: MKKnorr <MKKnorr@web.de >
* Update docs/about/compatibility/data-type-support.rst
Co-authored-by: MKKnorr <MKKnorr@web.de >
* lisa edits
---------
Co-authored-by: MKKnorr <MKKnorr@web.de >
Co-authored-by: Lisa Delaney <lisa.delaney@amd.com >
2024-03-05 09:48:14 -07:00
Sam Wu
89110a1662
Merge pull request #2910 from LisaDelaney/image-cleanup
...
add alt text
2024-02-22 17:26:17 -07:00
Istvan Kiss
67e3fc994b
MI300 documentation ( #2779 )
...
---------
Co-authored-by: Nagy-Egri Máté Ferenc <mate@streamhpc.com >
Co-authored-by: Lisa Delaney <lisa.delaney@amd.com >
Co-authored-by: Davide Teixeira <77169625+daviteix@users.noreply.github.com >
2024-02-20 17:02:36 -07:00
Lisa Delaney
70ea915709
reduce image sizes
2024-02-20 10:34:04 -07:00
Lisa
a44f6d1efc
link updates ( #2861 )
2024-02-08 17:24:12 -07:00
Lisa
3c94962813
new banner images ( #2884 )
2024-02-08 11:53:48 -07:00
Lisa
8bbd51376d
update contributing section & update card images ( #2865 )
2024-02-07 09:31:45 -07:00
Lisa
c6e2856822
Update style guidelines ( #2542 )
2023-10-12 13:50:15 -06:00
Lisa
e87dba01c6
ROCm restructuring ( #2521 )
...
Flattened out page structure for improved navigability.
* Change Table of Contents
* Update the install guides for windows and linux
* Removed extraneous index pages
* GPU architecture pages duplicate entries removed
* spack page cleanup
---------
Co-authored-by: Sam Wu <samwu103@amd.com >
Co-authored-by: Saad Rahim (AMD) <44449863+saadrahim@users.noreply.github.com >
2023-10-06 15:42:11 -06:00
Sam Wu
786b44d8eb
Remove 404.md from ROCm ( #2487 )
...
* rm 404 img
* remove gitignore file
* remove 404 page on rocm
2023-09-20 11:51:31 -06:00
Lisa
d0d4eed1a6
Update titles to sentence case ( #2455 )
2023-09-18 12:26:31 -06:00
Lisa
7c5976004f
ROCm A-Z page & link cleanup ( #2450 )
2023-09-13 13:00:50 -06:00
Lisa
890c735f53
site restructure phase 1 - file reorganization ( #2428 )
2023-09-08 10:02:17 -06:00
Lisa
b963f7fa05
404 updates ( #2406 )
...
add 404 page image
---------
Co-authored-by: Saad Rahim <44449863+saadrahim@users.noreply.github.com >
Co-authored-by: Sam Wu <sam.wu2@amd.com >
2023-08-24 17:35:44 -06:00
Saad Rahim
445432da13
Merge branch 'develop' into roc-5.6.x
2023-08-21 15:11:36 -06:00
Saad Rahim
e13e1d31c3
Adding Windows Installation Instructions ( #2339 )
2023-07-27 11:00:44 -06:00
srawat
253f69b445
Adding openmp image ( #2323 )
...
Co-authored-by: Sam Wu <sam.wu2@amd.com >
2023-07-25 11:05:09 -06:00
Nagy-Egri Máté Ferenc
6c1fff6692
RDNA2 Virtualization Guide ( #2149 )
2023-05-18 09:39:37 -06:00
Nagy-Egri Máté Ferenc
d9f272a505
MI100 and MI200 extra content ( #2112 )
2023-05-11 09:34:11 -06:00
Nagy-Egri Máté Ferenc
62ed404058
Initial GPU-aware MPI port ( #2086 )
...
* Initial GPU-aware MPI port
* Remove trailing spaces
* Allowlist word in gpu_aware_mpi
2023-05-04 09:42:22 -06:00
Sam Wu
57c601262b
HPC cleanup - Clean up the deployment related pages ( #2080 )
...
* Clean up the deployment related pages
- Add an index page for the linux deployment submenu
- Remove deployment options that are not yet completed (i.e. spack,
from source installation)
- remove the general deployment index page
- various cleanups and clarifications in the rest of the pages
* Move all deploy pages to deploy folder
---------
Co-authored-by: Gergely Meszaros <gergely@streamhpc.com >
2023-04-24 12:07:17 -06:00
Sam Wu
b897bddf38
Linkcheck and prepare alpha ( #2078 )
2023-04-24 11:25:31 -06:00
Ehud Sharlin
7bbd5bc79d
Deep Learning Training - Troubleshooting & References ( #2033 )
2023-04-12 07:37:52 -06:00
Nagy-Egri Máté Ferenc
1ec7e1c933
Port installation guide ( #2018 )
2023-04-06 09:42:07 -06:00
Nagy-Egri Máté Ferenc
2e7266c829
1908-uninstall-guide-linux ( #2000 )
2023-03-31 07:33:22 -06:00
Ehud Sharlin
415f3b93ad
Inception V3 Example, Deep Learning Guide Decomposed and OpenMP Guide ( #1937 )
2023-03-30 08:01:06 -06:00
Nagy-Egri Máté Ferenc
286f120d9a
MI100 architecture guide ( #1994 )
...
* Initial MI100 docs
* Try changing style to fix MD004
* Disable MD004
* Disable MD005
* Move to {table} from {list-table}
* Don't disable few MD styles
2023-03-29 07:14:23 -06:00
Nagy-Egri Máté Ferenc
e9ee6b9874
Initial MI250 Guide ( #1976 )
...
* Initial MI250 Guide
* Limit line length to 80 columns
* References using MyST
* Move to figure-md and numref
* Add MI250 to TOC
2023-03-22 15:45:00 +01:00
Alex Voicu
bcba7ed752
Rtd alexv feedback ( #1945 )
2023-03-15 12:22:25 -06:00
Saad Rahim
b19681711c
Pitchfork Standard for Docs ( #1918 )
2023-03-09 14:03:04 -07:00