Sam Wu
a1518ffa94
Merge develop into roc-6.1.x ( #3440 )
...
* Bump rocm-docs-core from 1.4.1 to 1.5.0 in /docs/sphinx (#3396 )
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core ) from 1.4.1 to 1.5.0.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases )
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md )
- [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.4.1...v1.5.0 )
---
updated-dependencies:
- dependency-name: rocm-docs-core
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Bump certifi from 2024.2.2 to 2024.7.4 in /docs/sphinx (#3399 )
Bumps [certifi](https://github.com/certifi/python-certifi ) from 2024.2.2 to 2024.7.4.
- [Commits](https://github.com/certifi/python-certifi/compare/2024.02.02...2024.07.04 )
---
updated-dependencies:
- dependency-name: certifi
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* External CI: build hipBLASLt external dependencies (#3405 )
* External CI: Increase composable_kernel pipeline time limit (#3407 )
* [Changelog/release notes] Fix and add custom templates for autotag script (#3408 )
* Update custom templates
* Add custom templates
* Fix custom template for hipfort
* Fix custom template for hipify
* Fix custom template for rvs
* External CI: Change composable_kernel pipeline to build for specific GPUs with tests and examples (#3412 )
* increase task time limit
* test building CK for multiple architectures
* Update composable_kernel.yml
* Update composable_kernel.yml
* gfx90a build
* gfx941;gfx1100;gfx1030 build
* hipTensor gfx941 build
* hipTensor gfx941 build
* reduce CK timeout to 100 minutes
* change all gfx90a targets to gfx942
* Bump sphinx-reredirects from 0.1.4 to 0.1.5 in /docs/sphinx (#3419 )
Bumps [sphinx-reredirects](https://github.com/documatt/sphinx-reredirects ) from 0.1.4 to 0.1.5.
- [Commits](https://github.com/documatt/sphinx-reredirects/compare/v0.1.4...v0.1.5 )
---
updated-dependencies:
- dependency-name: sphinx-reredirects
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Removed TransferBench from the tools list (#3421 )
* update AI framework image (#3406 )
* update AI framework image
* remove old image
* Update system optimization guides headings (#3422 )
* update headings to system optimization
* update index
* conv tuning-guides.md to rst
* shorten system optimization landing page
* update conf.py
update toc order
add space
* Update docs/how-to/tuning-guides.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* update keywords
* update intro
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* External CI: move hipBLASLt build directory to ephemeral storage (#3433 )
* build hipblaslt in /mnt instead
* rm checkoutref
* remove debug step
* Update using-gpu-sanitizer.md with new known issues (#3423 )
* External CI: move hipBLASLt to new large disk pool
* Remove unused custom template for ck (#3438 )
* External CI: ROCm nightly builds (#3435 )
* ROCm nightly builds
* remove branch trigger, enable develop
* Remove unused configurations in conf.py (#3444 )
* External CI: Switch all pipeline GPU_TARGETS to gfx942 (#3443 )
* Switch all pipeline gpu targets to gfx942
* Change more pipelines target to gfx942
* set variables for manual testing
* Switch all pipeline gpu targets to gfx942
* Change more pipelines target to gfx942
* set variables for manual testing
* add test pipeline id
* revert test changes
* correct gpu target name
* remove unused flags; change hipSPARSELt target to be gfx942
* Add MI300X tuning guides (#3448 )
* Add MI300X tuning guides
Add mi300x doc (pandoc conversion)
fix headings
add metadata
move images to shared/
move images to shared/
convert tuning-guides.md to rst using pandoc
add mi300x to tuning-guides.rst landing page
update h1s, toc, and landing page
fix spelling
fix fmt
format code blocks
add tensilelite imgs
fix formatting
fix formatting some more
fix formatting
more formatting
spelling
remove --enforce-eager note
satisfy spellcheck linter
more spelling
add fixes from hongxia
fix env var in D5
add fixes to PyTorch inductor section
fix
fix
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update 'torch_compile_debug' suggestion based on Hongxia's feedback
fix PyTorch inductor env vars
minor formatting fixes
Apply suggestions from code review
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update vllm path
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
disable numfig in Sphinx configuration
fix formatting and capitalization
add words to wordlist
update index
update wordlist
update optimizing-triton-kernel
convert cards to table
fix link in index.md
add @lpaoletti's feedback
Add system tuning guide
add images
add system section
add os settings and sys management
remove pcie=noats recommendation
reorg
add blurb to developer section
impr formatting
remove windows os from tuning guides pages in conf.py
add suggestions from review
fix typo and link
remove os windows from relevant pages in conf
mi300x
add suggestions from review
fix toc
fix index links
reorg
update vLLM vars
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
update vLLM vars
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
reorganize
add warnings
add text to system tuning
add filler text on index pages
reorg tuning pages
fix links
fix vars
* rm old pages
fix toc
* add suggestions from review
small change
add more suggestions
rewrite intro
* add 'workload tuning philosophy'
* refactor
* fix broken links
* black format conf.py
* simplify cmd and update doc structure
* add higher-level heading for consistency (mi300x.rst)
* add fixes from review
fix url
add fixes
fix formatting
fix fmt
fix hipBLASLt section
change words
fix tensilelite section
fix
fix
fix fmt
* style guide
* fix some formatting
* satisfy spellcheck linter
* update wordlist
* fix bad conflict resolution
---------
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: danielsu-amd <danielsu@amd.com >
Co-authored-by: alexxu-amd <159800977+alexxu-amd@users.noreply.github.com >
Co-authored-by: spolifroni-amd <Sandra.Polifroni@amd.com >
Co-authored-by: randyh62 <42045079+randyh62@users.noreply.github.com >
Co-authored-by: Peter Park <peter.park@amd.com >
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
Co-authored-by: b-sumner <brian.sumner@amd.com >
2024-07-22 15:39:48 -06:00
Istvan Kiss
78fdcdf48d
Update docs/conceptual/setting-cus.rst
...
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2024-06-12 16:17:42 +02:00
randyh62
f500c32989
add quarantine_size_mb ( #3264 )
...
* add quarantine_size_mb
* Update docs/conceptual/using-gpu-sanitizer.md
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* Update docs/conceptual/using-gpu-sanitizer.md
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* format fix
* format fix again
* ASAN capitalization
* remove particular
* indent bullets
* Leo comments
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2024-06-10 11:59:47 -07:00
Bence Parajdi
d86c23a847
remove unnecessary comma
2024-05-14 10:08:44 +02:00
Bence Parajdi
06c960aa97
Update docs/conceptual/setting-cus.rst
...
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
2024-05-13 16:27:05 +02:00
Bence Parajdi
41da494ef0
fix review comments
2024-05-13 16:26:16 +02:00
Bence Parajdi
c0fbd1ca5b
fix typos
2024-05-13 16:25:33 +02:00
Bence Parajdi
7f38465770
add cu setting page
2024-05-13 16:25:31 +02:00
randyh62
7ebd810f7a
updates for SWDEV-459863 ( #3113 )
2024-05-09 12:42:48 -07:00
Roopa Malavally
13ad427c8e
Update using-gpu-sanitizer.md ( #2991 )
...
* Update using-gpu-sanitizer.md
Minor OpenMP update
* Update using-gpu-sanitizer.md
Updated note with additional information.
* Update using-gpu-sanitizer.md
* Update using-gpu-sanitizer.md
Moved the note to another section
* Update using-gpu-sanitizer.md
2024-04-09 11:10:55 -07:00
Roopa Malavally
f298d60976
Update using-gpu-sanitizer.md ( #2970 )
...
* Update using-gpu-sanitizer.md
added the link text
Added the example
---------
Co-authored-by: Sam Wu <sam.wu2@amd.com >
2024-03-22 15:33:50 -06:00
Istvan Kiss
e4c0cf9044
Branch rebase fix ( #2916 )
2024-02-29 15:05:49 -07:00
MKKnorr
cd586348f5
Add instinct gpu architectures information ( #2859 )
...
* Add instinct gpu architectures information
* Improve gpu architecture table
Move table to "reference" instead of "conceptual"
* Add HIP terminology to GPU Arch glossary
2024-02-29 15:03:23 -07:00
Istvan Kiss
67e3fc994b
MI300 documentation ( #2779 )
...
---------
Co-authored-by: Nagy-Egri Máté Ferenc <mate@streamhpc.com >
Co-authored-by: Lisa Delaney <lisa.delaney@amd.com >
Co-authored-by: Davide Teixeira <77169625+daviteix@users.noreply.github.com >
2024-02-20 17:02:36 -07:00
Lisa
8dc8a0eb62
Update docs/conceptual/cmake-packages.rst
...
Co-authored-by: Sam Wu <sam.wu2@amd.com >
2024-02-09 15:58:38 -07:00
Lisa Delaney
122705c1f4
remove broken links
2024-02-09 13:32:06 -07:00
Istvan Kiss
02cc970a75
Update github links to ROCm organization
2024-02-09 17:03:40 +01:00
Lisa
a44f6d1efc
link updates ( #2861 )
2024-02-08 17:24:12 -07:00
randyh62
c9425c6d19
corrections for Issue #2753 ( #2819 )
2024-01-18 09:31:45 -07:00
Lisa
d399b13c88
add keywords ( #2799 )
2024-01-11 14:07:30 -07:00
Sam Wu
7ffc622039
docs(gpu-enabled-mpi.rst): Fix links to 3rd party support matrices ( #2775 )
...
* docs(gpu-enabled-mpi.rst): Fix links to 3rd party support matrices
* docs: Directly link for RST instead of using intersphinx
2024-01-08 16:34:45 -07:00
Lisa
5f9842db8f
link fixes & consistency ( #2761 )
2023-12-20 12:42:15 -07:00
Lisa
bcc8603454
update links, remove windows ( #2706 )
2023-12-14 09:21:50 -07:00
srawat
7889220f04
Mi200 counters ( #2622 )
2023-12-12 11:25:57 -07:00
Lisa
3aa7072fc2
metadata test ( #2656 )
2023-11-30 14:37:12 -07:00
Nagy-Egri Máté Ferenc
3b9cd77b93
Clarify mixing C++ and HIP sources via CMake ( #2618 )
...
* Carify mixing C++ and HIP sources via CMake
* Designate code blocks
* Simplify lang around host-only use of the HIP API
* Remove superfluous wording.
* Note LINKER_LANGUAGE of mixed sources
* Space after code-block
* Single space in code-block
2023-11-29 07:03:44 -07:00
Istvan Kiss
f8446befd2
Remove disable spellchecks of cmake-packages.rst ( #2676 )
2023-11-27 11:17:13 -07:00
Lisa
33f110e354
update ROCm name ( #2660 )
...
* update ROCm name
* update version history page
2023-11-22 10:30:10 -07:00
Saad Rahim (AMD)
9a9cf073b4
spelling check fix ( #2649 )
2023-11-20 10:12:39 -07:00
Lisa
c326a64381
Acronym update ( #2637 )
2023-11-14 08:54:13 -07:00
Lisa
37c48060f7
update release note files ( #2617 )
...
---------
Co-authored-by: Sam Wu <sam.wu2@amd.com >
Co-authored-by: Saad Rahim (AMD) <44449863+saadrahim@users.noreply.github.com >
2023-11-10 15:14:59 -07:00
Lisa Delaney
7585e9b165
merge conflict
2023-10-25 13:52:44 -06:00
Istvan Kiss
2dd6923ab9
Fix warnings ( #2548 )
...
* Fixed most of the warnings
* Temporary fix of copied files links
2023-10-17 07:05:58 -06:00
Lisa
e87dba01c6
ROCm restructuring ( #2521 )
...
Flattened out page structure for improved navigability.
* Change Table of Contents
* Update the install guides for windows and linux
* Removed extraneous index pages
* GPU architecture pages duplicate entries removed
* spack page cleanup
---------
Co-authored-by: Sam Wu <samwu103@amd.com >
Co-authored-by: Saad Rahim (AMD) <44449863+saadrahim@users.noreply.github.com >
2023-10-06 15:42:11 -06:00
Lisa
940d2933ff
Link and formatting fixes ( #2482 )
2023-09-20 09:55:21 -06:00
Saad Rahim
03f78be781
Merge remote-tracking branch 'origin/develop' into 5.7.0-merge-to-develop
2023-09-18 15:29:06 -06:00
Lisa
d0d4eed1a6
Update titles to sentence case ( #2455 )
2023-09-18 12:26:31 -06:00
Nara
006546e9e6
GPU memory model ( #2379 )
2023-09-18 07:16:50 -06:00
Sam Wu
1e92ef9a2d
update using gpu sanitizer ( #2462 )
2023-09-15 09:03:41 -07:00
Lisa
7c5976004f
ROCm A-Z page & link cleanup ( #2450 )
2023-09-13 13:00:50 -06:00
Lisa
890c735f53
site restructure phase 1 - file reorganization ( #2428 )
2023-09-08 10:02:17 -06:00