Commit Graph

97 Commits

Author SHA1 Message Date
Sam Wu
a1518ffa94 Merge develop into roc-6.1.x (#3440)
* Bump rocm-docs-core from 1.4.1 to 1.5.0 in /docs/sphinx (#3396)

Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.4.1 to 1.5.0.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.4.1...v1.5.0)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump certifi from 2024.2.2 to 2024.7.4 in /docs/sphinx (#3399)

Bumps [certifi](https://github.com/certifi/python-certifi) from 2024.2.2 to 2024.7.4.
- [Commits](https://github.com/certifi/python-certifi/compare/2024.02.02...2024.07.04)

---
updated-dependencies:
- dependency-name: certifi
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* External CI: build hipBLASLt external dependencies (#3405)

* External CI: Increase composable_kernel pipeline time limit (#3407)

* [Changelog/release notes] Fix and add custom templates for autotag script (#3408)

* Update custom templates

* Add custom templates

* Fix custom template for hipfort

* Fix custom template for hipify

* Fix custom template for rvs

* External CI: Change composable_kernel pipeline to build for specific GPUs with tests and examples (#3412)

* increase task time limit

* test building CK for multiple architectures

* Update composable_kernel.yml

* Update composable_kernel.yml

* gfx90a build

* gfx941;gfx1100;gfx1030 build

* hipTensor gfx941 build

* hipTensor gfx941 build

* reduce CK timeout to 100 minutes

* change all gfx90a targets to gfx942

* Bump sphinx-reredirects from 0.1.4 to 0.1.5 in /docs/sphinx (#3419)

Bumps [sphinx-reredirects](https://github.com/documatt/sphinx-reredirects) from 0.1.4 to 0.1.5.
- [Commits](https://github.com/documatt/sphinx-reredirects/compare/v0.1.4...v0.1.5)

---
updated-dependencies:
- dependency-name: sphinx-reredirects
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Removed TransferBench from the tools list (#3421)

* update AI framework image (#3406)

* update AI framework image

* remove old image

* Update system optimization guides headings (#3422)

* update headings to system optimization

* update index

* conv tuning-guides.md to rst

* shorten system optimization landing page

* update conf.py

update toc order

add space

* Update docs/how-to/tuning-guides.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* update keywords

* update intro

---------

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* External CI: move hipBLASLt build directory to ephemeral storage (#3433)

* build hipblaslt in /mnt instead

* rm checkoutref

* remove debug step

* Update using-gpu-sanitizer.md with new known issues (#3423)

* External CI: move hipBLASLt to new large disk pool

* Remove unused custom template for ck (#3438)

* External CI: ROCm nightly builds (#3435)

* ROCm nightly builds

* remove branch trigger, enable develop

* Remove unused configurations in conf.py (#3444)

* External CI: Switch all pipeline GPU_TARGETS to gfx942 (#3443)

* Switch all pipeline gpu targets to gfx942

* Change more pipelines target to gfx942

* set variables for manual testing

* Switch all pipeline gpu targets to gfx942

* Change more pipelines target to gfx942

* set variables for manual testing

* add test pipeline id

* revert test changes

* correct gpu target name

* remove unused flags; change hipSPARSELt target to be gfx942

* Add MI300X tuning guides (#3448)

* Add MI300X tuning guides

Add mi300x doc (pandoc conversion)

fix headings

add metadata

move images to shared/

move images to shared/

convert tuning-guides.md to rst using pandoc

add mi300x to tuning-guides.rst landing page

update h1s, toc, and landing page

fix spelling

fix fmt

format code blocks

add tensilelite imgs

fix formatting

fix formatting some more

fix formatting

more formatting

spelling

remove --enforce-eager note

satisfy spellcheck linter

more spelling

add fixes from hongxia

fix env var in D5

add fixes to PyTorch inductor section

fix

fix

Update docs/how-to/tuning-guides/mi300x.rst

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

Update docs/how-to/tuning-guides/mi300x.rst

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

Update docs/how-to/tuning-guides/mi300x.rst

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

Update docs/how-to/tuning-guides/mi300x.rst

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

Update docs/how-to/tuning-guides/mi300x.rst

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

Update docs/how-to/tuning-guides/mi300x.rst

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

Update docs/how-to/tuning-guides/mi300x.rst

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

Update docs/how-to/tuning-guides/mi300x.rst

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

Update docs/how-to/tuning-guides/mi300x.rst

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

Update docs/how-to/tuning-guides/mi300x.rst

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

Update docs/how-to/tuning-guides/mi300x.rst

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

Update docs/how-to/tuning-guides/mi300x.rst

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

Update 'torch_compile_debug' suggestion based on Hongxia's feedback

fix PyTorch inductor env vars

minor formatting fixes

Apply suggestions from code review

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

Update vllm path

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

disable numfig in Sphinx configuration

fix formatting and capitalization

add words to wordlist

update index

update wordlist

update optimizing-triton-kernel

convert cards to table

fix link in index.md

add @lpaoletti's feedback

Add system tuning guide

add images

add system section

add os settings and sys management

remove pcie=noats recommendation

reorg

add blurb to developer section

impr formatting

remove windows os from tuning guides pages in conf.py

add suggestions from review

fix typo and link

remove os windows from relevant pages in conf

mi300x

add suggestions from review

fix toc

fix index links

reorg

update vLLM vars

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

update vLLM vars

Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com>

reorganize

add warnings

add text to system tuning

add filler text on index pages

reorg tuning pages

fix links

fix vars

* rm old pages

fix toc

* add suggestions from review

small change

add more suggestions

rewrite intro

* add 'workload tuning philosophy'

* refactor

* fix broken links

* black format conf.py

* simplify cmd and update doc structure

* add higher-level heading for consistency (mi300x.rst)

* add fixes from review

fix url

add fixes

fix formatting

fix fmt

fix hipBLASLt section

change words

fix tensilelite section

fix

fix

fix fmt

* style guide

* fix some formatting

* satisfy spellcheck linter

* update wordlist

* fix bad conflict resolution

---------

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: danielsu-amd <danielsu@amd.com>
Co-authored-by: alexxu-amd <159800977+alexxu-amd@users.noreply.github.com>
Co-authored-by: spolifroni-amd <Sandra.Polifroni@amd.com>
Co-authored-by: randyh62 <42045079+randyh62@users.noreply.github.com>
Co-authored-by: Peter Park <peter.park@amd.com>
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>
Co-authored-by: b-sumner <brian.sumner@amd.com>
2024-07-22 15:39:48 -06:00
alexxu-amd
1733902f7c External CI: Add symlink to lib/llvm folder for ROCmValidationSuite (#3390)
* add CXX flag

* add CXX flag

* Update ROCmValidationSuite.yml

* Change googletest to libgtest-dev

* Update ROCmValidationSuite.yml

* Update ROCmValidationSuite.yml

* add ROCM_PATH as env var

* add HIP_INC_DIR

* remove manual test variables

* set variables for manual test

* remove CMAKE_CXX_COMPILER flag

* Set link to redirect llvm folder

* correct indentation

* remove manual test variables

* rename task
2024-07-03 17:02:53 -04:00
alexxu-amd
14dbb44056 Update ROCmValidationSuite pipeline according to the latest change (#3387)
* add CXX flag

* Change googletest to libgtest-dev

* add ROCM_PATH as env var

* add HIP_INC_DIR
2024-07-03 10:32:11 -04:00
danielsu-amd
813972b62b External CI: Add hipBLAS to MIOpen (#3386) 2024-07-02 15:40:44 -04:00
danielsu-amd
5e64d851cb External CI: Add all current component pipeline IDs (#3385) 2024-07-02 13:51:10 -04:00
Brian Harrison
2ee59acf20 External CI: updated MIOpen dependencies 2024-07-02 16:02:49 +00:00
alexxu-amd
325a2fd54c External CI: Fix a typo from composable_kernel pipeline (#3373)
* add libdrm-dev lib to CK dependency list

* change INSTANCE_ONLY to INSTANCES_ONLY
2024-06-28 15:39:08 -04:00
Joseph Macaranas
accb1347ea External CI: Add initial support for rocAL (#3365) 2024-06-27 13:58:10 -04:00
alexxu-amd
699b604f00 Add INSTANCE_ONLY cmake flag; change pool to ultra; increase time limit to 3.5hr (#3275) 2024-06-27 10:01:43 -04:00
abhimeda
217830fe25 added matrices artifact uploading code from rocSPARSE (#3356) 2024-06-25 15:04:52 -04:00
danielsu-amd
8b95ab0a02 External CI: remove redundant rocm-examples build flags (#3331) 2024-06-19 13:08:31 -04:00
danielsu-amd
e74245fbe4 External CI: Latest source pipeline for rocm-examples (#3317) 2024-06-19 09:59:02 -04:00
Joseph Macaranas
e903ffa952 External CI: Update aqlprofile binary used for rocprofiler (#3304) 2024-06-17 14:23:36 -04:00
Joseph Macaranas
923141f300 External CI: Fixes for two repos to work with latest source (#3293)
With MIOpen now building with latest source on External CI, this unblocked AMDMIGraphX from building with latest source.

Determined rocMLIR also needed to be built with latest source as a dependency.
2024-06-13 11:55:40 -04:00
Joseph Macaranas
13e14363cc External CI: updated MIOpen dependencies (#3278) 2024-06-12 11:23:21 -04:00
Joseph Macaranas
664c047311 External CI: Package rocSPARSE matrices for testers to consume (#3276) 2024-06-12 11:22:46 -04:00
alexxu-amd
7a13a6ee86 Merge pull request #3274 from ROCm/amd/alexxu12/fixStagingCI
Fix hipTensor build error on develop branch
2024-06-11 11:02:26 -04:00
Joseph Macaranas
ace708935d External CI: updated rocr_debug_agent dependencies (#3277) 2024-06-11 10:59:13 -04:00
alexxu-amd
cff1b2b021 revert changes for manual test 2024-06-11 10:39:28 -04:00
alexxu-amd
d7eacf56e3 adjust variables for manual test 2024-06-11 10:20:54 -04:00
alexxu-amd
bddbc6b444 revert changes to see if the build still fails 2024-06-11 10:07:20 -04:00
alexxu-amd
67f04977fb Move double dash to parameter for generic use case 2024-06-11 09:53:14 -04:00
alexxu-amd
3c1d39f251 revert changes to rdc 2024-06-10 14:02:57 -04:00
alexxu-amd
93f524586b revert changes made for manual tests 2024-06-10 14:02:04 -04:00
alexxu-amd
b36de1d3d4 delete space 2024-06-10 13:59:33 -04:00
alexxu-amd
627d38412a Revert changes to CK 2024-06-10 13:58:44 -04:00
alexxu-amd
1be99075e2 Change thread number to 32 2024-06-10 13:53:23 -04:00
alexxu-amd
05d7992361 change multithread flag 2024-06-10 13:03:53 -04:00
alexxu-amd
98f2e183a2 change pool back to MEDIUM before merge 2024-06-10 11:56:25 -04:00
alexxu-amd
ab1c62464a change pool to high 2024-06-10 11:38:32 -04:00
alexxu-amd
2e73c56275 Update hipTensor.yml 2024-06-10 11:37:22 -04:00
Joseph Macaranas
f8151b6cb5 rocprofiler-register: Add unit testing (#3272)
Since this component uses the base pool, does not need GPU for testing and is very quick to run, unit testing can be done within the same job.
2024-06-10 11:29:47 -04:00
alexxu-amd
52bccc1819 add variable declaration 2024-06-10 10:51:38 -04:00
alexxu-amd
2b492056ec add multithread Flag to build-cmake to allow hipTensor pass -j16 2024-06-10 10:46:33 -04:00
alexxu-amd
b12e5c32ca Restore hipTensor's original flag, remove GNinja 2024-06-10 10:15:05 -04:00
Joseph Macaranas
8db9220935 External CI: non-interactive apt upgrades (#3271) 2024-06-08 22:20:11 -04:00
alexxu-amd
fdd0ed080b fix a typo 2024-06-07 13:29:14 -04:00
Joseph Macaranas
d3f634ea33 Remove branch filter for aomp pipeline trigger (#3258)
Previous filter was not triggering this CI pipeline when ROCm-Runtime build was triggered from a pipeline completion trigger of llvm-project.
2024-06-07 11:14:32 -04:00
alexxu-amd
8c3eaa1fda Update hipTensor.yml 2024-06-06 11:56:08 -04:00
alexxu-amd
acca214a29 Update hipTensor.yml 2024-06-06 11:43:07 -04:00
alexxu-amd
6eb6a5bd90 change compiler from hipcc to amdclang++ 2024-06-05 14:14:24 -04:00
abhimeda
bf08674992 Built rccl using latest source code (#3230) 2024-06-04 17:50:36 -04:00
alexxu-amd
8826b10b92 Updates cmake flag to run CK with instance_only on all gpu targets 2024-06-04 17:40:48 -04:00
alexxu-amd
a96ec80cb0 Increase timeout limites to a day for CK 2024-06-04 13:05:41 -04:00
alexxu-amd
57506ba947 upgrade pool to HIGH for CK 2024-06-04 11:59:16 -04:00
alexxu-amd
4b67c8725b change compiler to clang++ and build for instance only 2024-06-04 11:57:18 -04:00
alexxu-amd
258e504595 change pool to medium 2024-06-04 09:52:36 -04:00
alexxu-amd
156215efcc Upgrade pool to HIGH 2024-06-04 09:38:50 -04:00
alexxu-amd
7c448eec8f add MI250 target to CK 2024-06-04 09:38:05 -04:00
alexxu-amd
29f9b4ab23 chang gpu target to gfx90a 2024-06-03 15:39:41 -04:00