Sam Wu
a1518ffa94
Merge develop into roc-6.1.x ( #3440 )
...
* Bump rocm-docs-core from 1.4.1 to 1.5.0 in /docs/sphinx (#3396 )
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core ) from 1.4.1 to 1.5.0.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases )
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md )
- [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.4.1...v1.5.0 )
---
updated-dependencies:
- dependency-name: rocm-docs-core
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Bump certifi from 2024.2.2 to 2024.7.4 in /docs/sphinx (#3399 )
Bumps [certifi](https://github.com/certifi/python-certifi ) from 2024.2.2 to 2024.7.4.
- [Commits](https://github.com/certifi/python-certifi/compare/2024.02.02...2024.07.04 )
---
updated-dependencies:
- dependency-name: certifi
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* External CI: build hipBLASLt external dependencies (#3405 )
* External CI: Increase composable_kernel pipeline time limit (#3407 )
* [Changelog/release notes] Fix and add custom templates for autotag script (#3408 )
* Update custom templates
* Add custom templates
* Fix custom template for hipfort
* Fix custom template for hipify
* Fix custom template for rvs
* External CI: Change composable_kernel pipeline to build for specific GPUs with tests and examples (#3412 )
* increase task time limit
* test building CK for multiple architectures
* Update composable_kernel.yml
* Update composable_kernel.yml
* gfx90a build
* gfx941;gfx1100;gfx1030 build
* hipTensor gfx941 build
* hipTensor gfx941 build
* reduce CK timeout to 100 minutes
* change all gfx90a targets to gfx942
* Bump sphinx-reredirects from 0.1.4 to 0.1.5 in /docs/sphinx (#3419 )
Bumps [sphinx-reredirects](https://github.com/documatt/sphinx-reredirects ) from 0.1.4 to 0.1.5.
- [Commits](https://github.com/documatt/sphinx-reredirects/compare/v0.1.4...v0.1.5 )
---
updated-dependencies:
- dependency-name: sphinx-reredirects
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Removed TransferBench from the tools list (#3421 )
* update AI framework image (#3406 )
* update AI framework image
* remove old image
* Update system optimization guides headings (#3422 )
* update headings to system optimization
* update index
* conv tuning-guides.md to rst
* shorten system optimization landing page
* update conf.py
update toc order
add space
* Update docs/how-to/tuning-guides.rst
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* update keywords
* update intro
---------
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
* External CI: move hipBLASLt build directory to ephemeral storage (#3433 )
* build hipblaslt in /mnt instead
* rm checkoutref
* remove debug step
* Update using-gpu-sanitizer.md with new known issues (#3423 )
* External CI: move hipBLASLt to new large disk pool
* Remove unused custom template for ck (#3438 )
* External CI: ROCm nightly builds (#3435 )
* ROCm nightly builds
* remove branch trigger, enable develop
* Remove unused configurations in conf.py (#3444 )
* External CI: Switch all pipeline GPU_TARGETS to gfx942 (#3443 )
* Switch all pipeline gpu targets to gfx942
* Change more pipelines target to gfx942
* set variables for manual testing
* Switch all pipeline gpu targets to gfx942
* Change more pipelines target to gfx942
* set variables for manual testing
* add test pipeline id
* revert test changes
* correct gpu target name
* remove unused flags; change hipSPARSELt target to be gfx942
* Add MI300X tuning guides (#3448 )
* Add MI300X tuning guides
Add mi300x doc (pandoc conversion)
fix headings
add metadata
move images to shared/
move images to shared/
convert tuning-guides.md to rst using pandoc
add mi300x to tuning-guides.rst landing page
update h1s, toc, and landing page
fix spelling
fix fmt
format code blocks
add tensilelite imgs
fix formatting
fix formatting some more
fix formatting
more formatting
spelling
remove --enforce-eager note
satisfy spellcheck linter
more spelling
add fixes from hongxia
fix env var in D5
add fixes to PyTorch inductor section
fix
fix
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update docs/how-to/tuning-guides/mi300x.rst
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update 'torch_compile_debug' suggestion based on Hongxia's feedback
fix PyTorch inductor env vars
minor formatting fixes
Apply suggestions from code review
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
Update vllm path
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
disable numfig in Sphinx configuration
fix formatting and capitalization
add words to wordlist
update index
update wordlist
update optimizing-triton-kernel
convert cards to table
fix link in index.md
add @lpaoletti's feedback
Add system tuning guide
add images
add system section
add os settings and sys management
remove pcie=noats recommendation
reorg
add blurb to developer section
impr formatting
remove windows os from tuning guides pages in conf.py
add suggestions from review
fix typo and link
remove os windows from relevant pages in conf
mi300x
add suggestions from review
fix toc
fix index links
reorg
update vLLM vars
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
update vLLM vars
Co-authored-by: Hongxia Yang
<62075498+hongxiayang@users.noreply.github.com >
reorganize
add warnings
add text to system tuning
add filler text on index pages
reorg tuning pages
fix links
fix vars
* rm old pages
fix toc
* add suggestions from review
small change
add more suggestions
rewrite intro
* add 'workload tuning philosophy'
* refactor
* fix broken links
* black format conf.py
* simplify cmd and update doc structure
* add higher-level heading for consistency (mi300x.rst)
* add fixes from review
fix url
add fixes
fix formatting
fix fmt
fix hipBLASLt section
change words
fix tensilelite section
fix
fix
fix fmt
* style guide
* fix some formatting
* satisfy spellcheck linter
* update wordlist
* fix bad conflict resolution
---------
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: danielsu-amd <danielsu@amd.com >
Co-authored-by: alexxu-amd <159800977+alexxu-amd@users.noreply.github.com >
Co-authored-by: spolifroni-amd <Sandra.Polifroni@amd.com >
Co-authored-by: randyh62 <42045079+randyh62@users.noreply.github.com >
Co-authored-by: Peter Park <peter.park@amd.com >
Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com >
Co-authored-by: b-sumner <brian.sumner@amd.com >
2024-07-22 15:39:48 -06:00
alexxu-amd
1733902f7c
External CI: Add symlink to lib/llvm folder for ROCmValidationSuite ( #3390 )
...
* add CXX flag
* add CXX flag
* Update ROCmValidationSuite.yml
* Change googletest to libgtest-dev
* Update ROCmValidationSuite.yml
* Update ROCmValidationSuite.yml
* add ROCM_PATH as env var
* add HIP_INC_DIR
* remove manual test variables
* set variables for manual test
* remove CMAKE_CXX_COMPILER flag
* Set link to redirect llvm folder
* correct indentation
* remove manual test variables
* rename task
2024-07-03 17:02:53 -04:00
alexxu-amd
14dbb44056
Update ROCmValidationSuite pipeline according to the latest change ( #3387 )
...
* add CXX flag
* Change googletest to libgtest-dev
* add ROCM_PATH as env var
* add HIP_INC_DIR
2024-07-03 10:32:11 -04:00
danielsu-amd
813972b62b
External CI: Add hipBLAS to MIOpen ( #3386 )
2024-07-02 15:40:44 -04:00
danielsu-amd
5e64d851cb
External CI: Add all current component pipeline IDs ( #3385 )
2024-07-02 13:51:10 -04:00
Brian Harrison
2ee59acf20
External CI: updated MIOpen dependencies
2024-07-02 16:02:49 +00:00
alexxu-amd
325a2fd54c
External CI: Fix a typo from composable_kernel pipeline ( #3373 )
...
* add libdrm-dev lib to CK dependency list
* change INSTANCE_ONLY to INSTANCES_ONLY
2024-06-28 15:39:08 -04:00
Joseph Macaranas
accb1347ea
External CI: Add initial support for rocAL ( #3365 )
2024-06-27 13:58:10 -04:00
alexxu-amd
699b604f00
Add INSTANCE_ONLY cmake flag; change pool to ultra; increase time limit to 3.5hr ( #3275 )
2024-06-27 10:01:43 -04:00
abhimeda
217830fe25
added matrices artifact uploading code from rocSPARSE ( #3356 )
2024-06-25 15:04:52 -04:00
danielsu-amd
8b95ab0a02
External CI: remove redundant rocm-examples build flags ( #3331 )
2024-06-19 13:08:31 -04:00
danielsu-amd
e74245fbe4
External CI: Latest source pipeline for rocm-examples ( #3317 )
2024-06-19 09:59:02 -04:00
Joseph Macaranas
e903ffa952
External CI: Update aqlprofile binary used for rocprofiler ( #3304 )
2024-06-17 14:23:36 -04:00
Joseph Macaranas
923141f300
External CI: Fixes for two repos to work with latest source ( #3293 )
...
With MIOpen now building with latest source on External CI, this unblocked AMDMIGraphX from building with latest source.
Determined rocMLIR also needed to be built with latest source as a dependency.
2024-06-13 11:55:40 -04:00
Joseph Macaranas
13e14363cc
External CI: updated MIOpen dependencies ( #3278 )
2024-06-12 11:23:21 -04:00
Joseph Macaranas
664c047311
External CI: Package rocSPARSE matrices for testers to consume ( #3276 )
2024-06-12 11:22:46 -04:00
alexxu-amd
7a13a6ee86
Merge pull request #3274 from ROCm/amd/alexxu12/fixStagingCI
...
Fix hipTensor build error on develop branch
2024-06-11 11:02:26 -04:00
Joseph Macaranas
ace708935d
External CI: updated rocr_debug_agent dependencies ( #3277 )
2024-06-11 10:59:13 -04:00
alexxu-amd
cff1b2b021
revert changes for manual test
2024-06-11 10:39:28 -04:00
alexxu-amd
d7eacf56e3
adjust variables for manual test
2024-06-11 10:20:54 -04:00
alexxu-amd
bddbc6b444
revert changes to see if the build still fails
2024-06-11 10:07:20 -04:00
alexxu-amd
67f04977fb
Move double dash to parameter for generic use case
2024-06-11 09:53:14 -04:00
alexxu-amd
3c1d39f251
revert changes to rdc
2024-06-10 14:02:57 -04:00
alexxu-amd
93f524586b
revert changes made for manual tests
2024-06-10 14:02:04 -04:00
alexxu-amd
b36de1d3d4
delete space
2024-06-10 13:59:33 -04:00
alexxu-amd
627d38412a
Revert changes to CK
2024-06-10 13:58:44 -04:00
alexxu-amd
1be99075e2
Change thread number to 32
2024-06-10 13:53:23 -04:00
alexxu-amd
05d7992361
change multithread flag
2024-06-10 13:03:53 -04:00
alexxu-amd
98f2e183a2
change pool back to MEDIUM before merge
2024-06-10 11:56:25 -04:00
alexxu-amd
ab1c62464a
change pool to high
2024-06-10 11:38:32 -04:00
alexxu-amd
2e73c56275
Update hipTensor.yml
2024-06-10 11:37:22 -04:00
Joseph Macaranas
f8151b6cb5
rocprofiler-register: Add unit testing ( #3272 )
...
Since this component uses the base pool, does not need GPU for testing and is very quick to run, unit testing can be done within the same job.
2024-06-10 11:29:47 -04:00
alexxu-amd
52bccc1819
add variable declaration
2024-06-10 10:51:38 -04:00
alexxu-amd
2b492056ec
add multithread Flag to build-cmake to allow hipTensor pass -j16
2024-06-10 10:46:33 -04:00
alexxu-amd
b12e5c32ca
Restore hipTensor's original flag, remove GNinja
2024-06-10 10:15:05 -04:00
Joseph Macaranas
8db9220935
External CI: non-interactive apt upgrades ( #3271 )
2024-06-08 22:20:11 -04:00
alexxu-amd
fdd0ed080b
fix a typo
2024-06-07 13:29:14 -04:00
Joseph Macaranas
d3f634ea33
Remove branch filter for aomp pipeline trigger ( #3258 )
...
Previous filter was not triggering this CI pipeline when ROCm-Runtime build was triggered from a pipeline completion trigger of llvm-project.
2024-06-07 11:14:32 -04:00
alexxu-amd
8c3eaa1fda
Update hipTensor.yml
2024-06-06 11:56:08 -04:00
alexxu-amd
acca214a29
Update hipTensor.yml
2024-06-06 11:43:07 -04:00
alexxu-amd
6eb6a5bd90
change compiler from hipcc to amdclang++
2024-06-05 14:14:24 -04:00
abhimeda
bf08674992
Built rccl using latest source code ( #3230 )
2024-06-04 17:50:36 -04:00
alexxu-amd
8826b10b92
Updates cmake flag to run CK with instance_only on all gpu targets
2024-06-04 17:40:48 -04:00
alexxu-amd
a96ec80cb0
Increase timeout limites to a day for CK
2024-06-04 13:05:41 -04:00
alexxu-amd
57506ba947
upgrade pool to HIGH for CK
2024-06-04 11:59:16 -04:00
alexxu-amd
4b67c8725b
change compiler to clang++ and build for instance only
2024-06-04 11:57:18 -04:00
alexxu-amd
258e504595
change pool to medium
2024-06-04 09:52:36 -04:00
alexxu-amd
156215efcc
Upgrade pool to HIGH
2024-06-04 09:38:50 -04:00
alexxu-amd
7c448eec8f
add MI250 target to CK
2024-06-04 09:38:05 -04:00
alexxu-amd
29f9b4ab23
chang gpu target to gfx90a
2024-06-03 15:39:41 -04:00