Commit Graph

327 Commits

Author SHA1 Message Date
darren-amd
dc648ad764 External CI: Disable rocprof-system openmp target example (#4166) 2024-12-16 14:38:55 -05:00
Joseph Macaranas
bacb49681e External CI: Typo in rocPyDecode Pipeline Parameter (#4157) 2024-12-13 14:35:35 -05:00
darren-amd
1b33f1d7da External CI: llvm project comgr disable spirv (#4154)
External CI: add flag to disable SPIRV from comgr build in llvm-project
2024-12-13 10:51:36 -05:00
Joseph Macaranas
fd067f7b3b External CI: HIP shared library symlinks for rocPyDecode (#4153)
- Modifying the HIP shared libraries installed to follow the Linux symbolic link convention resolves test failures in rocPyDecode.
2024-12-13 09:58:25 -05:00
Joseph Macaranas
5271c2c82d External CI: MIOpen Test Parameters (#4148)
- Exclude lone, consistently failing MIOpen test.
- test_rnn_seq_api is the only ctest failure, so let's filter it out for now to easily identify new failures.
2024-12-11 13:31:12 -05:00
darren-amd
3dffe1998a External CI: add aomp dependency to rocprofiler-sdk (#4135)
External CI: Add aomp dependency for rocprofiler-sdk
2024-12-06 16:20:09 -05:00
Daniel Su
73e21c82c0 External CI: finalize rocJPEG enablement (#4125) 2024-12-06 11:47:45 -05:00
Daniel Su
788796bfe1 External CI: create pipeline files for rocJPEG (#4117) 2024-12-05 16:17:42 -05:00
Daniel Su
922209e5c9 External CI: change rocm-core staging branch to master (#4115) 2024-12-05 14:45:38 -05:00
Daniel Su
3690edc79c External CI: give manifest files same naming scheme as build artifacts (#4100) 2024-12-04 17:45:31 -05:00
Daniel Su
6d8fd0cf85 External CI: remove omniperf files again (#4096) 2024-12-04 10:06:13 -05:00
Sam Wu
f77e2dd7a7 Sync develop branch (#4078) 2024-12-03 15:18:51 -07:00
Daniel Su
ca6c6d8464 External CI: AOMP symlink fix and manifest additions (#4077) 2024-12-03 14:06:41 -05:00
Daniel Su
4c7781a809 External CI: AOMP rename flang/llvm-legacy to flang/llvm-classic (#4076) 2024-12-03 10:17:27 -05:00
Daniel Su
6fa23decf5 External CI: install jq in manifest template (#4075) 2024-12-02 14:38:57 -05:00
Daniel Su
30cce6094d External CI: fix manifests when triggered by component repo (#4074) 2024-12-02 13:12:24 -05:00
Daniel Su
b0bf8915bb External CI: pipeline manifests (#4073) 2024-12-02 11:08:16 -05:00
Joseph Macaranas
f3bd764cf9 External CI: aomp and rocWMMA build fixes (#4069)
- aomp: Account for path changes due to LLVM_INSTALL_LOC from aomp PR #1012
- aomp: Add llvm-legacy build script step for aomp PR #1062
- rocWMMA: Fix rpath issue when using ninja.
2024-12-02 10:29:43 -05:00
Daniel Su
01f2d756a7 External CI: set MIOpen dependencies install path (#4060) 2024-11-28 10:12:07 -05:00
Daniel Su
a3b316dd41 External CI: ROCR build release, hipBLAS test fix (#4058) 2024-11-26 16:55:05 -05:00
Daniel Su
a44d1d8187 External CI: rocprofiler-compute test fixes (#4057) 2024-11-26 13:50:34 -05:00
Daniel Su
75c2943831 External CI: add ROCM_PATH to rocprofiler-systems/compute (#4052) 2024-11-22 16:59:29 -05:00
Daniel Su
9ef3bc00f2 External CI: use MIOpen's requirements.txt (#4050) 2024-11-22 10:51:53 -05:00
Kiriti Gowda
0346076cd8 MIVisionX - Updates (#4044)
Add OpenMP dev package
2024-11-21 14:41:12 -05:00
Daniel Su
a4030f4ed9 External CI: add new RDC dependencies (#4029) 2024-11-14 11:49:17 -05:00
Daniel Su
0347e914f1 External CI: increase Tensile test timeout to 90 mins (#4027) 2024-11-12 15:39:48 -05:00
Daniel Su
b548d2bf2c External CI: fix comgr build (#4025) 2024-11-11 11:34:37 -05:00
Daniel Su
aac0f87c03 External CI: disable rdmatest and rocrtstFunc.Memory_Max_Mem (#4016) 2024-11-08 09:21:08 -05:00
Daniel Su
80741cf81d External CI: allow test failures to present as failures on Github (#3993) 2024-11-07 14:26:07 -05:00
Daniel Su
d1e1b53f87 External CI: remove omniperf from nightly (#4000) 2024-11-06 15:10:49 -05:00
Daniel Su
d80747b432 External CI: add aomp as rocprofiler-systems dependency (#3996) 2024-11-06 09:13:26 -05:00
Daniel Su
3fef03bc61 External CI: finish rocprofiler-compute enablement (#3995) 2024-11-06 09:13:17 -05:00
Joseph Macaranas
1356cf8a71 External CI: decode test changes (#3983)
- Only target container with access to first device
- Ensure pybind11-dev is uninstalled before the package manager install steps
2024-11-01 12:57:29 -04:00
Daniel Su
37521226ab External CI: add roctracer to rocBLAS deps (#3982) 2024-11-01 11:31:32 -04:00
Joseph Macaranas
92a0020d4c External CI: Change test setup for rocPyDecode (#3978)
- Use multiple potential locations for pybind11 to be found by cmake.
2024-10-31 09:53:20 -04:00
Daniel Su
b233053e54 External CI: disable reload AMDGPU (#3974) 2024-10-30 17:36:45 -04:00
Daniel Su
8f025d6321 External CI: rocprofiler-compute pipeline files (#3973) 2024-10-30 17:36:30 -04:00
Daniel Su
750484ab57 External CI: suppress GPU diag warnings (#3972) 2024-10-30 16:29:02 -04:00
Daniel Su
75f7dfbac3 External CI: move allowPartiallySucceededBuilds to library variable (#3970) 2024-10-30 16:16:03 -04:00
Daniel Su
21c79587b1 External CI: various fixes (#3963) 2024-10-30 13:27:14 -04:00
Joseph Macaranas
c6dcba1fcc External CI: Add aqlprofile to Tensile test dependencies (#3961) 2024-10-30 13:14:05 -04:00
Joseph Macaranas
2960fbbbd6 External CI: Always force rocPyDecode cleanup step 2024-10-30 10:50:44 -04:00
Joseph Macaranas
0298a79c97 External CI: do not assume python is python3 for rocpydecode (#3955) 2024-10-29 13:28:43 -04:00
Joseph Macaranas
ae2b197fc8 External CI: rocpydecode dependency installation change (#3954)
- Install pybind11 through pip instead of apt
- Add pip-installed pybind11 path to CMAKE_PREFIX_PATH
- Tested against source of PR 122
2024-10-29 11:01:54 -04:00
Joseph Macaranas
bce439ecac External CI: pytorch pipeline updates (#3948)
To support recent upstream changes and issues observed.
2024-10-25 17:17:16 -04:00
alexxu-amd
8e3d51c31d Sync change from ROCm to ROCm-internal (#194)
* Fix Radeon link and point at R6.1.3 as absolute link (#3757)

* Update ROCm manifest to 6.2.1

* Update ROCm branch name

* Add 6.2.1 to version list (#3770)

* Add links to GH issues in 6.2.1 release notes (#3769)

* add MAD page

* link to GitHub issues in release notes known issues

* update templates for 6.2.1

* Revert "add MAD page"

This reverts commit 9cce72bba3.

* update wordlist for spellcheck linter

* add rccl note

* update rocal version change heading to be more obvious

* make rocal note more specific

* fix missing space

* fix capitalization

* Update RCCL known issue wording (#3775)

* add MAD page

* fix wording in RCCL known issue

* Revert "add MAD page"

This reverts commit c81d0f3b0a.

* update llvm version for 6.2.1 (#3779)

* Fix broken links in 6.2.1 release notes (#3782)

* External CI: Replace libomp dependencies with aomp (#3781)

Add roctracer dependency for hipBLAS and rocWMMA testing

* External CI: Add rocprofiler v1 and v2 smoke tests (#3784)

* External CI: ROCgdb smoke tests (#3785)

- Since this is an autotools project and not cmake, build and test on gfx942 system instead of separating into two jobs. Pipeline time is short anyway.
- Follow build instructions to update build flags and to incorporate the ROCdbgapi.
- Results are not parsed and graphed, but the log contents are printed at the end. This was helpful for debugging and will be kept in the pipeline, as the make check-gdb command's output was not helpful on its own.

* External CI: rocPyDecode Smoke Test (#3786)

* External CI: omniperf pipeline (#3788)

- Referred to public documentation, source, and iterative attempts to create and improve build and test pipeline.
- ctest failures are due to the test node not having expected marketing name string and override not working.
- The fix should be on the omniperf repo side of things, so this pull request should be fine as is.

* External CI: create omniperf pipeline IDs, update nightly build (#3790)

* Fixed greater than to be less than in rocFFT changes

* fix footnote for 6.1.0 (#3791)

* fix footnote for 6.1.0

* fix empty columns in historical KFD title

* External CI: Publish wheel as artifact for rocPyDecode (#3796)

* External CI: fix hip-tests symlink creation (#3799)

* Docs: Add Ubuntu 24.04.1 (#3801)

* add ubuntu 24.04.1

* add 24.04.1 to bottom os section

* fix heading and template

* Update compatibility-matrix.rst for OpenMP version

* Update compatibility-matrix-historical-6.0.csv for OpenMP version

* rm ubuntu 24.04.1 from 6.2.0

* Update docs/compatibility/compatibility-matrix.rst

Co-authored-by: Young Hui - AMD <145490163+yhuiYH@users.noreply.github.com>

* rm duplicate ubuntu in historical

---------

Co-authored-by: Young Hui - AMD <145490163+yhuiYH@users.noreply.github.com>

* External CI: fixes for rocMLIR and nightly build (#3800)

* External CI: fix symlinks for rocMLIR and nightly build

* add pipeline IDs for hip-tests

* fix hip-test ID typo

* remove llvm-alt license (#3727)

* remove llvm-alt license

* fix linting error

* External CI: enable ROCR-Runtime tests (#3809)

* External CI: default branches for hip-tests, omniperf (#3811)

* External CI: torch and torchvision smoke tests (#3810)

* External CI: torch and torchvision smoke tests

- Fixed issues with package name and version for the vision wheel that prevented it from installing. A patch is used until my pull request in vision repo is merged.
- Referred to rocAutomation scripts to pick which test scripts to run out of the many in the torch and vision repo, and iteratively tested suggested scripts to see which ones completed in a timely manner.
- Leveraging pytest-azurepipelines module to automatically parse and graph results from these tests.

* External CI: omnitrace build pipeline (#3812)

* External CI: omnitrace build pipeline starter

- Adding initial set of dependencies and build flags.

* External CI: omnitrace build pipeline

- Add bison, rccl, texinfo dependencies based on build failures.
- Add AMDGPU_TARGETS flag
- Add ROCm binaries to PATH for clang-format and other tools used.

* Fix indentation

---------

Co-authored-by: Daniel Su <danielsu@amd.com>

* External CI: AMDMIGraphX Build Fix (#3814)

- Swap to default gcc on OS to resolve build errors from recent commits.
- Added libdnnl-dev dependency from iterative attempts with compiler change.
- Referred to the passing GitHub checks to observe the compilers that was used.
- Build CK jit lib and include in AMDMIGraphX build.

* External CI: test fixes w/ roctracer, list omniperf as partially succeeding (#3815)

* External CI: rpp tests (#3816)

* External CI: Build pipeline for rocprofiler-sdk (#3819)

* External CI: Pipeline for rocprofiler-sdk

* Add rocprofiler dependency

* External CI: rocprofiler-sdk build pipeline

---------

Co-authored-by: Daniel Su <danielsu@amd.com>

* External CI: Fix/add missing pipeline IDs (#3818)

* External CI: omnitrace tests (#3822)

* Update tags to 6.2.2 (#3827)

* External CI: add roctracer to roc/hipSOLVER test deps (#3825)

* External CI: add rocprofiler-sdk pipeline IDs (#3824)

* External CI: AMDMIGraphX Smoke Tests (#3830)

Co-authored-by: Daniel Su <danielsu@amd.com>

* External CI: MIOpen tests (#3837)

* Point to release history instead of deprecated changelog (#3836)

* External CI: filter out hipTensor extended tests (#3838)

* added revised note re. radeon gpus (#3839)

* Restructured the contributions section. (#3715)

* testing if this file is editable

* changed 'kebob-case' to 'dash-case'

* Restructured the page to be more straightforward and provide additional repo information

* forgot to save

* Moved the topic sentence

* Wrong accent on the a in diataxis

* Removed the feedback info from contributing and moved it to Feedback

* fixed spelling errors

* fixed some wording and removed second person text

* consolidated Build and Structure into Contribute; edited toolchai to (hopefully) conform to style guide; updated toc

* updated the titles in the toc

* made changes based on feedback

* it's better when you save

* removed structure and build; fixed something for the linter

* added rst to wordlist

* added customizations to wordlist

* Add links to gpu cluster network guides (#3763)

* Add links to gpu cluster network guides

* Add newline character to eof

* Make link absolute

* add dynamic branch in toc

* remove unnecessary page

clean up

* clean up index/toc

* make multi-node topics adjacent

---------

Co-authored-by: Peter Park <peter.park@amd.com>

* updated the radeon note (#3850)

* External CI: Fix rocPyDecode wheel creation (#3852)

- Set values for expected environment variables.
- Accompanying changes required in rocPyDecode repo. Pull request will be made.

* External CI: pytorch vision patch removal (#3855)

My pull request applying this patch was merged upstream, so this is no longer needed and will break the pipeline since it can no longer be applied.

* Build(deps): Bump rocm-docs-core from 1.8.1 to 1.8.2 in /docs/sphinx (#3807)

Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.8.1 to 1.8.2.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/v1.8.2/CHANGELOG.md)
- [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.8.1...v1.8.2)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* updated the radeon note, as it were (#3857)

* updated the radeon note, as it were

* updated the note again

* Set devops team as codeowners for rocm-build (#3860)

* Set ext CI as codeowners for rocm-build

* Update CODEOWNERS to rocm-devops

* External CI: Add option to pull mainline branch for dependencies (#3689)

* External CI: Add option to pull mainline branch for dependencies

* Missing parameter for mainline branch dependencies.

* External CI: mainline branch definitions

* Removed MIGraphX optimization page (#3848)

* External CI: add a global variable to control gfx942 tests (#3864)

* External CI: update component default/mainline branches (#3871)

* External CI: Stop building gfx90a (#3872)

Save on VM resources until infrastructure has test targets.

* External CI: add libstdc++-12 to rocMLIR (#3874)

* Add building doc section (#3873)

* External CI: programmatically get latest aqlprofile (#3876)

* External CI: use ctest for rocm-examples (#3877)

* External CI: Tensile pipeline (#3884)

* add oversubscription conceptual doc (#3885)

add mitigiation steps

add to toc

move page for build

move doc

fix spelling

update doc

update oversubscription

update order

fix spelling

add oversubscription to wordlist

move oversubscription topic to bottom of toc and index

* add oversubscription conceptual doc (#3885)

(cherry picked from commit d0ecf51b0c)

* External CI: Add pipeline to build upstream boost (#3896)

* Update bitsandbytes branch in docs (#3898)

* Documentation: Add reference to precision-support floating-point types (#3899)

* External CI: use Boost template for MIOpen (#3903)

* External CI: create rocprofiler-systems pipeline (#3906)

* External CI: omnitrace/rocprof-sys pipeline IDs (#3908)

* External CI: MIOpen parse test results (#3913)

* External CI: Use pip to install latest cmake on test system (#3915)

* added a link to the compatibility matrix (#3904)

* added a link to the compatibility matrix

* removed quotes

* docs: Remove invalid amd_iommu=on parameter

Per kernel-parameters.txt, there is no "on" option for amd_iommu. While
intel_iommu has it, amd_iommu is automatically on unless specified
otherwise. For more info, see these 2 links:

https://www.kernel.org/doc/Documentation/admin-guide/kernel-parameters.txt
75aa74d52f/drivers/iommu/amd/init.c (L3481)

Signed-off-by: Kent Russell <kent.russell@amd.com>

* External CI: hipBLASLt build now requires python packaging module (#3926)

https://github.com/ROCm/hipBLASLt/pull/1250/files#diff-fee2e6f068b33fca3a1dc49392de8848dbf05c3f4632b680abb1052523e5a30fR35

* External CI: Moved location of upstream pytorch build scripts (#3930)

https://github.com/pytorch/pytorch/pull/138103

* External CI: disable rocMLIR tests (#3931)

* External CI: disable rocMLIR tests

* roctracer AMDGPU_TARGETS flag

* External CI: create a GPU diagnostics template (#3932)

* External CI: Add CK into pytorch build environment (#3934)

* External CI: add support to disable individual component tests (#3938)

* External CI: AMDMIGraphX greater-equal pip dependencies (#3939)

* Build(deps): Bump rocm-docs-core from 1.8.2 to 1.8.3 in /docs/sphinx (#3933)

Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.8.2 to 1.8.3.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.8.2...v1.8.3)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* External CI: rocDecode add libva-amdgpu-dev dependency (#3940)

* External CI: enumerate GPUs in gpu-diagnostics (#3942)

* External CI: move gpu-diag directly before tests (#3943)

* External CI: fix HIP_PIPELINE_ID (#3944)

---------

Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: Kent Russell <kent.russell@amd.com>
Co-authored-by: Jeffrey Novotny <jnovotny@amd.com>
Co-authored-by: Sam Wu <22262939+samjwu@users.noreply.github.com>
Co-authored-by: Wang, Yanyao <yanyao.wang@amd.com>
Co-authored-by: Yanyao Wang <yanywang@amd.com>
Co-authored-by: Peter Park <peter.park@amd.com>
Co-authored-by: Young Hui - AMD <145490163+yhuiYH@users.noreply.github.com>
Co-authored-by: Joseph Macaranas <145489236+amd-jmacaran@users.noreply.github.com>
Co-authored-by: Daniel Su <danielsu@amd.com>
Co-authored-by: Sandra Polifroni <sandra.polifroni@amd.com>
Co-authored-by: randyh62 <42045079+randyh62@users.noreply.github.com>
Co-authored-by: Michael Benavidez <michael.benavidez@amd.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: MKKnorr <MKKnorr@web.de>
Co-authored-by: Kent Russell <kent.russell@amd.com>
Co-authored-by: Joseph Greathouse <jlgreathouse@users.noreply.github.com>
2024-10-25 14:41:40 -04:00
Daniel Su
9fdd785979 External CI: fix HIP_PIPELINE_ID (#3944) 2024-10-25 11:23:32 -04:00
Daniel Su
bc7ab7707d External CI: move gpu-diag directly before tests (#3943) 2024-10-25 11:23:23 -04:00
Daniel Su
55239688b4 External CI: enumerate GPUs in gpu-diagnostics (#3942) 2024-10-24 16:56:12 -04:00
Daniel Su
d90775066f External CI: rocDecode add libva-amdgpu-dev dependency (#3940) 2024-10-24 12:06:51 -04:00