Commit Graph

28 Commits

Author SHA1 Message Date
amd-hsivasun
99f0875e70 [Ex CI] amdsmi monorepo enablement (#5677)
* [Ex CI] amdsmi monorepo enablement

* Fix amdsmi yaml
2025-11-20 13:52:01 -05:00
Daniel Su
b65996587f [Ex CI] remove ALLOWED_PARTIAL_SUCCEED_BUILDS library variable (#4937) 2025-06-18 12:10:04 -04:00
Daniel Su
9c38a9cf71 [Ex CI] move from almalinux pool to manylinux containers (#4892) 2025-06-06 13:40:03 -04:00
idass1990
1416c355e3 remove trailing whitespaces 2025-06-05 16:42:57 -04:00
idass1990
769eee92bf fixed formatting 2025-06-05 16:38:53 -04:00
idass
1dd4b4230d [External CI] Add support for Ubuntu 24.04 OS 2025-06-05 12:29:09 -04:00
Joseph Macaranas
ff11bd392e [External CI] AlmaLinux 8 Support Refactor (#4869)
GCC Toolset 14 Environment
- source /opt/rh/gcc-toolset-14/enable only lasts for the shell session, so run at the beginning of relevant build and test tasks when the OS is AlmaLinux 8.
- CMake tasks set env to behave as if source /opt/rh/gcc-toolset-14/enable command was run.
- Observed that the built ROCm libraries can either be installed on lib or lib64 directories in this OS profile, so ldconfig step is adjusted to look at additional directories. This won't impact usage in ubuntu22 if the lib64 directories don't exist in the custom ROCm build.
- For the llvm linking step we cannot assume the ROCm lib directory exists, as only ROCm lib64 might be present on the build environment.
- libatomic package was added to the gcc toolset setup.

yaml-based Changes
- base set of dnf packages now defined in an array for dependencies that already come pre-installed on the ubuntu22 VMs.
- Changed format of the job matrix for readability.

New Features
- AlmaLinux 8 pipelines for roctracer and ROCdbgapi.
- roctracer pipeline expanded to support compilation for gfx1030 and gfx1100.
- AlmaLinux 8 llvm-project pipeline now builds flang and flang-rt, so re-enabled for ubuntu 22.04 pipeline as well.

TODO
- Revisit why ninja-build is not used for comgr, device-libs, and hipcc.
2025-06-02 11:10:09 -04:00
Joseph Macaranas
21a4565e02 [External CI] Add support for other OS profile builds, starting with AlmaLinux OS 8.10 (#4823)
- Modified job matrices and templates to support a second OS.
- Included creation of Virtual Machine Scale Sets running AlmaLinux OS 8.10 with GCC toolset 14 to match manylinux 2_28.
- Dependency download algorithm modified so that only a single array of package manager (apt) packages need to be provided as input and then the other package managers have a mapping of equivalent packages.
- Cleaned up python3-pip in the arrays as those should already be on the VMs.
- This will be an iterative process of getting components to build on this OS profile, and starting with the components that don't have interdependencies.
- Highest priority is to get the rocm-libraries working.
2025-05-28 16:12:53 -04:00
Daniel Su
ec36f39665 Ex CI: fix comparison in aggregatePipeline test condition (#4622) 2025-04-14 12:22:29 -04:00
Joseph Macaranas
87b9fac022 External CI: Boilerplate code for aggregate build pipeline (#4496)
- Add knobs to toggle aggregate build options.
- Aggregate build pipeline will pull ROCm dependencies from earlier in the same pipeline.
- Changing build pool of some components for more compute power.
- Deleting deprecated component.
- Add Ninja to dependency compilation in MIOpen.
- Add retries to wget for MIOpen CK build case.

---------

Co-authored-by: Daniel Su <danielsu@amd.com>
2025-04-11 17:13:58 -04:00
Daniel Su
09a3cd9a46 Ex CI: convert job strategy matrices into compiletime parameters (#4553) 2025-04-02 11:43:52 -04:00
Daniel Su
a6232d89f2 Ex CI: add Ninja build gen for 12 components (#4544) 2025-03-28 13:40:57 -04:00
Daniel Su
1d9ecdef44 Ex CI: temporarily change from low pool to base pool (#4463) 2025-03-07 17:15:32 -05:00
Daniel Su
c83677f41c Ex CI: enable gfx90a tests (#4450) 2025-03-06 13:50:12 -05:00
Joseph Macaranas
5fabb45bd1 External CI: Print out artifact download links (#4297)
- Also, fixes to rocprofiler-sdk when incorporating recent features.
- URL encoding algorithm converts trailing '=' in the base64 string to an integer representing the number of those trailing '=' characters.
2025-01-27 09:50:22 -05:00
Joseph Macaranas
9121d6b12a External CI: Remove aotriton from pytorch build (#4290)
- pip update click module for test failures.
- Test results are at 99.8% with these fixes.
- Missing cmake dependency from last PR for ROCR-Runtime
- Missing pkg-config dependency for amdsmi
- Modify PATH to find pip's cmake for rocprofiler-sdk
2025-01-23 10:22:08 -05:00
Joseph Macaranas
5e7b87c4ad External CI: Install cmake on LOW_BUILD_POOL pipelines (#4292) 2025-01-23 00:42:48 -05:00
Joseph Macaranas
ac583e4240 External CI: Install pip on LOW_BUILD_POOL pipelines (#4291)
- pip might not be installed on LOW_BUILD_POOL VMs.
- Ensure it is installed to not fail the 'pip list' step.
2025-01-23 00:23:15 -05:00
Joseph Macaranas
dade3d7642 External CI: Docker Containers for Job Failures (#4288)
- Dynamically write a Dockerfile based on the environment for the failing job.
- Account for additional dependencies that need to be installed and setup.
- Build and push a custom container based on that dynamic Dockerfile to capture that failing environment.
- Documenting additional setup to install Docker on VMSS during provisioning.
2025-01-22 17:31:27 -05:00
Daniel Su
b0bf8915bb External CI: pipeline manifests (#4073) 2024-12-02 11:08:16 -05:00
Daniel Su
bc7ab7707d External CI: move gpu-diag directly before tests (#3943) 2024-10-25 11:23:23 -04:00
Daniel Su
55239688b4 External CI: enumerate GPUs in gpu-diagnostics (#3942) 2024-10-24 16:56:12 -04:00
Daniel Su
fd2f2dc77a External CI: add support to disable individual component tests (#3938) 2024-10-23 14:48:34 -04:00
Daniel Su
93be8ca70f External CI: create a GPU diagnostics template (#3932) 2024-10-22 09:33:35 -04:00
Daniel Su
f72e28afbb External CI: add a global variable to control gfx942 tests (#3864) 2024-10-04 18:57:19 -04:00
Daniel Su
26e6732e74 External CI: integrate first batch of component tests (#3482) 2024-08-29 13:50:39 -04:00
Joseph Macaranas
d18b2339de Add build pools, change gpu targets, and build tests (#3115)
To best determine hardware specs per repo, added more build pool options with varying number of vCPUs, RAM size, etc. and will kick off builds with test targets enabled to determine long-term cost values.

Co-authored-by: alexxu-amd <alexxu12@amd.com>
2024-05-15 11:38:49 -04:00
Joseph Macaranas
8f4237df61 More external CI pipelines (#3083)
Changing default behaviour for PRs with tag-builds.
Changing build system for some jobs based on execution time.

Co-authored-by: abhimeda <abhinav.meda@amd.com>
Co-authored-by: alexxu-amd <alex.xu@amd.com>
2024-05-04 01:14:26 -04:00