From e183eee87bf126c6c68b16e7235616846168378c Mon Sep 17 00:00:00 2001 From: alexxu-amd <159800977+alexxu-amd@users.noreply.github.com> Date: Wed, 6 Nov 2024 16:42:28 -0500 Subject: [PATCH] Sync from internal develop 6.2.4 (#4002) * add radeon pro v710 to gpu arch specs (#192) * Add V710 specs gpg: using RSA key 22223038B47B3ED4B3355AB11B54779B4780494E gpg: Good signature from "Peter Park (MKMPETEPARK01) " [ultimate] add some specs add cols clean up extra line * fix graphics l1 cache description * update SGPR for RDNA2 and RDNA3 archs * update VGPR * Apply suggestions from code review * change l2 cache to 4 * Update docs/reference/gpu-arch-specs.rst * ROCm 6.2.4 compatibility matrix (#186) * prep compat column (historical) and mi300x column * update historical compat matrix for 6.2.4 * update compat matrix for 6.2.4 * fix compat * fix thunk version * fix hipify ver * ROCm 6.2.4 release notes (#184) * prep 6.2.4 release notes * add mathlibs * add detail component changes * rm non-updated linnks * fix sentence * fix rocthrust v * rm offline installer * condense * add leo/ram fdback words * update documentation section * add rocm on radeon note * update os support note wording * update release * update version and GA date to 10-17 * update 6.2.4 rn * update wording * add link to v710 * update wording * update templ * simplify note * words os note words * change URLs to latest * update link to supported GPUs * Update versions.md 6.2.4 date to Oct 18 * Update conf.py release note date to Oct 18 --------- Co-authored-by: Sam Wu <22262939+samjwu@users.noreply.github.com> * Sync change from ROCm to ROCm-internal (#194) * Fix Radeon link and point at R6.1.3 as absolute link (#3757) * Update ROCm manifest to 6.2.1 * Update ROCm branch name * Add 6.2.1 to version list (#3770) * Add links to GH issues in 6.2.1 release notes (#3769) * add MAD page * link to GitHub issues in release notes known issues * update templates for 6.2.1 * Revert "add MAD page" This reverts commit 9cce72bba306286c7eb317d592645d4e0e1b27aa. * update wordlist for spellcheck linter * add rccl note * update rocal version change heading to be more obvious * make rocal note more specific * fix missing space * fix capitalization * Update RCCL known issue wording (#3775) * add MAD page * fix wording in RCCL known issue * Revert "add MAD page" This reverts commit c81d0f3b0a3620305b11de8745686c86b060b006. * update llvm version for 6.2.1 (#3779) * Fix broken links in 6.2.1 release notes (#3782) * External CI: Replace libomp dependencies with aomp (#3781) Add roctracer dependency for hipBLAS and rocWMMA testing * External CI: Add rocprofiler v1 and v2 smoke tests (#3784) * External CI: ROCgdb smoke tests (#3785) - Since this is an autotools project and not cmake, build and test on gfx942 system instead of separating into two jobs. Pipeline time is short anyway. - Follow build instructions to update build flags and to incorporate the ROCdbgapi. - Results are not parsed and graphed, but the log contents are printed at the end. This was helpful for debugging and will be kept in the pipeline, as the make check-gdb command's output was not helpful on its own. * External CI: rocPyDecode Smoke Test (#3786) * External CI: omniperf pipeline (#3788) - Referred to public documentation, source, and iterative attempts to create and improve build and test pipeline. - ctest failures are due to the test node not having expected marketing name string and override not working. - The fix should be on the omniperf repo side of things, so this pull request should be fine as is. * External CI: create omniperf pipeline IDs, update nightly build (#3790) * Fixed greater than to be less than in rocFFT changes * fix footnote for 6.1.0 (#3791) * fix footnote for 6.1.0 * fix empty columns in historical KFD title * External CI: Publish wheel as artifact for rocPyDecode (#3796) * External CI: fix hip-tests symlink creation (#3799) * Docs: Add Ubuntu 24.04.1 (#3801) * add ubuntu 24.04.1 * add 24.04.1 to bottom os section * fix heading and template * Update compatibility-matrix.rst for OpenMP version * Update compatibility-matrix-historical-6.0.csv for OpenMP version * rm ubuntu 24.04.1 from 6.2.0 * Update docs/compatibility/compatibility-matrix.rst Co-authored-by: Young Hui - AMD <145490163+yhuiYH@users.noreply.github.com> * rm duplicate ubuntu in historical --------- Co-authored-by: Young Hui - AMD <145490163+yhuiYH@users.noreply.github.com> * External CI: fixes for rocMLIR and nightly build (#3800) * External CI: fix symlinks for rocMLIR and nightly build * add pipeline IDs for hip-tests * fix hip-test ID typo * remove llvm-alt license (#3727) * remove llvm-alt license * fix linting error * External CI: enable ROCR-Runtime tests (#3809) * External CI: default branches for hip-tests, omniperf (#3811) * External CI: torch and torchvision smoke tests (#3810) * External CI: torch and torchvision smoke tests - Fixed issues with package name and version for the vision wheel that prevented it from installing. A patch is used until my pull request in vision repo is merged. - Referred to rocAutomation scripts to pick which test scripts to run out of the many in the torch and vision repo, and iteratively tested suggested scripts to see which ones completed in a timely manner. - Leveraging pytest-azurepipelines module to automatically parse and graph results from these tests. * External CI: omnitrace build pipeline (#3812) * External CI: omnitrace build pipeline starter - Adding initial set of dependencies and build flags. * External CI: omnitrace build pipeline - Add bison, rccl, texinfo dependencies based on build failures. - Add AMDGPU_TARGETS flag - Add ROCm binaries to PATH for clang-format and other tools used. * Fix indentation --------- Co-authored-by: Daniel Su * External CI: AMDMIGraphX Build Fix (#3814) - Swap to default gcc on OS to resolve build errors from recent commits. - Added libdnnl-dev dependency from iterative attempts with compiler change. - Referred to the passing GitHub checks to observe the compilers that was used. - Build CK jit lib and include in AMDMIGraphX build. * External CI: test fixes w/ roctracer, list omniperf as partially succeeding (#3815) * External CI: rpp tests (#3816) * External CI: Build pipeline for rocprofiler-sdk (#3819) * External CI: Pipeline for rocprofiler-sdk * Add rocprofiler dependency * External CI: rocprofiler-sdk build pipeline --------- Co-authored-by: Daniel Su * External CI: Fix/add missing pipeline IDs (#3818) * External CI: omnitrace tests (#3822) * Update tags to 6.2.2 (#3827) * External CI: add roctracer to roc/hipSOLVER test deps (#3825) * External CI: add rocprofiler-sdk pipeline IDs (#3824) * External CI: AMDMIGraphX Smoke Tests (#3830) Co-authored-by: Daniel Su * External CI: MIOpen tests (#3837) * Point to release history instead of deprecated changelog (#3836) * External CI: filter out hipTensor extended tests (#3838) * added revised note re. radeon gpus (#3839) * Restructured the contributions section. (#3715) * testing if this file is editable * changed 'kebob-case' to 'dash-case' * Restructured the page to be more straightforward and provide additional repo information * forgot to save * Moved the topic sentence * Wrong accent on the a in diataxis * Removed the feedback info from contributing and moved it to Feedback * fixed spelling errors * fixed some wording and removed second person text * consolidated Build and Structure into Contribute; edited toolchai to (hopefully) conform to style guide; updated toc * updated the titles in the toc * made changes based on feedback * it's better when you save * removed structure and build; fixed something for the linter * added rst to wordlist * added customizations to wordlist * Add links to gpu cluster network guides (#3763) * Add links to gpu cluster network guides * Add newline character to eof * Make link absolute * add dynamic branch in toc * remove unnecessary page clean up * clean up index/toc * make multi-node topics adjacent --------- Co-authored-by: Peter Park * updated the radeon note (#3850) * External CI: Fix rocPyDecode wheel creation (#3852) - Set values for expected environment variables. - Accompanying changes required in rocPyDecode repo. Pull request will be made. * External CI: pytorch vision patch removal (#3855) My pull request applying this patch was merged upstream, so this is no longer needed and will break the pipeline since it can no longer be applied. * Build(deps): Bump rocm-docs-core from 1.8.1 to 1.8.2 in /docs/sphinx (#3807) Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.8.1 to 1.8.2. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/v1.8.2/CHANGELOG.md) - [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.8.1...v1.8.2) --- updated-dependencies: - dependency-name: rocm-docs-core dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * updated the radeon note, as it were (#3857) * updated the radeon note, as it were * updated the note again * Set devops team as codeowners for rocm-build (#3860) * Set ext CI as codeowners for rocm-build * Update CODEOWNERS to rocm-devops * External CI: Add option to pull mainline branch for dependencies (#3689) * External CI: Add option to pull mainline branch for dependencies * Missing parameter for mainline branch dependencies. * External CI: mainline branch definitions * Removed MIGraphX optimization page (#3848) * External CI: add a global variable to control gfx942 tests (#3864) * External CI: update component default/mainline branches (#3871) * External CI: Stop building gfx90a (#3872) Save on VM resources until infrastructure has test targets. * External CI: add libstdc++-12 to rocMLIR (#3874) * Add building doc section (#3873) * External CI: programmatically get latest aqlprofile (#3876) * External CI: use ctest for rocm-examples (#3877) * External CI: Tensile pipeline (#3884) * add oversubscription conceptual doc (#3885) add mitigiation steps add to toc move page for build move doc fix spelling update doc update oversubscription update order fix spelling add oversubscription to wordlist move oversubscription topic to bottom of toc and index * add oversubscription conceptual doc (#3885) (cherry picked from commit d0ecf51b0c9202475e2abe90a45b50df0de6d7ae) * External CI: Add pipeline to build upstream boost (#3896) * Update bitsandbytes branch in docs (#3898) * Documentation: Add reference to precision-support floating-point types (#3899) * External CI: use Boost template for MIOpen (#3903) * External CI: create rocprofiler-systems pipeline (#3906) * External CI: omnitrace/rocprof-sys pipeline IDs (#3908) * External CI: MIOpen parse test results (#3913) * External CI: Use pip to install latest cmake on test system (#3915) * added a link to the compatibility matrix (#3904) * added a link to the compatibility matrix * removed quotes * docs: Remove invalid amd_iommu=on parameter Per kernel-parameters.txt, there is no "on" option for amd_iommu. While intel_iommu has it, amd_iommu is automatically on unless specified otherwise. For more info, see these 2 links: https://www.kernel.org/doc/Documentation/admin-guide/kernel-parameters.txt https://github.com/torvalds/linux/blob/75aa74d52f43e75d0beb20572f98529071b700e5/drivers/iommu/amd/init.c#L3481 Signed-off-by: Kent Russell * External CI: hipBLASLt build now requires python packaging module (#3926) https://github.com/ROCm/hipBLASLt/pull/1250/files#diff-fee2e6f068b33fca3a1dc49392de8848dbf05c3f4632b680abb1052523e5a30fR35 * External CI: Moved location of upstream pytorch build scripts (#3930) https://github.com/pytorch/pytorch/pull/138103 * External CI: disable rocMLIR tests (#3931) * External CI: disable rocMLIR tests * roctracer AMDGPU_TARGETS flag * External CI: create a GPU diagnostics template (#3932) * External CI: Add CK into pytorch build environment (#3934) * External CI: add support to disable individual component tests (#3938) * External CI: AMDMIGraphX greater-equal pip dependencies (#3939) * Build(deps): Bump rocm-docs-core from 1.8.2 to 1.8.3 in /docs/sphinx (#3933) Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.8.2 to 1.8.3. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md) - [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.8.2...v1.8.3) --- updated-dependencies: - dependency-name: rocm-docs-core dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * External CI: rocDecode add libva-amdgpu-dev dependency (#3940) * External CI: enumerate GPUs in gpu-diagnostics (#3942) * External CI: move gpu-diag directly before tests (#3943) * External CI: fix HIP_PIPELINE_ID (#3944) --------- Signed-off-by: dependabot[bot] Signed-off-by: Kent Russell Co-authored-by: Jeffrey Novotny Co-authored-by: Sam Wu <22262939+samjwu@users.noreply.github.com> Co-authored-by: Wang, Yanyao Co-authored-by: Yanyao Wang Co-authored-by: Peter Park Co-authored-by: Young Hui - AMD <145490163+yhuiYH@users.noreply.github.com> Co-authored-by: Joseph Macaranas <145489236+amd-jmacaran@users.noreply.github.com> Co-authored-by: Daniel Su Co-authored-by: Sandra Polifroni Co-authored-by: randyh62 <42045079+randyh62@users.noreply.github.com> Co-authored-by: Michael Benavidez Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: MKKnorr Co-authored-by: Kent Russell Co-authored-by: Joseph Greathouse * 6.2.4 release notes: add known/fixed issues (#193) * add "for compute workloads" wording for clarity * add AMDSMI resolved issue * add dlm known issue intro text wording * update wording rm bullet point update wording * fix spellcheck due to spacing * rm s * rm gfx1151 * remove dlm known issue * update list of updated docs; note for Radeon users fmt * update GA date for 6.2.4 * fix rdc version * fix RDC version strings (#196) * revert outdataed change for .azuredevops * Fix 6.2.4 date in versions.md Co-authored-by: Sam Wu <22262939+samjwu@users.noreply.github.com> --------- Signed-off-by: dependabot[bot] Signed-off-by: Kent Russell Co-authored-by: Peter Park Co-authored-by: Sam Wu <22262939+samjwu@users.noreply.github.com> Co-authored-by: Jeffrey Novotny Co-authored-by: Wang, Yanyao Co-authored-by: Yanyao Wang Co-authored-by: Young Hui - AMD <145490163+yhuiYH@users.noreply.github.com> Co-authored-by: Joseph Macaranas <145489236+amd-jmacaran@users.noreply.github.com> Co-authored-by: Daniel Su Co-authored-by: Sandra Polifroni Co-authored-by: randyh62 <42045079+randyh62@users.noreply.github.com> Co-authored-by: Michael Benavidez Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: MKKnorr Co-authored-by: Kent Russell Co-authored-by: Joseph Greathouse --- RELEASE.md | 518 +++++++----------- .../compatibility-matrix-historical-6.0.csv | 234 ++++---- docs/compatibility/compatibility-matrix.rst | 55 +- docs/conf.py | 6 +- docs/reference/gpu-arch-specs.rst | 116 ++-- docs/release/versions.md | 2 + .../templates/extra_components/6.2.4.md | 35 ++ tools/autotag/templates/highlights/6.2.4.md | 55 ++ tools/autotag/templates/support/6.2.4.md | 5 + .../templates/upcoming_changes/6.2.4.md | 9 + 10 files changed, 509 insertions(+), 526 deletions(-) create mode 100644 tools/autotag/templates/extra_components/6.2.4.md create mode 100644 tools/autotag/templates/highlights/6.2.4.md create mode 100644 tools/autotag/templates/support/6.2.4.md create mode 100644 tools/autotag/templates/upcoming_changes/6.2.4.md diff --git a/RELEASE.md b/RELEASE.md index 5ba7126db..726a599d4 100644 --- a/RELEASE.md +++ b/RELEASE.md @@ -1,104 +1,69 @@ -# ROCm 6.2.2 release notes +# ROCm 6.2.4 release notes -These release notes provide a summary of notable changes since the previous ROCm release. +The release notes provide a summary of notable changes since the previous ROCm release. + +- [Release highlights](#release-highlights) + +- [Operating system and hardware support changes](#operating-system-and-hardware-support-changes) + +- [ROCm components versioning](#rocm-components) + +- [Detailed component changes](#detailed-component-changes) + +- [ROCm known issues](#rocm-known-issues) + +- [ROCm upcoming changes](#rocm-upcoming-changes) ```{note} -As ROCm 6.2.2 was released shortly after 6.2.1, the changes between these versions -are minimal. For a comprehensive overview of recent updates, the ROCm 6.2.1 release -notes are appended to the end of this document. - -For detailed information about the changes in ROCm 6.2.1, refer to the appended -section: [ROCm 6.2.1 release notes](rocm-6-2-1-release-notes). +If you’re using Radeon™ PRO or Radeon GPUs in a workstation setting with a +display connected, continue to use ROCm 6.2.3. See the [Use ROCm on Radeon +GPUs](https://rocm.docs.amd.com/projects/radeon/en/latest/index.html) +documentation to verify compatibility and system requirements. ``` -The [Compatibility matrix](https://rocm.docs.amd.com/en/docs-6.2.2/compatibility/compatibility-matrix.html) -provides the full list of supported hardware, operating systems, ecosystems, third-party components, and ROCm components -for each ROCm release. - -Release notes for previous ROCm releases are available in earlier versions of the documentation. -See the [ROCm documentation release history](https://rocm.docs.amd.com/en/latest/release/versions.html). - ## Release highlights -The following is a significant fix introduced in ROCm 6.2.2. +The following are notable new features and improvements in ROCm 6.2.4. For changes to individual components, see +[Detailed component changes](#detailed-component-changes). -### Fixed Instinct MI300X error recovery failure +#### ROCm documentation updates -Improved the reliability of AMD Instinct MI300X accelerators in scenarios involving -uncorrectable errors. Previously, error recovery did not occur as expected, -potentially leaving the system in an undefined state. This fix ensures that error -recovery functions as expected, maintaining system stability. +ROCm documentation continues to be updated to provide clearer and more comprehensive guidance for +a wider variety of user needs and use cases. -See the [original issue](#instinct-mi300x-gpu-recovery-failure-on-uncorrectable-errors) -noted in the ROCm 6.2.1 release notes. +* Added a new GPU cluster networking guide. See + [Cluster network performance validation for AMD Instinct accelerators](https://rocm.docs.amd.com/projects/gpu-cluster-networking/en/latest/index.html). + This documentation provides guidelines on validating network configurations + in single-node and multi-node environments to attain optimal speed and bandwidth + in AMD Instinct-powered clusters. ---- +* Updated the HIP runtime documentation. -## ROCm 6.2.1 release notes + * Added a new section on how to use [HIP graphs](https://rocm.docs.amd.com/projects/HIP/en/latest/how-to/hipgraph.html). -The ROCm 6.2.1 release notes document newly added ecosystem support, ROCm Offline Installer Creator updates, -and improvements to several ROCm libraries and tools. + * Added a new section about the [Stream ordered memory allocator (SOMA)](https://rocm.docs.amd.com/projects/HIP/en/latest/how-to/stream_ordered_allocator.html). -- [Release highlights](release-highlights) + * Updated the [Porting CUDA driver API](https://rocm.docs.amd.com/projects/HIP/en/latest/how-to/hip_porting_driver_api.html) section. -- [Operating system and hardware support changes](operating-system-and-hardware-support-changes) +* Updated the [Post-installation instructions](https://rocm.docs.amd.com/projects/install-on-linux/en/docs-6.2.4/install/post-install.html) + with guidance on using the `update-alternatives` utility and environment modules to help you manage multiple ROCm + versions and streamline PATH configuration. -- [ROCm components versioning](rocm-components) +* Updated the [LLM inference performance validation on AMD Instinct + MI300X](https://rocm.docs.amd.com/en/docs-6.2.4/how-to/performance-validation/mi300x/vllm-benchmark.html) + documentation with more detailed guidance, new models, and the `float8` data type. -- [Detailed component changes](detailed-component-changes) +## Operating system and hardware support changes -- [ROCm known issues](rocm-known-issues) +ROCm 6.2.4 adds support for the [AMD Radeon PRO V710](https://www.amd.com/en/products/accelerators/radeon-pro/amd-radeon-pro-v710.html) GPU for compute workloads. See +[Supported GPUs](https://advanced-micro-devices-demo--287.com.readthedocs.build/projects/install-on-linux-internal/en/287/reference/system-requirements.html) +for more information. -- [ROCm upcoming changes](rocm-upcoming-changes) +This release maintains the same operating system support as 6.2.2. -### Release highlights +## ROCm components -The following are notable new features and improvements in ROCm 6.2.1. For changes to individual components, see [Detailed component changes](#detailed-component-changes). - -#### rocAL major version change - -The new version of rocAL introduces many new features, but does not modify any of the existing public API functions. However, the version number was incremented from 1.3 to 2.0. -Applications linked to version 1.3 must be recompiled to link against version 2.0. - -See [the rocAL detailed changes](#rocal-2-0-0) for more information. - -#### New support for FBGEMM (Facebook General Matrix Multiplication) - -As of ROCm 6.2.1, ROCm supports Facebook General Matrix Multiplication (FBGEMM) and the related FBGEMM_GPU library. - -FBGEMM is a low-precision, high-performance CPU kernel library for convolution and matrix multiplication. It is used for server-side inference and as a back end for PyTorch quantized operators. FBGEMM_GPU includes a collection of PyTorch GPU operator libraries for training and inference. For more information, see the ROCm [Model acceleration libraries guide](https://rocm.docs.amd.com/en/docs-6.2.1/how-to/llm-fine-tuning-optimization/model-acceleration-libraries.html) -and [PyTorch's FBGEMM GitHub repository](https://github.com/pytorch/FBGEMM). - -#### ROCm Offline Installer Creator changes - -The [ROCm Offline Installer Creator 6.2.1](https://rocm.docs.amd.com/projects/install-on-linux/en/docs-6.2.1/install/rocm-offline-installer.html) introduces several new features and improvements including: - -* Logging support for create and install logs -* More stringent checks for Linux versions and distributions -* Updated prerequisite repositories -* Fixed CTest issues - -#### ROCm documentation changes - -There have been no changes to supported hardware or operating systems from ROCm 6.2.0 to ROCm 6.2.1. - -* The Programming Model Reference and Understanding the Programming Model topics in HIP have been consolidated into one topic, -[HIP programming model (conceptual)](https://rocm.docs.amd.com/projects/HIP/en/docs-6.2.1/understand/programming_model.html). -* The [HIP virtual memory management](https://rocm.docs.amd.com/projects/HIP/en/docs-6.2.1/how-to/virtual_memory.html) and [HIP virtual memory management API](https://rocm.docs.amd.com/projects/HIP/en/docs-6.2.1/reference/virtual_memory_reference.html) topics have been added. - -```{note} -The ROCm documentation, like all ROCm projects, is open source and available on GitHub. To contribute to ROCm documentation, see the [ROCm documentation contribution guidelines](https://rocm.docs.amd.com/en/latest/contribute/contributing.html). -``` - -### Operating system and hardware support changes - -ROCm 6.2.1 adds support for Ubuntu 24.04.1 (kernel: 6.8 [GA]). - -See the [Compatibility matrix](https://rocm.docs.amd.com/en/docs-6.2.1/compatibility/compatibility-matrix.html) for the full list of supported operating systems and hardware architectures. - -### ROCm components - -The following table lists the versions of ROCm components for ROCm 6.2.1, including any version changes from 6.2.0 to 6.2.1. +The following table lists the versions of ROCm components for ROCm 6.2.4, including any version changes from 6.2.2 to 6.2.4. Click the component's updated version to go to a detailed list of its changes. Click to go to the component's source code on GitHub. @@ -121,51 +86,51 @@ Click the component's updated version to go to a detailed list of its changes. C Libraries Machine learning and computer vision - Composable Kernel + Composable Kernel 1.1.0 - - MIGraphX + MIGraphX 2.10 - + - MIOpen + MIOpen 3.2.0 - - MIVisionX + MIVisionX 3.0.0 - - rocAL - 1.0.0 ⇒ 2.0.0 - rocAL + 2.0.0 + - rocDecode + rocDecode 0.6.0 - - rocPyDecode + rocPyDecode 0.1.0 - - RPP + RPP 1.8.0 - @@ -173,9 +138,9 @@ Click the component's updated version to go to a detailed list of its changes. C Communication - RCCL - 2.20.5 ⇒ 2.20.5 - RCCL + 2.20.5 + @@ -183,99 +148,99 @@ Click the component's updated version to go to a detailed list of its changes. C Math - hipBLAS + hipBLAS 2.2.0 - - hipBLASLt + hipBLASLt 0.8.0 - - hipFFT - 1.0.15 - hipFFT + 1.0.15 ⇒ 1.0.16 + - hipfort + hipfort 0.4.0 - - hipRAND - 2.11.0 - hipRAND + 2.11.0 ⇒ 2.11.1 + - hipSOLVER + hipSOLVER 2.2.0 - - hipSPARSE + hipSPARSE 3.1.1 - - hipSPARSELt + hipSPARSELt 0.2.1 - - rocALUTION - 3.2.0 - rocALUTION + 3.2.0 ⇒ 3.2.1 + - rocBLAS - 4.1.2 ⇒ 4.2.1 - rocBLAS + 4.2.1 ⇒ 4.2.4 + - rocFFT - 1.0.28 ⇒ 1.0.29 - rocFFT + 1.0.29 ⇒ 1.0.30 + - rocRAND - 3.1.0 - rocRAND + 3.1.0 ⇒ 3.1.1 + - rocSOLVER - 3.26.0 - rocSOLVER + 3.26.0 ⇒ 3.26.2 + - rocSPARSE - 3.2.0 - rocSPARSE + 3.2.0 ⇒ 3.2.1 + - rocWMMA + rocWMMA 1.5.0 - Tensile 4.41.0 - @@ -283,27 +248,27 @@ Click the component's updated version to go to a detailed list of its changes. C Primitives - hipCUB - 3.2.0 - hipCUB + 3.2.0 ⇒ 3.2.1 + - hipTensor + hipTensor 1.3.0 - - rocPRIM - 3.2.0 ⇒ 3.2.1 - rocPRIM + 3.2.1 ⇒ 3.2.2 + - rocThrust - 3.1.0 - rocThrust + 3.1.0 ⇒ 3.1.1 + @@ -311,33 +276,33 @@ Click the component's updated version to go to a detailed list of its changes. C Tools System management - AMD SMI - 24.6.2 ⇒ 24.6.3 - AMD SMI + 24.6.3 ⇒ 24.6.3 + - rocminfo + rocminfo 1.0.0 - - ROCm Data Center Tool + ROCm Data Center Tool + 0.3.0 + + + + ROCm SMI + 7.3.0 + + + + ROCm Validation Suite 1.0.0 - - - - ROCm SMI - 7.3.0 ⇒ 7.3.0 - - - - ROCm Validation Suite - 1.0.0 - @@ -345,40 +310,40 @@ Click the component's updated version to go to a detailed list of its changes. C Performance - Omniperf + Omniperf 2.0.1 - - Omnitrace - 1.11.2 ⇒ 1.11.2 - Omnitrace + 1.11.2 + - ROCm Bandwidth + ROCm Bandwidth Test 1.4.0 - - ROCProfiler + ROCProfiler 2.0.0 - - ROCprofiler-SDK + ROCprofiler-SDK 0.4.0 - - ROCTracer + ROCTracer 4.1.0 - @@ -386,219 +351,114 @@ Click the component's updated version to go to a detailed list of its changes. C Development - HIPIFY - 18.0.0 ⇒ 18.0.0 - HIPIFY + 18.0.0 + - ROCdbgapi + ROCdbgapi 0.76.0 - - ROCm CMake + ROCm CMake 0.13.0 - - ROCm Debugger (ROCgdb) + ROCm Debugger (ROCgdb) 14.2 - - ROCr Debug Agent + ROCr Debug Agent 2.0.3 - Compilers - HIPCC + HIPCC 1.1.1 - - llvm-project + llvm-project 18.0.0 - Runtimes - HIP - 6.2 ⇒ 6.2.1 - HIP + 6.2.4 + - ROCr Runtime + ROCr Runtime 1.14.0 - -### Detailed component changes +## Detailed component changes The following sections describe key changes to ROCm components. -#### **AMD SMI** (24.6.3) +### **AMD SMI** (24.6.3) -##### Changes +#### Resolved issues -* Added `amd-smi static --ras` on Guest VMs. Guest VMs can view enabled/disabled RAS features on Host cards. +* Fixed support for the API calls `amdsmi_get_gpu_process_isolation` and + `amdsmi_clean_gpu_local_data`, along with the `amd-smi set + --process-isolation <0 or 1>` command. See issue + [#3500](https://github.com/ROCm/ROCm/issues/3500) on GitHub. -##### Removals +### **rocFFT** (1.0.30) -* Removed `amd-smi metric --ecc` & `amd-smi metric --ecc-blocks` on Guest VMs. Guest VMs do not support getting current ECC counts from the Host cards. +#### Optimized -##### Resolved issues +* Implemented 1D kernels for factorizable sizes greater than 1024 and less than 2048. -* Fixed TypeError in `amd-smi process -G`. -* Updated CLI error strings to handle empty and invalid GPU/CPU inputs. -* Fixed Guest VM showing passthrough options. -* Fixed firmware formatting where leading 0s were missing. +#### Resolved issues -#### **HIP** (6.2.1) +* Fixed plan creation failure on some even-length real-complex transforms that use Bluestein's algorithm. -##### Resolved issues +### **rocSOLVER** (3.26.2) -* Soft hang when using `AMD_SERIALIZE_KERNEL` -* Memory leak in `hipIpcCloseMemHandle` +#### Resolved issues -#### **HIPIFY** (18.0.0) +* Fixed synchronization issue in STEIN. -##### Changes +## ROCm known issues -* Added CUDA 12.5.1 support -* Added cuDNN 9.2.1 support -* Added LLVM 18.1.8 support -* Added `hipBLAS` 64-bit APIs support -* Added Support for math constants `math_constants.h` +ROCm known issues are tracked on [GitHub](https://github.com/ROCm/ROCm/labels/Verified%20Issue). +Known issues related to individual components are listed in the [Detailed component changes](#detailed-component-changes) +section. -#### **Omnitrace** (1.11.2) - -##### Known issues - -Perfetto can no longer open Omnitrace proto files. Loading Perfetto trace output `.proto` files in the latest version of `ui.perfetto.dev` can result in a dialog with the message, "Oops, something went wrong! Please file a bug." The information in the dialog will refer to an "Unknown field type." The workaround is to open the files with the previous version of the Perfetto UI found at [https://ui.perfetto.dev/v46.0-35b3d9845/#!/](https://ui.perfetto.dev/v46.0-35b3d9845/#!/). - -See [issue #3767](https://github.com/ROCm/ROCm/issues/3767) on GitHub. - -#### **RCCL** (2.20.5) - -##### Known issues - -On systems running Linux kernel 6.8.0, such as Ubuntu 24.04, Direct Memory Access (DMA) transfers between the GPU and NIC are disabled and impacts multi-node RCCL performance. -This issue was reproduced with RCCL 2.20.5 (ROCm 6.2.0 and 6.2.1) on systems with Broadcom Thor-2 NICs and affects other systems with RoCE networks using Linux 6.8.0 or newer. -Older RCCL versions are also impacted. - -This issue will be addressed in a future ROCm release. - -See [issue #3772](https://github.com/ROCm/ROCm/issues/3772) on GitHub. - -#### **rocAL** (2.0.0) - -##### Changes - -* The new version of rocAL introduces many new features, but does not modify any of the existing public API functions.However, the version number was incremented from 1.3 to 2.0. - Applications linked to version 1.3 must be recompiled to link against version 2.0. -* Added development and test packages. -* Added C++ rocAL audio unit test and Python script to run and compare the outputs. -* Added Python support for audio decoders. -* Added Pytorch iterator for audio. -* Added Python audio unit test and support to verify outputs. -* Added rocDecode for HW decode. -* Added support for: - * Audio loader and decoder, which uses libsndfile library to decode wav files - * Audio augmentation - PreEmphasis filter, Spectrogram, ToDecibels, Resample, NonSilentRegionDetection, MelFilterBank - * Generic augmentation - Slice, Normalize - * Reading from file lists in file reader - * Downmixing audio channels during decoding - * TensorTensorAdd and TensorScalarMultiply operations - * Uniform and Normal distribution nodes -* Image to tensor updates -* ROCm install - use case graphics removed - -##### Known issues - -* Dependencies are not installed with the rocAL package installer. Dependencies must be installed with the prerequisite setup script provided. See the [rocAL README on GitHub](https://github.com/ROCm/rocAL/blob/docs/6.2.1/README.md#prerequisites-setup-script) for details. - -#### **rocBLAS** (4.2.1) - -##### Removals - -* Removed Device_Memory_Allocation.pdf link in documentation. - -##### Resolved issues - -* Fixed error/warning message during `rocblas_set_stream()` call. - -#### **rocFFT** (1.0.29) - -##### Optimizations - -* Implemented 1D kernels for factorizable sizes less than 1024. - -#### **ROCm SMI** (7.3.0) - -##### Optimizations - -* Improved handling of UnicodeEncodeErrors with non UTF-8 locales. Non UTF-8 locales were causing crashes on UTF-8 special characters. - -##### Resolved issues - -* Fixed an issue where the Compute Partition tests segfaulted when AMDGPU was loaded with optional parameters. - -##### Known issues - -* When setting CPX as a partition mode, there is a DRM node limit of 64. This is a known limitation when multiple drivers are using the DRM nodes. The `ls /sys/class/drm` command can be used to see the number of DRM nodes, and the following steps can be used to remove unnecessary drivers: - - 1. Unload AMDGPU: `sudo rmmod amdgpu`. - 2. Remove any unnecessary drivers using `rmmod`. For example, to remove an AST driver, run `sudo rmmod ast`. - 3. Reload AMDGPU using `modprobe`: `sudo modprobe amdgpu`. - -#### **rocPRIM** (3.2.1) - -##### Optimizations - -* Improved performance of `block_reduce_warp_reduce` when warp size equals block size. - -### ROCm known issues - -ROCm known issues are tracked on [GitHub](https://github.com/ROCm/ROCm/labels/Verified%20Issue). Known issues related to -individual components are listed in the [Detailed component changes](detailed-component-changes) section. - -#### Instinct MI300X GPU recovery failure on uncorrectable errors - -For the AMD Instinct MI300X accelerator, GPU recovery resets triggered by uncorrectable errors (UE) might not complete -successfully, which can result in the system being left in an undefined state. A system reboot is needed to recover from -this state. Additionally, error logging might fail in these situations, hindering diagnostics. - -This issue is under investigation and will be resolved in a future ROCm release. - -See [issue #3766](https://github.com/ROCm/ROCm/issues/3766) on GitHub. - -### ROCm upcoming changes +## ROCm upcoming changes The following changes to the ROCm software stack are anticipated for future releases. -#### rocm-llvm-alt +### rocm-llvm-alt The `rocm-llvm-alt` package will be removed in an upcoming release. Users relying on the functionality provided by the closed-source compiler should transition to the open-source compiler. Once the `rocm-llvm-alt` package is removed, any compilation requesting functionality provided by the closed-source compiler will result in a Clang warning: "*[AMD] proprietary optimization compiler has been removed*". -#### rccl-rdma-sharp-plugins +### rccl-rdma-sharp-plugins The RCCL plugin package, `rccl-rdma-sharp-plugins`, will be removed in an upcoming ROCm release. diff --git a/docs/compatibility/compatibility-matrix-historical-6.0.csv b/docs/compatibility/compatibility-matrix-historical-6.0.csv index c19f1ed9b..1af1eca26 100644 --- a/docs/compatibility/compatibility-matrix-historical-6.0.csv +++ b/docs/compatibility/compatibility-matrix-historical-6.0.csv @@ -1,117 +1,117 @@ -ROCm Version,6.2.2,6.2.1,6.2.0, 6.1.2, 6.1.1, 6.1.0, 6.0.2, 6.0.0 - :ref:`Operating systems & kernels `,"Ubuntu 24.04.1, 24.04","Ubuntu 24.04.1, 24.04",Ubuntu 24.04,,,,, - ,"Ubuntu 22.04.5, 22.04.4","Ubuntu 22.04.5, 22.04.4","Ubuntu 22.04.5, 22.04.4","Ubuntu 22.04.4, 22.04.3","Ubuntu 22.04.4, 22.04.3","Ubuntu 22.04.4, 22.04.3","Ubuntu 22.04.4, 22.04.3, 22.04.2","Ubuntu 22.04.4, 22.04.3, 22.04.2" - ,,,,"Ubuntu 20.04.6, 20.04.5","Ubuntu 20.04.6, 20.04.5","Ubuntu 20.04.6, 20.04.5","Ubuntu 20.04.6, 20.04.5","Ubuntu 20.04.6, 20.04.5" - ,"RHEL 9.4, 9.3","RHEL 9.4, 9.3","RHEL 9.4, 9.3","RHEL 9.4 [#red-hat94-past-60]_, 9.3, 9.2","RHEL 9.4 [#red-hat94-past-60]_, 9.3, 9.2","RHEL 9.4 [#red-hat94-past-60]_, 9.3, 9.2","RHEL 9.3, 9.2","RHEL 9.3, 9.2" - ,"RHEL 8.10, 8.9","RHEL 8.10, 8.9","RHEL 8.10, 8.9","RHEL 8.9, 8.8","RHEL 8.9, 8.8","RHEL 8.9, 8.8","RHEL 8.9, 8.8","RHEL 8.9, 8.8" - ,"SLES 15 SP6, SP5","SLES 15 SP6, SP5","SLES 15 SP6, SP5","SLES 15 SP5, SP4","SLES 15 SP5, SP4","SLES 15 SP5, SP4","SLES 15 SP5, SP4","SLES 15 SP5, SP4" - ,,,,CentOS 7.9,CentOS 7.9,CentOS 7.9,CentOS 7.9,CentOS 7.9 - ,Oracle Linux 8.9 [#oracle89-past-60]_,Oracle Linux 8.9 [#oracle89-past-60]_,Oracle Linux 8.9 [#oracle89-past-60]_,Oracle Linux 8.9 [#oracle89-past-60]_,Oracle Linux 8.9 [#oracle89-past-60]_,,, - ,.. _architecture-support-compatibility-matrix-past-60:,,,,,,, - :doc:`Architecture `,CDNA3,CDNA3,CDNA3,CDNA3,CDNA3,CDNA3,CDNA3,CDNA3 - ,CDNA2,CDNA2,CDNA2,CDNA2,CDNA2,CDNA2,CDNA2,CDNA2 - ,CDNA,CDNA,CDNA,CDNA,CDNA,CDNA,CDNA,CDNA - ,RDNA3,RDNA3,RDNA3,RDNA3,RDNA3,RDNA3,RDNA3,RDNA3 - ,RDNA2,RDNA2,RDNA2,RDNA2,RDNA2,RDNA2,RDNA2,RDNA2 - ,.. _gpu-support-compatibility-matrix-past-60:,,,,,,, - :doc:`GPU / LLVM target `,gfx1100,gfx1100,gfx1100,gfx1100,gfx1100,gfx1100,gfx1100,gfx1100 - ,gfx1030,gfx1030,gfx1030,gfx1030,gfx1030,gfx1030,gfx1030,gfx1030 - ,gfx942 [#mi300_622-past-60]_,gfx942 [#mi300_621-past-60]_,gfx942 [#mi300_620-past-60]_, gfx942 [#mi300_612-past-60]_, gfx942 [#mi300_611-past-60]_, gfx942 [#mi300_610-past-60]_, gfx942 [#mi300_602-past-60]_, gfx942 [#mi300_600-past-60]_ - ,gfx90a,gfx90a,gfx90a,gfx90a,gfx90a,gfx90a,gfx90a,gfx90a - ,gfx908,gfx908,gfx908,gfx908,gfx908,gfx908,gfx908,gfx908 - ,,,,,,,, - FRAMEWORK SUPPORT,.. _framework-support-compatibility-matrix-past-60:,,,,,,, - :doc:`PyTorch `,"2.3, 2.2, 2.1, 2.0, 1.13","2.3, 2.2, 2.1, 2.0, 1.13","2.3, 2.2, 2.1, 2.0, 1.13","2.1, 2.0, 1.13","2.1, 2.0, 1.13","2.1, 2.0, 1.13","2.1, 2.0, 1.13","2.1, 2.0, 1.13" - :doc:`TensorFlow `,"2.16.1, 2.15.1, 2.14.1","2.16.1, 2.15.1, 2.14.1","2.16.1, 2.15.1, 2.14.1","2.15.0, 2.14.0, 2.13.1","2.15.0, 2.14.0, 2.13.1","2.15.0, 2.14.0, 2.13.1","2.14.0, 2.13.1, 2.12.1","2.14.0, 2.13.1, 2.12.1" - :doc:`JAX `,0.4.26,0.4.26,0.4.26,0.4.26,0.4.26,0.4.26,0.4.26,0.4.26 - `ONNX Runtime `_,1.17.3,1.17.3,1.17.3,1.17.3,1.17.3,1.17.3,1.14.1,1.14.1 - ,,,,,,,, - THIRD PARTY COMMS,.. _thirdpartycomms-support-compatibility-matrix-past-60:,,,,,,, - `UCC `_,>=1.3.0,>=1.3.0,>=1.3.0,>=1.3.0,>=1.3.0,>=1.3.0,>=1.2.0,>=1.2.0 - `UCX `_,>=1.15.0,>=1.15.0,>=1.15.0,>=1.14.1,>=1.14.1,>=1.14.1,>=1.14.1,>=1.14.1 - ,,,,,,,, - THIRD PARTY ALGORITHM,.. _thirdpartyalgorithm-support-compatibility-matrix-past-60:,,,,,,, - Thrust,2.2.0,2.2.0,2.2.0,2.1.0,2.1.0,2.1.0,2.0.1,2.0.1 - CUB,2.2.0,2.2.0,2.2.0,2.1.0,2.1.0,2.1.0,2.0.1,2.0.1 - ,,,,,,,, - KFD & USER SPACE [#kfd_support-past-60]_,.. _kfd-userspace-support-compatibility-matrix-past-60:,,,,,,, - Tested user space versions,"6.1.x, 6.0.x","6.1.x, 6.0.x","6.1.x, 6.0.x","6.2.x, 6.0.x, 5.7.x","6.2.x, 6.0.x, 5.7.x","6.2.x, 6.0.x, 5.7.x","6.2.x, 6.0.x, 5.7.x, 5.6.x","6.2.x, 6.0.x, 5.7.x, 5.6.x" - ,,,,,,,, - ML & COMPUTER VISION,.. _mllibs-support-compatibility-matrix-past-60:,,,,,,, - :doc:`Composable Kernel `,1.1.0,1.1.0,1.1.0,1.1.0,1.1.0,1.1.0,1.1.0,1.1.0 - :doc:`MIGraphX `,2.10.0,2.10.0,2.10.0,2.9.0,2.9.0,2.9.0,2.8.0,2.8.0 - :doc:`MIOpen `,3.2.0,3.2.0,3.2.0,3.1.0,3.1.0,3.1.0,3.0.0,3.0.0 - :doc:`MIVisionX `,3.0.0,3.0.0,3.0.0,2.5.0,2.5.0,2.5.0,2.5.0,2.5.0 - :doc:`rocAL `,2.0.0,2.0.0,1.0.0,1.0.0,1.0.0,1.0.0,1.0.0,1.0.0 - :doc:`rocDecode `,0.6.0,0.6.0,0.6.0,0.6.0,0.5.0,0.5.0,N/A,N/A - :doc:`rocPyDecode `,0.1.0,0.1.0,0.1.0,N/A,N/A,N/A,N/A,N/A - :doc:`RPP `,1.8.0,1.8.0,1.8.0,1.5.0,1.5.0,1.5.0,1.4.0,1.4.0 - ,,,,,,,, - COMMUNICATION,.. _commlibs-support-compatibility-matrix-past-60:,,,,,,, - :doc:`RCCL `,2.20.5,2.20.5,2.20.5,2.18.6,2.18.6,2.18.6,2.18.3,2.18.3 - ,,,,,,,, - MATH LIBS,.. _mathlibs-support-compatibility-matrix-past-60:,,,,,,, - `half `_ ,1.12.0,1.12.0,1.12.0,1.12.0,1.12.0,1.12.0,1.12.0,1.12.0 - :doc:`hipBLAS `,2.2.0,2.2.0,2.2.0,2.1.0,2.1.0,2.1.0,2.0.0,2.0.0 - :doc:`hipBLASLt `,0.8.0,0.8.0,0.8.0,0.7.0,0.7.0,0.7.0,0.6.0,0.6.0 - :doc:`hipFFT `,1.0.15,1.0.15,1.0.14,1.0.14,1.0.14,1.0.14,1.0.13,1.0.13 - :doc:`hipFORT `,0.4.0,0.4.0,0.4.0,0.4.0,0.4.0,0.4.0,0.4.0,0.4.0 - :doc:`hipRAND `,2.11.0,2.11.0,2.11.0,2.10.16,2.10.16,2.10.16,2.10.16,2.10.16 - :doc:`hipSOLVER `,2.2.0,2.2.0,2.2.0,2.1.1,2.1.1,2.1.0,2.0.0,2.0.0 - :doc:`hipSPARSE `,3.1.1,3.1.1,3.1.1,3.0.1,3.0.1,3.0.1,3.0.0,3.0.0 - :doc:`hipSPARSELt `,0.2.1,0.2.1,0.2.1,0.2.0,0.1.0,0.1.0,0.1.0,0.1.0 - :doc:`rocALUTION `,3.2.0,3.2.0,3.2.0,3.1.1,3.1.1,3.1.1,3.0.3,3.0.3 - :doc:`rocBLAS `,4.2.1,4.2.1,4.2.0,4.1.2,4.1.0,4.1.0,4.0.0,4.0.0 - :doc:`rocFFT `,1.0.29,1.0.29,1.0.28,1.0.27,1.0.27,1.0.26,1.0.25,1.0.23 - :doc:`rocRAND `,3.1.0,3.1.0,3.1.0,3.0.1,3.0.1,3.0.1,3.0.0,2.10.17 - :doc:`rocSOLVER `,3.26.0,3.26.0,3.26.0,3.25.0,3.25.0,3.25.0,3.24.0,3.24.0 - :doc:`rocSPARSE `,3.2.0,3.2.0,3.2.0,3.1.2,3.1.2,3.1.2,3.0.2,3.0.2 - :doc:`rocWMMA `,1.5.0,1.5.0,1.5.0,1.4.0,1.4.0,1.4.0,1.3.0,1.3.0 - `Tensile `_,4.40.0,4.40.0,4.40.0,4.40.0,4.40.0,4.40.0,4.39.0,4.39.0 - ,,,,,,,, - PRIMITIVES,.. _primitivelibs-support-compatibility-matrix-past-60:,,,,,,, - :doc:`hipCUB `,3.2.0,3.2.0,3.2.0,3.1.0,3.1.0,3.1.0,3.0.0,3.0.0 - :doc:`hipTensor `,1.3.0,1.3.0,1.3.0,1.2.0,1.2.0,1.2.0,1.1.0,1.1.0 - :doc:`rocPRIM `,3.2.0,3.2.0,3.2.0,3.1.0,3.1.0,3.1.0,3.0.0,3.0.0 - :doc:`rocThrust `,3.1.0,3.1.0,3.0.1,3.0.1,3.0.1,3.0.1,3.0.0,3.0.0 - ,,,,,,,, - SUPPORT LIBS,,,,,,,, - `hipother `_,6.2.41134,6.2.41134,6.2.41133,6.1.40093,6.1.40092,6.1.40091,6.1.32831,6.1.32830 - `rocm-core `_,6.2.2,6.2.1,6.2.0,6.1.2,6.1.1,6.1.0,6.0.2,6.0.0 - `ROCT-Thunk-Interface `_,20240607.4.05,20240607.4.05,20240607.1.4246,20240125.5.08,20240125.5.08,20240125.3.30,20231016.2.245,20231016.2.245 - ,,,,,,,, - SYSTEM MGMT TOOLS,.. _tools-support-compatibility-matrix-past-60:,,,,,,, - :doc:`AMD SMI `,24.6.3,24.6.3,24.6.2,24.5.1,24.5.1,24.4.1,23.4.2,23.4.2 - :doc:`ROCm Data Center Tool `,1.0.0,1.0.0,1.0.0,0.3.0,0.3.0,0.3.0,0.3.0,0.3.0 - :doc:`rocminfo `,1.0.0,1.0.0,1.0.0,1.0.0,1.0.0,1.0.0,1.0.0,1.0.0 - :doc:`ROCm SMI `,7.3.0,7.3.0,7.3.0,7.2.0,7.0.0,7.0.0,6.0.2,6.0.0 - :doc:`ROCm Validation Suite `,rocm-6.2.2,rocm-6.2.1,rocm-6.2.0,rocm-6.1.2,rocm-6.1.1,rocm-6.1.0,rocm-6.0.2,rocm-6.0.0 - ,,,,,,,, - PERFORMANCE TOOLS,,,,,,,, - :doc:`Omniperf `,2.0.1,2.0.1,2.0.1,N/A,N/A,N/A,N/A,N/A - :doc:`Omnitrace `,1.11.2,1.11.2,1.11.2,N/A,N/A,N/A,N/A,N/A - :doc:`ROCm Bandwidth Test `,1.4.0,1.4.0,1.4.0,1.4.0,1.4.0,1.4.0,1.4.0,1.4.0 - :doc:`ROCProfiler `,2.0.60202,2.0.60201,2.0.60200,2.0.60102,2.0.60101,2.0.60100,2.0.60002,2.0.60000 - :doc:`ROCprofiler-SDK `,0.4.0,0.4.0,0.4.0,N/A,N/A,N/A,N/A,N/A - :doc:`ROCTracer `,4.1.60202,4.1.60201,4.1.60200,4.1.60102,4.1.60101,4.1.60100,4.1.60002,4.1.60000 - ,,,,,,,, - DEVELOPMENT TOOLS,,,,,,,, - :doc:`HIPIFY `,18.0.0.24355,18.0.0.24355,18.0.0.24232,17.0.0.24193,17.0.0.24154,17.0.0.24103,17.0.0.24012,17.0.0.23483 - :doc:`ROCm CMake `,0.13.0,0.13.0,0.13.0,0.12.0,0.12.0,0.12.0,0.11.0,0.11.0 - :doc:`ROCdbgapi `,0.76.0,0.76.0,0.76.0,0.71.0,0.71.0,0.71.0,0.71.0,0.71.0 - :doc:`ROCm Debugger (ROCgdb) `,14.2.0,14.2.0,14.2.0,14.1.0,14.1.0,14.1.0,13.2.0,13.2.0 - `rocprofiler-register `_,0.4.0,0.4.0,0.4.0,0.3.0,0.3.0,0.3.0,N/A,N/A - :doc:`ROCr Debug Agent `,2.0.3,2.0.3,2.0.3,2.0.3,2.0.3,2.0.3,2.0.3,2.0.3 - ,,,,,,,, - COMPILERS,.. _compilers-support-compatibility-matrix-past-60:,,,,,,, - `clang-ocl `_,N/A,N/A,N/A,0.5.0,0.5.0,0.5.0,0.5.0,0.5.0 - :doc:`hipCC `,1.1.1,1.1.1,1.1.1,1.0.0,1.0.0,1.0.0,1.0.0,1.0.0 - `Flang `_,18.0.0.24355,18.0.0.24355,18.0.0.24232,17.0.0.24193,17.0.0.24154,17.0.0.24103,17.0.0.24012,17.0.0.23483 - :doc:`llvm-project `,18.0.0.24355,18.0.0.24355,18.0.0.24232,17.0.0.24193,17.0.0.24154,17.0.0.24103,17.0.0.24012,17.0.0.23483 - `OpenMP `_,18.0.0.24355,18.0.0.24355,18.0.0.24232,17.0.0.24193,17.0.0.24154,17.0.0.24103,17.0.0.24012,17.0.0.23483 - ,,,,,,,, - RUNTIMES,.. _runtime-support-compatibility-matrix-past-60:,,,,,,, - :doc:`AMD CLR `,6.2.41134,6.2.41134,6.2.41133,6.1.40093,6.1.40092,6.1.40091,6.1.32831,6.1.32830 - :doc:`HIP `,6.2.41134,6.2.41134,6.2.41133,6.1.40093,6.1.40092,6.1.40091,6.1.32831,6.1.32830 - `OpenCL Runtime `_,2.0.0,2.0.0,2.0.0,2.0.0,2.0.0,2.0.0,2.0.0,2.0.0 - :doc:`ROCR-Runtime `,1.14.0,1.14.0,1.13.0,1.13.0,1.13.0,1.13.0,1.12.0,1.12.0 +ROCm Version,6.2.4,6.2.2,6.2.1,6.2.0, 6.1.2, 6.1.1, 6.1.0, 6.0.2, 6.0.0 + :ref:`Operating systems & kernels `,"Ubuntu 24.04.1, 24.04","Ubuntu 24.04.1, 24.04","Ubuntu 24.04.1, 24.04",Ubuntu 24.04,,,,, + ,"Ubuntu 22.04.5, 22.04.4","Ubuntu 22.04.5, 22.04.4","Ubuntu 22.04.5, 22.04.4","Ubuntu 22.04.5, 22.04.4","Ubuntu 22.04.4, 22.04.3","Ubuntu 22.04.4, 22.04.3","Ubuntu 22.04.4, 22.04.3","Ubuntu 22.04.4, 22.04.3, 22.04.2","Ubuntu 22.04.4, 22.04.3, 22.04.2" + ,,,,,"Ubuntu 20.04.6, 20.04.5","Ubuntu 20.04.6, 20.04.5","Ubuntu 20.04.6, 20.04.5","Ubuntu 20.04.6, 20.04.5","Ubuntu 20.04.6, 20.04.5" + ,"RHEL 9.4, 9.3","RHEL 9.4, 9.3","RHEL 9.4, 9.3","RHEL 9.4, 9.3","RHEL 9.4 [#red-hat94-past-60]_, 9.3, 9.2","RHEL 9.4 [#red-hat94-past-60]_, 9.3, 9.2","RHEL 9.4 [#red-hat94-past-60]_, 9.3, 9.2","RHEL 9.3, 9.2","RHEL 9.3, 9.2" + ,"RHEL 8.10, 8.9","RHEL 8.10, 8.9","RHEL 8.10, 8.9","RHEL 8.10, 8.9","RHEL 8.9, 8.8","RHEL 8.9, 8.8","RHEL 8.9, 8.8","RHEL 8.9, 8.8","RHEL 8.9, 8.8" + ,"SLES 15 SP6, SP5","SLES 15 SP6, SP5","SLES 15 SP6, SP5","SLES 15 SP6, SP5","SLES 15 SP5, SP4","SLES 15 SP5, SP4","SLES 15 SP5, SP4","SLES 15 SP5, SP4","SLES 15 SP5, SP4" + ,,,,,CentOS 7.9,CentOS 7.9,CentOS 7.9,CentOS 7.9,CentOS 7.9 + ,Oracle Linux 8.9 [#oracle89-past-60]_,Oracle Linux 8.9 [#oracle89-past-60]_,Oracle Linux 8.9 [#oracle89-past-60]_,Oracle Linux 8.9 [#oracle89-past-60]_,Oracle Linux 8.9 [#oracle89-past-60]_,Oracle Linux 8.9 [#oracle89-past-60]_,,, + ,.. _architecture-support-compatibility-matrix-past-60:,,,,,,,, + :doc:`Architecture `,CDNA3,CDNA3,CDNA3,CDNA3,CDNA3,CDNA3,CDNA3,CDNA3,CDNA3 + ,CDNA2,CDNA2,CDNA2,CDNA2,CDNA2,CDNA2,CDNA2,CDNA2,CDNA2 + ,CDNA,CDNA,CDNA,CDNA,CDNA,CDNA,CDNA,CDNA,CDNA + ,RDNA3,RDNA3,RDNA3,RDNA3,RDNA3,RDNA3,RDNA3,RDNA3,RDNA3 + ,RDNA2,RDNA2,RDNA2,RDNA2,RDNA2,RDNA2,RDNA2,RDNA2,RDNA2 + ,.. _gpu-support-compatibility-matrix-past-60:,,,,,,,, + :doc:`GPU / LLVM target `,gfx1100,gfx1100,gfx1100,gfx1100,gfx1100,gfx1100,gfx1100,gfx1100,gfx1100 + ,gfx1030,gfx1030,gfx1030,gfx1030,gfx1030,gfx1030,gfx1030,gfx1030,gfx1030 + ,gfx942 [#mi300_624-past-60]_,gfx942 [#mi300_622-past-60]_,gfx942 [#mi300_621-past-60]_,gfx942 [#mi300_620-past-60]_, gfx942 [#mi300_612-past-60]_, gfx942 [#mi300_611-past-60]_, gfx942 [#mi300_610-past-60]_, gfx942 [#mi300_602-past-60]_, gfx942 [#mi300_600-past-60]_ + ,gfx90a,gfx90a,gfx90a,gfx90a,gfx90a,gfx90a,gfx90a,gfx90a,gfx90a + ,gfx908,gfx908,gfx908,gfx908,gfx908,gfx908,gfx908,gfx908,gfx908 + ,,,,,,,,, + FRAMEWORK SUPPORT,.. _framework-support-compatibility-matrix-past-60:,,,,,,,, + :doc:`PyTorch `,"2.3, 2.2, 2.1, 2.0, 1.13","2.3, 2.2, 2.1, 2.0, 1.13","2.3, 2.2, 2.1, 2.0, 1.13","2.3, 2.2, 2.1, 2.0, 1.13","2.1, 2.0, 1.13","2.1, 2.0, 1.13","2.1, 2.0, 1.13","2.1, 2.0, 1.13","2.1, 2.0, 1.13" + :doc:`TensorFlow `,"2.16.1, 2.15.1, 2.14.1","2.16.1, 2.15.1, 2.14.1","2.16.1, 2.15.1, 2.14.1","2.16.1, 2.15.1, 2.14.1","2.15.0, 2.14.0, 2.13.1","2.15.0, 2.14.0, 2.13.1","2.15.0, 2.14.0, 2.13.1","2.14.0, 2.13.1, 2.12.1","2.14.0, 2.13.1, 2.12.1" + :doc:`JAX `,0.4.26,0.4.26,0.4.26,0.4.26,0.4.26,0.4.26,0.4.26,0.4.26,0.4.26 + `ONNX Runtime `_,1.17.3,1.17.3,1.17.3,1.17.3,1.17.3,1.17.3,1.17.3,1.14.1,1.14.1 + ,,,,,,,,, + THIRD PARTY COMMS,.. _thirdpartycomms-support-compatibility-matrix-past-60:,,,,,,,, + `UCC `_,>=1.3.0,>=1.3.0,>=1.3.0,>=1.3.0,>=1.3.0,>=1.3.0,>=1.3.0,>=1.2.0,>=1.2.0 + `UCX `_,>=1.15.0,>=1.15.0,>=1.15.0,>=1.15.0,>=1.14.1,>=1.14.1,>=1.14.1,>=1.14.1,>=1.14.1 + ,,,,,,,,, + THIRD PARTY ALGORITHM,.. _thirdpartyalgorithm-support-compatibility-matrix-past-60:,,,,,,,, + Thrust,2.2.0,2.2.0,2.2.0,2.2.0,2.1.0,2.1.0,2.1.0,2.0.1,2.0.1 + CUB,2.2.0,2.2.0,2.2.0,2.2.0,2.1.0,2.1.0,2.1.0,2.0.1,2.0.1 + ,,,,,,,,, + KFD & USER SPACE [#kfd_support-past-60]_,.. _kfd-userspace-support-compatibility-matrix-past-60:,,,,,,,, + Tested user space versions,"6.1.x, 6.0.x","6.1.x, 6.0.x","6.1.x, 6.0.x","6.1.x, 6.0.x","6.2.x, 6.0.x, 5.7.x","6.2.x, 6.0.x, 5.7.x","6.2.x, 6.0.x, 5.7.x","6.2.x, 6.0.x, 5.7.x, 5.6.x","6.2.x, 6.0.x, 5.7.x, 5.6.x" + ,,,,,,,,, + ML & COMPUTER VISION,.. _mllibs-support-compatibility-matrix-past-60:,,,,,,,, + :doc:`Composable Kernel `,1.1.0,1.1.0,1.1.0,1.1.0,1.1.0,1.1.0,1.1.0,1.1.0,1.1.0 + :doc:`MIGraphX `,2.10.0,2.10.0,2.10.0,2.10.0,2.9.0,2.9.0,2.9.0,2.8.0,2.8.0 + :doc:`MIOpen `,3.2.0,3.2.0,3.2.0,3.2.0,3.1.0,3.1.0,3.1.0,3.0.0,3.0.0 + :doc:`MIVisionX `,3.0.0,3.0.0,3.0.0,3.0.0,2.5.0,2.5.0,2.5.0,2.5.0,2.5.0 + :doc:`rocAL `,2.0.0,2.0.0,2.0.0,1.0.0,1.0.0,1.0.0,1.0.0,1.0.0,1.0.0 + :doc:`rocDecode `,0.6.0,0.6.0,0.6.0,0.6.0,0.6.0,0.5.0,0.5.0,N/A,N/A + :doc:`rocPyDecode `,0.1.0,0.1.0,0.1.0,0.1.0,N/A,N/A,N/A,N/A,N/A + :doc:`RPP `,1.8.0,1.8.0,1.8.0,1.8.0,1.5.0,1.5.0,1.5.0,1.4.0,1.4.0 + ,,,,,,,,, + COMMUNICATION,.. _commlibs-support-compatibility-matrix-past-60:,,,,,,,, + :doc:`RCCL `,2.20.5,2.20.5,2.20.5,2.20.5,2.18.6,2.18.6,2.18.6,2.18.3,2.18.3 + ,,,,,,,,, + MATH LIBS,.. _mathlibs-support-compatibility-matrix-past-60:,,,,,,,, + `half `_ ,1.12.0,1.12.0,1.12.0,1.12.0,1.12.0,1.12.0,1.12.0,1.12.0,1.12.0 + :doc:`hipBLAS `,2.2.0,2.2.0,2.2.0,2.2.0,2.1.0,2.1.0,2.1.0,2.0.0,2.0.0 + :doc:`hipBLASLt `,0.8.0,0.8.0,0.8.0,0.8.0,0.7.0,0.7.0,0.7.0,0.6.0,0.6.0 + :doc:`hipFFT `,1.0.16,1.0.15,1.0.15,1.0.14,1.0.14,1.0.14,1.0.14,1.0.13,1.0.13 + :doc:`hipFORT `,0.4.0,0.4.0,0.4.0,0.4.0,0.4.0,0.4.0,0.4.0,0.4.0,0.4.0 + :doc:`hipRAND `,2.11.1,2.11.0,2.11.0,2.11.0,2.10.16,2.10.16,2.10.16,2.10.16,2.10.16 + :doc:`hipSOLVER `,2.2.0,2.2.0,2.2.0,2.2.0,2.1.1,2.1.1,2.1.0,2.0.0,2.0.0 + :doc:`hipSPARSE `,3.1.1,3.1.1,3.1.1,3.1.1,3.0.1,3.0.1,3.0.1,3.0.0,3.0.0 + :doc:`hipSPARSELt `,0.2.1,0.2.1,0.2.1,0.2.1,0.2.0,0.1.0,0.1.0,0.1.0,0.1.0 + :doc:`rocALUTION `,3.2.1,3.2.0,3.2.0,3.2.0,3.1.1,3.1.1,3.1.1,3.0.3,3.0.3 + :doc:`rocBLAS `,4.2.4,4.2.1,4.2.1,4.2.0,4.1.2,4.1.0,4.1.0,4.0.0,4.0.0 + :doc:`rocFFT `,1.0.30,1.0.29,1.0.29,1.0.28,1.0.27,1.0.27,1.0.26,1.0.25,1.0.23 + :doc:`rocRAND `,3.1.1,3.1.0,3.1.0,3.1.0,3.0.1,3.0.1,3.0.1,3.0.0,2.10.17 + :doc:`rocSOLVER `,3.26.2,3.26.0,3.26.0,3.26.0,3.25.0,3.25.0,3.25.0,3.24.0,3.24.0 + :doc:`rocSPARSE `,3.2.1,3.2.0,3.2.0,3.2.0,3.1.2,3.1.2,3.1.2,3.0.2,3.0.2 + :doc:`rocWMMA `,1.5.0,1.5.0,1.5.0,1.5.0,1.4.0,1.4.0,1.4.0,1.3.0,1.3.0 + `Tensile `_,4.40.0,4.40.0,4.40.0,4.40.0,4.40.0,4.40.0,4.40.0,4.39.0,4.39.0 + ,,,,,,,,, + PRIMITIVES,.. _primitivelibs-support-compatibility-matrix-past-60:,,,,,,,, + :doc:`hipCUB `,3.2.1,3.2.0,3.2.0,3.2.0,3.1.0,3.1.0,3.1.0,3.0.0,3.0.0 + :doc:`hipTensor `,1.3.0,1.3.0,1.3.0,1.3.0,1.2.0,1.2.0,1.2.0,1.1.0,1.1.0 + :doc:`rocPRIM `,3.2.2,3.2.0,3.2.0,3.2.0,3.1.0,3.1.0,3.1.0,3.0.0,3.0.0 + :doc:`rocThrust `,3.1.1,3.1.0,3.1.0,3.0.1,3.0.1,3.0.1,3.0.1,3.0.0,3.0.0 + ,,,,,,,,, + SUPPORT LIBS,,,,,,,,, + `hipother `_,6.2.41134,6.2.41134,6.2.41134,6.2.41133,6.1.40093,6.1.40092,6.1.40091,6.1.32831,6.1.32830 + `rocm-core `_,6.2.4,6.2.2,6.2.1,6.2.0,6.1.2,6.1.1,6.1.0,6.0.2,6.0.0 + `ROCT-Thunk-Interface `_,20240607.5.7,20240607.5.7,20240607.4.05,20240607.1.4246,20240125.5.08,20240125.5.08,20240125.3.30,20231016.2.245,20231016.2.245 + ,,,,,,,,, + SYSTEM MGMT TOOLS,.. _tools-support-compatibility-matrix-past-60:,,,,,,,, + :doc:`AMD SMI `,24.6.3,24.6.3,24.6.3,24.6.2,24.5.1,24.5.1,24.4.1,23.4.2,23.4.2 + :doc:`ROCm Data Center Tool `,0.3.0,0.3.0,0.3.0,0.3.0,0.3.0,0.3.0,0.3.0,0.3.0,0.3.0 + :doc:`rocminfo `,1.0.0,1.0.0,1.0.0,1.0.0,1.0.0,1.0.0,1.0.0,1.0.0,1.0.0 + :doc:`ROCm SMI `,7.3.0,7.3.0,7.3.0,7.3.0,7.2.0,7.0.0,7.0.0,6.0.2,6.0.0 + :doc:`ROCm Validation Suite `,rocm-6.2.4,rocm-6.2.2,rocm-6.2.1,rocm-6.2.0,rocm-6.1.2,rocm-6.1.1,rocm-6.1.0,rocm-6.0.2,rocm-6.0.0 + ,,,,,,,,, + PERFORMANCE TOOLS,,,,,,,,, + :doc:`Omniperf `,2.0.1,2.0.1,2.0.1,2.0.1,N/A,N/A,N/A,N/A,N/A + :doc:`Omnitrace `,1.11.2,1.11.2,1.11.2,1.11.2,N/A,N/A,N/A,N/A,N/A + :doc:`ROCm Bandwidth Test `,1.4.0,1.4.0,1.4.0,1.4.0,1.4.0,1.4.0,1.4.0,1.4.0,1.4.0 + :doc:`ROCProfiler `,2.0.60204,2.0.60202,2.0.60201,2.0.60200,2.0.60102,2.0.60101,2.0.60100,2.0.60002,2.0.60000 + :doc:`ROCprofiler-SDK `,0.4.0,0.4.0,0.4.0,0.4.0,N/A,N/A,N/A,N/A,N/A + :doc:`ROCTracer `,4.1.60204,4.1.60202,4.1.60201,4.1.60200,4.1.60102,4.1.60101,4.1.60100,4.1.60002,4.1.60000 + ,,,,,,,,, + DEVELOPMENT TOOLS,,,,,,,,, + :doc:`HIPIFY `,18.0.0.24392,18.0.0.24355,18.0.0.24355,18.0.0.24232,17.0.0.24193,17.0.0.24154,17.0.0.24103,17.0.0.24012,17.0.0.23483 + :doc:`ROCm CMake `,0.13.0,0.13.0,0.13.0,0.13.0,0.12.0,0.12.0,0.12.0,0.11.0,0.11.0 + :doc:`ROCdbgapi `,0.76.0,0.76.0,0.76.0,0.76.0,0.71.0,0.71.0,0.71.0,0.71.0,0.71.0 + :doc:`ROCm Debugger (ROCgdb) `,14.2.0,14.2.0,14.2.0,14.2.0,14.1.0,14.1.0,14.1.0,13.2.0,13.2.0 + `rocprofiler-register `_,0.4.0,0.4.0,0.4.0,0.4.0,0.3.0,0.3.0,0.3.0,N/A,N/A + :doc:`ROCr Debug Agent `,2.0.3,2.0.3,2.0.3,2.0.3,2.0.3,2.0.3,2.0.3,2.0.3,2.0.3 + ,,,,,,,,, + COMPILERS,.. _compilers-support-compatibility-matrix-past-60:,,,,,,,, + `clang-ocl `_,N/A,N/A,N/A,N/A,0.5.0,0.5.0,0.5.0,0.5.0,0.5.0 + :doc:`hipCC `,1.1.1,1.1.1,1.1.1,1.1.1,1.0.0,1.0.0,1.0.0,1.0.0,1.0.0 + `Flang `_,18.0.0.24392,18.0.0.24355,18.0.0.24355,18.0.0.24232,17.0.0.24193,17.0.0.24154,17.0.0.24103,17.0.0.24012,17.0.0.23483 + :doc:`llvm-project `,18.0.0.24392,18.0.0.24355,18.0.0.24355,18.0.0.24232,17.0.0.24193,17.0.0.24154,17.0.0.24103,17.0.0.24012,17.0.0.23483 + `OpenMP `_,18.0.0.24392,18.0.0.24355,18.0.0.24355,18.0.0.24232,17.0.0.24193,17.0.0.24154,17.0.0.24103,17.0.0.24012,17.0.0.23483 + ,,,,,,,,, + RUNTIMES,.. _runtime-support-compatibility-matrix-past-60:,,,,,,,, + :doc:`AMD CLR `,6.2.41134,6.2.41134,6.2.41134,6.2.41133,6.1.40093,6.1.40092,6.1.40091,6.1.32831,6.1.32830 + :doc:`HIP `,6.2.41134,6.2.41134,6.2.41134,6.2.41133,6.1.40093,6.1.40092,6.1.40091,6.1.32831,6.1.32830 + `OpenCL Runtime `_,2.0.0,2.0.0,2.0.0,2.0.0,2.0.0,2.0.0,2.0.0,2.0.0,2.0.0 + :doc:`ROCR-Runtime `,1.14.0,1.14.0,1.14.0,1.13.0,1.13.0,1.13.0,1.13.0,1.12.0,1.12.0 diff --git a/docs/compatibility/compatibility-matrix.rst b/docs/compatibility/compatibility-matrix.rst index 9dfc45d1e..40a301437 100644 --- a/docs/compatibility/compatibility-matrix.rst +++ b/docs/compatibility/compatibility-matrix.rst @@ -18,11 +18,11 @@ Accelerators and GPUs listed in the following table support compute workloads (n .. container:: format-big-table - .. csv-table:: - :header: "ROCm Version", "6.2.2", "6.2.1", "6.1.0" + .. csv-table:: + :header: "ROCm Version", "6.2.4", "6.2.2", "6.1.0" :stub-columns: 1 - :ref:`Operating systems & kernels `,"Ubuntu 24.04.1, 24.04","Ubuntu 24.04.1, 24.04",Ubuntu 24.04 + :ref:`Operating systems & kernels `,"Ubuntu 24.04.1, 24.04","Ubuntu 24.04.1, 24.04", ,"Ubuntu 22.04.5, 22.04.4","Ubuntu 22.04.5, 22.04.4","Ubuntu 22.04.4, 22.04.3" ,,,"Ubuntu 20.04.6, 20.04.5" ,"RHEL 9.4, 9.3","RHEL 9.4, 9.3","RHEL 9.4 [#red-hat94]_, 9.3, 9.2" @@ -39,13 +39,13 @@ Accelerators and GPUs listed in the following table support compute workloads (n ,.. _gpu-support-compatibility-matrix:,, :doc:`GPU / LLVM target `,gfx1100,gfx1100,gfx1100 ,gfx1030,gfx1030,gfx1030 - ,gfx942 [#mi300_622]_,gfx942 [#mi300_621]_, gfx942 [#mi300_610]_ + ,gfx942 [#mi300_624]_,gfx942 [#mi300_622]_, gfx942 [#mi300_610]_ ,gfx90a,gfx90a,gfx90a ,gfx908,gfx908,gfx908 ,,, FRAMEWORK SUPPORT,.. _framework-support-compatibility-matrix:,, :doc:`PyTorch `,"2.3, 2.2, 2.1, 2.0, 1.13","2.3, 2.2, 2.1, 2.0, 1.13","2.1, 2.0, 1.13" - :doc:`TensorFlow `,"2.16.1, 2.15.1, 2.14.1","2.16.1, 2.15.1, 2.14.1","2.15, 2.14, 2.13" + :doc:`TensorFlow `,"2.16.1, 2.15.1, 2.14.1","2.16.1, 2.15.1, 2.14.1","2.15.0, 2.14.0, 2.13.1" :doc:`JAX `,0.4.26,0.4.26,0.4.26 `ONNX Runtime `_,1.17.3,1.17.3,1.17.3 ,,, @@ -77,49 +77,49 @@ Accelerators and GPUs listed in the following table support compute workloads (n `half `_ ,1.12.0,1.12.0,1.12.0 :doc:`hipBLAS `,2.2.0,2.2.0,2.1.0 :doc:`hipBLASLt `,0.8.0,0.8.0,0.7.0 - :doc:`hipFFT `,1.0.15,1.0.15,1.0.14 + :doc:`hipFFT `,1.0.16,1.0.15,1.0.14 :doc:`hipFORT `,0.4.0,0.4.0,0.4.0 - :doc:`hipRAND `,2.11.0,2.11.0,2.10.16 + :doc:`hipRAND `,2.11.1,2.11.0,2.10.16 :doc:`hipSOLVER `,2.2.0,2.2.0,2.1.0 :doc:`hipSPARSE `,3.1.1,3.1.1,3.0.1 :doc:`hipSPARSELt `,0.2.1,0.2.1,0.1.0 - :doc:`rocALUTION `,3.2.0,3.2.0,3.1.1 - :doc:`rocBLAS `,4.2.1,4.2.1,4.1.0 - :doc:`rocFFT `,1.0.29,1.0.29,1.0.26 - :doc:`rocRAND `,3.1.0,3.1.0,3.0.1 - :doc:`rocSOLVER `,3.26.0,3.26.0,3.25.0 - :doc:`rocSPARSE `,3.2.0,3.2.0,3.1.2 + :doc:`rocALUTION `,3.2.1,3.2.0,3.1.1 + :doc:`rocBLAS `,4.2.4,4.2.1,4.1.0 + :doc:`rocFFT `,1.0.30,1.0.29,1.0.26 + :doc:`rocRAND `,3.1.1,3.1.0,3.0.1 + :doc:`rocSOLVER `,3.26.2,3.26.0,3.25.0 + :doc:`rocSPARSE `,3.2.1,3.2.0,3.1.2 :doc:`rocWMMA `,1.5.0,1.5.0,1.4.0 `Tensile `_,4.40.0,4.40.0,4.40.0 ,,, PRIMITIVES,.. _primitivelibs-support-compatibility-matrix:,, - :doc:`hipCUB `,3.2.0,3.2.0,3.1.0 + :doc:`hipCUB `,3.2.1,3.2.0,3.1.0 :doc:`hipTensor `,1.3.0,1.3.0,1.2.0 - :doc:`rocPRIM `,3.2.0,3.2.0,3.1.0 - :doc:`rocThrust `,3.1.0,3.1.0,3.0.1 + :doc:`rocPRIM `,3.2.2,3.2.0,3.1.0 + :doc:`rocThrust `,3.1.1,3.1.0,3.0.1 ,,, SUPPORT LIBS,,, `hipother `_,6.2.41134,6.2.41134,6.1.40091 - `rocm-core `_,6.2.2,6.2.1,6.1.0 - `ROCT-Thunk-Interface `_,20240607.4.05,20240607.4.05,20240125.3.30 + `rocm-core `_,6.2.4,6.2.2,6.1.0 + `ROCT-Thunk-Interface `_,20240607.5.7,20240607.5.7,20240125.3.30 ,,, SYSTEM MGMT TOOLS,.. _tools-support-compatibility-matrix:,, :doc:`AMD SMI `,24.6.3,24.6.3,24.4.1 - :doc:`ROCm Data Center Tool `,1.0.0,1.0.0,0.3.0 + :doc:`ROCm Data Center Tool `,0.3.0,0.3.0,0.3.0 :doc:`rocminfo `,1.0.0,1.0.0,1.0.0 :doc:`ROCm SMI `,7.3.0,7.3.0,7.0.0 - :doc:`ROCm Validation Suite `,rocm-6.2.2,rocm-6.2.1,rocm-6.1.0 + :doc:`ROCm Validation Suite `,rocm-6.2.4,rocm-6.2.2,rocm-6.1.0 ,,, PERFORMANCE TOOLS,,, :doc:`Omniperf `,2.0.1,2.0.1,N/A :doc:`Omnitrace `,1.11.2,1.11.2,N/A :doc:`ROCm Bandwidth Test `,1.4.0,1.4.0,1.4.0 - :doc:`ROCProfiler `,2.0.60202,2.0.60201,2.0.60100 + :doc:`ROCProfiler `,2.0.60204,2.0.60202,2.0.60100 :doc:`ROCprofiler-SDK `,0.4.0,0.4.0,N/A - :doc:`ROCTracer `,4.1.60202,4.1.60201,4.1.60100 + :doc:`ROCTracer `,4.1.60204,4.1.60202,4.1.60100 ,,, DEVELOPMENT TOOLS,,, - :doc:`HIPIFY `,18.0.0.24355,18.0.0.24355,17.0.0.24103 + :doc:`HIPIFY `,18.0.0.24392,18.0.0.24355,17.0.0.24103 :doc:`ROCm CMake `,0.13.0,0.13.0,0.12.0 :doc:`ROCdbgapi `,0.76.0,0.76.0,0.71.0 :doc:`ROCm Debugger (ROCgdb) `,14.2.0,14.2.0,14.1.0 @@ -129,9 +129,9 @@ Accelerators and GPUs listed in the following table support compute workloads (n COMPILERS,.. _compilers-support-compatibility-matrix:,, `clang-ocl `_,N/A,N/A,0.5.0 :doc:`hipCC `,1.1.1,1.1.1,1.0.0 - `Flang `_,18.0.0.24355,18.0.0.24355,17.0.0.24103 - :doc:`llvm-project `,18.0.0.24355,18.0.0.24355,17.0.0.24103 - `OpenMP `_,18.0.0.24355,18.0.0.24355,17.0.0.24103 + `Flang `_,18.0.0.24392,18.0.0.24355,17.0.0.24103 + :doc:`llvm-project `,18.0.0.24392,18.0.0.24355,17.0.0.24103 + `OpenMP `_,18.0.0.24392,18.0.0.24355,17.0.0.24103 ,,, RUNTIMES,.. _runtime-support-compatibility-matrix:,, :doc:`AMD CLR `,6.2.41134,6.2.41134,6.1.40091 @@ -144,8 +144,8 @@ Accelerators and GPUs listed in the following table support compute workloads (n .. [#red-hat94] RHEL 9.4 is supported only on AMD Instinct MI300A. .. [#oracle89] Oracle Linux is supported only on AMD Instinct MI300X. +.. [#mi300_624] **For ROCm 6.2.4** - MI300X (gfx942) is supported on listed operating systems *except* Ubuntu 22.04.5 [6.8 HWE] and Ubuntu 22.04.4 [6.5 HWE]. .. [#mi300_622] **For ROCm 6.2.2** - MI300X (gfx942) is supported on listed operating systems *except* Ubuntu 22.04.5 [6.8 HWE] and Ubuntu 22.04.4 [6.5 HWE]. -.. [#mi300_621] **For ROCm 6.2.1** - MI300X (gfx942) is supported on listed operating systems *except* Ubuntu 22.04.5 [6.8 HWE] and Ubuntu 22.04.4 [6.5 HWE]. .. [#mi300_610] **For ROCm 6.1.0** - MI300A (gfx942) is supported on Ubuntu 22.04.4, RHEL 9.4, RHEL 9.3, RHEL 8.9, and SLES 15 SP5. MI300X (gfx942) is only supported on Ubuntu 22.04.4. .. [#kfd_support] ROCm provides forward and backward compatibility between the Kernel Fusion Driver (KFD) and its user space software for +/- 2 releases. These are the compatibility combinations that are currently supported. @@ -216,6 +216,7 @@ Expand for full historical view of: .. [#red-hat94-past-60] RHEL 9.4 is supported only on AMD Instinct MI300A. .. [#oracle89-past-60] Oracle Linux is supported only on AMD Instinct MI300X. + .. [#mi300_624-past-60] **For ROCm 6.2.4** - MI300X (gfx942) is supported on listed operating systems *except* Ubuntu 22.04.5 [6.8 HWE] and Ubuntu 22.04.4 [6.5 HWE]. .. [#mi300_622-past-60] **For ROCm 6.2.2** - MI300X (gfx942) is supported on listed operating systems *except* Ubuntu 22.04.5 [6.8 HWE] and Ubuntu 22.04.4 [6.5 HWE]. .. [#mi300_621-past-60] **For ROCm 6.2.1** - MI300X (gfx942) is supported on listed operating systems *except* Ubuntu 22.04.5 [6.8 HWE] and Ubuntu 22.04.4 [6.5 HWE]. .. [#mi300_620-past-60] **For ROCm 6.2.0** - MI300X (gfx942) is supported on listed operating systems *except* Ubuntu 22.04.5 [6.8 HWE] and Ubuntu 22.04.4 [6.5 HWE]. diff --git a/docs/conf.py b/docs/conf.py index 7b870c54c..7c74b94f6 100644 --- a/docs/conf.py +++ b/docs/conf.py @@ -30,15 +30,15 @@ if os.environ.get("READTHEDOCS", "") == "True": project = "ROCm Documentation" author = "Advanced Micro Devices, Inc." copyright = "Copyright (c) 2024 Advanced Micro Devices, Inc. All rights reserved." -version = "6.2.2" -release = "6.2.2" +version = "6.2.4" +release = "6.2.4" setting_all_article_info = True all_article_info_os = ["linux", "windows"] all_article_info_author = "" # pages with specific settings article_pages = [ - {"file": "about/release-notes", "os": ["linux", "windows"], "date": "2024-09-27"}, + {"file": "about/release-notes", "os": ["linux", "windows"], "date": "2024-11-06"}, {"file": "how-to/deep-learning-rocm", "os": ["linux"]}, {"file": "how-to/rocm-for-ai/index", "os": ["linux"]}, {"file": "how-to/rocm-for-ai/install", "os": ["linux"]}, diff --git a/docs/reference/gpu-arch-specs.rst b/docs/reference/gpu-arch-specs.rst index fdc3896c9..73c5b7359 100644 --- a/docs/reference/gpu-arch-specs.rst +++ b/docs/reference/gpu-arch-specs.rst @@ -235,6 +235,22 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - L0 Instruction Cache (KiB) - VGPR File (KiB) - SGPR File (KiB) + * + - Radeon PRO V710 + - RDNA3 + - gfx1101 + - 28 + - 54 + - 32 + - 128 + - 56 + - 4 + - 256 + - 32 + - 16 + - 32 + - 768 + - 16 * - Radeon PRO W7900 Dual Slot - RDNA3 @@ -249,8 +265,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 32 - 16 - 32 - - 384 - - 20 + - 768 + - 16 * - Radeon PRO W7900 - RDNA3 @@ -265,8 +281,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 32 - 16 - 32 - - 384 - - 20 + - 768 + - 16 * - Radeon PRO W7800 - RDNA3 @@ -281,8 +297,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 32 - 16 - 32 - - 384 - - 20 + - 768 + - 16 * - Radeon PRO W7700 - RDNA3 @@ -297,8 +313,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 32 - 16 - 32 - - 384 - - 20 + - 768 + - 16 * - Radeon PRO W6800 - RDNA2 @@ -313,8 +329,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 16 - 16 - 32 - - 256 - - 20 + - 512 + - 16 * - Radeon PRO W6600 - RDNA2 @@ -329,8 +345,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 16 - 16 - 32 - - 256 - - 20 + - 512 + - 16 * - Radeon PRO V620 - RDNA2 @@ -345,8 +361,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 16 - 16 - 32 - - 256 - - 20 + - 512 + - 16 * - Radeon Pro W5500 - RDNA @@ -361,7 +377,7 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 16 - 16 - 32 - - 256 + - 512 - 20 * - Radeon Pro VII @@ -416,8 +432,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 32 - 16 - 32 - - 384 - - 20 + - 768 + - 16 * - Radeon RX 7900 XT - RDNA3 @@ -432,8 +448,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 32 - 16 - 32 - - 384 - - 20 + - 768 + - 16 * - Radeon RX 7900 GRE - RDNA3 @@ -448,8 +464,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 32 - 16 - 32 - - 384 - - 20 + - 768 + - 16 * - Radeon RX 7800 XT - RDNA3 @@ -464,8 +480,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 32 - 16 - 32 - - 384 - - 20 + - 768 + - 16 * - Radeon RX 7700 XT - RDNA3 @@ -480,8 +496,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 32 - 16 - 32 - - 384 - - 20 + - 768 + - 16 * - Radeon RX 7600 - RDNA3 @@ -496,8 +512,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 32 - 16 - 32 - - 256 - - 20 + - 512 + - 16 * - Radeon RX 6950 XT - RDNA2 @@ -512,8 +528,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 16 - 16 - 32 - - 256 - - 20 + - 512 + - 16 * - Radeon RX 6900 XT - RDNA2 @@ -528,8 +544,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 16 - 16 - 32 - - 256 - - 20 + - 512 + - 16 * - Radeon RX 6800 XT - RDNA2 @@ -544,8 +560,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 16 - 16 - 32 - - 256 - - 20 + - 512 + - 16 * - Radeon RX 6800 - RDNA2 @@ -560,8 +576,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 16 - 16 - 32 - - 256 - - 20 + - 512 + - 16 * - Radeon RX 6750 XT - RDNA2 @@ -576,8 +592,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 16 - 16 - 32 - - 256 - - 20 + - 512 + - 16 * - Radeon RX 6700 XT - RDNA2 @@ -592,8 +608,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 16 - 16 - 32 - - 256 - - 20 + - 512 + - 16 * - Radeon RX 6700 - RDNA2 @@ -608,8 +624,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 16 - 16 - 32 - - 256 - - 20 + - 512 + - 16 * - Radeon RX 6650 XT - RDNA2 @@ -624,8 +640,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 16 - 16 - 32 - - 256 - - 20 + - 512 + - 16 * - Radeon RX 6600 XT - RDNA2 @@ -640,8 +656,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 16 - 16 - 32 - - 256 - - 20 + - 512 + - 16 * - Radeon RX 6600 - RDNA2 @@ -656,8 +672,8 @@ For more information about ROCm hardware compatibility, see the ROCm `Compatibil - 16 - 16 - 32 - - 256 - - 20 + - 512 + - 16 * - Radeon VII - GCN5.1 @@ -684,7 +700,7 @@ For more information about the terms used, see the **LLVM target name** -Argument to pass to clang in `--offload-arch` to compile code for the given +Argument to pass to clang in ``--offload-arch`` to compile code for the given architecture. **VRAM** @@ -719,13 +735,13 @@ data and instructions. Similar to the L3 Cache on CDNA/GCN architectures. **L2 Cache** -Size of the level 3 cache. Shared by all compute units on the same GCD. Caches +Size of the level 2 cache. Shared by all compute units on the same GCD. Caches data and instructions. **Graphics L1 Cache (RDNA only)** An additional cache level that only exists in RDNA architectures. Local to a -work group processor. +shader array. **L1 Vector Cache (CDNA/GCN only)** diff --git a/docs/release/versions.md b/docs/release/versions.md index 6d1e5b5fd..fbc8170a6 100644 --- a/docs/release/versions.md +++ b/docs/release/versions.md @@ -8,6 +8,8 @@ | Version | Release date | | ------- | ------------ | +| [6.2.4](https://rocm.docs.amd.com/en/docs-6.2.4/) | November 6, 2024 | + | [6.2.2](https://rocm.docs.amd.com/en/docs-6.2.2/) | September 27, 2024 | | [6.2.1](https://rocm.docs.amd.com/en/docs-6.2.1/) | September 20, 2024 | | [6.2.0](https://rocm.docs.amd.com/en/docs-6.2.0/) | August 2, 2024 | diff --git a/tools/autotag/templates/extra_components/6.2.4.md b/tools/autotag/templates/extra_components/6.2.4.md new file mode 100644 index 000000000..37f6d2962 --- /dev/null +++ b/tools/autotag/templates/extra_components/6.2.4.md @@ -0,0 +1,35 @@ +### Hardware architecture support updates + +Updated the following math and primitives libraries to pre-enable support for +an upcoming hardware architecture. + +* hipCUB (3.2.1) + +* hipFFT (1.0.16) + +* hipRAND (2.11.1) + +* rocALUTION (3.2.1) + +* rocBLAS (4.2.4) + +* rocFFT (1.0.30) + +* rocPRIM (3.2.2) + +* rocRAND (3.1.1) + +* rocSOLVER (3.26.2) + +* rocSPARSE (3.2.1) + +* rocThrust (3.1.1) + +### **AMD SMI** (24.6.3) + +#### Resolved issues + +* Fixed support for the API calls `amdsmi_get_gpu_process_isolation` and + `amdsmi_clean_gpu_local_data`, along with the + `amd-smi set --process-isolation <0 or 1>` command. See issue + [#3500](https://github.com/ROCm/ROCm/issues/3500) on GitHub. diff --git a/tools/autotag/templates/highlights/6.2.4.md b/tools/autotag/templates/highlights/6.2.4.md new file mode 100644 index 000000000..4d8af7762 --- /dev/null +++ b/tools/autotag/templates/highlights/6.2.4.md @@ -0,0 +1,55 @@ +# ROCm 6.2.4 release notes + +The release notes provide a summary of notable changes since the previous ROCm release. + +- [Release highlights](#release-highlights) + +- [Operating system and hardware support changes](#operating-system-and-hardware-support-changes) + +- [ROCm components versioning](#rocm-components) + +- [Detailed component changes](#detailed-component-changes) + +- [ROCm known issues](#rocm-known-issues) + +- [ROCm upcoming changes](#rocm-upcoming-changes) + +```{note} +If you’re using Radeon™ PRO or Radeon GPUs in a workstation setting with a +display connected, continue to use ROCm 6.2.3. See the [Use ROCm on Radeon +GPUs](https://rocm.docs.amd.com/projects/radeon/en/latest/index.html) +documentation to verify compatibility and system requirements. +``` + +## Release highlights + +The following are notable new features and improvements in ROCm 6.2.4. For changes to individual components, see +[Detailed component changes](#detailed-component-changes). + +#### ROCm documentation updates + +ROCm documentation continues to be updated to provide clearer and more comprehensive guidance for +a wider variety of user needs and use cases. + +* Added a new GPU cluster networking guide. See + [Cluster network performance validation for AMD Instinct accelerators](https://rocm.docs.amd.com/projects/gpu-cluster-networking/en/latest/index.html). + + This documentation provides guidelines on validating network configurations + in single-node and multi-node environments to attain optimal speed and bandwidth + in AMD Instinct-powered clusters. + +* Updated the HIP runtime documentation. + + * Added a new section on how to use [HIP graphs](https://rocm.docs.amd.com/projects/HIP/en/latest/how-to/hipgraph.html). + + * Added a new section about the [Stream ordered memory allocator (SOMA)](https://rocm.docs.amd.com/projects/HIP/en/latest/how-to/stream_ordered_allocator.html). + + * Updated the [Porting CUDA driver API](https://rocm.docs.amd.com/projects/HIP/en/latest/how-to/hip_porting_driver_api.html) section. + +* Updated the [Post-installation instructions](https://rocm.docs.amd.com/projects/install-on-linux/en/docs-6.2.4/install/post-install.html) + with guidance on using the `update-alternatives` utility and environment modules to help you manage multiple ROCm + versions and streamline PATH configuration. + +* Updated [LLM inference performance validation on AMD Instinct + MI300X](https://rocm.docs.amd.com/en/docs-6.2.4/how-to/performance-validation/mi300x/vllm-benchmark.html) + documentation with more detailed guidance, new models, and the `float8` data type. diff --git a/tools/autotag/templates/support/6.2.4.md b/tools/autotag/templates/support/6.2.4.md new file mode 100644 index 000000000..cb5b1f3a2 --- /dev/null +++ b/tools/autotag/templates/support/6.2.4.md @@ -0,0 +1,5 @@ +## Operating system and hardware support changes + +ROCm 6.2.4 adds support for the AMD Radeon PRO V710 GPU. See +[Supported GPUs](https://rocm.docs.amd.com/projects/install-on-linux-internal/en/docs-6.2.4/reference/system-requirements.html#supported-gpus) +for more information. diff --git a/tools/autotag/templates/upcoming_changes/6.2.4.md b/tools/autotag/templates/upcoming_changes/6.2.4.md new file mode 100644 index 000000000..24e7e6696 --- /dev/null +++ b/tools/autotag/templates/upcoming_changes/6.2.4.md @@ -0,0 +1,9 @@ +The following changes to the ROCm software stack are anticipated for future releases. + +### rocm-llvm-alt + +The `rocm-llvm-alt` package will be removed in an upcoming release. Users relying on the functionality provided by the closed-source compiler should transition to the open-source compiler. Once the `rocm-llvm-alt` package is removed, any compilation requesting functionality provided by the closed-source compiler will result in a Clang warning: "*[AMD] proprietary optimization compiler has been removed*". + +### rccl-rdma-sharp-plugins + +The RCCL plugin package, `rccl-rdma-sharp-plugins`, will be removed in an upcoming ROCm release.