mirror of
https://github.com/ROCm/ROCm.git
synced 2026-01-10 15:18:11 -05:00
Compare commits
9 Commits
docs/5.3.0
...
docs/5.4.3
| Author | SHA1 | Date | |
|---|---|---|---|
|
|
b3d51b08e4 | ||
|
|
e215bc3545 | ||
|
|
8b1b3048e3 | ||
|
|
18e68a7aac | ||
|
|
c3ff8fd3c7 | ||
|
|
100e37a80f | ||
|
|
79efb6705d | ||
|
|
c4c902d3a8 | ||
|
|
9f24b4168a |
954
CHANGELOG.md
954
CHANGELOG.md
@@ -15,6 +15,960 @@ The release notes for the ROCm platform.
|
||||
|
||||
-------------------
|
||||
|
||||
## ROCm 5.4.3
|
||||
<!-- markdownlint-disable first-line-h1 -->
|
||||
### Deprecations and Warnings
|
||||
|
||||
#### HIP Perl Scripts Deprecation
|
||||
|
||||
The `hipcc` and `hipconfig` Perl scripts are deprecated. In a future release, compiled binaries will be available as `hipcc.bin` and `hipconfig.bin` as replacements for the Perl scripts.
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> There will be a transition period where the Perl scripts and compiled binaries are available before the scripts are removed. There will be no functional difference between the Perl scripts and their compiled binary counterpart. No user action is required. Once these are available, users can optionally switch to `hipcc.bin` and `hipconfig.bin`. The `hipcc`/`hipconfig` soft link will be assimilated to point from `hipcc`/`hipconfig` to the respective compiled binaries as the default option.
|
||||
|
||||
##### Linux Filesystem Hierarchy Standard for ROCm
|
||||
|
||||
ROCm packages have adopted the Linux foundation filesystem hierarchy standard in this release to ensure ROCm components follow open source conventions for Linux-based distributions. While moving to a new filesystem hierarchy, ROCm ensures backward compatibility with its 5.1 version or older filesystem hierarchy. See below for a detailed explanation of the new filesystem hierarchy and backward compatibility.
|
||||
|
||||
##### New Filesystem Hierarchy
|
||||
|
||||
The following is the new filesystem hierarchy:4
|
||||
|
||||
```text
|
||||
/opt/rocm-<ver>
|
||||
| --bin
|
||||
| --All externally exposed Binaries
|
||||
| --libexec
|
||||
| --<component>
|
||||
| -- Component specific private non-ISA executables (architecture independent)
|
||||
| --include
|
||||
| -- <component>
|
||||
| --<header files>
|
||||
| --lib
|
||||
| --lib<soname>.so -> lib<soname>.so.major -> lib<soname>.so.major.minor.patch
|
||||
(public libraries linked with application)
|
||||
| --<component> (component specific private library, executable data)
|
||||
| --<cmake>
|
||||
| --components
|
||||
| --<component>.config.cmake
|
||||
| --share
|
||||
| --html/<component>/*.html
|
||||
| --info/<component>/*.[pdf, md, txt]
|
||||
| --man
|
||||
| --doc
|
||||
| --<component>
|
||||
| --<licenses>
|
||||
| --<component>
|
||||
| --<misc files> (arch independent non-executable)
|
||||
| --samples
|
||||
|
||||
```
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> ROCm will not support backward compatibility with the v5.1(old) file system hierarchy in its next major release.
|
||||
|
||||
For more information, refer to <https://refspecs.linuxfoundation.org/fhs.shtml>.
|
||||
|
||||
##### Backward Compatibility with Older Filesystems
|
||||
|
||||
ROCm has moved header files and libraries to its new location as indicated in the above structure and included symbolic-link and wrapper header files in its old location for backward compatibility.
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> ROCm will continue supporting backward compatibility until the next major release.
|
||||
|
||||
##### Wrapper header files
|
||||
|
||||
Wrapper header files are placed in the old location (`/opt/rocm-xxx/<component>/include`) with a warning message to include files from the new location (`/opt/rocm-xxx/include`) as shown in the example below:
|
||||
|
||||
```h
|
||||
// Code snippet from hip_runtime.h
|
||||
#pragma message “This file is deprecated. Use file from include path /opt/rocm-ver/include/ and prefix with hip”.
|
||||
#include "hip/hip_runtime.h"
|
||||
```
|
||||
|
||||
The wrapper header files’ backward compatibility deprecation is as follows:
|
||||
|
||||
- `#pragma` message announcing deprecation -- ROCm v5.2 release
|
||||
- `#pragma` message changed to `#warning` -- Future release
|
||||
- `#warning` changed to `#error` -- Future release
|
||||
- Backward compatibility wrappers removed -- Future release
|
||||
|
||||
##### Library files
|
||||
|
||||
Library files are available in the `/opt/rocm-xxx/lib` folder. For backward compatibility, the old library location (`/opt/rocm-xxx/<component>/lib`) has a soft link to the library at the new location.
|
||||
|
||||
Example:
|
||||
|
||||
```log
|
||||
$ ls -l /opt/rocm/hip/lib/
|
||||
total 4
|
||||
drwxr-xr-x 4 root root 4096 May 12 10:45 cmake
|
||||
lrwxrwxrwx 1 root root 24 May 10 23:32 libamdhip64.so -> ../../lib/libamdhip64.so
|
||||
```
|
||||
|
||||
##### CMake Config files
|
||||
|
||||
All CMake configuration files are available in the `/opt/rocm-xxx/lib/cmake/<component>` folder. For backward compatibility, the old CMake locations (`/opt/rocm-xxx/<component>/lib/cmake`) consist of a soft link to the new CMake config.
|
||||
|
||||
Example:
|
||||
|
||||
```log
|
||||
$ ls -l /opt/rocm/hip/lib/cmake/hip/
|
||||
total 0
|
||||
lrwxrwxrwx 1 root root 42 May 10 23:32 hip-config.cmake -> ../../../../lib/cmake/hip/hip-config.cmake
|
||||
```
|
||||
|
||||
### Fixed Defects
|
||||
|
||||
#### Compiler Improvements
|
||||
|
||||
In ROCm v5.4.3, improvements to the compiler address errors with the following signatures:
|
||||
|
||||
- "error: unhandled SGPR spill to memory"
|
||||
- "cannot scavenge register without an emergency spill slot!"
|
||||
- "error: ran out of registers during register allocation"
|
||||
|
||||
### Known Issues
|
||||
|
||||
#### Compiler Option Error at Runtime
|
||||
|
||||
Some users may encounter a “Cannot find Symbol” error at runtime when using -save-temps. While most -save-temps use cases work correctly, this error may appear occasionally.
|
||||
|
||||
This issue is under investigation, and the known workaround is not to use -save-temps when the error appears.
|
||||
|
||||
### Library Changes in ROCM 5.4.3
|
||||
|
||||
| Library | Version |
|
||||
|---------|---------|
|
||||
| hipBLAS | [0.53.0](https://github.com/ROCmSoftwarePlatform/hipBLAS/releases/tag/rocm-5.4.3) |
|
||||
| hipCUB | [2.13.0](https://github.com/ROCmSoftwarePlatform/hipCUB/releases/tag/rocm-5.4.3) |
|
||||
| hipFFT | [1.0.10](https://github.com/ROCmSoftwarePlatform/hipFFT/releases/tag/rocm-5.4.3) |
|
||||
| hipSOLVER | [1.6.0](https://github.com/ROCmSoftwarePlatform/hipSOLVER/releases/tag/rocm-5.4.3) |
|
||||
| hipSPARSE | [2.3.3](https://github.com/ROCmSoftwarePlatform/hipSPARSE/releases/tag/rocm-5.4.3) |
|
||||
| rccl | [2.13.4](https://github.com/ROCmSoftwarePlatform/rccl/releases/tag/rocm-5.4.3) |
|
||||
| rocALUTION | [2.1.3](https://github.com/ROCmSoftwarePlatform/rocALUTION/releases/tag/rocm-5.4.3) |
|
||||
| rocBLAS | [2.46.0](https://github.com/ROCmSoftwarePlatform/rocBLAS/releases/tag/rocm-5.4.3) |
|
||||
| rocFFT | 1.0.20 ⇒ [1.0.21](https://github.com/ROCmSoftwarePlatform/rocFFT/releases/tag/rocm-5.4.3) |
|
||||
| rocPRIM | [2.12.0](https://github.com/ROCmSoftwarePlatform/rocPRIM/releases/tag/rocm-5.4.3) |
|
||||
| rocRAND | [2.10.16](https://github.com/ROCmSoftwarePlatform/rocRAND/releases/tag/rocm-5.4.3) |
|
||||
| rocSOLVER | [3.20.0](https://github.com/ROCmSoftwarePlatform/rocSOLVER/releases/tag/rocm-5.4.3) |
|
||||
| rocSPARSE | [2.4.0](https://github.com/ROCmSoftwarePlatform/rocSPARSE/releases/tag/rocm-5.4.3) |
|
||||
| rocThrust | [2.17.0](https://github.com/ROCmSoftwarePlatform/rocThrust/releases/tag/rocm-5.4.3) |
|
||||
| rocWMMA | [0.9](https://github.com/ROCmSoftwarePlatform/rocWMMA/releases/tag/rocm-5.4.3) |
|
||||
| Tensile | [4.35.0](https://github.com/ROCmSoftwarePlatform/Tensile/releases/tag/rocm-5.4.3) |
|
||||
|
||||
#### rocFFT 1.0.21
|
||||
|
||||
rocFFT 1.0.21 for ROCm 5.4.3
|
||||
|
||||
##### Fixed
|
||||
|
||||
- Removed source directory from rocm_install_targets call to prevent installation of rocfft.h in an unintended location.
|
||||
|
||||
-------------------
|
||||
|
||||
## ROCm 5.4.2
|
||||
<!-- markdownlint-disable first-line-h1 -->
|
||||
### Deprecations and Warnings
|
||||
|
||||
#### HIP Perl Scripts Deprecation
|
||||
|
||||
The `hipcc` and `hipconfig` Perl scripts are deprecated. In a future release, compiled binaries will be available as `hipcc.bin` and `hipconfig.bin` as replacements for the Perl scripts.
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> There will be a transition period where the Perl scripts and compiled binaries are available before the scripts are removed. There will be no functional difference between the Perl scripts and their compiled binary counterpart. No user action is required. Once these are available, users can optionally switch to `hipcc.bin` and `hipconfig.bin`. The `hipcc`/`hipconfig` soft link will be assimilated to point from `hipcc`/`hipconfig` to the respective compiled binaries as the default option.
|
||||
|
||||
#### `hipcc` Options Deprecation
|
||||
|
||||
The following hipcc options are being deprecated and will be removed in a future release:
|
||||
|
||||
- The `--amdgpu-target` option is being deprecated, and user must use the `–offload-arch` option to specify the GPU architecture.
|
||||
- The `--amdhsa-code-object-version` option is being deprecated. Users can use the Clang/LLVM option `-mllvm -mcode-object-version` to debug issues related to code object versions.
|
||||
- The `--hipcc-func-supp`/`--hipcc-no-func-supp` options are being deprecated, as the function calls are already supported in production on AMD GPUs.
|
||||
|
||||
### Known Issues
|
||||
|
||||
Under certain circumstances typified by high register pressure, users may encounter a compiler abort with one of the following error messages:
|
||||
|
||||
- > `error: unhandled SGPR spill to memory`
|
||||
|
||||
- > `cannot scavenge register without an emergency spill slot!`
|
||||
|
||||
- > `error: ran out of registers during register allocation`
|
||||
|
||||
This is a known issue and will be fixed in a future release.
|
||||
|
||||
### Library Changes in ROCM 5.4.2
|
||||
|
||||
| Library | Version |
|
||||
|---------|---------|
|
||||
| hipBLAS | [0.53.0](https://github.com/ROCmSoftwarePlatform/hipBLAS/releases/tag/rocm-5.4.2) |
|
||||
| hipCUB | [2.13.0](https://github.com/ROCmSoftwarePlatform/hipCUB/releases/tag/rocm-5.4.2) |
|
||||
| hipFFT | [1.0.10](https://github.com/ROCmSoftwarePlatform/hipFFT/releases/tag/rocm-5.4.2) |
|
||||
| hipSOLVER | [1.6.0](https://github.com/ROCmSoftwarePlatform/hipSOLVER/releases/tag/rocm-5.4.2) |
|
||||
| hipSPARSE | [2.3.3](https://github.com/ROCmSoftwarePlatform/hipSPARSE/releases/tag/rocm-5.4.2) |
|
||||
| rccl | [2.13.4](https://github.com/ROCmSoftwarePlatform/rccl/releases/tag/rocm-5.4.2) |
|
||||
| rocALUTION | [2.1.3](https://github.com/ROCmSoftwarePlatform/rocALUTION/releases/tag/rocm-5.4.2) |
|
||||
| rocBLAS | [2.46.0](https://github.com/ROCmSoftwarePlatform/rocBLAS/releases/tag/rocm-5.4.2) |
|
||||
| rocFFT | [1.0.20](https://github.com/ROCmSoftwarePlatform/rocFFT/releases/tag/rocm-5.4.2) |
|
||||
| rocPRIM | [2.12.0](https://github.com/ROCmSoftwarePlatform/rocPRIM/releases/tag/rocm-5.4.2) |
|
||||
| rocRAND | [2.10.16](https://github.com/ROCmSoftwarePlatform/rocRAND/releases/tag/rocm-5.4.2) |
|
||||
| rocSOLVER | [3.20.0](https://github.com/ROCmSoftwarePlatform/rocSOLVER/releases/tag/rocm-5.4.2) |
|
||||
| rocSPARSE | [2.4.0](https://github.com/ROCmSoftwarePlatform/rocSPARSE/releases/tag/rocm-5.4.2) |
|
||||
| rocThrust | [2.17.0](https://github.com/ROCmSoftwarePlatform/rocThrust/releases/tag/rocm-5.4.2) |
|
||||
| rocWMMA | [0.9](https://github.com/ROCmSoftwarePlatform/rocWMMA/releases/tag/rocm-5.4.2) |
|
||||
| Tensile | [4.35.0](https://github.com/ROCmSoftwarePlatform/Tensile/releases/tag/rocm-5.4.2) |
|
||||
|
||||
-------------------
|
||||
|
||||
## ROCm 5.4.1
|
||||
<!-- markdownlint-disable first-line-h1 -->
|
||||
### What's New in This Release
|
||||
|
||||
#### HIP Enhancements
|
||||
|
||||
The ROCm v5.4.1 release consists of the following new HIP API:
|
||||
|
||||
##### New HIP API - hipLaunchHostFunc
|
||||
|
||||
The following new HIP API is introduced in the ROCm v5.4.1 release.
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> This is a pre-official version (beta) release of the new APIs.
|
||||
|
||||
```h
|
||||
hipError_t hipLaunchHostFunc(hipStream_t stream, hipHostFn_t fn, void* userData);
|
||||
```
|
||||
|
||||
This swaps the stream capture mode of a thread.
|
||||
|
||||
```text
|
||||
@param [in] mode - Pointer to mode value to swap with the current mode
|
||||
```
|
||||
|
||||
This parameter returns `#hipSuccess`, `#hipErrorInvalidValue`.
|
||||
|
||||
For more information, refer to the HIP API documentation at /bundle/HIP_API_Guide/page/modules.html.
|
||||
|
||||
### Deprecations and Warnings
|
||||
|
||||
#### HIP Perl Scripts Deprecation
|
||||
|
||||
The `hipcc` and `hipconfig` Perl scripts are deprecated. In a future release, compiled binaries will be available as `hipcc.bin` and `hipconfig.bin` as replacements for the Perl scripts.
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> There will be a transition period where the Perl scripts and compiled binaries are available before the scripts are removed. There will be no functional difference between the Perl scripts and their compiled binary counterpart. No user action is required. Once these are available, users can optionally switch to `hipcc.bin` and `hipconfig.bin`. The `hipcc`/`hipconfig` soft link will be assimilated to point from `hipcc`/`hipconfig` to the respective compiled binaries as the default option.
|
||||
|
||||
### IFWI Fixes
|
||||
|
||||
These defects were identified and documented as known issues in previous ROCm releases and are fixed in this release.
|
||||
AMD Instinct™ MI200 Firmware IFWI Maintenance Update #3
|
||||
|
||||
This IFWI release fixes the following issue in AMD Instinct™ MI210/MI250 Accelerators.
|
||||
|
||||
After prolonged periods of operation, certain MI200 Instinct™ Accelerators may perform in a degraded way resulting in application failures.
|
||||
|
||||
In this package, AMD delivers a new firmware version for MI200 GPU accelerators and a firmware installation tool – AMD FW FLASH 1.2.
|
||||
|
||||
| GPU | Production Part Number | SKU | IFWI Name |
|
||||
|-------|------------|--------|---------------|
|
||||
| MI210 | 113-D673XX | D67302 | D6730200V.110 |
|
||||
| MI210 | 113-D673XX | D67301 | D6730100V.073 |
|
||||
| MI250 | 113-D652XX | D65209 | D6520900.073 |
|
||||
| MI250 | 113-D652XX | D65210 | D6521000.073 |
|
||||
|
||||
Instructions on how to download and apply MI200 maintenance updates are available at:
|
||||
|
||||
<https://www.amd.com/en/support/server-accelerators/amd-instinct/amd-instinct-mi-series/amd-instinct-mi210>
|
||||
|
||||
#### AMD Instinct™ MI200 SRIOV Virtualization Support
|
||||
|
||||
Maintenance update #3, combined with ROCm 5.4.1, now provides SRIOV virtualization support for all AMD Instinct™ MI200 devices.
|
||||
|
||||
### Library Changes in ROCM 5.4.1
|
||||
|
||||
| Library | Version |
|
||||
|---------|---------|
|
||||
| hipBLAS | [0.53.0](https://github.com/ROCmSoftwarePlatform/hipBLAS/releases/tag/rocm-5.4.1) |
|
||||
| hipCUB | [2.13.0](https://github.com/ROCmSoftwarePlatform/hipCUB/releases/tag/rocm-5.4.1) |
|
||||
| hipFFT | [1.0.10](https://github.com/ROCmSoftwarePlatform/hipFFT/releases/tag/rocm-5.4.1) |
|
||||
| hipSOLVER | [1.6.0](https://github.com/ROCmSoftwarePlatform/hipSOLVER/releases/tag/rocm-5.4.1) |
|
||||
| hipSPARSE | [2.3.3](https://github.com/ROCmSoftwarePlatform/hipSPARSE/releases/tag/rocm-5.4.1) |
|
||||
| rccl | [2.13.4](https://github.com/ROCmSoftwarePlatform/rccl/releases/tag/rocm-5.4.1) |
|
||||
| rocALUTION | [2.1.3](https://github.com/ROCmSoftwarePlatform/rocALUTION/releases/tag/rocm-5.4.1) |
|
||||
| rocBLAS | [2.46.0](https://github.com/ROCmSoftwarePlatform/rocBLAS/releases/tag/rocm-5.4.1) |
|
||||
| rocFFT | 1.0.19 ⇒ [1.0.20](https://github.com/ROCmSoftwarePlatform/rocFFT/releases/tag/rocm-5.4.1) |
|
||||
| rocPRIM | [2.12.0](https://github.com/ROCmSoftwarePlatform/rocPRIM/releases/tag/rocm-5.4.1) |
|
||||
| rocRAND | [2.10.16](https://github.com/ROCmSoftwarePlatform/rocRAND/releases/tag/rocm-5.4.1) |
|
||||
| rocSOLVER | [3.20.0](https://github.com/ROCmSoftwarePlatform/rocSOLVER/releases/tag/rocm-5.4.1) |
|
||||
| rocSPARSE | [2.4.0](https://github.com/ROCmSoftwarePlatform/rocSPARSE/releases/tag/rocm-5.4.1) |
|
||||
| rocThrust | [2.17.0](https://github.com/ROCmSoftwarePlatform/rocThrust/releases/tag/rocm-5.4.1) |
|
||||
| rocWMMA | [0.9](https://github.com/ROCmSoftwarePlatform/rocWMMA/releases/tag/rocm-5.4.1) |
|
||||
| Tensile | [4.35.0](https://github.com/ROCmSoftwarePlatform/Tensile/releases/tag/rocm-5.4.1) |
|
||||
|
||||
#### rocFFT 1.0.20
|
||||
|
||||
rocFFT 1.0.20 for ROCm 5.4.1
|
||||
|
||||
##### Fixed
|
||||
|
||||
- Fixed incorrect results on strided large 1D FFTs where batch size does not equal the stride.
|
||||
|
||||
-------------------
|
||||
|
||||
## ROCm 5.4.0
|
||||
<!-- markdownlint-disable first-line-h1 -->
|
||||
<!-- markdownlint-disable no-duplicate-header -->
|
||||
### What's New in This Release
|
||||
|
||||
#### HIP Enhancements
|
||||
|
||||
The ROCm v5.4 release consists of the following HIP enhancements:
|
||||
|
||||
##### Support for Wall Clock64
|
||||
|
||||
A new timer function wall_clock64() is supported, which returns wall clock count at a constant frequency on the device.
|
||||
|
||||
```h
|
||||
long long int wall_clock64();
|
||||
```
|
||||
|
||||
It returns wall clock count at a constant frequency on the device, which can be queried via HIP API with the hipDeviceAttributeWallClockRate attribute of the device in the HIP application code.
|
||||
|
||||
Example:
|
||||
|
||||
```h
|
||||
int wallClkRate = 0; //in kilohertz
|
||||
+HIPCHECK(hipDeviceGetAttribute(&wallClkRate, hipDeviceAttributeWallClockRate, deviceId));
|
||||
```
|
||||
|
||||
Where hipDeviceAttributeWallClockRate is a device attribute.
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> The wall clock frequency is a per-device attribute.
|
||||
|
||||
##### New Registry Added for GPU_MAX_HW_QUEUES
|
||||
|
||||
The GPU_MAX_HW_QUEUES registry defines the maximum number of independent hardware queues allocated per process per device.
|
||||
|
||||
The environment variable controls how many independent hardware queues HIP runtime can create per process, per device. If the application allocates more HIP streams than this number, then the HIP runtime reuses the same hardware queues for the new streams in a round-robin manner.
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> This maximum number does not apply to hardware queues created for CU-masked HIP streams or cooperative queues for HIP Cooperative Groups (there is only one queue per device).
|
||||
|
||||
For more details, refer to the HIP Programming Guide.
|
||||
|
||||
#### New HIP APIs in This Release
|
||||
|
||||
The following new HIP APIs are available in the ROCm v5.4 release.
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> This is a pre-official version (beta) release of the new APIs.
|
||||
|
||||
##### Error Handling
|
||||
|
||||
```h
|
||||
hipError_t hipDrvGetErrorName(hipError_t hipError, const char** errorString);
|
||||
```
|
||||
|
||||
This returns HIP errors in the text string format.
|
||||
|
||||
```h
|
||||
hipError_t hipDrvGetErrorString(hipError_t hipError, const char** errorString);
|
||||
```
|
||||
|
||||
This returns text string messages with more details about the error.
|
||||
|
||||
For more information, refer to the HIP API Guide.
|
||||
|
||||
##### HIP Tests Source Separation
|
||||
|
||||
With ROCm v5.4, a separate GitHub project is created at
|
||||
|
||||
<https://github.com/ROCm-Developer-Tools/hip-tests>
|
||||
|
||||
This contains HIP catch2 tests and samples, and new tests will continue to develop.
|
||||
|
||||
In future ROCm releases, catch2 tests and samples will be removed from the HIP project.
|
||||
|
||||
### OpenMP Enhancements
|
||||
|
||||
This release consists of the following OpenMP enhancements:
|
||||
|
||||
- Enable new device RTL in libomptarget as default.
|
||||
- New flag `-fopenmp-target-fast` to imply `-fopenmp-target-ignore-env-vars -fopenmp-assume-no-thread-state -fopenmp-assume-no-nested-parallelism`.
|
||||
- Support for the collapse clause and non-unit stride in cases where the No-Loop specialized kernel is generated.
|
||||
- Initial implementation of optimized cross-team sum reduction for float and double type scalars.
|
||||
- Pool-based optimization in the OpenMP runtime to reduce locking during data transfer.
|
||||
|
||||
### Deprecations and Warnings
|
||||
|
||||
#### HIP Perl Scripts Deprecation
|
||||
|
||||
The `hipcc` and `hipconfig` Perl scripts are deprecated. In a future release, compiled binaries will be available as `hipcc.bin` and `hipconfig.bin` as replacements for the Perl scripts.
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> There will be a transition period where the Perl scripts and compiled binaries are available before the scripts are removed. There will be no functional difference between the Perl scripts and their compiled binary counterpart. No user action is required. Once these are available, users can optionally switch to `hipcc.bin` and `hipconfig.bin`. The `hipcc`/`hipconfig` soft link will be assimilated to point from `hipcc`/`hipconfig` to the respective compiled binaries as the default option.
|
||||
|
||||
##### Linux Filesystem Hierarchy Standard for ROCm
|
||||
|
||||
ROCm packages have adopted the Linux foundation filesystem hierarchy standard in this release to ensure ROCm components follow open source conventions for Linux-based distributions. While moving to a new filesystem hierarchy, ROCm ensures backward compatibility with its 5.1 version or older filesystem hierarchy. See below for a detailed explanation of the new filesystem hierarchy and backward compatibility.
|
||||
|
||||
##### New Filesystem Hierarchy
|
||||
|
||||
The following is the new filesystem hierarchy:
|
||||
|
||||
```text
|
||||
/opt/rocm-<ver>
|
||||
| --bin
|
||||
| --All externally exposed Binaries
|
||||
| --libexec
|
||||
| --<component>
|
||||
| -- Component specific private non-ISA executables (architecture independent)
|
||||
| --include
|
||||
| -- <component>
|
||||
| --<header files>
|
||||
| --lib
|
||||
| --lib<soname>.so -> lib<soname>.so.major -> lib<soname>.so.major.minor.patch
|
||||
(public libraries linked with application)
|
||||
| --<component> (component specific private library, executable data)
|
||||
| --<cmake>
|
||||
| --components
|
||||
| --<component>.config.cmake
|
||||
| --share
|
||||
| --html/<component>/*.html
|
||||
| --info/<component>/*.[pdf, md, txt]
|
||||
| --man
|
||||
| --doc
|
||||
| --<component>
|
||||
| --<licenses>
|
||||
| --<component>
|
||||
| --<misc files> (arch independent non-executable)
|
||||
| --samples
|
||||
|
||||
```
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> ROCm will not support backward compatibility with the v5.1(old) file system hierarchy in its next major release.
|
||||
|
||||
For more information, refer to <https://refspecs.linuxfoundation.org/fhs.shtml>.
|
||||
|
||||
##### Backward Compatibility with Older Filesystems
|
||||
|
||||
ROCm has moved header files and libraries to its new location as indicated in the above structure and included symbolic-link and wrapper header files in its old location for backward compatibility.
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> ROCm will continue supporting backward compatibility until the next major release.
|
||||
|
||||
##### Wrapper header files
|
||||
|
||||
Wrapper header files are placed in the old location (`/opt/rocm-xxx/<component>/include`) with a warning message to include files from the new location (`/opt/rocm-xxx/include`) as shown in the example below:
|
||||
|
||||
```h
|
||||
// Code snippet from hip_runtime.h
|
||||
#pragma message “This file is deprecated. Use file from include path /opt/rocm-ver/include/ and prefix with hip”.
|
||||
#include "hip/hip_runtime.h"
|
||||
```
|
||||
|
||||
The wrapper header files’ backward compatibility deprecation is as follows:
|
||||
|
||||
- `#pragma` message announcing deprecation -- ROCm v5.2 release
|
||||
- `#pragma` message changed to `#warning` -- Future release
|
||||
- `#warning` changed to `#error` -- Future release
|
||||
- Backward compatibility wrappers removed -- Future release
|
||||
|
||||
##### Library files
|
||||
|
||||
Library files are available in the `/opt/rocm-xxx/lib` folder. For backward compatibility, the old library location (`/opt/rocm-xxx/<component>/lib`) has a soft link to the library at the new location.
|
||||
|
||||
Example:
|
||||
|
||||
```log
|
||||
$ ls -l /opt/rocm/hip/lib/
|
||||
total 4
|
||||
drwxr-xr-x 4 root root 4096 May 12 10:45 cmake
|
||||
lrwxrwxrwx 1 root root 24 May 10 23:32 libamdhip64.so -> ../../lib/libamdhip64.so
|
||||
```
|
||||
|
||||
##### CMake Config files
|
||||
|
||||
All CMake configuration files are available in the `/opt/rocm-xxx/lib/cmake/<component>` folder. For backward compatibility, the old CMake locations (`/opt/rocm-xxx/<component>/lib/cmake`) consist of a soft link to the new CMake config.
|
||||
|
||||
Example:
|
||||
|
||||
```log
|
||||
$ ls -l /opt/rocm/hip/lib/cmake/hip/
|
||||
total 0
|
||||
lrwxrwxrwx 1 root root 42 May 10 23:32 hip-config.cmake -> ../../../../lib/cmake/hip/hip-config.cmake
|
||||
```
|
||||
|
||||
### Fixed Defects
|
||||
|
||||
The following defects are fixed in this release.
|
||||
|
||||
These defects were identified and documented as known issues in previous ROCm releases and are fixed in this release.
|
||||
|
||||
#### Memory Allocated Using hipHostMalloc() with Flags Did Not Exhibit Fine-Grain Behavior
|
||||
|
||||
##### Issue
|
||||
|
||||
The test was incorrectly using the `hipDeviceAttributePageableMemoryAccess` device attribute to determine coherent support.
|
||||
|
||||
##### Fix
|
||||
|
||||
`hipHostMalloc()` allocates memory with fine-grained access by default when the environment variable `HIP_HOST_COHERENT=1` is used.
|
||||
|
||||
For more information, refer to the HIP Programming Guide at
|
||||
|
||||
<https://docs.amd.com/bundle/HIP-Programming-Guide-v5.4/page/Introduction_to_HIP_Programming_Guide.html>
|
||||
|
||||
#### SoftHang with `hipStreamWithCUMask` test on AMD Instinct™
|
||||
|
||||
##### Issue
|
||||
|
||||
On GFX10 GPUs, kernel execution hangs when it is launched on streams created using `hipStreamWithCUMask`.
|
||||
|
||||
##### Fix
|
||||
|
||||
On GFX10 GPUs, each workgroup processor encompasses two compute units, and the compute units must be enabled as a pair. The `hipStreamWithCUMask` API unit test cases are updated to set compute unit mask (cuMask) in pairs for GFX10 GPUs.
|
||||
|
||||
#### ROCm Tools GPU IDs
|
||||
|
||||
The HIP language device IDs are not the same as the GPU IDs reported by the tools. GPU IDs are globally unique and guaranteed to be consistent across APIs and processes.
|
||||
|
||||
GPU IDs reported by ROCTracer and ROCProfiler or ROCm Tools are HSA Driver Node ID of that GPU, as it is a unique ID for that device in that particular node.
|
||||
|
||||
### Library Changes in ROCM 5.4.0
|
||||
|
||||
| Library | Version |
|
||||
|---------|---------|
|
||||
| hipBLAS | 0.52.0 ⇒ [0.53.0](https://github.com/ROCmSoftwarePlatform/hipBLAS/releases/tag/rocm-5.4.0) |
|
||||
| hipCUB | 2.12.0 ⇒ [2.13.0](https://github.com/ROCmSoftwarePlatform/hipCUB/releases/tag/rocm-5.4.0) |
|
||||
| hipFFT | 1.0.9 ⇒ [1.0.10](https://github.com/ROCmSoftwarePlatform/hipFFT/releases/tag/rocm-5.4.0) |
|
||||
| hipSOLVER | 1.5.0 ⇒ [1.6.0](https://github.com/ROCmSoftwarePlatform/hipSOLVER/releases/tag/rocm-5.4.0) |
|
||||
| hipSPARSE | 2.3.1 ⇒ [2.3.3](https://github.com/ROCmSoftwarePlatform/hipSPARSE/releases/tag/rocm-5.4.0) |
|
||||
| rccl | 2.12.10 ⇒ [2.13.4](https://github.com/ROCmSoftwarePlatform/rccl/releases/tag/rocm-5.4.0) |
|
||||
| rocALUTION | 2.1.0 ⇒ [2.1.3](https://github.com/ROCmSoftwarePlatform/rocALUTION/releases/tag/rocm-5.4.0) |
|
||||
| rocBLAS | 2.45.0 ⇒ [2.46.0](https://github.com/ROCmSoftwarePlatform/rocBLAS/releases/tag/rocm-5.4.0) |
|
||||
| rocFFT | 1.0.18 ⇒ [1.0.19](https://github.com/ROCmSoftwarePlatform/rocFFT/releases/tag/rocm-5.4.0) |
|
||||
| rocPRIM | 2.11.0 ⇒ [2.12.0](https://github.com/ROCmSoftwarePlatform/rocPRIM/releases/tag/rocm-5.4.0) |
|
||||
| rocRAND | 2.10.15 ⇒ [2.10.16](https://github.com/ROCmSoftwarePlatform/rocRAND/releases/tag/rocm-5.4.0) |
|
||||
| rocSOLVER | 3.19.0 ⇒ [3.20.0](https://github.com/ROCmSoftwarePlatform/rocSOLVER/releases/tag/rocm-5.4.0) |
|
||||
| rocSPARSE | 2.2.0 ⇒ [2.4.0](https://github.com/ROCmSoftwarePlatform/rocSPARSE/releases/tag/rocm-5.4.0) |
|
||||
| rocThrust | 2.16.0 ⇒ [2.17.0](https://github.com/ROCmSoftwarePlatform/rocThrust/releases/tag/rocm-5.4.0) |
|
||||
| rocWMMA | 0.8 ⇒ [0.9](https://github.com/ROCmSoftwarePlatform/rocWMMA/releases/tag/rocm-5.4.0) |
|
||||
| Tensile | 4.34.0 ⇒ [4.35.0](https://github.com/ROCmSoftwarePlatform/Tensile/releases/tag/rocm-5.4.0) |
|
||||
|
||||
#### hipBLAS 0.53.0
|
||||
|
||||
hipBLAS 0.53.0 for ROCm 5.4.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Allow for selection of int8 datatype
|
||||
- Added support for hipblasXgels and hipblasXgelsStridedBatched operations (with s,d,c,z precisions),
|
||||
only supported with rocBLAS backend
|
||||
- Added support for hipblasXgelsBatched operations (with s,d,c,z precisions)
|
||||
|
||||
#### hipCUB 2.13.0
|
||||
|
||||
hipCUB 2.13.0 for ROCm 5.4.0
|
||||
|
||||
##### Added
|
||||
|
||||
- CMake functionality to improve build parallelism of the test suite that splits compilation units by
|
||||
function or by parameters.
|
||||
- New overload for `BlockAdjacentDifference::SubtractLeftPartialTile` that takes a predecessor item.
|
||||
|
||||
##### Changed
|
||||
|
||||
- Improved build parallelism of the test suite by splitting up large compilation units for `DeviceRadixSort`,
|
||||
`DeviceSegmentedRadixSort` and `DeviceSegmentedSort`.
|
||||
- CUB backend references CUB and thrust version 1.17.1.
|
||||
|
||||
#### hipFFT 1.0.10
|
||||
|
||||
hipFFT 1.0.10 for ROCm 5.4.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Added hipfftExtPlanScaleFactor API to efficiently multiply each output element of a FFT by a given scaling factor. Result scaling must be supported in the backend FFT library.
|
||||
|
||||
##### Changed
|
||||
|
||||
- When hipFFT is built against the rocFFT backend, rocFFT 1.0.19 or higher is now required.
|
||||
|
||||
#### hipSOLVER 1.6.0
|
||||
|
||||
hipSOLVER 1.6.0 for ROCm 5.4.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Added compatibility-only functions
|
||||
- gesvdaStridedBatched
|
||||
- hipsolverDnSgesvdaStridedBatched_bufferSize, hipsolverDnDgesvdaStridedBatched_bufferSize, hipsolverDnCgesvdaStridedBatched_bufferSize, hipsolverDnZgesvdaStridedBatched_bufferSize
|
||||
- hipsolverDnSgesvdaStridedBatched, hipsolverDnDgesvdaStridedBatched, hipsolverDnCgesvdaStridedBatched, hipsolverDnZgesvdaStridedBatched
|
||||
|
||||
#### hipSPARSE 2.3.3
|
||||
|
||||
hipSPARSE 2.3.3 for ROCm 5.4.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Added hipsparseCsr2cscEx2_bufferSize and hipsparseCsr2cscEx2 routines
|
||||
|
||||
##### Changed
|
||||
|
||||
- HIPSPARSE_ORDER_COLUMN has been renamed to HIPSPARSE_ORDER_COL to match cusparse
|
||||
|
||||
#### rccl 2.13.4
|
||||
|
||||
RCCL 2.13.4 for ROCm 5.4.0
|
||||
|
||||
##### Changed
|
||||
|
||||
- Compatibility with NCCL 2.13.4
|
||||
- Improvements to RCCL when running with hipGraphs
|
||||
- RCCL_ENABLE_HIPGRAPH environment variable is no longer necessary to enable hipGraph support
|
||||
- Minor latency improvements
|
||||
|
||||
##### Fixed
|
||||
|
||||
- Resolved potential memory access error due to asynchronous memset
|
||||
|
||||
#### rocALUTION 2.1.3
|
||||
|
||||
rocALUTION 2.1.3 for ROCm 5.4.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Added build support for Navi31 and Navi33
|
||||
- Added support for non-squared global matrices
|
||||
|
||||
##### Improved
|
||||
|
||||
- Fixed a memory leak in MatrixMult on HIP backend
|
||||
- Global structures can now be used with a single process
|
||||
|
||||
##### Changed
|
||||
|
||||
- Switched GTest death test style to 'threadsafe'
|
||||
- GlobalVector::GetGhostSize() is deprecated and will be removed
|
||||
- ParallelManager::GetGlobalSize(), ParallelManager::GetLocalSize(), ParallelManager::SetGlobalSize() and ParallelManager::SetLocalSize() are deprecated and will be removed
|
||||
- Vector::GetGhostSize() is deprecated and will be removed
|
||||
- Multigrid::SetOperatorFormat(unsigned int) is deprecated and will be removed, use Multigrid::SetOperatorFormat(unsigned int, int) instead
|
||||
- RugeStuebenAMG::SetCouplingStrength(ValueType) is deprecated and will be removed, use SetStrengthThreshold(float) instead
|
||||
|
||||
#### rocBLAS 2.46.0
|
||||
|
||||
rocBLAS 2.46.0 for ROCm 5.4.0
|
||||
|
||||
##### Added
|
||||
|
||||
- client smoke test dataset added for quick validation using command rocblas-test --yaml rocblas_smoke.yaml
|
||||
- Added stream order device memory allocation as a non-default beta option.
|
||||
|
||||
##### Optimized
|
||||
|
||||
- Improved trsm performance for small sizes by using a substitution method technique
|
||||
- Improved syr2k and her2k performance significantly by using a block-recursive algorithm
|
||||
|
||||
##### Changed
|
||||
|
||||
- Level 2, Level 1, and Extension functions: argument checking when the handle is set to rocblas_pointer_mode_host now returns the status of rocblas_status_invalid_pointer only for pointers that must be dereferenced based on the alpha and beta argument values. With handle mode rocblas_pointer_mode_device only pointers that are always dereferenced regardless of alpha and beta values are checked and so may lead to a return status of rocblas_status_invalid_pointer. This improves consistency with legacy BLAS behaviour.
|
||||
- Add variable to turn on/off ieee16/ieee32 tests for mixed precision gemm
|
||||
- Allow hipBLAS to select int8 datatype
|
||||
- Disallow B == C && ldb != ldc in rocblas_xtrmm_outofplace
|
||||
|
||||
##### Fixed
|
||||
|
||||
- FORTRAN interfaces generalized for FORTRAN compilers other than gfortran
|
||||
- fix for trsm_strided_batched rocblas-bench performance gathering
|
||||
- Fix for rocm-smi path in commandrunner.py script to match ROCm 5.2 and above
|
||||
|
||||
#### rocFFT 1.0.19
|
||||
|
||||
rocFFT 1.0.19 for ROCm 5.4.0
|
||||
|
||||
##### Optimizations
|
||||
|
||||
- Optimized some strided large 1D plans.
|
||||
|
||||
##### Added
|
||||
|
||||
- Added rocfft_plan_description_set_scale_factor API to efficiently multiply each output element of a FFT by a given scaling factor.
|
||||
- Created a rocfft_kernel_cache.db file next to the installed library. SBCC kernels are moved to this file when built with the library, and are runtime-compiled for new GPU architectures.
|
||||
- Added gfx1100 and gfx1102 to default AMDGPU_TARGETS.
|
||||
|
||||
##### Changed
|
||||
|
||||
- Moved runtime compilation cache to in-memory by default. A default on-disk cache can encounter contention problems
|
||||
on multi-node clusters with a shared filesystem. rocFFT can still be told to use an on-disk cache by setting the
|
||||
ROCFFT_RTC_CACHE_PATH environment variable.
|
||||
|
||||
#### rocPRIM 2.12.0
|
||||
|
||||
rocPRIM 2.12.0 for ROCm 5.4.0
|
||||
|
||||
##### Changed
|
||||
|
||||
- `device_partition`, `device_unique`, and `device_reduce_by_key` now support problem
|
||||
sizes larger than 2^32 items.
|
||||
|
||||
##### Removed
|
||||
|
||||
- `block_sort::sort()` overload for keys and values with a dynamic size. This overload was documented but the
|
||||
implementation is missing. To avoid further confusion the documentation is removed until a decision is made on
|
||||
implementing the function.
|
||||
|
||||
##### Fixed
|
||||
|
||||
- Fixed the compilation failure in `device_merge` if the two key iterators don't match.
|
||||
|
||||
#### rocRAND 2.10.16
|
||||
|
||||
rocRAND 2.10.16 for ROCm 5.4.0
|
||||
|
||||
##### Added
|
||||
|
||||
- MRG31K3P pseudorandom number generator based on L'Ecuyer and Touzin, 2000, "Fast combined multiple recursive generators with multipliers of the form a = ±2q ±2r".
|
||||
- LFSR113 pseudorandom number generator based on L'Ecuyer, 1999, "Tables of maximally equidistributed combined LFSR generators".
|
||||
- SCRAMBLED_SOBOL32 and SCRAMBLED_SOBOL64 quasirandom number generators. The Scrambled Sobol sequences are generated by scrambling the output of a Sobol sequence.
|
||||
|
||||
##### Changed
|
||||
|
||||
- The `mrg_<distribution>_distribution` structures, which provided numbers based on MRG32K3A, are now replaced by `mrg_engine_<distribution>_distribution`, where `<distribution>` is `log_normal`, `normal`, `poisson`, or `uniform`. These structures provide numbers for MRG31K3P (with template type `rocrand_state_mrg31k3p`) and MRG32K3A (with template type `rocrand_state_mrg32k3a`).
|
||||
|
||||
##### Fixed
|
||||
|
||||
- Sobol64 now returns 64 bits random numbers, instead of 32 bits random numbers. As a result, the performance of this generator has regressed.
|
||||
- Fixed a bug that prevented compiling code in C++ mode (with a host compiler) when it included the rocRAND headers on Windows.
|
||||
|
||||
#### rocSOLVER 3.20.0
|
||||
|
||||
rocSOLVER 3.20.0 for ROCm 5.4.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Partial SVD for bidiagonal matrices:
|
||||
- BDSVDX
|
||||
- Partial SVD for general matrices:
|
||||
- GESVDX (with batched and strided\_batched versions)
|
||||
|
||||
##### Changed
|
||||
|
||||
- Changed `ROCSOLVER_EMBED_FMT` default to `ON` for users building directly with CMake.
|
||||
This matches the existing default when building with install.sh or rmake.py.
|
||||
|
||||
#### rocSPARSE 2.4.0
|
||||
|
||||
rocSPARSE 2.4.0 for ROCm 5.4.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Added rocsparse_spmv_ex routine
|
||||
- Added rocsparse_bsrmv_ex_analysis and rocsparse_bsrmv_ex routines
|
||||
- Added csritilu0 routine
|
||||
- Added build support for Navi31 and Navi 33
|
||||
|
||||
##### Improved
|
||||
|
||||
- Optimization to segmented algorithm for COO SpMV by performing analysis
|
||||
- Improve performance when generating random matrices.
|
||||
- Fixed bug in ellmv
|
||||
- Optimized bsr2csr routine
|
||||
- Fixed integer overflow bugs
|
||||
|
||||
#### rocThrust 2.17.0
|
||||
|
||||
rocThrust 2.17.0 for ROCm 5.4.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Updated to match upstream Thrust 1.17.0
|
||||
|
||||
#### rocWMMA 0.9
|
||||
|
||||
rocWMMA 0.9 for ROCm 5.4.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Added gemm driver APIs for flow control builtins
|
||||
- Added benchmark logging systems
|
||||
- Restructured tests to follow naming convention. Added macros for test generation
|
||||
|
||||
##### Changed
|
||||
|
||||
- Changed CMake to accomodate the modified test infrastructure
|
||||
- Fine tuned the multi-block kernels with and without lds
|
||||
- Adjusted Maximum Vector Width to dWordx4 Width
|
||||
- Updated Efficiencies to display as whole number percentages
|
||||
- Updated throughput from GFlops/s to TFlops/s
|
||||
- Reset the ad-hoc tests to use smaller sizes
|
||||
- Modified the output validation to use CPU-based implementation against rocWMMA
|
||||
- Modified the extended vector test to return error codes for memory allocation failures
|
||||
|
||||
#### Tensile 4.35.0
|
||||
|
||||
Tensile 4.35.0 for ROCm 5.4.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Async DMA support for Transpose Data Layout (ThreadSeparateGlobalReadA/B)
|
||||
- Option to output library logic in dictionary format
|
||||
- No solution found error message for benchmarking client
|
||||
- Exact K check for StoreCInUnrollExact
|
||||
- Support for CGEMM + MIArchVgpr
|
||||
- client-path parameter for using prebuilt client
|
||||
- CleanUpBuildFiles global parameter
|
||||
- Debug flag for printing library logic index of winning solution
|
||||
- NumWarmups global parameter for benchmarking
|
||||
- Windows support for benchmarking client
|
||||
- DirectToVgpr support for CGEMM
|
||||
- TensileLibLogicToYaml for creating tuning configs from library logic solutions
|
||||
|
||||
##### Optimizations
|
||||
|
||||
- Put beta code and store separately if StoreCInUnroll = x4 store
|
||||
- Improved performance for StoreCInUnroll + b128 store
|
||||
|
||||
##### Changed
|
||||
|
||||
- Re-enable HardwareMonitor for gfx90a
|
||||
- Decision trees use MLFeatures instead of Properties
|
||||
|
||||
##### Fixed
|
||||
|
||||
- Reject DirectToVgpr + MatrixInstBM/BN > 1
|
||||
- Fix benchmark timings when using warmups and/or validation
|
||||
- Fix mismatch issue with DirectToVgprB + VectorWidth > 1
|
||||
- Fix mismatch issue with DirectToLds + NumLoadsCoalesced > 1 + TailLoop
|
||||
- Fix incorrect reject condition for DirectToVgpr
|
||||
- Fix reject condition for DirectToVgpr + MIWaveTile < VectorWidth
|
||||
- Fix incorrect instruction generation with StoreCInUnroll
|
||||
|
||||
-------------------
|
||||
|
||||
## ROCm 5.3.3
|
||||
<!-- markdownlint-disable first-line-h1 -->
|
||||
### Fixed Defects
|
||||
|
||||
#### Issue with rocTHRUST and rocPRIM Libraries
|
||||
|
||||
There was a known issue with rocTHRUST and rocPRIM libraries supporting iterator and types in ROCm v5.3.x releases.
|
||||
|
||||
- `thrust::merge` no longer correctly supports different iterator types for `keys_input1` and `keys_input2`.
|
||||
- `rocprim::device_merge` no longer correctly supports using different types for `keys_input1` and `keys_input2`.
|
||||
|
||||
This issue is resolved with the following fixes to compilation failures:
|
||||
|
||||
- rocPRIM: in device_merge if the two key iterators do not match.
|
||||
- rocTHRUST: in thrust::merge if the two key iterators do not match.
|
||||
|
||||
### Library Changes in ROCM 5.3.3
|
||||
|
||||
| Library | Version |
|
||||
|---------|---------|
|
||||
| hipBLAS | [0.52.0](https://github.com/ROCmSoftwarePlatform/hipBLAS/releases/tag/rocm-5.3.3) |
|
||||
| hipCUB | [2.12.0](https://github.com/ROCmSoftwarePlatform/hipCUB/releases/tag/rocm-5.3.3) |
|
||||
| hipFFT | [1.0.9](https://github.com/ROCmSoftwarePlatform/hipFFT/releases/tag/rocm-5.3.3) |
|
||||
| hipSOLVER | [1.5.0](https://github.com/ROCmSoftwarePlatform/hipSOLVER/releases/tag/rocm-5.3.3) |
|
||||
| hipSPARSE | [2.3.1](https://github.com/ROCmSoftwarePlatform/hipSPARSE/releases/tag/rocm-5.3.3) |
|
||||
| rccl | [2.12.10](https://github.com/ROCmSoftwarePlatform/rccl/releases/tag/rocm-5.3.3) |
|
||||
| rocALUTION | [2.1.0](https://github.com/ROCmSoftwarePlatform/rocALUTION/releases/tag/rocm-5.3.3) |
|
||||
| rocBLAS | [2.45.0](https://github.com/ROCmSoftwarePlatform/rocBLAS/releases/tag/rocm-5.3.3) |
|
||||
| rocFFT | [1.0.18](https://github.com/ROCmSoftwarePlatform/rocFFT/releases/tag/rocm-5.3.3) |
|
||||
| rocPRIM | [2.11.0](https://github.com/ROCmSoftwarePlatform/rocPRIM/releases/tag/rocm-5.3.3) |
|
||||
| rocRAND | [2.10.15](https://github.com/ROCmSoftwarePlatform/rocRAND/releases/tag/rocm-5.3.3) |
|
||||
| rocSOLVER | [3.19.0](https://github.com/ROCmSoftwarePlatform/rocSOLVER/releases/tag/rocm-5.3.3) |
|
||||
| rocSPARSE | [2.2.0](https://github.com/ROCmSoftwarePlatform/rocSPARSE/releases/tag/rocm-5.3.3) |
|
||||
| rocThrust | [2.16.0](https://github.com/ROCmSoftwarePlatform/rocThrust/releases/tag/rocm-5.3.3) |
|
||||
| rocWMMA | [0.8](https://github.com/ROCmSoftwarePlatform/rocWMMA/releases/tag/rocm-5.3.3) |
|
||||
| Tensile | [4.34.0](https://github.com/ROCmSoftwarePlatform/Tensile/releases/tag/rocm-5.3.3) |
|
||||
|
||||
-------------------
|
||||
|
||||
## ROCm 5.3.2
|
||||
<!-- markdownlint-disable first-line-h1 -->
|
||||
### Fixed Defects
|
||||
|
||||
The following known issues in ROCm v5.3.2 are fixed in this release.
|
||||
|
||||
#### Peer-to-Peer DMA Mapping Errors with SLES and RHEL
|
||||
|
||||
Peer-to-Peer Direct Memory Access (DMA) mapping errors on Dell systems (R7525 and R750XA) with SLES 15 SP3/SP4 and RHEL 9.0 are fixed in this release.
|
||||
|
||||
Previously, running rocminfo resulted in Peer-to-Peer DMA mapping errors.
|
||||
|
||||
#### RCCL Tuning Table
|
||||
|
||||
The RCCL tuning table is updated for supported platforms.
|
||||
|
||||
#### SGEMM (F32 GEMM) Routines in rocBLAS
|
||||
|
||||
Functional correctness failures in SGEMM (F32 GEMM) routines in rocBLAS for certain problem sizes and ranges are fixed in this release.
|
||||
|
||||
### Known Issues
|
||||
|
||||
This section consists of known issues in this release.
|
||||
|
||||
#### AMD Instinct™ MI200 SRIOV Virtualization Issue
|
||||
|
||||
There is a known issue in this ROCm v5.3 release with all AMD Instinct™ MI200 devices running within a virtual function (VF) under SRIOV virtualization. This issue will likely impact the functionality of SRIOV-based workloads but does not impact Discrete Device Assignment (DDA) or bare metal.
|
||||
|
||||
Until a fix is provided, users should rely on ROCm v5.2.3 to support their SRIOV workloads.
|
||||
|
||||
#### AMD Instinct™ MI200 Firmware Updates
|
||||
|
||||
Customers cannot update the Integrated Firmware Image (IFWI) for AMD Instinct™ MI200 accelerators.
|
||||
|
||||
An updated firmware maintenance bundle consisting of an installation tool and images specific to AMD Instinct™ MI200 accelerators is under planning and will be available soon.
|
||||
|
||||
#### Known Issue with rocThrust and rocPRIM Libraries
|
||||
|
||||
There is a known known issue with rocThrust and rocPRIM libraries supporting iterator and types in ROCm v5.3.x releases.
|
||||
|
||||
- thrust::merge no longer correctly supports different iterator types for `keys_input1` and `keys_input2`.
|
||||
|
||||
- rocprim::device_merge no longer correctly supports using different types for `keys_input1` and `keys_input2`.
|
||||
|
||||
This issue is currently under investigation and will be resolved in a future release.
|
||||
|
||||
### Library Changes in ROCM 5.3.2
|
||||
|
||||
| Library | Version |
|
||||
|---------|---------|
|
||||
| hipBLAS | [0.52.0](https://github.com/ROCmSoftwarePlatform/hipBLAS/releases/tag/rocm-5.3.2) |
|
||||
| hipCUB | [2.12.0](https://github.com/ROCmSoftwarePlatform/hipCUB/releases/tag/rocm-5.3.2) |
|
||||
| hipFFT | [1.0.9](https://github.com/ROCmSoftwarePlatform/hipFFT/releases/tag/rocm-5.3.2) |
|
||||
| hipSOLVER | [1.5.0](https://github.com/ROCmSoftwarePlatform/hipSOLVER/releases/tag/rocm-5.3.2) |
|
||||
| hipSPARSE | [2.3.1](https://github.com/ROCmSoftwarePlatform/hipSPARSE/releases/tag/rocm-5.3.2) |
|
||||
| rccl | [2.12.10](https://github.com/ROCmSoftwarePlatform/rccl/releases/tag/rocm-5.3.2) |
|
||||
| rocALUTION | [2.1.0](https://github.com/ROCmSoftwarePlatform/rocALUTION/releases/tag/rocm-5.3.2) |
|
||||
| rocBLAS | [2.45.0](https://github.com/ROCmSoftwarePlatform/rocBLAS/releases/tag/rocm-5.3.2) |
|
||||
| rocFFT | [1.0.18](https://github.com/ROCmSoftwarePlatform/rocFFT/releases/tag/rocm-5.3.2) |
|
||||
| rocPRIM | [2.11.0](https://github.com/ROCmSoftwarePlatform/rocPRIM/releases/tag/rocm-5.3.2) |
|
||||
| rocRAND | [2.10.15](https://github.com/ROCmSoftwarePlatform/rocRAND/releases/tag/rocm-5.3.2) |
|
||||
| rocSOLVER | [3.19.0](https://github.com/ROCmSoftwarePlatform/rocSOLVER/releases/tag/rocm-5.3.2) |
|
||||
| rocSPARSE | [2.2.0](https://github.com/ROCmSoftwarePlatform/rocSPARSE/releases/tag/rocm-5.3.2) |
|
||||
| rocThrust | [2.16.0](https://github.com/ROCmSoftwarePlatform/rocThrust/releases/tag/rocm-5.3.2) |
|
||||
| rocWMMA | [0.8](https://github.com/ROCmSoftwarePlatform/rocWMMA/releases/tag/rocm-5.3.2) |
|
||||
| Tensile | [4.34.0](https://github.com/ROCmSoftwarePlatform/Tensile/releases/tag/rocm-5.3.2) |
|
||||
|
||||
-------------------
|
||||
|
||||
## ROCm 5.3.0
|
||||
<!-- markdownlint-disable first-line-h1 -->
|
||||
### Deprecations and Warnings
|
||||
|
||||
58
RELEASE.md
58
RELEASE.md
@@ -15,7 +15,7 @@ The release notes for the ROCm platform.
|
||||
|
||||
-------------------
|
||||
|
||||
## ROCm 5.3.0
|
||||
## ROCm 5.4.3
|
||||
<!-- markdownlint-disable first-line-h1 -->
|
||||
<!-- markdownlint-disable no-duplicate-header -->
|
||||
### Deprecations and Warnings
|
||||
@@ -28,13 +28,13 @@ The `hipcc` and `hipconfig` Perl scripts are deprecated. In a future release, co
|
||||
>
|
||||
> There will be a transition period where the Perl scripts and compiled binaries are available before the scripts are removed. There will be no functional difference between the Perl scripts and their compiled binary counterpart. No user action is required. Once these are available, users can optionally switch to `hipcc.bin` and `hipconfig.bin`. The `hipcc`/`hipconfig` soft link will be assimilated to point from `hipcc`/`hipconfig` to the respective compiled binaries as the default option.
|
||||
|
||||
#### Linux Filesystem Hierarchy Standard for ROCm
|
||||
##### Linux Filesystem Hierarchy Standard for ROCm
|
||||
|
||||
ROCm packages have adopted the Linux foundation filesystem hierarchy standard in this release to ensure ROCm components follow open source conventions for Linux-based distributions. While moving to a new filesystem hierarchy, ROCm ensures backward compatibility with its 5.1 version or older filesystem hierarchy. See below for a detailed explanation of the new filesystem hierarchy and backward compatibility.
|
||||
|
||||
##### New Filesystem Hierarchy
|
||||
|
||||
The following is the new filesystem hierarchy:
|
||||
The following is the new filesystem hierarchy:4
|
||||
|
||||
```text
|
||||
/opt/rocm-<ver>
|
||||
@@ -124,54 +124,18 @@ lrwxrwxrwx 1 root root 42 May 10 23:32 hip-config.cmake -> ../../../../lib/cmake
|
||||
|
||||
### Fixed Defects
|
||||
|
||||
The following defects are fixed in this release.
|
||||
#### Compiler Improvements
|
||||
|
||||
These defects were identified and documented as known issues in previous ROCm releases and are fixed in the ROCm v5.3 release.
|
||||
In ROCm v5.4.3, improvements to the compiler address errors with the following signatures:
|
||||
|
||||
#### Kernel produces incorrect results with ROCm 5.2
|
||||
|
||||
User code did not initialize certain data constructs, leading to a correctness issue. A strict reading of the C++ standard suggests that failing to initialize these data constructs is undefined behavior. However, a special case was added for a specific compiler builtin to handle the uninitialized data in a defined manner.
|
||||
|
||||
The compiler fix consists of the following patches:
|
||||
|
||||
- A new `noundef` attribute is added. This attribute denotes when a function call argument or return val may never contain uninitialized bits.
|
||||
For more information, see <https://reviews.llvm.org/D81678>
|
||||
- The application of this attribute was refined such that it was not added to a specific compiler builtin where the compiler knows that inactive lanes do not impact program execution.
|
||||
|
||||
For more information, see <https://github.com/RadeonOpenCompute/llvm-project/commit/accf36c58409268ca1f216cdf5ad812ba97ceccd>.
|
||||
- "error: unhandled SGPR spill to memory"
|
||||
- "cannot scavenge register without an emergency spill slot!"
|
||||
- "error: ran out of registers during register allocation"
|
||||
|
||||
### Known Issues
|
||||
|
||||
This section consists of known issues in this release.
|
||||
#### Compiler Option Error at Runtime
|
||||
|
||||
#### Issue with OpenMP-Extras Package Upgrade
|
||||
Some users may encounter a “Cannot find Symbol” error at runtime when using -save-temps. While most -save-temps use cases work correctly, this error may appear occasionally.
|
||||
|
||||
The `openmp-extras` package has been split into runtime (`openmp-extras-runtime`) and dev (`openmp-extras-devel`) packages. This change has broken the upgrade support for the `openmp-extras` package in RHEL/SLES.
|
||||
An available workaround in RHEL is to use the following command for upgrades:
|
||||
|
||||
```sh
|
||||
sudo yum upgrade rocm-language-runtime --allowerasing
|
||||
|
||||
```
|
||||
|
||||
An available workaround in SLES is to use the following command for upgrades:
|
||||
|
||||
```sh
|
||||
zypper update --force-resolution <meta-package>
|
||||
```
|
||||
|
||||
#### AMD Instinct™ MI200 SRIOV Virtualization Issue
|
||||
|
||||
There is a known issue in this ROCm v5.3 release with all AMD Instinct™ MI200 devices running within a virtual function (VF) under SRIOV virtualization. This issue will likely impact the functionality of SRIOV-based workloads, but does not impact Discrete Device Assignment (DDA) or Bare Metal.
|
||||
|
||||
Until a fix is provided, users should rely on ROCm v5.2.3 to support their SRIOV workloads.
|
||||
|
||||
#### System Crash when IMMOU is Enabled
|
||||
|
||||
If IOMMU is enabled in SBIOS and ROCm is installed, the system may report the following failure or errors when running workloads such as bandwidth test, clinfo, and HelloWord.cl and cause a system crash.
|
||||
|
||||
- IO PAGE FAULT
|
||||
- IRQ remapping does not support X2APIC mode
|
||||
- NMI error
|
||||
|
||||
Workaround: To avoid the system crash, add `amd_iommu=on iommu=pt` as the kernel bootparam, as indicated in the warning message.
|
||||
This issue is under investigation, and the known workaround is not to use -save-temps when the error appears.
|
||||
|
||||
@@ -18,8 +18,8 @@ shutil.copy2('../CHANGELOG.md','./CHANGELOG.md')
|
||||
project = "ROCm Documentation"
|
||||
author = "Advanced Micro Devices, Inc."
|
||||
copyright = "Copyright (c) 2023 Advanced Micro Devices, Inc. All rights reserved."
|
||||
version = "5.3.0"
|
||||
release = "5.3.0"
|
||||
version = "5.4.3"
|
||||
release = "5.4.3"
|
||||
|
||||
setting_all_article_info = True
|
||||
all_article_info_os = ["linux"]
|
||||
@@ -64,7 +64,7 @@ article_pages = [
|
||||
|
||||
external_toc_path = "./sphinx/_toc.yml"
|
||||
|
||||
docs_core = ROCmDocs("ROCm 5.3.0 Documentation Home")
|
||||
docs_core = ROCmDocs("ROCm 5.4.3 Documentation Home")
|
||||
docs_core.setup()
|
||||
|
||||
external_projects_current_project = "rocm"
|
||||
|
||||
@@ -18,8 +18,8 @@ following commands based on your distribution.
|
||||
|
||||
```shell
|
||||
sudo apt update
|
||||
wget https://repo.radeon.com/amdgpu-install/5.3/ubuntu/focal/amdgpu-install_5.3.50300-1_all.deb
|
||||
sudo apt install ./amdgpu-install_5.3.50300-1_all.deb
|
||||
wget https://repo.radeon.com/amdgpu-install/5.4.3/ubuntu/focal/amdgpu-install_5.4.50403-1_all.deb
|
||||
sudo apt install ./amdgpu-install_5.4.50403-1_all.deb
|
||||
```
|
||||
|
||||
:::
|
||||
@@ -28,8 +28,8 @@ sudo apt install ./amdgpu-install_5.3.50300-1_all.deb
|
||||
|
||||
```shell
|
||||
sudo apt update
|
||||
wget https://repo.radeon.com/amdgpu-install/5.3/ubuntu/jammy/amdgpu-install_5.3.50300-1_all.deb
|
||||
sudo apt install ./amdgpu-install_5.3.50300-1_all.deb
|
||||
wget https://repo.radeon.com/amdgpu-install/5.4.3/ubuntu/jammy/amdgpu-install_5.4.50403-1_all.deb
|
||||
sudo apt install ./amdgpu-install_5.4.50403-1_all.deb
|
||||
```
|
||||
|
||||
:::
|
||||
@@ -39,39 +39,30 @@ sudo apt install ./amdgpu-install_5.3.50300-1_all.deb
|
||||
:sync: RHEL
|
||||
|
||||
::::{tab-set}
|
||||
:::{tab-item} RHEL 7.9
|
||||
:sync: RHEL-7.9
|
||||
:sync: RHEL-7
|
||||
|
||||
```shell
|
||||
sudo yum install https://repo.radeon.com/amdgpu-install/5.3/rhel/7.9/amdgpu-install-5.3.50300-1.el7.noarch.rpm
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} RHEL 8.5
|
||||
:sync: RHEL-8.5
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
sudo yum install https://repo.radeon.com/amdgpu-install/5.3/rhel/8.5/amdgpu-install-5.3.50300-1.el8.noarch.rpm
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} RHEL 8.6
|
||||
:sync: RHEL-8.6
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
sudo yum install https://repo.radeon.com/amdgpu-install/5.3/rhel/8.6/amdgpu-install-5.3.50300-1.el8.noarch.rpm
|
||||
sudo yum install https://repo.radeon.com/amdgpu-install/5.4.3/rhel/8.6/amdgpu-install-5.4.50403-1.el8.noarch.rpm
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} RHEL 9.0
|
||||
:sync: RHEL-9.0
|
||||
:::{tab-item} RHEL 8.7
|
||||
:sync: RHEL-8.7
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
sudo yum install https://repo.radeon.com/amdgpu-install/5.4.3/rhel/8.7/amdgpu-install-5.4.50403-1.el8.noarch.rpm
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} RHEL 9.1
|
||||
:sync: RHEL-9.1
|
||||
:sync: RHEL-9
|
||||
|
||||
```shell
|
||||
sudo yum install https://repo.radeon.com/amdgpu-install/5.3/rhel/9.0/amdgpu-install-5.3.50300-1.el9.noarch.rpm
|
||||
sudo yum install https://repo.radeon.com/amdgpu-install/5.4.3/rhel/9.1/amdgpu-install-5.4.50403-1.el9.noarch.rpm
|
||||
```
|
||||
|
||||
:::
|
||||
@@ -85,15 +76,7 @@ sudo yum install https://repo.radeon.com/amdgpu-install/5.3/rhel/9.0/amdgpu-inst
|
||||
:sync: SLES15-SP4
|
||||
|
||||
```shell
|
||||
sudo zypper --no-gpg-checks install https://repo.radeon.com/amdgpu-install/5.3/sle/15.4/amdgpu-install-5.3.50300-1.noarch.rpm
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} Service Pack 3
|
||||
:sync: SLES15-SP3
|
||||
|
||||
```shell
|
||||
sudo zypper --no-gpg-checks install https://repo.radeon.com/amdgpu-install/5.3/sle/15.3/amdgpu-install-5.3.50300-1.noarch.rpm
|
||||
sudo zypper --no-gpg-checks install https://repo.radeon.com/amdgpu-install/5.4.3/sle/15.4/amdgpu-install-5.4.50403-1.noarch.rpm
|
||||
```
|
||||
|
||||
:::
|
||||
@@ -172,9 +155,9 @@ the installer script will install packages in the single-version layout.
|
||||
For the multi-version ROCm installation you must use the installer script from
|
||||
the latest release of ROCm that you wish to install.
|
||||
|
||||
**Example:** If you want to install ROCm releases 5.2.1 and 5.3
|
||||
**Example:** If you want to install ROCm releases 5.3.3 and 5.4.3
|
||||
simultaneously, you are required to download the installer from the latest ROCm
|
||||
release v5.3.
|
||||
release v5.4.3.
|
||||
|
||||
### Add Required Repositories
|
||||
|
||||
@@ -193,7 +176,7 @@ Run the following commands based on your distribution to add the repositories:
|
||||
:sync: ubuntu-20.04
|
||||
|
||||
```shell
|
||||
for ver in 5.2.1; do
|
||||
for ver in 5.3.3 5.4.3; do
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/trusted.gpg.d/rocm-keyring.gpg] https://repo.radeon.com/rocm/apt/$ver focal main" | sudo tee /etc/apt/sources.list.d/rocm.list
|
||||
done
|
||||
echo -e 'Package: *\nPin: release o=repo.radeon.com\nPin-Priority: 600' | sudo tee /etc/apt/preferences.d/rocm-pin-600
|
||||
@@ -205,7 +188,7 @@ sudo apt update
|
||||
:sync: ubuntu-22.04
|
||||
|
||||
```shell
|
||||
for ver in 5.2.1; do
|
||||
for ver in 5.3.3 5.4.3; do
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/trusted.gpg.d/rocm-keyring.gpg] https://repo.radeon.com/rocm/apt/$ver jammy main" | sudo tee /etc/apt/sources.list.d/rocm.list
|
||||
done
|
||||
echo -e 'Package: *\nPin: release o=repo.radeon.com\nPin-Priority: 600' | sudo tee /etc/apt/preferences.d/rocm-pin-600
|
||||
@@ -219,30 +202,11 @@ sudo apt update
|
||||
:sync: RHEL
|
||||
|
||||
::::{tab-set}
|
||||
:::{tab-item} RHEL 7
|
||||
:sync: RHEL-7
|
||||
|
||||
```shell
|
||||
for ver in 5.2.1; do
|
||||
sudo tee --append /etc/yum.repos.d/rocm.repo <<EOF
|
||||
[ROCm-$ver]
|
||||
name=ROCm$ver
|
||||
baseurl=https://repo.radeon.com/rocm/yum/$ver/main
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
EOF
|
||||
done
|
||||
sudo yum clean all
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} RHEL 8
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
for ver in 5.2.1; do
|
||||
for ver in 5.3.3 5.4.3; do
|
||||
sudo tee --append /etc/yum.repos.d/rocm.repo <<EOF
|
||||
[ROCm-$ver]
|
||||
name=ROCm$ver
|
||||
@@ -261,7 +225,7 @@ sudo yum clean all
|
||||
:sync: RHEL-9
|
||||
|
||||
```shell
|
||||
for ver in 5.2.1; do
|
||||
for ver in 5.3.3 5.4.3; do
|
||||
sudo tee --append /etc/yum.repos.d/rocm.repo <<EOF
|
||||
[ROCm-$ver]
|
||||
name=ROCm$ver
|
||||
@@ -281,29 +245,8 @@ sudo yum clean all
|
||||
:::::{tab-item} SUSE Linux Enterprise Server 15
|
||||
:sync: SLES15
|
||||
|
||||
::::{tab-set}
|
||||
:::{tab-item} Service Pack 3
|
||||
:sync: SLES15-SP3
|
||||
|
||||
```shell
|
||||
for ver in 5.2.1; do
|
||||
sudo tee --append /etc/zypp/repos.d/rocm.repo <<EOF
|
||||
name=rocm
|
||||
baseurl=https://repo.radeon.com/rocm/$ver/sle/15.3/main/x86_64
|
||||
enabled=1
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
EOF
|
||||
done
|
||||
sudo zypper ref
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} Service Pack 4
|
||||
:sync: SLES15-SP4
|
||||
|
||||
```shell
|
||||
for ver in 5.2.1; do
|
||||
for ver in 5.3.3 5.4.3; do
|
||||
sudo tee --append /etc/zypp/repos.d/rocm.repo <<EOF
|
||||
name=rocm
|
||||
baseurl=https://repo.radeon.com/rocm/$ver/sle/15.4/main/x86_64
|
||||
@@ -315,9 +258,6 @@ done
|
||||
sudo zypper ref
|
||||
```
|
||||
|
||||
:::
|
||||
::::
|
||||
|
||||
:::::
|
||||
::::::
|
||||
|
||||
@@ -332,12 +272,12 @@ sudo amdgpu-install --usecase=rocm --rocmrelease=<release-number-3>
|
||||
```
|
||||
|
||||
Following are examples of ROCm multi-version installation. The kernel-mode
|
||||
driver, associated with the ROCm release v5.3, will be installed as its latest
|
||||
driver, associated with the ROCm release v5.4.3, will be installed as its latest
|
||||
release in the list.
|
||||
|
||||
```none
|
||||
sudo amdgpu-install --usecase=rocm --rocmrelease=5.2.1
|
||||
sudo amdgpu-install --usecase=rocm --rocmrelease=5.3.0
|
||||
sudo amdgpu-install --usecase=rocm --rocmrelease=5.3.3
|
||||
sudo amdgpu-install --usecase=rocm --rocmrelease=5.4.3
|
||||
```
|
||||
|
||||
## Additional options
|
||||
|
||||
@@ -53,7 +53,7 @@ To add the AMDGPU repository, follow these steps:
|
||||
|
||||
```shell
|
||||
# amdgpu repository for focal
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/5.3/ubuntu focal main' \
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/5.4.3/ubuntu focal main' \
|
||||
| sudo tee /etc/apt/sources.list.d/amdgpu.list
|
||||
sudo apt update
|
||||
```
|
||||
@@ -64,7 +64,7 @@ sudo apt update
|
||||
|
||||
```shell
|
||||
# amdgpu repository for jammy
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/5.3/ubuntu jammy main' \
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/5.4.3/ubuntu jammy main' \
|
||||
| sudo tee /etc/apt/sources.list.d/amdgpu.list
|
||||
sudo apt update
|
||||
```
|
||||
@@ -91,7 +91,7 @@ To add the ROCm repository, use the following steps:
|
||||
|
||||
```shell
|
||||
# ROCm repositories for focal
|
||||
for ver in 5.2.1 5.3; do
|
||||
for ver in 5.3.3 5.4.3; do
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/$ver focal main" \
|
||||
| sudo tee --append /etc/apt/sources.list.d/rocm.list
|
||||
done
|
||||
@@ -106,7 +106,7 @@ sudo apt update
|
||||
|
||||
```shell
|
||||
# ROCm repositories for jammy
|
||||
for ver in 5.2.1 5.3; do
|
||||
for ver in 5.3.3 5.4.3; do
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/$ver jammy main" \
|
||||
| sudo tee --append /etc/apt/sources.list.d/rocm.list
|
||||
done
|
||||
@@ -136,7 +136,7 @@ For a comprehensive list of meta-packages, refer to
|
||||
- Sample Multi-version installation
|
||||
|
||||
```shell
|
||||
sudo apt install rocm-hip-sdk5.3.0 rocm-hip-sdk5.2.1
|
||||
sudo apt install rocm-hip-sdk5.4.3 rocm-hip-sdk5.3.3
|
||||
```
|
||||
|
||||
:::::
|
||||
@@ -152,44 +152,6 @@ section.
|
||||
```
|
||||
|
||||
::::{tab-set}
|
||||
:::{tab-item} RHEL 7.9
|
||||
:sync: RHEL-7.9
|
||||
:sync: RHEL-7
|
||||
|
||||
```shell
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3/rhel/7.9/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
EOF
|
||||
sudo yum clean all
|
||||
```
|
||||
|
||||
:::
|
||||
|
||||
:::{tab-item} RHEL 8.5
|
||||
:sync: RHEL-8.5
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3/rhel/8.5/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
EOF
|
||||
sudo yum clean all
|
||||
```
|
||||
|
||||
:::
|
||||
|
||||
:::{tab-item} RHEL 8.6
|
||||
:sync: RHEL-8.6
|
||||
:sync: RHEL-8
|
||||
@@ -198,7 +160,7 @@ sudo yum clean all
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3/rhel/8.6/main/x86_64/
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.4.3/rhel/8.6/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
@@ -209,15 +171,34 @@ sudo yum clean all
|
||||
|
||||
:::
|
||||
|
||||
:::{tab-item} RHEL 9.0
|
||||
:sync: RHEL-9.0
|
||||
:::{tab-item} RHEL 8.7
|
||||
:sync: RHEL-8.7
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.4.3/rhel/8.7/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
EOF
|
||||
sudo yum clean all
|
||||
```
|
||||
|
||||
:::
|
||||
|
||||
:::{tab-item} RHEL 9.1
|
||||
:sync: RHEL-9.1
|
||||
:sync: RHEL-9
|
||||
|
||||
```shell
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3/rhel/9.0/main/x86_64/
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.4.3/rhel/9.1/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
@@ -243,30 +224,11 @@ sudo reboot
|
||||
To add the ROCm repository, use the following steps, based on your distribution:
|
||||
|
||||
::::{tab-set}
|
||||
:::{tab-item} RHEL 7
|
||||
:sync: RHEL-7
|
||||
|
||||
```shell
|
||||
for ver in 5.2.1 5.3; do
|
||||
sudo tee --append /etc/yum.repos.d/rocm.repo <<EOF
|
||||
[ROCm-$ver]
|
||||
name=ROCm$ver
|
||||
baseurl=https://repo.radeon.com/rocm/yum/$ver/main
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
EOF
|
||||
done
|
||||
sudo yum clean all
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} RHEL 8
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
for ver in 5.2.1 5.3; do
|
||||
for ver in 5.3.3 5.4.3; do
|
||||
sudo tee --append /etc/yum.repos.d/rocm.repo <<EOF
|
||||
[ROCm-$ver]
|
||||
name=ROCm$ver
|
||||
@@ -285,7 +247,7 @@ sudo yum clean all
|
||||
:sync: RHEL-9
|
||||
|
||||
```shell
|
||||
for ver in 5.2.1 5.3; do
|
||||
for ver in 5.3.3 5.4.3; do
|
||||
sudo tee --append /etc/yum.repos.d/rocm.repo <<EOF
|
||||
[ROCm-$ver]
|
||||
name=ROCm$ver
|
||||
@@ -320,7 +282,7 @@ For a comprehensive list of meta-packages, refer to
|
||||
- Sample Multi-version installation
|
||||
|
||||
```shell
|
||||
sudo yum install rocm-hip-sdk5.3.0 rocm-hip-sdk5.2.1
|
||||
sudo yum install rocm-hip-sdk5.4.3 rocm-hip-sdk5.3.3
|
||||
```
|
||||
|
||||
:::::
|
||||
@@ -335,15 +297,11 @@ If you have a version of the kernel-mode driver installed, you may skip this
|
||||
section.
|
||||
```
|
||||
|
||||
::::{tab-set}
|
||||
:::{tab-item} Service Pack 3
|
||||
:sync: SLES15-SP3
|
||||
|
||||
```shell
|
||||
sudo tee /etc/zypp/repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3/sle/15.3/main/x86_64
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.4.3/sle/15.4/main/x86_64
|
||||
enabled=1
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
@@ -351,25 +309,6 @@ EOF
|
||||
sudo zypper ref
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} Service Pack 4
|
||||
:sync: SLES15-SP4
|
||||
|
||||
```shell
|
||||
sudo tee /etc/zypp/repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3/sle/15.4/main/x86_64
|
||||
enabled=1
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
EOF
|
||||
sudo zypper ref
|
||||
```
|
||||
|
||||
:::
|
||||
::::
|
||||
|
||||
Install the kernel mode driver and reboot the system using the following
|
||||
commands:
|
||||
|
||||
@@ -384,7 +323,7 @@ sudo reboot
|
||||
To add the ROCm repository, use the following steps:
|
||||
|
||||
```shell
|
||||
for ver in 5.2.1 5.3; do
|
||||
for ver in 5.3.3 5.4.3; do
|
||||
sudo tee --append /etc/zypp/repos.d/rocm.repo <<EOF
|
||||
[ROCm-$ver]
|
||||
name=ROCm$ver
|
||||
@@ -416,7 +355,7 @@ For a comprehensive list of meta-packages, refer to
|
||||
- Sample Multi-version installation
|
||||
|
||||
```shell
|
||||
sudo zypper --gpg-auto-import-keys install rocm-hip-sdk5.3.0 rocm-hip-sdk5.2.1
|
||||
sudo zypper --gpg-auto-import-keys install rocm-hip-sdk5.4.3 rocm-hip-sdk5.3.3
|
||||
```
|
||||
|
||||
:::::
|
||||
@@ -453,7 +392,7 @@ but are generally useful. Verification of the install is advised.
|
||||
2. Add binary paths to the `PATH` environment variable.
|
||||
|
||||
```shell
|
||||
export PATH=$PATH:/opt/rocm-5.3.2/bin:/opt/rocm-5.2.1/opencl/bin
|
||||
export PATH=$PATH:/opt/rocm-5.4.3/bin:/opt/rocm-5.4.3/opencl/bin
|
||||
```
|
||||
|
||||
```{attention}
|
||||
|
||||
@@ -26,7 +26,7 @@ repository to the new release.
|
||||
|
||||
```shell
|
||||
# amdgpu repository for focal
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/5.3/ubuntu focal main' \
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/5.4.3/ubuntu focal main' \
|
||||
| sudo tee /etc/apt/sources.list.d/amdgpu.list
|
||||
sudo apt update
|
||||
```
|
||||
@@ -37,7 +37,7 @@ sudo apt update
|
||||
|
||||
```shell
|
||||
# amdgpu repository for jammy
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/5.3/ubuntu jammy main' \
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/5.4.3/ubuntu jammy main' \
|
||||
| sudo tee /etc/apt/sources.list.d/amdgpu.list
|
||||
sudo apt update
|
||||
```
|
||||
@@ -49,42 +49,6 @@ sudo apt update
|
||||
:sync: RHEL
|
||||
|
||||
::::{tab-set}
|
||||
:::{tab-item} RHEL 7.9
|
||||
:sync: RHEL-7.9
|
||||
:sync: RHEL-7
|
||||
|
||||
```shell
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3/rhel/7.9/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
EOF
|
||||
sudo yum clean all
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} RHEL 8.5
|
||||
:sync: RHEL-8.5
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3/rhel/8.5/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
EOF
|
||||
sudo yum clean all
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} RHEL 8.6
|
||||
:sync: RHEL-8.6
|
||||
:sync: RHEL-8
|
||||
@@ -93,7 +57,7 @@ sudo yum clean all
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3/rhel/8.6/main/x86_64/
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.4.3/rhel/8.6/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
@@ -103,15 +67,33 @@ sudo yum clean all
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} RHEL 9.0
|
||||
:sync: RHEL-9.0
|
||||
:::{tab-item} RHEL 8.7
|
||||
:sync: RHEL-8.7
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.4.3/rhel/8.7/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
EOF
|
||||
sudo yum clean all
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} RHEL 9.1
|
||||
:sync: RHEL-9.1
|
||||
:sync: RHEL-9
|
||||
|
||||
```shell
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3/rhel/9.0/main/x86_64/
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.4.3/rhel/9.1/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
@@ -126,15 +108,11 @@ sudo yum clean all
|
||||
:::::{tab-item} SUSE Linux Enterprise Server 15
|
||||
:sync: SLES15
|
||||
|
||||
::::{tab-set}
|
||||
:::{tab-item} Service Pack 3
|
||||
:sync: SLES15-SP3
|
||||
|
||||
```shell
|
||||
sudo tee /etc/zypp/repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3.3/sle/15.3/main/x86_64
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.4.3/sle/15.4/main/x86_64
|
||||
enabled=1
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
@@ -142,25 +120,6 @@ EOF
|
||||
sudo zypper ref
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} Service Pack 4
|
||||
:sync: SLES15-SP4
|
||||
|
||||
```shell
|
||||
sudo tee /etc/zypp/repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3/sle/15.4/main/x86_64
|
||||
enabled=1
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
EOF
|
||||
sudo zypper ref
|
||||
```
|
||||
|
||||
:::
|
||||
::::
|
||||
|
||||
:::::
|
||||
::::::
|
||||
|
||||
@@ -213,7 +172,7 @@ repository to the new release.
|
||||
:sync: ubuntu-20.04
|
||||
|
||||
```shell
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.3 focal main" \
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.4.3 focal main" \
|
||||
| sudo tee /etc/apt/sources.list.d/rocm.list
|
||||
echo -e 'Package: *\nPin: release o=repo.radeon.com\nPin-Priority: 600' \
|
||||
| sudo tee /etc/apt/preferences.d/rocm-pin-600
|
||||
@@ -225,7 +184,7 @@ sudo apt update
|
||||
:sync: ubuntu-22.04
|
||||
|
||||
```shell
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.3 jammy main" \
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.4.3 jammy main" \
|
||||
| sudo tee /etc/apt/sources.list.d/rocm.list
|
||||
echo -e 'Package: *\nPin: release o=repo.radeon.com\nPin-Priority: 600' \
|
||||
| sudo tee /etc/apt/preferences.d/rocm-pin-600
|
||||
@@ -239,31 +198,14 @@ sudo apt update
|
||||
:sync: RHEL
|
||||
|
||||
::::{tab-set}
|
||||
:::{tab-item} RHEL 7
|
||||
:sync: RHEL-7
|
||||
|
||||
```shell
|
||||
sudo tee /etc/yum.repos.d/rocm.repo <<EOF
|
||||
[ROCm-5.3]
|
||||
name=ROCm5.3
|
||||
baseurl=https://repo.radeon.com/rocm/yum/5.3/main
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
EOF
|
||||
sudo yum clean all
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} RHEL 8
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
sudo tee /etc/yum.repos.d/rocm.repo <<EOF
|
||||
[ROCm-5.3]
|
||||
name=ROCm5.3
|
||||
baseurl=https://repo.radeon.com/rocm/rhel8/5.3/main
|
||||
[ROCm-5.4.3]
|
||||
name=ROCm5.4.3
|
||||
baseurl=https://repo.radeon.com/rocm/rhel8/5.4.3/main
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
@@ -278,9 +220,9 @@ sudo yum clean all
|
||||
|
||||
```shell
|
||||
sudo tee /etc/yum.repos.d/rocm.repo <<EOF
|
||||
[ROCm-5.3]
|
||||
name=ROCm5.3
|
||||
baseurl=https://repo.radeon.com/rocm/rhel9/5.3/main
|
||||
[ROCm-5.4.3]
|
||||
name=ROCm5.4.3
|
||||
baseurl=https://repo.radeon.com/rocm/rhel9/5.4.3/main
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
@@ -297,10 +239,10 @@ sudo yum clean all
|
||||
|
||||
```shell
|
||||
sudo tee /etc/zypp/repos.d/rocm.repo <<EOF
|
||||
[ROCm-5.3]
|
||||
name=ROCm5.3
|
||||
[ROCm-5.4.3]
|
||||
name=ROCm5.4.3
|
||||
name=rocm
|
||||
baseurl=https://repo.radeon.com/rocm/zyp/5.3/main
|
||||
baseurl=https://repo.radeon.com/rocm/zyp/5.4.3/main
|
||||
enabled=1
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
|
||||
@@ -29,11 +29,11 @@ wget https://repo.radeon.com/rocm/rocm.gpg.key -O - | \
|
||||
```shell
|
||||
# Kernel driver repository for focal
|
||||
sudo tee /etc/apt/sources.list.d/amdgpu.list <<'EOF'
|
||||
deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/5.3/ubuntu focal main
|
||||
deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/5.4.3/ubuntu focal main
|
||||
EOF
|
||||
# ROCm repository for focal
|
||||
sudo tee /etc/apt/sources.list.d/rocm.list <<'EOF'
|
||||
deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.3 focal main
|
||||
deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.4.3 focal main
|
||||
EOF
|
||||
```
|
||||
|
||||
@@ -44,11 +44,11 @@ EOF
|
||||
```shell
|
||||
# Kernel driver repository for jammy
|
||||
sudo tee /etc/apt/sources.list.d/amdgpu.list <<'EOF'
|
||||
deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/5.3/ubuntu jammy main
|
||||
deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/5.4.3/ubuntu jammy main
|
||||
EOF
|
||||
# ROCm repository for jammy
|
||||
sudo tee /etc/apt/sources.list.d/rocm.list <<'EOF'
|
||||
deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.3 jammy main
|
||||
deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.4.3 jammy main
|
||||
echo -e 'Package: *\nPin: release o=repo.radeon.com\nPin-Priority: 600' | sudo tee /etc/apt/preferences.d/rocm-pin-600
|
||||
EOF
|
||||
```
|
||||
@@ -80,7 +80,7 @@ sudo apt update
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<'EOF'
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3/rhel/8.6/main/x86_64
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.4.3/rhel/8.6/main/x86_64
|
||||
enabled=1
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
@@ -89,7 +89,7 @@ EOF
|
||||
sudo tee /etc/yum.repos.d/rocm.repo <<'EOF'
|
||||
[rocm]
|
||||
name=rocm
|
||||
baseurl=https://repo.radeon.com/rocm/rhel8/5.3/main
|
||||
baseurl=https://repo.radeon.com/rocm/rhel8/5.4.3/main
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
@@ -107,7 +107,7 @@ EOF
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<'EOF'
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3/rhel/8.7/main/x86_64
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.4.3/rhel/8.7/main/x86_64
|
||||
enabled=1
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
@@ -116,7 +116,7 @@ EOF
|
||||
sudo tee /etc/yum.repos.d/rocm.repo <<'EOF'
|
||||
[rocm]
|
||||
name=rocm
|
||||
baseurl=https://repo.radeon.com/rocm/rhel8/5.3/main
|
||||
baseurl=https://repo.radeon.com/rocm/rhel8/5.4.3/main
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
@@ -134,7 +134,7 @@ EOF
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<'EOF'
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3/rhel/9.1/main/x86_64
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.4.3/rhel/9.1/main/x86_64
|
||||
enabled=1
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
@@ -143,7 +143,7 @@ EOF
|
||||
sudo tee /etc/yum.repos.d/rocm.repo <<'EOF'
|
||||
[rocm]
|
||||
name=rocm
|
||||
baseurl=https://repo.radeon.com/rocm/rhel9/5.3/main
|
||||
baseurl=https://repo.radeon.com/rocm/rhel9/5.4.3/main
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
@@ -179,7 +179,7 @@ sudo yum clean all
|
||||
sudo tee /etc/zypp/repos.d/amdgpu.repo <<'EOF'
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.3/sle/15.4/main/x86_64
|
||||
baseurl=https://repo.radeon.com/amdgpu/5.4.3/sle/15.4/main/x86_64
|
||||
enabled=1
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
|
||||
@@ -45,6 +45,15 @@ modern CPU and GPU platforms.
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} {doc}`rocWMMA <rocwmma:index>`
|
||||
`rocWMMA` provides an API to break down mixed precision matrix multiply-accumulate
|
||||
(MMA) problems into fragments and distributes these over GPU wavefronts.
|
||||
|
||||
- {doc}`Documentation <rocwmma:index>`
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/rocWMMA/blob/develop/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} {doc}`rocSOLVER <rocsolver:index>`
|
||||
`rocSOLVER` provides a subset of LAPACK (Linear Algebra Package) functionality on the ROCm platform.
|
||||
|
||||
|
||||
@@ -21,6 +21,7 @@ at compile-time of the hipLIB in question. For dynamic dispatch between vendor i
|
||||
- {doc}`hipBLAS <hipblas:index>`
|
||||
- {doc}`hipBLASLt <hipblaslt:index>`
|
||||
- {doc}`rocALUTION <rocalution:index>`
|
||||
- {doc}`rocWMMA <rocwmma:index>`
|
||||
- {doc}`rocSOLVER <rocsolver:index>`
|
||||
- {doc}`hipSOLVER <hipsolver:index>`
|
||||
- {doc}`rocSPARSE <rocsparse:index>`
|
||||
|
||||
@@ -44,6 +44,7 @@ contemporary CUDA / NVIDIA HPC SDK alternatives.
|
||||
| 5.1.3 | 1.15 | 22.1 |
|
||||
| 5.2.x | 1.15 | 22.2, 22.3 |
|
||||
| 5.3.x | 1.16 | 22.7 |
|
||||
| 5.4.x | 1.16 | 22.9 |
|
||||
|
||||
For the latest documentation of these libraries, refer to the
|
||||
[associated documentation](../reference/gpu_libraries/c%2B%2B_primitives.md).
|
||||
|
||||
@@ -8,15 +8,11 @@ AMD ROCm™ Platform supports the following Linux distributions.
|
||||
|
||||
| Distribution |Processor Architectures| Validated Kernel |
|
||||
|--------------------|-----------------------|--------------------|
|
||||
| RHEL 9.0 | x86-64 | 5.14 |
|
||||
| RHEL 8.6 to 8.5 | x86-64 | 4.18 |
|
||||
| RHEL 7.9 | x86-64 | 3.10 |
|
||||
| RHEL 9.1 | x86-64 | 5.14 |
|
||||
| RHEL 8.6 to 8.7 | x86-64 | 4.18 |
|
||||
| SLES 15 SP4 | x86-64 | 5.14.21 |
|
||||
| SLES 15 SP3 | x86-64 | 5.3.18 |
|
||||
| Ubuntu 20.04.5 LTS | x86-64 | 5.15 |
|
||||
| Ubuntu 20.04.4 LTS | x86-64 | 5.13 |
|
||||
| Ubuntu 22.04.1 LTS | x86-64 | 5.15, 5.17 OEM |
|
||||
| Ubuntu 22.04.0 LTS | x86-64 | 5.15 |
|
||||
|
||||
## Virtualization Support
|
||||
|
||||
|
||||
@@ -121,4 +121,4 @@ following location: `/opt/rocm/share/doc/<component-name>/`
|
||||
For example, you can fetch the licensing information of the `_amd_comgr_`
|
||||
component (Code Object Manager) from the `amd_comgr` folder. A file named
|
||||
`LICENSE.txt` contains the license details at:
|
||||
`/opt/rocm-5.3.0/share/doc/amd_comgr/LICENSE.txt`
|
||||
`/opt/rocm-5.4.3/share/doc/amd_comgr/LICENSE.txt`
|
||||
|
||||
@@ -10,5 +10,6 @@ the compatibility combinations that are currently supported.
|
||||
| 5.1.0 | 5.0.2 |
|
||||
| 5.1.3 | 5.2.0, 5.3.0 |
|
||||
| 5.2.0 | 5.0.2, 5.1.3 |
|
||||
| 5.2.3 | 5.3.0 |
|
||||
| 5.2.3 | 5.3.0, 5.4.0 |
|
||||
| 5.3.0 | 5.1.3, 5.2.3 |
|
||||
| 5.4.0 | 5.2.3, 5.3.3 |
|
||||
|
||||
@@ -99,6 +99,8 @@ subtrees:
|
||||
url: ${project:hipblaslt}
|
||||
- title: rocALUTION
|
||||
url: ${project:rocalution}
|
||||
- title: rocWMMA
|
||||
url: ${project:rocwmma}
|
||||
- title: rocSOLVER
|
||||
url: ${project:rocsolver}
|
||||
- title: hipSOLVER
|
||||
|
||||
Reference in New Issue
Block a user