Compare commits

..

52 Commits

Author SHA1 Message Date
Sam Wu
fdfd9187d7 Update documentation requirements 2024-09-16 10:12:18 -08:00
Sam Wu
1659e6e8a0 Update documentation requirements 2024-06-06 16:58:20 -06:00
Sam Wu
9a496d97d5 Fix RTD config 2024-05-02 08:53:40 -06:00
Sam Wu
7dd2b6f12c Update documentation requirements 2024-05-01 16:58:39 -06:00
Sam Wu
7e53ad4f9c Update documentation requirements 2024-05-01 16:50:38 -06:00
Sam Wu
a1f0050f6b add version to html title 2023-08-04 17:15:18 -06:00
Sam Wu
bc8686a20c rocm-docs-core v0.18.3 2023-06-30 09:34:05 -06:00
dependabot[bot]
12ad0c6c8b Bump rocm-docs-core from 0.18.0 to 0.18.1 in /docs/sphinx (#2280)
Bumps [rocm-docs-core](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.18.0 to 0.18.1.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/RadeonOpenCompute/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.18.0...v0.18.1)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-06-27 19:34:34 -06:00
Sam Wu
eaf98b0001 rocm_smi_lib 2023-06-21 17:00:38 -06:00
Sam Wu
6dcbf4b594 edit instructions, logs, and notes for 5.1.3 2023-06-21 15:25:13 -06:00
Sam Wu
ca426a3821 edit instructions, logs, and notes for 5.1.1 2023-06-21 15:18:20 -06:00
Máté Ferenc Nagy-Egri
0e54cd2ec6 Downgrade license notice to 5.2.0 2023-06-15 13:04:48 +02:00
Máté Ferenc Nagy-Egri
e8ad3843bf Downgrade changelog to 5.2.0 2023-06-15 13:04:48 +02:00
Máté Ferenc Nagy-Egri
daab3058b0 Downgrade install instructions to 5.2.0 2023-06-15 13:04:48 +02:00
Máté Ferenc Nagy-Egri
16be156acc Downgrade release notes to 5.2.0 2023-06-15 13:04:48 +02:00
Máté Ferenc Nagy-Egri
279fa18f5a Downgrade support matrices to 5.2.3 2023-06-15 13:04:48 +02:00
Máté Ferenc Nagy-Egri
746cc7fe57 Downgrade license notice to 5.2.3 2023-06-15 13:04:47 +02:00
Máté Ferenc Nagy-Egri
9e5263ebca Downgrade changelog to 5.2.3 2023-06-15 13:04:47 +02:00
Máté Ferenc Nagy-Egri
95281a4570 Downgrade install instructions to 5.2.3 2023-06-15 13:04:29 +02:00
Máté Ferenc Nagy-Egri
f5e2c6640d Downgrade release notes to 5.2.3 2023-06-14 15:08:19 +02:00
Máté Ferenc Nagy-Egri
d9173e132e Downgrade license notice to 5.3.0 2023-06-14 15:08:19 +02:00
Máté Ferenc Nagy-Egri
cf8a084f47 Downgrade changelog to 5.3.0 2023-06-14 15:08:19 +02:00
Máté Ferenc Nagy-Egri
e69f4bd470 Downgrade install instructions to 5.3.0 2023-06-14 15:08:19 +02:00
Máté Ferenc Nagy-Egri
2044d16f32 Downgrade release notes to 5.3.0 2023-06-14 15:08:19 +02:00
Máté Ferenc Nagy-Egri
6029cf7fff Downgrade license notice to 5.3.2 2023-06-14 15:08:19 +02:00
Máté Ferenc Nagy-Egri
4cbce8a0eb Downgrade changelog to 5.3.2 2023-06-14 15:08:19 +02:00
Máté Ferenc Nagy-Egri
00a448f25d Downgrade install instructions to 5.3.2 2023-06-14 15:08:18 +02:00
Máté Ferenc Nagy-Egri
5f11e96b89 Downgrade release notes to 5.3.2 2023-06-14 15:08:18 +02:00
Máté Ferenc Nagy-Egri
0b60d6c54d Downgrade support matrices to 5.3.3 2023-06-14 15:06:35 +02:00
Máté Ferenc Nagy-Egri
aeb750208f Downgrade license notice to 5.3.3 2023-06-14 14:04:06 +02:00
Máté Ferenc Nagy-Egri
fba1e09deb Downgrade changelog to 5.3.3 2023-06-14 14:04:06 +02:00
Máté Ferenc Nagy-Egri
d6990e32f3 Downgrade install instructions to 5.3.3 2023-06-14 14:04:06 +02:00
Máté Ferenc Nagy-Egri
01d12821fe Downgrade release notes to 5.3.3 2023-06-14 10:59:36 +02:00
Máté Ferenc Nagy-Egri
070a7db8a2 Downgrade license notice to 5.4.0 2023-06-13 14:46:19 +02:00
Máté Ferenc Nagy-Egri
911f18c6c6 Downgrade changelog to 5.4.0 2023-06-13 14:46:10 +02:00
Máté Ferenc Nagy-Egri
9a5d323b01 Downgrade install instructions to 5.4.0 2023-06-13 14:46:02 +02:00
Máté Ferenc Nagy-Egri
ab0e1fd625 Downgrade release notes to 5.4.0 2023-06-13 14:45:50 +02:00
Máté Ferenc Nagy-Egri
a635018505 Downgrade license notice to 5.4.1 2023-06-13 14:24:12 +02:00
Máté Ferenc Nagy-Egri
d2070c1b4a Downgrade changelog to 5.4.1 2023-06-13 14:23:59 +02:00
Máté Ferenc Nagy-Egri
12c9158880 Downgrade install instructions to 5.4.1 2023-06-13 14:23:51 +02:00
Máté Ferenc Nagy-Egri
9a55d71cec Downgrade release notes to 5.4.1 2023-06-13 14:23:38 +02:00
Máté Ferenc Nagy-Egri
9a9df83a77 Downgrade license notice to 5.4.2 2023-06-13 14:21:39 +02:00
Máté Ferenc Nagy-Egri
395e607525 Downgrade changelog to 5.4.2 2023-06-13 14:21:39 +02:00
Máté Ferenc Nagy-Egri
561c304e10 Downgrade install instructions to 5.4.2 2023-06-13 14:21:39 +02:00
Máté Ferenc Nagy-Egri
778db160eb Downgrade release notes to 5.4.2 2023-06-13 14:16:46 +02:00
Máté Ferenc Nagy-Egri
60a3065399 Downgrade support matrices to 5.4.3 2023-06-12 14:38:40 +02:00
Máté Ferenc Nagy-Egri
3c360627bd Downgrade changelog to 5.4.3 2023-06-12 14:37:41 +02:00
Máté Ferenc Nagy-Egri
e1151d4dbb Downgrade install instructions to 5.4.3 2023-06-12 14:37:25 +02:00
Máté Ferenc Nagy-Egri
616a09c442 Downgrade release notes to 5.4.3 2023-06-12 14:37:06 +02:00
Máté Ferenc Nagy-Egri
082fbb9d44 Downgrade changelog to 5.5.0 2023-06-12 14:27:40 +02:00
Máté Ferenc Nagy-Egri
1971584024 Downgrade install instructions to 5.5.0 2023-06-12 14:27:40 +02:00
Máté Ferenc Nagy-Egri
71e52a1c84 Downgrade release notes to 5.5.0 2023-06-12 14:17:26 +02:00
21 changed files with 739 additions and 120 deletions

View File

@@ -15,6 +15,609 @@ The release notes for the ROCm platform.
-------------------
## ROCm 5.1.3
### Library Changes in ROCM 5.1.3
| Library | Version |
|---------|---------|
| hipBLAS | [0.50.0](https://github.com/ROCmSoftwarePlatform/hipBLAS/releases/tag/rocm-5.1.3) |
| hipCUB | [2.11.0](https://github.com/ROCmSoftwarePlatform/hipCUB/releases/tag/rocm-5.1.3) |
| hipFFT | [1.0.7](https://github.com/ROCmSoftwarePlatform/hipFFT/releases/tag/rocm-5.1.3) |
| hipSOLVER | [1.3.0](https://github.com/ROCmSoftwarePlatform/hipSOLVER/releases/tag/rocm-5.1.3) |
| hipSPARSE | [2.1.0](https://github.com/ROCmSoftwarePlatform/hipSPARSE/releases/tag/rocm-5.1.3) |
| rccl | [2.11.4](https://github.com/ROCmSoftwarePlatform/rccl/releases/tag/rocm-5.1.3) |
| rocALUTION | [2.0.2](https://github.com/ROCmSoftwarePlatform/rocALUTION/releases/tag/rocm-5.1.3) |
| rocBLAS | [2.43.0](https://github.com/ROCmSoftwarePlatform/rocBLAS/releases/tag/rocm-5.1.3) |
| rocFFT | [1.0.16](https://github.com/ROCmSoftwarePlatform/rocFFT/releases/tag/rocm-5.1.3) |
| rocPRIM | [2.10.13](https://github.com/ROCmSoftwarePlatform/rocPRIM/releases/tag/rocm-5.1.3) |
| rocRAND | [2.10.13](https://github.com/ROCmSoftwarePlatform/rocRAND/releases/tag/rocm-5.1.3) |
| rocSOLVER | [3.17.0](https://github.com/ROCmSoftwarePlatform/rocSOLVER/releases/tag/rocm-5.1.3) |
| rocSPARSE | [2.1.0](https://github.com/ROCmSoftwarePlatform/rocSPARSE/releases/tag/rocm-5.1.3) |
| rocThrust | [2.14.0](https://github.com/ROCmSoftwarePlatform/rocThrust/releases/tag/rocm-5.1.3) |
| Tensile | [4.32.0](https://github.com/ROCmSoftwarePlatform/Tensile/releases/tag/rocm-5.1.3) |
-------------------
## ROCm 5.1.1
### Library Changes in ROCM 5.1.1
| Library | Version |
|---------|---------|
| hipBLAS | [0.50.0](https://github.com/ROCmSoftwarePlatform/hipBLAS/releases/tag/rocm-5.1.1) |
| hipCUB | [2.11.0](https://github.com/ROCmSoftwarePlatform/hipCUB/releases/tag/rocm-5.1.1) |
| hipFFT | [1.0.7](https://github.com/ROCmSoftwarePlatform/hipFFT/releases/tag/rocm-5.1.1) |
| hipSOLVER | [1.3.0](https://github.com/ROCmSoftwarePlatform/hipSOLVER/releases/tag/rocm-5.1.1) |
| hipSPARSE | [2.1.0](https://github.com/ROCmSoftwarePlatform/hipSPARSE/releases/tag/rocm-5.1.1) |
| rccl | [2.11.4](https://github.com/ROCmSoftwarePlatform/rccl/releases/tag/rocm-5.1.1) |
| rocALUTION | [2.0.2](https://github.com/ROCmSoftwarePlatform/rocALUTION/releases/tag/rocm-5.1.1) |
| rocBLAS | [2.43.0](https://github.com/ROCmSoftwarePlatform/rocBLAS/releases/tag/rocm-5.1.1) |
| rocFFT | [1.0.16](https://github.com/ROCmSoftwarePlatform/rocFFT/releases/tag/rocm-5.1.1) |
| rocPRIM | [2.10.13](https://github.com/ROCmSoftwarePlatform/rocPRIM/releases/tag/rocm-5.1.1) |
| rocRAND | [2.10.13](https://github.com/ROCmSoftwarePlatform/rocRAND/releases/tag/rocm-5.1.1) |
| rocSOLVER | [3.17.0](https://github.com/ROCmSoftwarePlatform/rocSOLVER/releases/tag/rocm-5.1.1) |
| rocSPARSE | [2.1.0](https://github.com/ROCmSoftwarePlatform/rocSPARSE/releases/tag/rocm-5.1.1) |
| rocThrust | [2.14.0](https://github.com/ROCmSoftwarePlatform/rocThrust/releases/tag/rocm-5.1.1) |
| Tensile | [4.32.0](https://github.com/ROCmSoftwarePlatform/Tensile/releases/tag/rocm-5.1.1) |
-------------------
## ROCm 5.1.0
<!-- markdownlint-disable first-line-h1 -->
<!-- markdownlint-disable no-blanks-blockquote -->
### What's New in This Release
#### HIP Enhancements
The ROCm v5.1 release consists of the following HIP enhancements.
##### HIP Installation Guide Updates
The HIP Installation Guide is updated to include installation and building HIP from source on the AMD and NVIDIA platforms.
Refer to the HIP Installation Guide v5.1 for more details.
##### Support for HIP Graph
ROCm v5.1 extends support for HIP Graph.
##### Planned Changes for HIP in Future Releases
###### Separation of hiprtc (libhiprtc) library from hip runtime (amdhip64)
On ROCm/Linux, to maintain backward compatibility, the hipruntime library (amdhip64) will continue to include hiprtc symbols in future releases. The backward compatible support may be discontinued by removing hiprtc symbols from the hipruntime library (amdhip64) in the next major release.
###### hipDeviceProp_t Structure Enhancements
Changes to the hipDeviceProp_t structure in the next major release may result in backward incompatibility. More details on these changes will be provided in subsequent releases.
#### ROCDebugger Enhancements
##### Multi-language Source Level Debugger
The compiler now generates a source-level variable and function argument debug information.
The accuracy is guaranteed if the compiler options `-g -O0` are used and apply only to HIP.
This enhancement enables ROCDebugger users to interact with the HIP source-level variables and function arguments.
> **Note**
>
> The newly-suggested compiler -g option must be used instead of the previously-suggested `-ggdb` option. Although the effect of these two options is currently equivalent, this is not guaranteed for the future and might get changed by the upstream LLVM community.
##### Machine Interface Lanes Support
ROCDebugger Machine Interface (MI) extends support to lanes. The following enhancements are made:
- Added a new -lane-info command, listing the current thread's lanes.
- The -thread-select command now supports a lane switch to switch to a specific lane of a thread:
```sh
-thread-select -l LANE THREAD
```
- The =thread-selected notification gained a lane-id attribute. This enables the frontend to know which lane of the thread was selected.
- The *stopped asynchronous record gained lane-id and hit-lanes attributes. The former indicates which lane is selected, and the latter indicates which lanes explain the stop.
- MI commands now accept a global --lane option, similar to the global --thread and --frame options.
- MI varobjs are now lane-aware.
For more information, refer to the ROC Debugger User Guide at <https://docs.amd.com>.
##### Enhanced - clone-inferior Command
The clone-inferior command now ensures that the TTY, CMD, ARGS, and AMDGPU PRECISE-MEMORY settings are copied from the original inferior to the new one. All modifications to the environment variables done using the 'set environment' or 'unset environment' commands are also copied to the new inferior.
#### MIOpen Support for RDNA GPUs
This release includes support for AMD Radeon™ Pro W6800, in addition to other bug fixes and performance improvements as listed below:
- MIOpen now supports RDNA GPUs!! (via MIOpen PRs 973, 780, 764, 740, 739, 677, 660, 653, 493, 498)
- Fixed a correctness issue with ImplicitGemm algorithm
- Updated the performance data for new kernel versions
- Improved MIOpen build time by splitting large kernel header files
- Fixed an issue in reduction kernels for padded tensors
- Various other bug fixes and performance improvements
For more information, see <https://docs.amd.com/bundle/MIOpen_gh-pages/page/releasenotes.html>
#### Checkpoint Restore Support With CRIU
The new Checkpoint Restore in Userspace (CRIU) functionality is implemented to support AMD GPU and ROCm applications.
CRIU is a userspace tool to Checkpoint and Restore an application.
CRIU lacked the support for checkpoint restore applications that used device files such as a GPU. With this ROCm release, CRIU is enhanced with a new plugin to support AMD GPUs, which includes:
- Single and Multi GPU systems (Gfx9)
- Checkpoint / Restore on a different system
- Checkpoint / Restore inside a docker container
- PyTorch
- Tensorflow
- Using CRIU Image Streamer
For more information, refer to <https://github.com/checkpoint-restore/criu/tree/criu-dev/plugins/amdgpu>
> **Note**
>
> The CRIU plugin (amdgpu_plugin) is merged upstream with the CRIU repository. The KFD kernel patches are also available upstream with the amd-staging-drm-next branch (public) and the ROCm 5.1 release branch.
> **Note**
>
> This is a Beta release of the Checkpoint and Restore functionality, and some features are not available in this release.
For more information, refer to the following websites:
- <https://github.com/RadeonOpenCompute/criu/blob/amdgpu_plugin-03252022/Documentation/amdgpu_plugin.txt>
- <https://criu.org/Main_Page>
### Fixed Defects
The following defects are fixed in this release.
#### Driver Fails To Load after Installation
The issue with the driver failing to load after ROCm installation is now fixed.
The driver installs successfully, and the server reboots with working rocminfo and clinfo.
#### ROCDebugger Fixed Defects
##### Breakpoints in GPU kernel code Before Kernel Is Loaded
Previously, setting a breakpoint in device code by line number before the device code was loaded into the program resulted in ROCgdb incorrectly moving the breakpoint to the first following line that contains host code.
Now, the breakpoint is left pending. When the GPU kernel gets loaded, the breakpoint resolves to a location in the kernel.
##### Registers Invalidated After Write
Previously, the stale just-written value was presented as a current value.
ROCgdb now invalidates the cached values of registers whose content might differ after being written. For example, registers with read-only bits.
ROCgdb also invalidates all volatile registers when a volatile register is written. For example, writing VCC invalidates the content of STATUS as STATUS.VCCZ may change.
##### Scheduler-locking and GPU Wavefronts
When scheduler-locking is in effect, new wavefronts created by a resumed thread, CPU, or GPU wavefront, are held in the halt state. For example, the "set scheduler-locking" command.
##### ROCDebugger Fails Before Completion of Kernel Execution
It was possible (although erroneous) for a debugger to load GPU code in memory, send it to the device, start executing a kernel on the device, and dispose of the original code before the kernel had finished execution. If a breakpoint was hit after this point, the debugger failed with an internal error while trying to access the debug information.
This issue is now fixed by ensuring that the debugger keeps a local copy of the original code and debug information.
### Known Issues
#### Random Memory Access Fault Errors Observed While Running Math Libraries Unit Tests
**Issue:** Random memory access fault issues are observed while running Math libraries unit tests. This issue is encountered in ROCm v5.0, ROCm v5.0.1, and ROCm v5.0.2.
Note, the faults only occur in the SRIOV environment.
**Workaround:** Use SDMA to update the page table. The Guest set up steps are as follows:
```sh
sudo modprobe amdgpu vm_update_mode=0
```
To verify, use
**Guest:**
```sh
cat /sys/module/amdgpu/parameters/vm_update_mode 0
```
Where expectation is 0.
#### CU Masking Causes Application to Freeze
Using CU Masking results in an application freeze or runs exceptionally slowly. This issue is noticed only in the GFX10 suite of products. Note, this issue is observed only in GFX10 suite of products.
This issue is under active investigation at this time.
#### Failed Checkpoint in Docker Containers
A defect with Ubuntu images kernel-5.13-30-generic and kernel-5.13-35-generic with Overlay FS results in incorrect reporting of the mount ID.
This issue with Ubuntu causes CRIU checkpointing to fail in Docker containers.
As a workaround, use an older version of the kernel. For example, Ubuntu 5.11.0-46-generic.
#### Issue with Restoring Workloads Using Cooperative Groups Feature
Workloads that use the cooperative groups function to ensure all waves can be resident at the same time may fail to restore correctly.
This issue is under investigation and will be fixed in a future release.
#### Radeon Pro V620 and W6800 Workstation GPUs
##### No Support for ROCDebugger on SRIOV
ROCDebugger is not supported in the SRIOV environment on any GPU.
This is a known issue and will be fixed in a future release.
#### Random Error Messages in ROCm SMI for SR-IOV
Random error messages are generated by unsupported functions or commands.
This is a known issue and will be fixed in a future release.
### Library Changes in ROCM 5.1.0
| Library | Version |
|---------|---------|
| hipBLAS | 0.49.0 ⇒ [0.50.0](https://github.com/ROCmSoftwarePlatform/hipBLAS/releases/tag/rocm-5.1.0) |
| hipCUB | 2.10.13 ⇒ [2.11.0](https://github.com/ROCmSoftwarePlatform/hipCUB/releases/tag/rocm-5.1.0) |
| hipFFT | 1.0.4 ⇒ [1.0.7](https://github.com/ROCmSoftwarePlatform/hipFFT/releases/tag/rocm-5.1.0) |
| hipSOLVER | 1.2.0 ⇒ [1.3.0](https://github.com/ROCmSoftwarePlatform/hipSOLVER/releases/tag/rocm-5.1.0) |
| hipSPARSE | 2.0.0 ⇒ [2.1.0](https://github.com/ROCmSoftwarePlatform/hipSPARSE/releases/tag/rocm-5.1.0) |
| rccl | 2.10.3 ⇒ [2.11.4](https://github.com/ROCmSoftwarePlatform/rccl/releases/tag/rocm-5.1.0) |
| rocALUTION | 2.0.1 ⇒ [2.0.2](https://github.com/ROCmSoftwarePlatform/rocALUTION/releases/tag/rocm-5.1.0) |
| rocBLAS | 2.42.0 ⇒ [2.43.0](https://github.com/ROCmSoftwarePlatform/rocBLAS/releases/tag/rocm-5.1.0) |
| rocFFT | 1.0.13 ⇒ [1.0.16](https://github.com/ROCmSoftwarePlatform/rocFFT/releases/tag/rocm-5.1.0) |
| rocPRIM | 2.10.12 ⇒ [2.10.13](https://github.com/ROCmSoftwarePlatform/rocPRIM/releases/tag/rocm-5.1.0) |
| rocRAND | 2.10.12 ⇒ [2.10.13](https://github.com/ROCmSoftwarePlatform/rocRAND/releases/tag/rocm-5.1.0) |
| rocSOLVER | 3.16.0 ⇒ [3.17.0](https://github.com/ROCmSoftwarePlatform/rocSOLVER/releases/tag/rocm-5.1.0) |
| rocSPARSE | 2.0.0 ⇒ [2.1.0](https://github.com/ROCmSoftwarePlatform/rocSPARSE/releases/tag/rocm-5.1.0) |
| rocThrust | 2.13.0 ⇒ [2.14.0](https://github.com/ROCmSoftwarePlatform/rocThrust/releases/tag/rocm-5.1.0) |
| Tensile | 4.31.0 ⇒ [4.32.0](https://github.com/ROCmSoftwarePlatform/Tensile/releases/tag/rocm-5.1.0) |
#### hipBLAS 0.50.0
hipBLAS 0.50.0 for ROCm 5.1.0
##### Added
- Added library version and device information to hipblas-test output
- Added --rocsolver-path command line option to choose path to pre-built rocSOLVER, as
absolute or relative path
- Added --cmake_install command line option to update cmake to minimum version if required
- Added cmake-arg parameter to pass in cmake arguments while building
- Added infrastructure to support readthedocs hipBLAS documentation.
##### Fixed
- Added hipblasVersionMinor define. hipblaseVersionMinor remains defined
for backwards compatibility.
- Doxygen warnings in hipblas.h header file.
##### Changed
- rocblas-path command line option can be specified as either absolute or relative path
- Help message improvements in install.sh and rmake.py
- Updated googletest dependency from 1.10.0 to 1.11.0
#### hipCUB 2.11.0
hipCUB 2.11.0 for ROCm 5.1.0
##### Added
- Device segmented sort
- Warp merge sort, WarpMask and thread sort from cub 1.15.0 supported in hipCUB
- Device three way partition
##### Changed
- Device_scan and device_segmented_scan: inclusive_scan now uses the input-type as accumulator-type, exclusive_scan uses initial-value-type.
- This particularly changes behaviour of small-size input types with large-size output types (e.g. short input, int output).
- And low-res input with high-res output (e.g. float input, double output)
- Block merge sort no longer supports non power of two blocksizes
#### hipFFT 1.0.7
hipFFT 1.0.7 for ROCm 5.1.0
##### Changed
- Use fft_params struct for accuracy and benchmark clients.
#### hipSOLVER 1.3.0
hipSOLVER 1.3.0 for ROCm 5.1.0
##### Added
- Added functions
- gels
- hipsolverSSgels_bufferSize, hipsolverDDgels_bufferSize, hipsolverCCgels_bufferSize, hipsolverZZgels_bufferSize
- hipsolverSSgels, hipsolverDDgels, hipsolverCCgels, hipsolverZZgels
- Added library version and device information to hipsolver-test output.
- Added compatibility API with hipsolverDn prefix.
- Added compatibility-only functions
- gesvdj
- hipsolverDnSgesvdj_bufferSize, hipsolverDnDgesvdj_bufferSize, hipsolverDnCgesvdj_bufferSize, hipsolverDnZgesvdj_bufferSize
- hipsolverDnSgesvdj, hipsolverDnDgesvdj, hipsolverDnCgesvdj, hipsolverDnZgesvdj
- gesvdjBatched
- hipsolverDnSgesvdjBatched_bufferSize, hipsolverDnDgesvdjBatched_bufferSize, hipsolverDnCgesvdjBatched_bufferSize, hipsolverDnZgesvdjBatched_bufferSize
- hipsolverDnSgesvdjBatched, hipsolverDnDgesvdjBatched, hipsolverDnCgesvdjBatched, hipsolverDnZgesvdjBatched
- syevj
- hipsolverDnSsyevj_bufferSize, hipsolverDnDsyevj_bufferSize, hipsolverDnCheevj_bufferSize, hipsolverDnZheevj_bufferSize
- hipsolverDnSsyevj, hipsolverDnDsyevj, hipsolverDnCheevj, hipsolverDnZheevj
- syevjBatched
- hipsolverDnSsyevjBatched_bufferSize, hipsolverDnDsyevjBatched_bufferSize, hipsolverDnCheevjBatched_bufferSize, hipsolverDnZheevjBatched_bufferSize
- hipsolverDnSsyevjBatched, hipsolverDnDsyevjBatched, hipsolverDnCheevjBatched, hipsolverDnZheevjBatched
- sygvj
- hipsolverDnSsygvj_bufferSize, hipsolverDnDsygvj_bufferSize, hipsolverDnChegvj_bufferSize, hipsolverDnZhegvj_bufferSize
- hipsolverDnSsygvj, hipsolverDnDsygvj, hipsolverDnChegvj, hipsolverDnZhegvj
##### Changed
- The rocSOLVER backend now allows hipsolverXXgels and hipsolverXXgesv to be called in-place when B == X.
- The rocSOLVER backend now allows rwork to be passed as a null pointer to hipsolverXgesvd.
##### Fixed
- bufferSize functions will now return HIPSOLVER_STATUS_NOT_INITIALIZED instead of HIPSOLVER_STATUS_INVALID_VALUE when both handle and lwork are null.
- Fixed rare memory allocation failure in syevd/heevd and sygvd/hegvd caused by improper workspace array allocation outside of rocSOLVER.
#### hipSPARSE 2.1.0
hipSPARSE 2.1.0 for ROCm 5.1.0
##### Added
- Added gtsv_interleaved_batch and gpsv_interleaved_batch routines
- Add SpGEMM_reuse
##### Changed
- Changed BUILD_CUDA with USE_CUDA in install script and cmake files
- Update googletest to 11.1
##### Improved
- Fixed a bug in SpMM Alg versioning
##### Known Issues
- none
#### rccl 2.11.4
RCCL 2.11.4 for ROCm 5.1.0
##### Added
- Compatibility with NCCL 2.11.4
##### Known Issues
- Managed memory is not currently supported for clique-based kernels
#### rocALUTION 2.0.2
rocALUTION 2.0.2 for ROCm 5.1.0
##### Added
- Added out-of-place matrix transpose functionality
- Added LocalVector&lt;bool&gt;
#### rocBLAS 2.43.0
rocBLAS 2.43.0 for ROCm 5.1.0
##### Added
- Option to install script for number of jobs to use for rocBLAS and Tensile compilation (-j, --jobs)
- Option to install script to build clients without using any Fortran (--clients_no_fortran)
- rocblas_client_initialize function, to perform rocBLAS initialize for clients(benchmark/test) and report the execution time.
- Added tests for output of reduction functions when given bad input
- Added user specified initialization (rand_int/trig_float/hpl) for initializing matrices and vectors in rocblas-bench
##### Optimizations
- Improved performance of trsm with side == left and n == 1
- Improved perforamnce of trsm with side == left and m &lt;= 32 along with side == right and n &lt;= 32
##### Changed
- For syrkx and trmm internal API use rocblas_stride datatype for offset
- For non-batched and batched gemm_ex functions if the C matrix pointer equals the D matrix pointer (aliased) their respective type and leading dimension arguments must now match
- Test client dependencies updated to GTest 1.11
- non-global false positives reported by cppcheck from file based suppression to inline suppression. File based suppression will only be used for global false positives.
- Help menu messages in install.sh
- For ger function, typecast the &#39;lda&#39;(offset) datatype to size_t during offset calculation to avoid overflow and remove duplicate template functions.
- Modified default initialization from rand_int to hpl for initializing matrices and vectors in rocblas-bench
##### Fixed
- For function trmv (non-transposed cases) avoid overflow in offset calculation
- Fixed cppcheck errors/warnings
- Fixed doxygen warnings
#### rocFFT 1.0.16
rocFFT 1.0.16 for ROCm 5.1.0
##### Changed
- Supported unaligned tile dimension for SBRC_2D kernels.
- Improved (more RAII) test and benchmark infrastructure.
- Enabled runtime compilation of length-2304 FFT kernel during plan creation.
##### Optimizations
- Optimized more large 1D cases by using L1D_CC plan.
- Optimized 3D 200^3 C2R case.
- Optimized 1D 2^30 double precision on MI200.
##### Fixed
- Fixed correctness of some R2C transforms with unusual strides.
##### Removed
- The hipFFT API (header) has been removed from after a long deprecation period. Please use the [hipFFT](https://github.com/ROCmSoftwarePlatform/hipFFT) package/repository to obtain the hipFFT API.
#### rocPRIM 2.10.13
rocPRIM 2.10.13 for ROCm 5.1.0
##### Fixed
- Fixed radix sort int64_t bug introduced in [2.10.11]
##### Added
- Future value
- Added device partition_three_way to partition input to three output iterators based on two predicates
##### Changed
- The reduce/scan algorithm precision issues in the tests has been resolved for half types.
##### Known Issues
- device_segmented_radix_sort unit test failing for HIP on Windows
#### rocRAND 2.10.13
rocRAND 2.10.13 for ROCm 5.1.0
##### Added
- Generating a random sequence different sizes now produces the same sequence without gaps
indepent of how many values are generated per call.
- Only in the case of XORWOW, MRG32K3A, PHILOX4X32_10, SOBOL32 and SOBOL64
- This only holds true if the size in each call is a divisor of the distributions
`output_width` due to performance
- Similarly the output pointer has to be aligned to `output_width * sizeof(output_type)`
##### Changed
- [hipRAND](https://github.com/ROCmSoftwarePlatform/hipRAND.git) split into a separate package
- Header file installation location changed to match other libraries.
- Using the `rocrand.h` header file should now use `#include &lt;rocrand/rocrand.h&gt;`, rather than `#include &lt;rocrand/rocrand.h&gt;`
- rocRAND still includes hipRAND using a submodule
- The rocRAND package also sets the provides field with hipRAND, so projects which require hipRAND can begin to specify it.
##### Fixed
- Fix offset behaviour for XORWOW, MRG32K3A and PHILOX4X32_10 generator, setting offset now
correctly generates the same sequence starting from the offset.
- Only uniform int and float will work as these can be generated with a single call to the generator
##### Known Issues
- kernel_xorwow unit test is failing for certain GPU architectures.
#### rocSOLVER 3.17.0
rocSOLVER 3.17.0 for ROCm 5.1.0
##### Optimized
- Optimized non-pivoting and batch cases of the LU factorization
##### Fixed
- Fixed missing synchronization in SYTRF with `rocblas_fill_lower` that could potentially
result in incorrect pivot values.
- Fixed multi-level logging output to file with the `ROCSOLVER_LOG_PATH`,
`ROCSOLVER_LOG_TRACE_PATH`, `ROCSOLVER_LOG_BENCH_PATH` and `ROCSOLVER_LOG_PROFILE_PATH`
environment variables.
- Fixed performance regression in the batched LU factorization of tiny matrices
#### rocSPARSE 2.1.0
rocSPARSE 2.1.0 for ROCm 5.1.0
##### Added
- gtsv_interleaved_batch
- gpsv_interleaved_batch
- SpGEMM_reuse
- Allow copying of mat info struct
##### Improved
- Optimization for SDDMM
- Allow unsorted matrices in csrgemm multipass algorithm
##### Known Issues
- none
#### rocThrust 2.14.0
rocThrust 2.14.0 for ROCm 5.1.0
##### Added
- Updated to match upstream Thrust 1.15.0
##### Known Issues
- async_copy, partition, and stable_sort_by_key unit tests are failing on HIP on Windows.
#### Tensile 4.32.0
Tensile 4.32.0 for ROCm 5.1.0
##### Added
- Better control of parallelism to control memory usage
- Support for multiprocessing on Windows for TensileCreateLibrary
- New JSD metric and metric selection functionality
- Initial changes to support two-tier solution selection
##### Optimized
- Optimized runtime of TensileCreateLibraries by reducing max RAM usage
- StoreCInUnroll additional optimizations plus adaptive K support
- DGEMM NN optimizations with PrefetchGlobalRead(PGR)=2 support
##### Changed
- Update Googletest to 1.11.0
##### Removed
- Remove no longer supported benchmarking steps
-------------------
## ROCm 5.0.2
<!-- markdownlint-disable first-line-h1 -->
### Fixed Defects

View File

@@ -15,23 +15,25 @@ The release notes for the ROCm platform.
-------------------
## ROCm 5.0.2
<!-- markdownlint-disable first-line-h1 -->
<!-- markdownlint-disable no-duplicate-header -->
### Fixed Defects
## ROCm 5.1.3
The following defects are fixed in the ROCm v5.0.2 release.
#### Issue with hostcall Facility in HIP Runtime
### Library Changes in ROCM 5.1.3
In ROCm v5.0, when using the “assert()” call in a HIP kernel, the compiler may sometimes fail to emit kernel metadata related to the hostcall facility, which results in incomplete initialization of the hostcall facility in the HIP runtime. This can cause the HIP kernel to crash when it attempts to execute the “assert()” call.
The root cause was an incorrect check in the compiler to determine whether the hostcall facility is required by the kernel. This is fixed in the ROCm v5.0.2 release.
The resolution includes a compiler change, which emits the required metadata by default, unless the compiler can prove that the hostcall facility is not required by the kernel. This ensures that the “assert()” call never fails.
Note:
This fix may lead to breakage in some OpenMP offload use cases, which use print inside a target region and result in an abort in device code. The issue will be fixed in a future release.
Compatibility Matrix Updates to ROCm Deep Learning Guide
The compatibility matrix in the AMD Deep Learning Guide is updated for ROCm v5.0.2.
| Library | Version |
|---------|---------|
| hipBLAS | [0.50.0](https://github.com/ROCmSoftwarePlatform/hipBLAS/releases/tag/rocm-5.1.3) |
| hipCUB | [2.11.0](https://github.com/ROCmSoftwarePlatform/hipCUB/releases/tag/rocm-5.1.3) |
| hipFFT | [1.0.7](https://github.com/ROCmSoftwarePlatform/hipFFT/releases/tag/rocm-5.1.3) |
| hipSOLVER | [1.3.0](https://github.com/ROCmSoftwarePlatform/hipSOLVER/releases/tag/rocm-5.1.3) |
| hipSPARSE | [2.1.0](https://github.com/ROCmSoftwarePlatform/hipSPARSE/releases/tag/rocm-5.1.3) |
| rccl | [2.11.4](https://github.com/ROCmSoftwarePlatform/rccl/releases/tag/rocm-5.1.3) |
| rocALUTION | [2.0.2](https://github.com/ROCmSoftwarePlatform/rocALUTION/releases/tag/rocm-5.1.3) |
| rocBLAS | [2.43.0](https://github.com/ROCmSoftwarePlatform/rocBLAS/releases/tag/rocm-5.1.3) |
| rocFFT | [1.0.16](https://github.com/ROCmSoftwarePlatform/rocFFT/releases/tag/rocm-5.1.3) |
| rocPRIM | [2.10.13](https://github.com/ROCmSoftwarePlatform/rocPRIM/releases/tag/rocm-5.1.3) |
| rocRAND | [2.10.13](https://github.com/ROCmSoftwarePlatform/rocRAND/releases/tag/rocm-5.1.3) |
| rocSOLVER | [3.17.0](https://github.com/ROCmSoftwarePlatform/rocSOLVER/releases/tag/rocm-5.1.3) |
| rocSPARSE | [2.1.0](https://github.com/ROCmSoftwarePlatform/rocSPARSE/releases/tag/rocm-5.1.3) |
| rocThrust | [2.14.0](https://github.com/ROCmSoftwarePlatform/rocThrust/releases/tag/rocm-5.1.3) |
| Tensile | [4.32.0](https://github.com/ROCmSoftwarePlatform/Tensile/releases/tag/rocm-5.1.3) |

View File

@@ -18,8 +18,8 @@ shutil.copy2('../CHANGELOG.md','./CHANGELOG.md')
project = "ROCm Documentation"
author = "Advanced Micro Devices, Inc."
copyright = "Copyright (c) 2023 Advanced Micro Devices, Inc. All rights reserved."
version = "5.0.2"
release = "5.0.2"
version = "5.1.3"
release = "5.1.3"
setting_all_article_info = True
all_article_info_os = ["linux"]
@@ -64,7 +64,7 @@ article_pages = [
external_toc_path = "./sphinx/_toc.yml"
docs_core = ROCmDocs("ROCm 5.0.2 Documentation Home")
docs_core = ROCmDocs("ROCm 5.1.3 Documentation Home")
docs_core.setup()
external_projects_current_project = "rocm"

View File

@@ -18,8 +18,8 @@ following commands based on your distribution.
```shell
sudo apt update
wget https://repo.radeon.com/amdgpu-install/21.50.2/ubuntu/bionic/amdgpu-install_21.50.2.50002-1_all.deb
sudo apt install ./amdgpu-install_21.50.2.50002-1_all.deb
wget https://repo.radeon.com/amdgpu-install/22.10.3/ubuntu/bionic/amdgpu-install_22.10.3.50103-1_all.deb
sudo apt install ./amdgpu-install_22.10.3.50103-1_all.deb
```
:::
@@ -28,8 +28,8 @@ sudo apt install ./amdgpu-install_21.50.2.50002-1_all.deb
```shell
sudo apt update
wget https://repo.radeon.com/amdgpu-install/21.50.2/ubuntu/focal/amdgpu-install_21.50.2.50002-1_all.deb
sudo apt install ./amdgpu-install_21.50.2.50002-1_all.deb
wget https://repo.radeon.com/amdgpu-install/22.10.3/ubuntu/focal/amdgpu-install_22.10.3.50103-1_all.deb
sudo apt install ./amdgpu-install_22.10.3.50103-1_all.deb
```
:::
@@ -44,7 +44,7 @@ sudo apt install ./amdgpu-install_21.50.2.50002-1_all.deb
:sync: RHEL-7
```shell
sudo yum install https://repo.radeon.com/amdgpu-install/21.50.2/rhel/7.9/amdgpu-install-21.50.2.50002-1.el7.noarch.rpm
sudo yum install https://repo.radeon.com/amdgpu-install/22.10.3/rhel/7.9/amdgpu-install-22.10.3.50103-1.el7.noarch.rpm
```
:::
@@ -53,7 +53,7 @@ sudo yum install https://repo.radeon.com/amdgpu-install/21.50.2/rhel/7.9/amdgpu-
:sync: RHEL-8
```shell
sudo yum install https://repo.radeon.com/amdgpu-install/21.50.2/rhel/8.4/amdgpu-install-21.50.2.50002-1.el7.noarch.rpm
sudo yum install https://repo.radeon.com/amdgpu-install/22.10.3/rhel/8.4/amdgpu-install-22.10.3.50103-1.el8.noarch.rpm
```
:::
@@ -62,7 +62,7 @@ sudo yum install https://repo.radeon.com/amdgpu-install/21.50.2/rhel/8.4/amdgpu-
:sync: RHEL-8
```shell
sudo yum install https://repo.radeon.com/amdgpu-install/21.50.2/rhel/8.5/amdgpu-install-21.50.2.50002-1.el7.noarch.rpm
sudo yum install https://repo.radeon.com/amdgpu-install/22.10.3/rhel/8.5/amdgpu-install-22.10.3.50103-1.el8.noarch.rpm
```
:::
@@ -72,11 +72,11 @@ sudo yum install https://repo.radeon.com/amdgpu-install/21.50.2/rhel/8.5/amdgpu-
:sync: SLES15
::::{tab-set}
:::{tab-item} Service Pack 3
:sync: SLES15-SP3
:::{tab-item} Service Pack 4
:sync: SLES15-SP4
```shell
sudo zypper --no-gpg-checks install https://repo.radeon.com/amdgpu-install/21.50.2/sle/15/amdgpu-install-21.50.2.50002-1.noarch.rpm
sudo zypper --no-gpg-checks install https://repo.radeon.com/amdgpu-install/22.10.3/sle/15/amdgpu-install-22.10.3.50103-1.noarch.rpm
```
:::
@@ -155,9 +155,9 @@ the installer script will install packages in the single-version layout.
For the multi-version ROCm installation you must use the installer script from
the latest release of ROCm that you wish to install.
**Example:** If you want to install ROCm releases 5.0.0 and 5.0.2
**Example:** If you want to install ROCm releases 5.0.2 and 5.1.3
simultaneously, you are required to download the installer from the latest ROCm
release v5.0.2.
release v5.1.3.
### Add Required Repositories
@@ -176,7 +176,7 @@ Run the following commands based on your distribution to add the repositories:
:sync: ubuntu-18.04
```shell
for ver in 5.0; do
for ver in 5.0.2 5.1.3; do
echo "deb [arch=amd64 signed-by=/etc/apt/trusted.gpg.d/rocm-keyring.gpg] https://repo.radeon.com/rocm/apt/$ver bionic main" | sudo tee /etc/apt/sources.list.d/rocm.list
done
echo -e 'Package: *\nPin: release o=repo.radeon.com\nPin-Priority: 600' | sudo tee /etc/apt/preferences.d/rocm-pin-600
@@ -188,7 +188,7 @@ sudo apt update
:sync: ubuntu-20.04
```shell
for ver in 5.0; do
for ver in 5.0.2 5.1.3; do
echo "deb [arch=amd64 signed-by=/etc/apt/trusted.gpg.d/rocm-keyring.gpg] https://repo.radeon.com/rocm/apt/$ver focal main" | sudo tee /etc/apt/sources.list.d/rocm.list
done
echo -e 'Package: *\nPin: release o=repo.radeon.com\nPin-Priority: 600' | sudo tee /etc/apt/preferences.d/rocm-pin-600
@@ -206,7 +206,7 @@ sudo apt update
:sync: RHEL-7
```shell
for ver in 5.0; do
for ver in 5.0.2 5.1.3; do
sudo tee --append /etc/yum.repos.d/rocm.repo <<EOF
[ROCm-$ver]
name=ROCm$ver
@@ -225,11 +225,11 @@ sudo yum clean all
:sync: RHEL-8
```shell
for ver in 5.0;
for ver in 5.0.2 5.1.3; do
sudo tee --append /etc/yum.repos.d/rocm.repo <<EOF
[ROCm-$ver]
name=ROCm$ver
baseurl=https://repo.radeon.com/rocm/rhel8/$ver/main
baseurl=https://repo.radeon.com/rocm/yum/$ver/main
enabled=1
priority=50
gpgcheck=1
@@ -245,15 +245,11 @@ sudo yum clean all
:::::{tab-item} SUSE Linux Enterprise Server 15
:sync: SLES15
::::{tab-set}
:::{tab-item} Service Pack 3
:sync: SLES15-SP3
```shell
for ver in 5.0; do
for ver in 5.0.2 5.1.3; do
sudo tee --append /etc/zypp/repos.d/rocm.repo <<EOF
name=rocm
baseurl=https://repo.radeon.com/rocm/$ver/sle/15/main/x86_64
baseurl=https://repo.radeon.com/rocm/zyp/$ver/main
enabled=1
gpgcheck=1
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
@@ -262,9 +258,6 @@ done
sudo zypper ref
```
:::
::::
:::::
::::::
@@ -279,12 +272,12 @@ sudo amdgpu-install --usecase=rocm --rocmrelease=<release-number-3>
```
Following are examples of ROCm multi-version installation. The kernel-mode
driver, associated with the ROCm release v5.3, will be installed as its latest
driver, associated with the ROCm release v5.1.3, will be installed as its latest
release in the list.
```none
sudo amdgpu-install --usecase=rocm --rocmrelease=5.0.0
sudo amdgpu-install --usecase=rocm --rocmrelease=5.0.2
sudo amdgpu-install --usecase=rocm --rocmrelease=5.1.3
```
## Additional options

View File

@@ -53,7 +53,7 @@ To add the AMDGPU repository, follow these steps:
```shell
# amdgpu repository for bionic
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/21.50.2/ubuntu bionic main' \
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/22.10.3/ubuntu bionic main' \
| sudo tee /etc/apt/sources.list.d/amdgpu.list
sudo apt update
```
@@ -64,7 +64,7 @@ sudo apt update
```shell
# amdgpu repository for focal
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/21.50.2/ubuntu focal main' \
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/22.10.3/ubuntu focal main' \
| sudo tee /etc/apt/sources.list.d/amdgpu.list
sudo apt update
```
@@ -91,7 +91,7 @@ To add the ROCm repository, use the following steps:
```shell
# ROCm repositories for bionic
for ver in 5.0 5.0.2; do
for ver in 5.0.2 5.1.3; do
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/$ver bionic main" \
| sudo tee --append /etc/apt/sources.list.d/rocm.list
done
@@ -106,7 +106,7 @@ sudo apt update
```shell
# ROCm repositories for focal
for ver in 5.0 5.0.2; do
for ver in 5.0.2 5.1.3; do
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/$ver focal main" \
| sudo tee --append /etc/apt/sources.list.d/rocm.list
done
@@ -136,7 +136,7 @@ For a comprehensive list of meta-packages, refer to
- Sample Multi-version installation
```shell
sudo apt install rocm-hip-sdk5.0.2
sudo apt install rocm-hip-sdk5.0.2 rocm-hip-sdk5.1.3
```
:::::
@@ -160,7 +160,7 @@ section.
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
[amdgpu]
name=amdgpu
baseurl=https://repo.radeon.com/amdgpu/21.50.2/rhel/7.9/main/x86_64/
baseurl=https://repo.radeon.com/amdgpu/22.10.3/rhel/7.9/main/x86_64/
enabled=1
priority=50
gpgcheck=1
@@ -179,7 +179,7 @@ sudo yum clean all
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
[amdgpu]
name=amdgpu
baseurl=https://repo.radeon.com/amdgpu/21.50.2/rhel/8.4/main/x86_64/
baseurl=https://repo.radeon.com/amdgpu/22.10.3/rhel/8.4/main/x86_64/
enabled=1
priority=50
gpgcheck=1
@@ -198,7 +198,7 @@ sudo yum clean all
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
[amdgpu]
name=amdgpu
baseurl=https://repo.radeon.com/amdgpu/21.50.2/rhel/8.5/main/x86_64/
baseurl=https://repo.radeon.com/amdgpu/22.10.3/rhel/8.5/main/x86_64/
enabled=1
priority=50
gpgcheck=1
@@ -224,11 +224,11 @@ sudo reboot
To add the ROCm repository, use the following steps, based on your distribution:
::::{tab-set}
:::{tab-item} RHEL 7
:sync: RHEL-7
:::{tab-item} RHEL 8
:sync: RHEL-8
```shell
for ver in 5.0 5.0.2; do
for ver in 5.0.2 5.1.3; do
sudo tee --append /etc/yum.repos.d/rocm.repo <<EOF
[ROCm-$ver]
name=ROCm$ver
@@ -247,11 +247,11 @@ sudo yum clean all
:sync: RHEL-8
```shell
for ver in 5.0 5.0.2; do
for ver in 5.0.2 5.1.3; do
sudo tee --append /etc/yum.repos.d/rocm.repo <<EOF
[ROCm-$ver]
name=ROCm$ver
baseurl=https://repo.radeon.com/rocm/rhel8/$ver/main
baseurl=https://repo.radeon.com/rocm/yum/$ver/main
enabled=1
priority=50
gpgcheck=1
@@ -282,7 +282,7 @@ For a comprehensive list of meta-packages, refer to
- Sample Multi-version installation
```shell
sudo yum install rocm-hip-sdk5.0.2
sudo yum install rocm-hip-sdk5.0.2 rocm-hip-sdk5.1.3
```
:::::
@@ -297,15 +297,11 @@ If you have a version of the kernel-mode driver installed, you may skip this
section.
```
::::{tab-set}
:::{tab-item} Service Pack 3
:sync: SLES15-SP3
```shell
sudo tee /etc/zypp/repos.d/amdgpu.repo <<EOF
[amdgpu]
name=amdgpu
baseurl=https://repo.radeon.com/amdgpu/21.50.2/sle/15.3/main/x86_64
baseurl=https://repo.radeon.com/amdgpu/22.10.3/sle/15/main/x86_64
enabled=1
gpgcheck=1
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
@@ -313,9 +309,6 @@ EOF
sudo zypper ref
```
:::
::::
Install the kernel mode driver and reboot the system using the following
commands:
@@ -330,7 +323,7 @@ sudo reboot
To add the ROCm repository, use the following steps:
```shell
for ver in 5.0 5.0.2; do
for ver in 5.0.2 5.1.3; do
sudo tee --append /etc/zypp/repos.d/rocm.repo <<EOF
[ROCm-$ver]
name=ROCm$ver
@@ -362,7 +355,7 @@ For a comprehensive list of meta-packages, refer to
- Sample Multi-version installation
```shell
sudo zypper --gpg-auto-import-keys install rocm-hip-sdk5.0.2
sudo zypper --gpg-auto-import-keys install rocm-hip-sdk5.0.2 rocm-hip-sdk5.1.3
```
:::::
@@ -399,7 +392,7 @@ but are generally useful. Verification of the install is advised.
2. Add binary paths to the `PATH` environment variable.
```shell
export PATH=$PATH:/opt/rocm-5.0.2/bin:/opt/rocm-5.0.2/opencl/bin
export PATH=$PATH:/opt/rocm-5.1.3/bin:/opt/rocm-5.1.3/opencl/bin
```
```{attention}

View File

@@ -26,7 +26,7 @@ repository to the new release.
```shell
# amdgpu repository for bionic
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/21.50.2/ubuntu bionic main' \
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/22.10.3/ubuntu bionic main' \
| sudo tee /etc/apt/sources.list.d/amdgpu.list
sudo apt update
```
@@ -37,7 +37,7 @@ sudo apt update
```shell
# amdgpu repository for focal
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/21.50.2/ubuntu focal main' \
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/22.10.3/ubuntu focal main' \
| sudo tee /etc/apt/sources.list.d/amdgpu.list
sudo apt update
```
@@ -57,7 +57,7 @@ sudo apt update
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
[amdgpu]
name=amdgpu
baseurl=https://repo.radeon.com/amdgpu/21.50.2/rhel/7.9/main/x86_64/
baseurl=https://repo.radeon.com/amdgpu/22.10.3/rhel/7.9/main/x86_64/
enabled=1
priority=50
gpgcheck=1
@@ -75,7 +75,7 @@ sudo yum clean all
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
[amdgpu]
name=amdgpu
baseurl=https://repo.radeon.com/amdgpu/21.50.2/rhel/8.4/main/x86_64/
baseurl=https://repo.radeon.com/amdgpu/22.10.3/rhel/8.4/main/x86_64/
enabled=1
priority=50
gpgcheck=1
@@ -93,7 +93,7 @@ sudo yum clean all
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
[amdgpu]
name=amdgpu
baseurl=https://repo.radeon.com/amdgpu/21.50.2/rhel/8.5/main/x86_64/
baseurl=https://repo.radeon.com/amdgpu/22.10.3/rhel/8.5/main/x86_64/
enabled=1
priority=50
gpgcheck=1
@@ -104,20 +104,15 @@ sudo yum clean all
:::
::::
:::::
:::::{tab-item} SUSE Linux Enterprise Server 15
:sync: SLES15
::::{tab-set}
:::{tab-item} Service Pack 3
:sync: SLES15-SP3
```shell
sudo tee /etc/zypp/repos.d/amdgpu.repo <<EOF
[amdgpu]
name=amdgpu
baseurl=https://repo.radeon.com/amdgpu/21.50.2/sle/15.3/main/x86_64
baseurl=https://repo.radeon.com/amdgpu/22.10.3/sle/15/main/x86_64
enabled=1
gpgcheck=1
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
@@ -125,9 +120,6 @@ EOF
sudo zypper ref
```
:::
::::
:::::
::::::
@@ -180,7 +172,7 @@ repository to the new release.
:sync: ubuntu-18.04
```shell
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.0.5 bionic main" \
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.1.3 bionic main" \
| sudo tee /etc/apt/sources.list.d/rocm.list
echo -e 'Package: *\nPin: release o=repo.radeon.com\nPin-Priority: 600' \
| sudo tee /etc/apt/preferences.d/rocm-pin-600
@@ -192,7 +184,7 @@ sudo apt update
:sync: ubuntu-20.04
```shell
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.0.2 focal main" \
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.1.3 focal main" \
| sudo tee /etc/apt/sources.list.d/rocm.list
echo -e 'Package: *\nPin: release o=repo.radeon.com\nPin-Priority: 600' \
| sudo tee /etc/apt/preferences.d/rocm-pin-600
@@ -211,9 +203,9 @@ sudo apt update
```shell
sudo tee /etc/yum.repos.d/rocm.repo <<EOF
[ROCm-5.0.22]
name=ROCm5.0.2
baseurl=https://repo.radeon.com/rocm/yum/5.0.2/main
[ROCm-5.1.3]
name=ROCm5.1.3
baseurl=https://repo.radeon.com/rocm/yum/5.1.3/main
enabled=1
priority=50
gpgcheck=1
@@ -228,9 +220,9 @@ sudo yum clean all
```shell
sudo tee /etc/yum.repos.d/rocm.repo <<EOF
[ROCm-5.0.2]
name=ROCm5.0.2
baseurl=https://repo.radeon.com/rocm/rhel8/5.0.2/main
[ROCm-5.1.3]
name=ROCm5.1.3
baseurl=https://repo.radeon.com/rocm/yum/5.1.3/main
enabled=1
priority=50
gpgcheck=1
@@ -247,10 +239,10 @@ sudo yum clean all
```shell
sudo tee /etc/zypp/repos.d/rocm.repo <<EOF
[ROCm-5.0.2]
name=ROCm5.0.2
[ROCm-5.1.3]
name=ROCm5.1.3
name=rocm
baseurl=https://repo.radeon.com/rocm/zyp/5.0.2/main
baseurl=https://repo.radeon.com/rocm/zyp/5.1.3/main
enabled=1
gpgcheck=1
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key

View File

@@ -18,6 +18,7 @@ Detailed walkthroughs of specific use-cases driven by frameworks using ROCm
acceleration.
- [Implementing Inception V3 on ROCm with PyTorch](machine_learning/pytorch_inception.md)
- [Optimizing Inference with MIGraphX](machine_learning/migraphx_optimization.md)
:::

View File

@@ -10,4 +10,11 @@ A collection of detailed and guided examples for working with Inception V3 with
:::
:::{grid-item-card} Optimizing Inference with MIGraphX
:link: migraphx_optimization
:link-type: doc
Walkthroughs of optimizing inference using MIGraphX.
:::
:::::

View File

@@ -83,6 +83,10 @@ TensorFlow, \[Online image\]. [https://www.tensorflow.org/extras/tensorflow_bran
MAGMA, \[Online image\]. [https://bitbucket.org/icl/magma/src/master/docs/](https://bitbucket.org/icl/magma/src/master/docs/)
Advanced Micro Devices, Inc., \[Online\]. Available: [https://rocmsoftwareplatform.github.io/AMDMIGraphX/doc/html/](https://rocmsoftwareplatform.github.io/AMDMIGraphX/doc/html/)
Advanced Micro Devices, Inc., \[Online\]. Available: [https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/wiki](https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/wiki)
Docker, \[Online\]. [https://docs.docker.com/get-started/overview/](https://docs.docker.com/get-started/overview/)
Torchvision, \[Online\]. Available [https://pytorch.org/vision/master/index.html?highlight=torchvision#module-torchvision](https://pytorch.org/vision/master/index.html?highlight=torchvision#module-torchvision)

View File

@@ -425,6 +425,10 @@ TensorFlow, \[Online image\]. [https://www.tensorflow.org/extras/tensorflow_bran
MAGMA, \[Online image\]. [https://bitbucket.org/icl/magma/src/master/docs/](https://bitbucket.org/icl/magma/src/master/docs/)
Advanced Micro Devices, Inc., \[Online\]. Available: [https://rocmsoftwareplatform.github.io/AMDMIGraphX/doc/html/](https://rocmsoftwareplatform.github.io/AMDMIGraphX/doc/html/)
Advanced Micro Devices, Inc., \[Online\]. Available: [https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/wiki](https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/wiki)
Docker, \[Online\]. [https://docs.docker.com/get-started/overview/](https://docs.docker.com/get-started/overview/)
Torchvision, \[Online\]. Available [https://pytorch.org/vision/master/index.html?highlight=torchvision#module-torchvision](https://pytorch.org/vision/master/index.html?highlight=torchvision#module-torchvision)

View File

@@ -197,6 +197,10 @@ TensorFlow, \[Online image\]. [https://www.tensorflow.org/extras/tensorflow_bran
MAGMA, \[Online image\]. [https://bitbucket.org/icl/magma/src/master/docs/](https://bitbucket.org/icl/magma/src/master/docs/)
Advanced Micro Devices, Inc., \[Online\]. Available: [https://rocmsoftwareplatform.github.io/AMDMIGraphX/doc/html/](https://rocmsoftwareplatform.github.io/AMDMIGraphX/doc/html/)
Advanced Micro Devices, Inc., \[Online\]. Available: [https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/wiki](https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/wiki)
Docker, \[Online\]. [https://docs.docker.com/get-started/overview/](https://docs.docker.com/get-started/overview/)
Torchvision, \[Online\]. Available [https://pytorch.org/vision/master/index.html?highlight=torchvision#module-torchvision](https://pytorch.org/vision/master/index.html?highlight=torchvision#module-torchvision)

View File

@@ -93,6 +93,7 @@ agile, flexible, rapid and secure manner. [more...](rocm)
- [Examples](https://github.com/amd/rocm-examples)
- [ML, DL, and AI](examples/machine_learning/all)
- [](examples/machine_learning/pytorch_inception)
- [](examples/machine_learning/migraphx_optimization)
:::
::::

View File

@@ -17,4 +17,11 @@ Composable Kernel: Performance Portable Programming Model for Machine Learning T
:::
:::{grid-item-card} {doc}`MIGraphX <amdmigraphx:index>`
AMD MIGraphX is AMD's graph inference engine that accelerates machine learning model inference.
- {doc}`Documentation <amdmigraphx:index>`
:::
:::::

View File

@@ -8,7 +8,7 @@
:::{grid-item-card} [HIP](./hip)
HIP is both AMD's GPU programming language extension and the GPU runtime.
- {doc}`hip:doxygen/html/index`
- {doc}`hip:.doxygen/docBin/html/index`
- [Examples](https://github.com/amd/rocm-examples/tree/develop/HIP-Basic)
:::
@@ -43,6 +43,7 @@ Libraries related to AI.
- {doc}`MIOpen <miopen:index>`
- {doc}`Composable Kernel <composable_kernel:index>`
- {doc}`MIGraphX <amdmigraphx:index>`
:::

View File

@@ -45,6 +45,15 @@ modern CPU and GPU platforms.
:::
:::{grid-item-card} {doc}`rocWMMA <rocwmma:index>`
`rocWMMA` provides an API to break down mixed precision matrix multiply-accumulate
(MMA) problems into fragments and distributes these over GPU wavefronts.
- {doc}`Documentation <rocwmma:index>`
- [Changelog](https://github.com/ROCmSoftwarePlatform/rocWMMA/blob/develop/CHANGELOG.md)
:::
:::{grid-item-card} {doc}`rocSOLVER <rocsolver:index>`
`rocSOLVER` provides a subset of LAPACK (Linear Algebra Package) functionality on the ROCm platform.

View File

@@ -21,6 +21,7 @@ at compile-time of the hipLIB in question. For dynamic dispatch between vendor i
- {doc}`hipBLAS <hipblas:index>`
- {doc}`hipBLASLt <hipblaslt:index>`
- {doc}`rocALUTION <rocalution:index>`
- {doc}`rocWMMA <rocwmma:index>`
- {doc}`rocSOLVER <rocsolver:index>`
- {doc}`hipSOLVER <hipsolver:index>`
- {doc}`rocSPARSE <rocsparse:index>`

View File

@@ -12,7 +12,7 @@ page introduces the HIP runtime and other HIP libraries and tools.
The HIP Runtime is used to enable GPU acceleration for all HIP language based
products.
- {doc}`hip:doxygen/html/index`
- {doc}`hip:.doxygen/docBin/html/index`
- [Examples](https://github.com/amd/rocm-examples/tree/develop/HIP-Basic)
:::

View File

@@ -8,13 +8,11 @@ AMD ROCm™ Platform supports the following Linux distributions.
| Distribution |Processor Architectures| Validated Kernel |
|--------------------|-----------------------|--------------------|
| CentOS 8.3 | x86-64 | 4.18 |
| CentOS 7.9 | x86-64 | 3.10 |
| RHEL 8.5, 8.4 | x86-64 | 4.18 |
| RHEL 7.9 | x86-64 | 3.10 |
| SLES 15 SP3 | x86-64 | 5.3.18 |
| Ubuntu 20.04.3 LTS | x86-64 | 5.8 |
| Ubuntu 18.04.5 LTS | x86-64 | 5.4.0 |
| RHEL 9.1 | x86-64 | 5.14 |
| RHEL 8.6 to 8.7 | x86-64 | 4.18 |
| SLES 15 SP4 | x86-64 | |
| Ubuntu 20.04.5 LTS | x86-64 | 5.15 |
| Ubuntu 22.04.1 LTS | x86-64 | 5.15, OEM 5.17 |
## Virtualization Support
@@ -54,24 +52,13 @@ Use Driver Shipped with ROCm
[Use Radeon Pro Driver](https://www.amd.com/en/support/linux-drivers)
| Name | Architecture |[LLVM Target](https://www.llvm.org/docs/AMDGPUUsage.html#processors) | Support|
|:----:|:------------:|:-------------------------------------------------------------------:|:-------:|
|:----:|:------------:|:--------------------------------------------------------------------:|:-------:|
| AMD Radeon™ Pro W6800 | RDNA2 | gfx1030 | ✅ |
| AMD Radeon™ Pro V620 | RDNA2 | gfx1030 | ✅ |
| AMD Radeon™ Pro VII | GCN5.1 | gfx906 | ✅ |
:::
:::{tab-item} Radeon™
:sync: radeon
[Use Radeon Pro Driver](https://www.amd.com/en/support/linux-drivers)
| Name | Architecture |[LLVM Target](https://www.llvm.org/docs/AMDGPUUsage.html#processors) | Support|
|:----:|:------------:|:-------------------------------------------------------------------:|:-------:|
| AMD Radeon™ VII | GCN5.1 | gfx906 | ✅ |
:::
::::
### Support Status

View File

@@ -58,6 +58,7 @@ The table is ordered to follow ROCm's manifest file.
| [rocPRIM](https://github.com/ROCmSoftwarePlatform/rocPRIM/) | [MIT](https://github.com/ROCmSoftwarePlatform/rocPRIM/blob/develop/LICENSE.txt) |
| [rocWMMA](https://github.com/ROCmSoftwarePlatform/rocWMMA/) | [MIT](https://github.com/ROCmSoftwarePlatform/rocWMMA/blob/develop/LICENSE.md) |
| [hipfort](https://github.com/ROCmSoftwarePlatform/hipfort/) | [MIT](https://github.com/ROCmSoftwarePlatform/hipfort/blob/master/LICENSE) |
| [AMDMIGraphX](https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/) | [MIT](https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/blob/develop/LICENSE) |
| [ROCmValidationSuite](https://github.com/ROCm-Developer-Tools/ROCmValidationSuite/) | [MIT](https://github.com/ROCm-Developer-Tools/ROCmValidationSuite/blob/master/LICENSE) |
| [aomp](https://github.com/ROCm-Developer-Tools/aomp/) | [Apache 2.0](https://github.com/ROCm-Developer-Tools/aomp/blob/aomp-dev/LICENSE) |
| [aomp-extras](https://github.com/ROCm-Developer-Tools/aomp-extras/) | [MIT](https://github.com/ROCm-Developer-Tools/aomp-extras/blob/aomp-dev/LICENSE) |
@@ -120,4 +121,4 @@ following location: `/opt/rocm/share/doc/<component-name>/`
For example, you can fetch the licensing information of the `_amd_comgr_`
component (Code Object Manager) from the `amd_comgr` folder. A file named
`LICENSE.txt` contains the license details at:
`/opt/rocm-5.0.2/share/doc/amd_comgr/LICENSE.txt`
`/opt/rocm-5.1.3/share/doc/amd_comgr/LICENSE.txt`

View File

@@ -99,6 +99,8 @@ subtrees:
url: ${project:hipblaslt}
- title: rocALUTION
url: ${project:rocalution}
- title: rocWMMA
url: ${project:rocwmma}
- title: rocSOLVER
url: ${project:rocsolver}
- title: hipSOLVER
@@ -147,6 +149,8 @@ subtrees:
url: ${project:miopen}
- title: Composable Kernel
url: ${project:composable_kernel}
- title: MIGraphX - Graph Optimization
url: ${project:amdmigraphx}
- file: reference/computer_vision
subtrees:
- entries:
@@ -221,6 +225,7 @@ subtrees:
subtrees:
- entries:
- file: examples/machine_learning/pytorch_inception
- file: examples/machine_learning/migraphx_optimization
- caption: About
entries:

View File

@@ -224,6 +224,10 @@ ROCm CMake Packages
+-----------+----------+--------------------------------------------------------+
| MIOpen | miopen | ``MIOpen`` |
+-----------+----------+--------------------------------------------------------+
| MIGraphX | migraphx | ``migraphx::migraphx``, ``migraphx::migraphx_c``, |
| | | ``migraphx::migraphx_cpu``, ``migraphx::migraphx_gpu``,|
| | | ``migraphx::migraphx_onnx``, ``migraphx::migraphx_tf`` |
+-----------+----------+--------------------------------------------------------+
Using CMake Presets
===================