Compare commits

...

124 Commits

Author SHA1 Message Date
Rmalavally
0e53cfa829 Update for ROCm v3.0 (#975) 2019-12-19 16:35:29 -08:00
Aditya Lad
fef7342549 Update default.xml for ROCm3.0 (#974)
* ROCm Release 3.0

* rocm_smi_lib changes

* Adding AOMP projects

Co-authored-by: adilad7 <46574164+adilad7@users.noreply.github.com>
2019-12-19 16:29:44 -08:00
zhang2amd
f6565e7f3f Merge pull request #956 from chauthai/Update-README-Ubuntu-Install
Update README.md
2019-12-19 09:30:15 -08:00
Rmalavally
c56404399b Merge branch 'master' into master 2019-12-18 17:00:40 -08:00
Rmalavally
664311f4fe Add files via upload 2019-12-18 16:32:05 -08:00
Rmalavally
f45bfbfeed Update README.md 2019-12-18 16:28:37 -08:00
Konstantin Zhuravlyov
f00e4fdf7b Merge pull request #971 from RadeonOpenCompute/kzhuravl-monorepo-switch
Switch LLVM and OCL components to monorepo
2019-12-18 13:44:16 -05:00
Konstantin Zhuravlyov
ae5f7d7be9 LLVM monorepo switch #2
- Remove nested compiler projects under ROCm-OpenCL-Runtime
2019-12-17 23:54:01 -05:00
Konstantin Zhuravlyov
720760ad63 LLVM monorepo switch #1
- Switch AMDGPU LLVM compiler to monorepo
  - Update LLVM and ROCm-Device-Libs to roc-ocl-3.0.0
2019-12-17 23:49:51 -05:00
zhang2amd
b8a0a90287 Merge pull request #921 from RadeonOpenCompute/zhang2amd-patch-package
Update package structure.
2019-12-05 16:32:02 -08:00
Chau Thai
73ed1122fa Fixed typos 2019-12-03 18:50:37 +01:00
Chau Thai
d50a208edd Fixed sudo sudo issue 2019-12-02 23:18:59 +01:00
Chau Thai
720c7cddc7 Update README.md
Fixed wget command to add `rocm.gpg.key` to sources.list.
2019-12-01 19:11:09 +01:00
Rmalavally
586a57823a Update README.md 2019-11-28 11:54:52 -08:00
Rmalavally
c546c5fe3c Update README.md 2019-11-28 11:45:11 -08:00
Rmalavally
a3ce396048 Add files via upload 2019-11-27 11:03:55 -08:00
Rmalavally
51a22a97f9 Add files via upload 2019-11-27 10:53:29 -08:00
Rmalavally
72ddeef85b Update README.md 2019-11-27 10:51:59 -08:00
Aakash Sudhanwa
6cafd3b8fd Updated release notes for ROCm 2.10 2019-11-27 10:47:55 -08:00
Rmalavally
111f8e8409 Add files via upload 2019-11-27 10:40:24 -08:00
Rmalavally
27bc1477f1 Update README.md 2019-11-27 10:38:38 -08:00
Rmalavally
edc179783c Update README.md 2019-11-26 11:18:01 -08:00
Rmalavally
8fac5002c4 Update README.md 2019-11-25 20:04:06 -08:00
Rmalavally
77764b4887 Update README.md 2019-11-25 20:01:18 -08:00
Rmalavally
b952ebd666 Update README.md 2019-11-25 19:57:46 -08:00
Rmalavally
db6dbdcc7a Merge branch 'master' into master 2019-11-25 17:34:45 -08:00
Rmalavally
924f08a6af Update README.md 2019-11-25 17:30:59 -08:00
Rmalavally
9406a51332 Update README.md 2019-11-25 17:29:37 -08:00
Rmalavally
c76270cb72 Update README.md 2019-11-25 17:28:21 -08:00
Rmalavally
5a82caca04 Update README.md 2019-11-25 17:25:48 -08:00
Rmalavally
6a4ae7c703 Merge branch 'master' into master 2019-11-25 17:20:51 -08:00
Rmalavally
f49275a805 Update README.md 2019-11-25 17:14:52 -08:00
Rmalavally
93858f653b Links 2019-11-25 17:12:28 -08:00
Rmalavally
2a1c123ad8 Update README.md 2019-11-25 17:08:20 -08:00
Rmalavally
7676103619 Update README.md 2019-11-25 17:07:43 -08:00
Rmalavally
202562d9fe Update README.md 2019-11-25 17:07:02 -08:00
Rmalavally
c10e1fcfe2 Update README.md 2019-11-25 17:06:16 -08:00
Rmalavally
a298b06a72 Update README.md 2019-11-25 17:05:46 -08:00
Rmalavally
2f9e5b68e7 Update README.md 2019-11-25 17:03:56 -08:00
Rmalavally
ccfe2f1a34 Update README.md 2019-11-25 16:47:24 -08:00
Rmalavally
09196ba787 Update README.md 2019-11-25 16:45:54 -08:00
Rmalavally
606cc11ebe Update README.md 2019-11-25 16:43:42 -08:00
Rmalavally
caead32d16 Update README.md 2019-11-25 16:42:52 -08:00
Rmalavally
75d01b3a23 Update README.md 2019-11-25 16:39:41 -08:00
Rmalavally
6e3be7651a Update README.md 2019-11-25 16:39:01 -08:00
Rmalavally
a7d27a43dc Update README.md 2019-11-25 16:38:07 -08:00
Rmalavally
af5b8e1b93 Update README.md 2019-11-25 16:34:41 -08:00
Rmalavally
b7e9232861 Update README.md 2019-11-25 16:32:13 -08:00
Rmalavally
dd86238927 Update README.md 2019-11-25 16:29:29 -08:00
Rmalavally
f452920919 Update README.md 2019-11-25 16:28:00 -08:00
Rmalavally
150d8a89da Update README.md 2019-11-25 16:20:47 -08:00
Rmalavally
7af0f9144b Update README.md 2019-11-25 16:19:25 -08:00
Rmalavally
ca1f7bc166 Update README.md 2019-11-25 16:18:47 -08:00
Rmalavally
a0773a297a Update README.md 2019-11-25 16:14:36 -08:00
Rmalavally
b421b06456 Update README.md 2019-11-25 16:02:01 -08:00
Rmalavally
754ebcec86 Updated version for components and libraries 2019-11-25 15:45:09 -08:00
Rmalavally
27daab068e Update README.md 2019-11-25 15:39:52 -08:00
Rmalavally
6c70298860 Update README.md 2019-11-25 15:38:00 -08:00
Rmalavally
f5cae8347d Update README.md 2019-11-25 15:04:04 -08:00
Rmalavally
2759642432 Update README.md 2019-11-25 14:27:10 -08:00
Rmalavally
8879e01c90 Update README.md 2019-11-25 14:24:33 -08:00
Rmalavally
5d05e3fdb6 Update README.md 2019-11-25 14:16:24 -08:00
Rmalavally
5c8047181e Update README.md 2019-11-25 14:15:59 -08:00
Rmalavally
eba4c1a25c Update README.md 2019-11-25 14:11:09 -08:00
Rmalavally
01d806e5c1 Update README.md 2019-11-25 14:10:04 -08:00
Rmalavally
d3ede323b6 Update README.md 2019-11-25 14:07:51 -08:00
Rmalavally
f5ab073219 Update README.md 2019-11-25 14:07:17 -08:00
Rmalavally
56e083faac Update README.md 2019-11-25 14:06:00 -08:00
Rmalavally
fd95c06730 Update README.md 2019-11-25 14:05:32 -08:00
Rmalavally
0852725b63 Update README.md 2019-11-25 14:02:27 -08:00
Rmalavally
0bcf42d721 Update README.md 2019-11-25 13:56:39 -08:00
Rmalavally
789ab1a6db Update README.md 2019-11-25 13:53:52 -08:00
Rmalavally
65beddddb8 Update README.md 2019-11-25 13:44:27 -08:00
Rmalavally
917a03430f Update README.md 2019-11-25 13:42:42 -08:00
Rmalavally
fefcb606fa Update README.md 2019-11-25 13:38:58 -08:00
Rmalavally
8f3d0331e0 Update README.md 2019-11-25 13:30:58 -08:00
Rmalavally
9f39dd9458 Update README.md 2019-11-25 13:28:58 -08:00
Rmalavally
7b52d73349 Update README.md 2019-11-25 13:25:03 -08:00
Rmalavally
b22c12266a Update README.md 2019-11-25 13:24:08 -08:00
Rmalavally
c02e11b650 Update README.md 2019-11-25 13:21:31 -08:00
Rmalavally
9ff3311a58 Update README.md 2019-11-25 13:19:19 -08:00
Rmalavally
1ca2e8eb87 Update README.md 2019-11-25 13:13:19 -08:00
Rmalavally
7370793bc7 Update README.md 2019-11-25 12:59:53 -08:00
Rmalavally
20cd1c8e53 Update README.md 2019-11-25 12:52:41 -08:00
Rmalavally
a65935a02e Update README.md 2019-11-25 12:49:36 -08:00
Rmalavally
03270a5bd5 Update README.md 2019-11-25 12:31:03 -08:00
Rmalavally
1d24ed9f9c Update README.md 2019-11-25 12:28:40 -08:00
Rmalavally
c0876b7ae5 Update README.md 2019-11-25 12:27:40 -08:00
Rmalavally
6b5707bfd9 Update README.md 2019-11-25 12:24:52 -08:00
Rmalavally
e6499f1563 Update README.md 2019-11-25 12:23:25 -08:00
Rmalavally
8ac3b0dcd9 Update README.md 2019-11-16 17:40:48 -08:00
Rmalavally
7f986f30d1 Update README.md 2019-11-16 17:39:34 -08:00
Rmalavally
87aa7362a3 Update README.md 2019-11-16 17:37:32 -08:00
Rmalavally
dde6ae15db Update README.md 2019-11-16 17:36:43 -08:00
Rmalavally
63eea0e9e7 Update README.md 2019-11-16 17:35:56 -08:00
Rmalavally
0ffc79f9b8 Update README.md 2019-11-16 17:35:06 -08:00
Rmalavally
32d48a74b9 Update README.md 2019-11-16 17:31:27 -08:00
Rmalavally
f6aa4d149e Update README.md 2019-11-16 17:30:43 -08:00
Rmalavally
6ef9776d50 Update README.md 2019-11-16 17:30:06 -08:00
Rmalavally
929f927ada Update README.md 2019-11-16 17:26:35 -08:00
Rmalavally
e0043fa0a1 Delete LegalGPUformatted.png 2019-11-16 17:24:45 -08:00
Rmalavally
354d8b5615 Delete AMDLogo.png 2019-11-16 17:24:33 -08:00
Rmalavally
aa17cc4dd0 Delete MLSE_Release_Notes_v2.10_v2.pdf 2019-11-16 17:24:13 -08:00
Rmalavally
c73d4777b9 Add files via upload 2019-11-16 17:22:13 -08:00
Rmalavally
cd0a756584 Delete MLSE_Release_Notes_v2.10_v2.pdf 2019-11-16 17:21:51 -08:00
Rmalavally
3d6de36a21 Add files via upload 2019-11-16 17:20:52 -08:00
Rmalavally
19efdc7d7b Add files via upload 2019-11-16 17:18:50 -08:00
Rmalavally
ea1b9f85f5 Add files via upload 2019-11-16 17:18:03 -08:00
Rmalavally
d948b1c844 Add files via upload 2019-11-15 14:01:09 -08:00
Rmalavally
046a4c35a6 Add files via upload 2019-11-15 14:00:39 -08:00
Rmalavally
916bc37963 Update README.md 2019-11-15 12:13:07 -08:00
Rmalavally
caf1f2e5aa Update README.md 2019-11-15 12:11:39 -08:00
Rmalavally
e5bbf892a9 Update README.md 2019-11-15 12:09:48 -08:00
Rmalavally
36e7bc054b Update README.md 2019-11-15 12:08:27 -08:00
Rmalavally
d8fb001470 Update README.md 2019-11-15 12:07:57 -08:00
Rmalavally
c67e8b1e2f Delete MLSE_Release_Notes_v2.10_v2.pdf 2019-11-15 12:06:36 -08:00
Cole Nelson
3ec6301df1 Create release notes for rocm 2.10 and archive folder.
Signed-off-by: Cole Nelson <cole.nelson@amd.com>
2019-11-15 11:38:33 -08:00
Rmalavally
dffe4ff850 Add files via upload 2019-11-15 11:09:21 -08:00
Rmalavally
77d20a4b1d Delete MLSE_Release_Notes_v2.10_v2.pdf 2019-11-15 10:45:22 -08:00
Rmalavally
7c52f68bd4 Add files via upload 2019-11-15 10:43:55 -08:00
Rmalavally
c6bb911570 Update README.md 2019-11-15 10:36:13 -08:00
Rmalavally
e8c598d156 Update README.md 2019-11-14 22:22:38 -08:00
zhang2amd
e9490d3a0b Make the package list alphabetical 2019-11-06 11:22:54 -08:00
zhang2amd
67a3649dce Update package structure. 2019-10-25 11:35:13 -07:00
5 changed files with 262 additions and 250 deletions

Binary file not shown.

Binary file not shown.

435
README.md
View File

@@ -1,29 +1,31 @@
# AMD ROCm Release Notes v2.10
# AMD ROCm Release Notes v3.0
This page describes the features, fixed issues, and information about downloading and installing the ROCm software.
It also covers known issues and deprecated features in the ROCm v2.10 release.
It also covers known issues and deprecated features in the ROCm v3.0 release.
- [What Is ROCm?](#What-Is-ROCm)
* [ROCm Components](#ROCm-Components)
* [Supported Operating Systems](#Supported-Operating-Systems)
* [Important ROCm Links](#Important-ROCm-Links)
- [Whats New in This Release](#Whats-New-in-This-Release)
* [rocBLAS Support for Complex GEMM](#rocBLAS-Support-for-Complex-GEMM)
* [Support for SLES 15 SP1](#Support-for-SLES-15-SP1)
* [Code Marker Support for rocProfiler and rocTracer Libraries](#Code-Marker-Support-for-rocProfiler-and-rocTracer-Libraries)
- [What\'s New in This Release](#Whats-New-in-This-Release)
* [Support for CentOS RHEL v7.7](#centos-anchor)
* [Initial distribution of AOMP 0.7-5 in ROCm v3.0](#aomp-anchor)
* [Fast Fourier Transform Updates](#Fast-Fourier-Transform-Updates)
* [MemCopy Enhancement for rocProf](#MemCopy-Enhancement-for-rocProf)
- [Fixed Issues](#Fixed-Issues)
* [Running TensorFlow and PyTorch Frameworks Consecutively Results in the Memory Access Fault error ](#Running-TensorFlow-and-PyTorch-Frameworks-Consecutively-Results-in-the-Memory-Access-Fault-error)
* [Issue with the Docker Container Environment Variable Setting](#Issue-with-the-Docker-Container-Environment-Variable-Setting)
* [Printf Functionality in ROCm Re-Enabled](#Printf-functionality-in-ROCm-Re-Enabled)
* [MIGraph v05 Graph Optimizer](#MIGraph-v05-Graph-Optimizer)
- [Known Issues](#Known-Issues)
* [Memory Access Fault Error While Running RCCL in Docker Container](#Memory-Access-Fault-Error-While-Running-RCCL-in-Docker-Container)
* [Workaround for Tracer Library Fails to Load on RHEL](#Workaround-for-Tracer-Library-Fails-to-Load-on-RHEL)
* [Installation Issue with Red Hat Enterprise Linux v7.7](#Installation-Issue-with-Red-Hat-Enterprise-Linux-v77)
* [Error While Running rocProfiler on SLES](#Error-While-Running-rocProfiler-on-SLES)
* [gpuOwl Fails with Memory Access Fault Error](#gpuOwl-Fails-with-Memory-Access-Fault-Error)
- [Deprecated Features](#Deprecated-Features)
* [ROCm OpenCL Driver](#ROCm-OpenCL-Driver)
* [Peer-to-Peer Bridge Driver for PeerDirect](#Peer-to-Peer-Bridge-Driver-for-PeerDirect)
[MIOpen](#MIOpen)
* [SCGEMM Convolution Algorithm](#SCGEMM-Convolution-Algorithm)
* [Text-Based Performance Database](#Text-Based-Performance-Database)
- [Deploying ROCm](#Deploying-ROCm)
* [Ubuntu](#Ubuntu)
@@ -44,7 +46,8 @@ Note: You can also clone the source code for individual ROCm components from the
### ROCm Components
The following components for the ROCm platform are released and available for the v2.10 release:
The following components for the ROCm platform are released and available for the v3.0
release:
• Drivers
@@ -58,20 +61,23 @@ You can access the latest supported version of drivers, tools, libraries, and so
https://github.com/RadeonOpenCompute/ROCm
### Supported Operating Systems
The ROCm v2.10.x platform is designed to support the following operating systems:
The ROCm v3.0.x platform is designed to support the following operating systems:
• SLES 15 SP1
• Ubuntu 16.04.6(Kernel 4.15) and 18.04.3(Kernel 5.0)
• CentOS 7.6 (Using devtoolset-7 runtime support)
• CentOS v7.7 (Using devtoolset-7 runtime support)
• RHEL 7.6 (Using devtoolset-7 runtime support)
• RHEL v7.7 (Using devtoolset-7 runtime support)
For details about deploying the ROCm v2.10.x on these operating systems, see the Deploying ROCm section later in the document.
For details about deploying the ROCm v3.0.x on these operating systems, see the Deploying ROCm section later in the document.
### Important ROCm Links
Access the following links for more information on:
• ROCm documentation, see
https://rocm-documentation.readthedocs.io/en/latest/index.html
@@ -83,139 +89,133 @@ https://rocm.github.io/install_issues.html
• Instructions to install PyTorch after ROCm is installed https://rocm-documentation.readthedocs.io/en/latest/Deep_learning/Deep-learning.html#pytorch
Note: These instructions reference the rocm/pytorch:rocm2.9_ubuntu16.04_py2.7_pytorch image. However, you can substitute the Ubuntu 18.04 image listed at https://hub.docker.com/r/rocm/pytorch/tags
Note: These instructions reference the rocm/pytorch:rocm3.0_ubuntu16.04_py2.7_pytorch image. However, you can substitute the Ubuntu 18.04 image listed at https://hub.docker.com/r/rocm/pytorch/tags
## Whats New in This Release
## What\'s New in This Release
### rocBLAS Support for Complex GEMM
The rocBLAS library is a gpu-accelerated implementation of the standard Basic Linear Algebra Subroutines (BLAS). rocBLAS is designed to enable you to develop algorithms, including high performance computing, image analysis, and machine learning.
### Support for CentOS RHEL v7.7 <a id="centos-anchor"></a>
Support is extended for CentOS/RHEL v7.7 in the ROCm v3.0 release. For more information about the CentOS/RHEL v7.7 release, see:
In the AMD ROCm release v2.10, support is extended to the General Matrix Multiply (GEMM) routine for multiple small matrices processed simultaneously for rocBLAS in AMD Radeon Instinct MI50. Both single and double precision, CGEMM and ZGEMM, are now supported in rocBLAS.
### Support for SLES 15 SP1
In the AMD ROCm v2.10 release, support is added for SUSE Linux® Enterprise Server (SLES) 15 SP1. SLES is a modular operating system for both multimodal and traditional IT.
Note: The SUSE Linux® Enterprise Server is a licensed platform. Ensure you have registered and have a license key prior to installation. Use the following SUSE command line to apply your license:
SUSEConnect -r < Key>
#### SLES 15 SP1
The following section tells you how to perform an install and uninstall ROCm on SLES 15 SP 1.
Run the following commands once for a fresh install on the operating system:
sudo usermod -a -G video $LOGNAME
sudo usermod -a -G sudo $LOGNAME
sudo reboot
Installation
1. Install the "dkms" package.
sudo SUSEConnect --product PackageHub/15.1/x86_64
sudo zypper install dkms
2. Add the ROCm repo.
sudo zypper clean --all
sudo zypper addrepo --no-gpgcheck http://repo.radeon.com/rocm/zyp/zypper/ rocm
sudo zypper ref
zypper install rocm-dkms
sudo zypper install rocm-dkms
sudo reboot
#Run the following command once
cat <<EOF | sudo tee /etc/modprobe.d/10-unsupported-modules.conf
allow_unsupported_modules 1
EOF
sudo modprobe amdgpu
3. Verify the ROCm installation.
Run /opt/rocm/bin/rocminfo and /opt/rocm/opencl/bin/x86_64/clinfo commands to list the GPUs and verify that the ROCm installation is successful.
Uninstallation
To uninstall, use the following command:
sudo zypper remove rocm-dkms rock-dkms
#Ensure all other installed packages/components are removed
Note: Ensure all the content in the /opt/rocm directory is completely removed.
[CentOS/RHEL](https://centos.org/forums/viewtopic.php?t=71657)
### Code Marker Support for rocProfiler and rocTracer Libraries
Code markers provide the external correlation ID for the calling thread. This function indicates that the calling thread is entering and leaving an external API region.
### Initial distribution of AOMP 0.7-5 in ROCm v3.0 <a id="aomp-anchor"></a>
The code base for this release of AOMP is the Clang/LLVM 9.0 sources as of October 8th, 2019. The LLVM-project branch used to build this release is AOMP-191008. It is now locked. With this release, an artifact tarball of the entire source tree is created. This tree includes a Makefile in the root directory used to build AOMP from the release tarball. You can use Spack to build AOMP from this source tarball or build manually without Spack.
• The rocProfiler library enables you to profile performance counters and derived metrics. This library supports GFX8/GFX9 and provides a hardware-specific low-level performance analysis interface for profiling of GPU compute applications. The profiling includes hardware performance counters with complex performance metrics.
• The rocTracer library provides a specific runtime profiler to trace API and asynchronous activity. The API provides functionality for registering the runtimes API callbacks and the asynchronous activity records pool support.
• rocTX provides a C API for code markup for performance profiling and supports annotation of code ranges and ASCII markers.
For more information about AOMP 0.7-5, see: [AOMP](https://github.com/ROCm-Developer-Tools/aomp/tree/roc-3.0.0)
## Fixed Issues
Fixed Issues in the v2.10 Release
### Running TensorFlow and PyTorch Frameworks Consecutively Results in the Memory Access Fault error
Issue: Running the TensorFlow and PyTorch frameworks in quick succession results in a Memory Access Fault error.
### Fast Fourier Transform Updates
The Fast Fourier Transform (FFT) is an efficient algorithm for computing the Discrete Fourier Transform. Fast Fourier transforms are used in signal processing, image processing, and many other areas. The following real FFT performance change is made in the ROCm v3.0 release:
Resolution: This issue is resolved, and the error no longer appears.
• Implement efficient real/complex 2D transforms for even lengths.
### Issue with the Docker Container Environment Variable Setting
Issue: Applications fail when the docker container is launched on the NUMA system without the security-opt seccomp=unconfined setting.
Other improvements:
Resolution: Configure the “security-opt seccomp=unconfined” variable setting to avoid this issue.
• More 2D test coverage sizes.
### Printf Functionality in ROCm Re-Enabled
Known issues with hc:printf have been addressed in ROCm v2.10. The hc:printf functionality has now been re-enabled on all supported distros.
• Fix buffer allocation error for large 1D transforms.
## Known Issues
### Memory Access Fault Error While Running RCCL in Docker Container
Issue: The Memory Access Fault error appears while running ROCm Communication Collectives Library (RCCL) tests in the Docker container.
• C++ compatibility improvements.
Resolution: While launching the Docker container to run tests related to RCCL, including PyTorch, increase the size limit for the shared memory (SHM) directory to 1 GB. To increase the size limit of the shared memory directory, enter:
### MemCopy Enhancement for rocProf
In the v3.0 release, the rocProf tool is enhanced with an additional capability to dump asynchronous GPU memcopy information into a .csv file. You can use the '-hsa-trace' option to create the results_mcopy.csv file.
Future enhancements will include column labels.
“--shm-size = 1G”
## Fixed Issues in This Release
### MIGraph v0.5 Graph Optimizer
The ROCm v3.0 release consists of performance updates and minor bug fixes for the MIGraphX graph optimizer.
For more information, see
By default, Docker uses only16 MB of shared memory. Running a Docker container for RCCL requires you to resize the limit to 1 GB.
https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/wiki/Getting-started:-using-the-new-features-of-MIGraphX-0.5
### Workaround for Tracer Library Fails to Load on RHEL
Issue: When running /opt/rocm/bin/rocprof --hip-trace <output filename>, a warning message is printed to console: "Tool lib "/opt/rocm/roctracer/tool/libtracer_tool.so" failed to load", and no output file is generated, on systems with RHEL distro.
Resolution: You can use either of the following workarounds to fix the issue:
## Known Issues in This Release
### Installation Issue with Red Hat Enterprise Linux v7.7
<b>Issue</b>: ROCm installation fails on Red Hat Enterprise Linux (RHEL) v7.7.
• Run Idconfig
<b>Resolution</b>: Ensure the following repo is installed and available prior to installing ROCm on RHEL v7.7:
'SUDO LDCONFIG'
<b>Note</b>:
or
• Configure LD_LIBRARY_PATH
For workstations, use
'EXPORT LD_LIBRARY_PATH=${LD_LIBRARY_PATH}:/OPT/ROCM/ROCTRACER/LIB'
<i>rhel-7-workstation-optional-rpms</i>
For servers, use
<i>rhel-7-server-optional-rpms</i>
<b>To install </b>
<i>$sudo subscription-manager repos --enable=rhel-7-workstation-optional-rpms</i>
|| You will see the following message:
Repository 'rhel-7-workstation-optional-rpms' is enabled for this system.
|| If the following error message appears,
<i>Error: 'rhel-7-workstation-optional-rpms' does not match a valid repository ID. Use "subscription-manager repos --list" to see valid repositories.</i>
|| Use
<i>$sudo subscription-manager repos --enable=rhel-7-server-optional-rpms</i>
|| You will see the following message:
Repository 'rhel-7-server-optional-rpms' is enabled for this system.
### Error While Running rocProfiler on SLES
<b>Issue</b>: Running rocprofiler: hip/hsa trace results in the following error. Note, this issue is noticed only on SLES.
<i>ImportError: No module named sqlite3 </i>
<b>Resolution</b>: The following workarounds are recommended:
<b>Workaround 1</b>
1. Run the following command
<i>sudo vi /opt/rocm/bin/rocprof </i>
2. Change Python to Python3.6.
3. Save and run the test again.
<b>Workaround 2:</b>
• Run the following command:
<i>alias python=python3.6</i>
### gpuOwl Fails with Memory Access Fault Error
<b>Issue</b>: gpuOwL is an OpenCL-based program for testing Mersenne numbers for primality. Currently, running gpuOwl for higher probable prime (PRP) values results in a Memory Access Fault error.
Note, the issue is noticed only when using higher PRP values.
<b>Resolution</b>: As a workaround, you may use lower PRP values.
## Deprecated Features
The following features are deprecated in the AMD ROCm v2.10 release.
The following features are deprecated in the AMD ROCm v3.0 release.
### ROCm OpenCL Driver
The AMD ROCm-OpenCL-Driver is now deprecated. Users should migrate to ROCm-CompilerSupport, which provides more comprehensive functionality. The compiler support repository provides various lightning compiler-related services. It currently contains a single library, the Code Object Manager (Comgr) at lib/comgr.
### MIOpen
ROCm-OpenCL-Driver will no longer be actively maintained after the v2.10 release. If your application was developed with the ROCm-OpenCL-Driver, we would encourage you to switch to the ROCm-CompilerSupport repository.
#### SCGEMM Convolution Algorithm
The SCGEMM convolution algorithm is now disabled by default. This algorithm is deprecated and will be removed in future releases.
### Peer-to-Peer Bridge Driver for PeerDirect
The Peer-to-Peer bridge driver for the PeerDirect feature still works in the current release, however, it is now included as part of the ROCk kernel driver. ROCmRDMA allows third-party kernel drivers to utilize DMA access to the GPU memory. It allows a direct path for data exchange (peer-to-peer) using the standard features of PCI Express.
#### Text-Based Performance Database
An SQLite database has been added to replace the text-based performance database. While the text file still exists, by default, SQLite is used over the text-based performance database. The text-based performance database support is deprecated and will be removed in a future release.
Currently, ROCmRDMA provides the following benefits:
• Direct access to ROCm memory for 3rd party PCIe devices
• Support for PeerDirect(c) interface to offloads the CPU when dealing with ROCm memory for RDMA network stacks
## Deploying ROCm
AMD hosts both Debian and RPM repositories for the ROCm v2.10x packages.
AMD hosts both Debian and RPM repositories for the ROCm v3.0.x packages.
The following directions show how to install ROCm on supported Debian-based systems such as Ubuntu 18.04.
The following directions show how to install ROCm on supported Debian-based systems such as Ubuntu 18.04.x.
Note: These directions may not work as written on unsupported Debian-based distributions. For example, newer versions of Ubuntu may not be compatible with the rock-dkms kernel driver. In this case, you can exclude the rocm-dkms and rock-dkms packages.
@@ -242,13 +242,10 @@ To install from a Debian Repository:
For Debian-based systems like Ubuntu, configure the Debian ROCm repository as follows:
wget -q0
http://repo.radeon.com/rocm/apt/debian/rocm.gpg.key |
wget -qO - http://repo.radeon.com/rocm/apt/debian/rocm.gpg.key | sudo apt-key add -
echo 'deb [arch=amd64] http://repo.radeon.com/rocm/apt/debian/ xenial main' | sudo tee /etc/apt/sources.list.d/rocm.list
sudo apt-key add -echo 'deb [arch=amd64]
http://repo.radeon.com/rocm/apt/debian/ xenial main' |
sudo tee /etc/apt/sources.list.d/rocm.list
The gpg key may change; ensure it is updated when installing a new release. If the key signature verification fails while updating, re-add the key from the ROCm apt repository.
@@ -277,9 +274,10 @@ To install from a Debian Repository:
6. By default, add any future users to the video group. Run the following command to add users to the video group:
echo 'ADD_EXTRA_GROUPS=1' | sudo tee -a /etc/adduser.conf
echo 'EXTRA_GROUPS=video' | sudo tee -a /etc/adduser.conf
echo 'ADD_EXTRA_GROUPS=1'
sudo tee -a /etc/adduser.conf
echo 'EXTRA_GROUPS=video'
sudo tee -a /etc/adduser.conf
7. Restart the system.
@@ -301,15 +299,15 @@ https://rocm.github.io/install_issues.html
### Uninstalling ROCm Packages from Ubuntu
To uninstall the ROCm packages from Ubuntu 1v6.04 or Ubuntu v18.04, run the following command:
To uninstall the ROCm packages from Ubuntu 1v6.04 or Ubuntu v18.04.x, run the following command:
sudo apt autoremove rocm-dkms rocm-dev rocm-utils
### Installing Development Packages for Cross Compilation
It is recommended that you develop and test development packages on different systems. For example, some development or build systems may not have an AMD GPU installed. In this scenario, you must avoid installing the ROCk kernel driver on the development system.
### Installing Development Applications for Cross Compilation
It is recommended that you develop and test applications on different systems. For example, some development or build systems may not have an AMD GPU installed. In this scenario, you must avoid installing the ROCk kernel driver on the development system.
Instead, install the following development subset of packages:
Instead, install the following development subset of applications:
sudo apt update
sudo apt install rocm-dev
@@ -322,22 +320,19 @@ You can install the ROCm user-level software without installing the AMD's custom
sudo apt update
sudo apt install rocm-dev
echo 'SUBSYSTEM=="kfd", KERNEL=="kfd", TAG+="uaccess", GROUP="video"' | sudo tee /etc/udev/rules.d/70-kfd.rules
echo 'SUBSYSTEM=="kfd", KERNEL=="kfd", TAG+="uaccess", GROUP="video"'
sudo tee /etc/udev/rules.d/70-kfd.rules
## CentOS RHEL
This section describes how to install ROCm on supported RPM-based systems such as CentOS v7.6.
Note: The following instructions may not work on unsupported RPM-based distributions. For example, Fedora may not be compatible with the rock-dkms kernel driver. You can exclude the rocm-dkms and rock-dkms packages and use the upstream kernel driver instead.
Note: Although support for CentOS/RHEL v7 was added in ROCm v1.8, ROCm requires a special runtime environment provided by the RHEL Software Collections and additional dkms support packages to install and run correctly.
This section describes how to install ROCm on supported RPM-based systems such as CentOS v7.7.
For more details, refer:
https://github.com/RadeonOpenCompute/ROCm/blob/master/README.md#rocm-binary-package-structure
### Preparing RHEL v7 (7.6) for Installation
### Preparing RHEL v7 (7.7) for Installation
RHEL is a subscription-based operating system. You must enable the external repositories to install on the devtoolset-7 environment and the dkms support files.
Note: The following steps do not apply to the CentOS installation.
@@ -346,13 +341,15 @@ Note: The following steps do not apply to the CentOS installation.
2. Enable the following repositories:
sudo subscription-manager repos --enable rhel-server-rhscl-7-rpms
sudo subscription-manager repos --enable rhel-7-server-optional-rpms
sudo subscription-manager repos --enable rhel-7-server-extras-rpms
sudo subscription-manager repos --enable rhel-server-rhscl-7-rpms
sudo subscription-manager repos --enable rhel-7-server-optional-rpms
sudo subscription-manager repos --enable rhel-7-server-extras-rpms
3. Enable additional repositories by downloading and installing the epel-release-latest-7 repository RPM:
sudo rpm -ivh
sudo rpm -ivh
For more details, see
https://dl.fedoraproject.org/pub/epel/epel-release-latest-7.noarch.rpm
@@ -364,9 +361,9 @@ https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/
Note: devtoolset-7 is a software collections package and is not supported by AMD.
### Installing CentOS/RHEL (v7.6) for DKMS
### Installing CentOS/RHEL (v7.7) for DKMS
Use the dkms tool to install the kernel drivers on CentOS/RHEL v7.6:
Use the dkms tool to install the kernel drivers on CentOS/RHEL v7.7:
sudo yum install -y epel-release
sudo yum install -y dkms kernel-headers-`uname -r` kernel-devel-`uname -r`
@@ -380,17 +377,17 @@ To install ROCm on your system, follow the instructions below:
1. Delete the previous versions of ROCm before installing the latest version.
2. Create a /etc/yum.repos.d/rocm.repo file with the following contents:
[ROCm]
name=ROCm
baseurl=http://repo.radeon.com/rocm/yum/rpm
enabled=1
gpgcheck=0
[ROCm]
name=ROCm
baseurl=http://repo.radeon.com/rocm/yum/rpm
enabled=1
gpgcheck=0
Note: The URL of the repository must point to the location of the repositories repodata database.
3. Install ROCm components using the following command:
sudo yum install rocm-dkms
sudo yum install rocm-dkms
4.Restart the system.
The rock-dkms component is installed and the /dev/kfd device is now available.
@@ -400,18 +397,18 @@ To configure permissions, following the instructions below:
1. Ensure that your user account is a member of the "video" or "wheel" group prior to using the ROCm driver. You can find which groups you are a member of with the following command:
groups
groups
2. Add your user to the video (or wheel) group you will need the sudo password and can use the following command:
sudo usermod -a -G video $LOGNAME
sudo usermod -a -G video $LOGNAME
Note: All future users must be added to the "video" group by default. To add the users to the group, run the following commands
echo 'ADD_EXTRA_GROUPS=1' | sudo tee -a /etc/adduser.conf
echo 'EXTRA_GROUPS=video' | sudo tee -a /etc/adduser.conf
Note: The current release supports CentOS/RHEL v7.6. Before updating to the latest version of the operating system, delete the ROCm packages to avoid DKMS-related issues.
Note: The current release supports CentOS/RHEL v7.7. Before updating to the latest version of the operating system, delete the ROCm packages to avoid DKMS-related issues.
3. Restart the system.
@@ -423,7 +420,8 @@ After restarting the system, run the following commands to verify that the ROCm
Note: Add the ROCm binaries in your PATH for easy implementation of the ROCm programs.
echo 'export PATH=$PATH:/opt/rocm/bin:/opt/rocm/profiler/bin:/opt/rocm/opencl/bin/x86_64' | sudo tee -a /etc/profile.d/rocm.sh
echo 'export PATH=$PATH:/opt/rocm/bin:/opt/rocm/profiler/bin:/opt/rocm/opencl/bin/x86_64' |
sudo tee -a /etc/profile.d/rocm.sh
For more information about installation issues, see:
https://rocm.github.io/install_issues.html
@@ -439,7 +437,7 @@ To compile applications or samples, run the following command to use gcc-7.2 pro
scl enable devtoolset-7 bash
### Uninstalling ROCm from CentOS/RHEL v7.6
### Uninstalling ROCm from CentOS/RHEL v7.7
To uninstall the ROCm packages, run the following command:
sudo yum autoremove rocm-dkms rock-dkms
@@ -455,18 +453,13 @@ Note: To execute ROCm-enabled applications, you will require a system installed
You can install ROCm user-level software without installing AMD's custom ROCk kernel driver. To use the upstream kernel drivers, run the following commands
sudo yum install rocm-dev
echo 'SUBSYSTEM=="kfd", KERNEL=="kfd", TAG+="uaccess", GROUP="video"' | sudo tee /etc/udev/rules.d/70-kfd.rules
echo 'SUBSYSTEM=="kfd", KERNEL=="kfd", TAG+="uaccess", GROUP="video"'
sudo tee /etc/udev/rules.d/70-kfd.rules
Note: You can use this command instead of installing rocm-dkms.
Note: You can use these commands instead of installing rocm-dkms.
### ROCm Installation - Known Issues and Workarounds
#### Docker container environment variable setting
Issue: Applications fail when a Docker container is launched on a NUMA system without --security-opt seccomp=unconfined.
Resolution: Set "--security-opt seccomp=unconfined" to fix this issue.
#### Closed source components
The ROCm platform relies on some closed source components to provide functionalities like HSA image support. These components are only available through the ROCm repositories, and they may be deprecated or become open source components in the future. These components are made available in the following packages:
@@ -490,7 +483,7 @@ The following example shows how to use the repo binary to download the ROCm sour
mkdir -p ~/ROCm/
cd ~/ROCm/
~/bin/repo init -u https://github.com/RadeonOpenCompute/ROCm.git -b roc-2.10.0
~/bin/repo init -u https://github.com/RadeonOpenCompute/ROCm.git -b roc-3.0.0
repo sync
Note: Using this sample code will cause the repo to download the open source code associated with this ROCm release. Ensure that you have ssh-keys configured on your machine for your GitHub ID prior to the download.
@@ -499,7 +492,6 @@ Note: Using this sample code will cause the repo to download the open source cod
Each ROCm component repository contains directions for building that component. You can access the desired component for instructions to build the repository.
### Hardware and Software Support
ROCm is focused on using AMD GPUs to accelerate computational tasks such as machine learning, engineering workloads, and scientific computing.
In order to focus our development efforts on these domains of interest, ROCm supports a targeted set of hardware configurations which are detailed further in this section.
@@ -595,11 +587,11 @@ from the list provided above for compatibility purposes.
### Supported Operating Systems - New operating systems available
The ROCm 2.9.x platform supports the following operating systems:
The ROCm 3.0.x platform supports the following operating systems:
* Ubuntu 16.04.5(Kernel 4.15) and 18.04.3(Kernel 4.15 and Kernel 4.18)
* CentOS 7.6 (Using devtoolset-7 runtime support)
* RHEL 7.6 (Using devtoolset-7 runtime support)
* CentOS 7.7 (Using devtoolset-7 runtime support)
* RHEL 7.7 (Using devtoolset-7 runtime support)
#### ROCm support in upstream Linux kernels
@@ -640,14 +632,11 @@ The releases of the upstream Linux kernel support the following GPUs in ROCm:
• Fiji, Polaris 10, Polaris 11, Vega10
• Fiji, Polaris 10, Polaris 11, Vega10, Vega 7nm
#### Supported Products
• CUDA v8
## Machine Learning and High Performance Computing Software Stack for AMD GPU
ROCm Version 2.10
ROCm Version 3.0
### ROCm Binary Package Structure
@@ -708,33 +697,37 @@ The chain of software installed by these meta-packages is illustrated below
rocm-dkms
|--rock-dkms
\--rocm-dev
|--hsa-rocr-dev
|--hsa-ext-rocr-dev
|--hsakmt-roct
|--hsakmt-roct-dev
|--rocm-cmake
|--rocm-device-libs
|--comgr
|--hcc
|--hip_base
|--hip_doc
|--hip_hcc
|--hip_samples
|--rocm-smi
|--hsakmt-roct
|--hsakmt-roct-dev
|--hsa-amd-aqlprofile
|--comgr
|--hsa-ext-rocr-dev
|--hsa-rocr-dev
|--rocm-cmake
|--rocm-device-libs
|--rocm-smi
|--rocprofiler-dev
|--rocr_debug_agent
\--rocm-utils
|--rocminfo
\--rocm-clang-ocl # This will cause OpenCL to be installed
rocm-libs
|--rocalution
|--hipblas
|--hipcub
|--hipsparse
|--rocalution
|--rocblas
|--rocfft
|--rocprim
|--rocrand
|--hipsparse
\--rocsparse
|--rocsparse
\--rocthrust
```
These meta-packages are not required but may be useful to make it easier to install ROCm on most systems.
@@ -751,57 +744,55 @@ Drivers, ToolChains, Libraries, and Source Code
The latest supported version of the drivers, tools, libraries and source code for the ROCm platform have been released and are available from the following GitHub repositories:
#### ROCm Core Components
- [ROCk Kernel Driver](https://github.com/RadeonOpenCompute/ROCK-Kernel-Driver/tree/roc-2.10.0)
- [ROCr Runtime](https://github.com/RadeonOpenCompute/ROCR-Runtime/tree/roc-2.10.0)
- [ROCt Thunk Interface](https://github.com/RadeonOpenCompute/ROCT-Thunk-Interface/tree/roc-2.10.0)
- [ROCk Kernel Driver](https://github.com/RadeonOpenCompute/ROCK-Kernel-Driver/tree/roc-3.0.0)
- [ROCr Runtime](https://github.com/RadeonOpenCompute/ROCR-Runtime/tree/roc-3.0.0)
- [ROCt Thunk Interface](https://github.com/RadeonOpenCompute/ROCT-Thunk-Interface/tree/roc-3.0.0)
#### ROCm Support Software
- [ROCm SMI](https://github.com/RadeonOpenCompute/ROC-smi/tree/roc-2.10.0)
- [ROCm cmake](https://github.com/RadeonOpenCompute/rocm-cmake/tree/roc-2.10.0)
- [rocminfo](https://github.com/RadeonOpenCompute/rocminfo/tree/roc-2.10.0)
- [ROCm Bandwidth Test](https://github.com/RadeonOpenCompute/rocm_bandwidth_test/tree/roc-2.10.0)
- [ROCm SMI](https://github.com/RadeonOpenCompute/ROC-smi/tree/roc-3.0.0)
- [ROCm cmake](https://github.com/RadeonOpenCompute/rocm-cmake/tree/roc-3.0.0)
- [rocminfo](https://github.com/RadeonOpenCompute/rocminfo/tree/roc-3.0.0)
- [ROCm Bandwidth Test](https://github.com/RadeonOpenCompute/rocm_bandwidth_test/tree/roc-3.0.0)
#### ROCm Development ToolChains
- [HCC compiler](https://github.com/RadeonOpenCompute/hcc/tree/roc-hcc-2.10.0)
- [HIP](https://github.com/ROCm-Developer-Tools/HIP/tree/roc-2.10.0)
- [ROCm Device Libraries](https://github.com/RadeonOpenCompute/ROCm-Device-Libs/tree/roc-hcc-2.10.0)
- [HCC compiler](https://github.com/RadeonOpenCompute/hcc/tree/roc-hcc-3.0.0)
- [HIP](https://github.com/ROCm-Developer-Tools/HIP/tree/roc-3.0.0)
- [ROCm Device Libraries](https://github.com/RadeonOpenCompute/ROCm-Device-Libs/tree/roc-hcc-3.0.0)
- ROCm OpenCL, which is created from the following components:
- [ROCm OpenCL Runtime](http://github.com/RadeonOpenCompute/ROCm-OpenCL-Runtime/tree/roc-2.10.0)
- [ROCm OpenCL Driver](http://github.com/RadeonOpenCompute/ROCm-OpenCL-Driver/tree/roc-2.10.0)
- The ROCm OpenCL compiler, which is created from the following components:
- [ROCm LLVM OCL](http://github.com/RadeonOpenCompute/llvm/tree/roc-ocl-2.10.0)
- [ROCm LLVM HCC](http://github.com/RadeonOpenCompute/llvm/tree/roc-hcc-2.10.0)
- [ROCm Clang](http://github.com/RadeonOpenCompute/clang/tree/roc-2.10.0)
- [ROCm lld OCL](http://github.com/RadeonOpenCompute/lld/tree/roc-ocl-2.10.0)
- [ROCm lld HCC](http://github.com/RadeonOpenCompute/lld/tree/roc-hcc-2.10.0)
- [ROCm Device Libraries](https://github.com/RadeonOpenCompute/ROCm-Device-Libs/tree/roc-ocl-2.10.0)
- [ROCM Clang-OCL Kernel Compiler](https://github.com/RadeonOpenCompute/clang-ocl/tree/roc-2.10.0)
- [Asynchronous Task and Memory Interface (ATMI)](https://github.com/RadeonOpenCompute/atmi/tree/rocm_2.10.0)
- [ROCr Debug Agent](https://github.com/ROCm-Developer-Tools/rocr_debug_agent/tree/roc-2.10.0)
- [ROCm Code Object Manager](https://github.com/RadeonOpenCompute/ROCm-CompilerSupport/tree/roc-2.10.0)
- [ROC Profiler](https://github.com/ROCm-Developer-Tools/rocprofiler/tree/roc-2.10.0)
- [ROC Tracer](https://github.com/ROCm-Developer-Tools/roctracer/tree/roc-2.10.x)
- [ROCm OpenCL Runtime](http://github.com/RadeonOpenCompute/ROCm-OpenCL-Runtime/tree/roc-3.0.0)
- The ROCm OpenCL compiler, which is created from the following components:
- [ROCm LLVM OCL](http://github.com/RadeonOpenCompute/llvm-project/tree/roc-ocl-3.0.0)
- [ROCm Device Libraries](https://github.com/RadeonOpenCompute/ROCm-Device-Libs/tree/roc-ocl-3.0.0)
- [ROCM Clang-OCL Kernel Compiler](https://github.com/RadeonOpenCompute/clang-ocl/tree/roc-3.0.0)
- [Asynchronous Task and Memory Interface (ATMI)](https://github.com/RadeonOpenCompute/atmi/tree/rocm_3.0.0)
- [ROCr Debug Agent](https://github.com/ROCm-Developer-Tools/rocr_debug_agent/tree/roc-3.0.0)
- [ROCm Code Object Manager](https://github.com/RadeonOpenCompute/ROCm-CompilerSupport/tree/roc-3.0.0)
- [ROC Profiler](https://github.com/ROCm-Developer-Tools/rocprofiler/tree/roc-3.0.0)
- [ROC Tracer](https://github.com/ROCm-Developer-Tools/roctracer/tree/roc-3.0.x)
- [AOMP](https://github.com/ROCm-Developer-Tools/aomp/tree/roc-3.0.0)
- [Radeon Compute Profiler](https://github.com/GPUOpen-Tools/RCP/tree/3a49405)
- [ROCmValidationSuite](https://github.com/ROCm-Developer-Tools/ROCmValidationSuite/tree/roc-3.0.0)
- Example Applications:
- [HCC Examples](https://github.com/ROCm-Developer-Tools/HCC-Example-Application/tree/ffd65333)
- [HIP Examples](https://github.com/ROCm-Developer-Tools/HIP-Examples/tree/roc-2.10.0)
- [HIP Examples](https://github.com/ROCm-Developer-Tools/HIP-Examples/tree/roc-3.0.0)
#### ROCm Libraries
- [rocBLAS](https://github.com/ROCmSoftwarePlatform/rocBLAS/tree/rocm-2.10)
- [hipBLAS](https://github.com/ROCmSoftwarePlatform/hipBLAS/tree/rocm-2.10)
- [rocFFT](https://github.com/ROCmSoftwarePlatform/rocFFT/tree/rocm-2.10)
- [rocRAND](https://github.com/ROCmSoftwarePlatform/rocRAND/tree/2.10.0)
- [rocSPARSE](https://github.com/ROCmSoftwarePlatform/rocSPARSE/tree/rocm-2.10)
- [hipSPARSE](https://github.com/ROCmSoftwarePlatform/hipSPARSE/tree/rocm-2.10)
- [rocALUTION](https://github.com/ROCmSoftwarePlatform/rocALUTION/tree/rocm-2.10)
- [rocBLAS](https://github.com/ROCmSoftwarePlatform/rocBLAS/tree/rocm-3.0)
- [hipBLAS](https://github.com/ROCmSoftwarePlatform/hipBLAS/tree/rocm-3.0)
- [rocFFT](https://github.com/ROCmSoftwarePlatform/rocFFT/tree/rocm-3.0)
- [rocRAND](https://github.com/ROCmSoftwarePlatform/rocRAND/tree/3.0.0)
- [rocSPARSE](https://github.com/ROCmSoftwarePlatform/rocSPARSE/tree/rocm-3.0)
- [hipSPARSE](https://github.com/ROCmSoftwarePlatform/hipSPARSE/tree/rocm-3.0)
- [rocALUTION](https://github.com/ROCmSoftwarePlatform/rocALUTION/tree/rocm-3.0)
- [MIOpenGEMM](https://github.com/ROCmSoftwarePlatform/MIOpenGEMM/tree/6275a879)
- [MIOpen](https://github.com/ROCmSoftwarePlatform/MIOpen/tree/roc-2.10.0)
- [rocThrust](https://github.com/ROCmSoftwarePlatform/rocThrust/tree/2.10.0)
- [ROCm SMI Lib](https://github.com/RadeonOpenCompute/rocm_smi_lib/tree/roc-2.10.0)
- [RCCL](https://github.com/ROCmSoftwarePlatform/rccl/tree/2.10.0)
- [MIOpen](https://github.com/ROCmSoftwarePlatform/MIOpen/tree/roc-3.0.0)
- [rocThrust](https://github.com/ROCmSoftwarePlatform/rocThrust/tree/3.0.0)
- [ROCm SMI Lib](https://github.com/RadeonOpenCompute/rocm_smi_lib/tree/roc.3.0.0)
- [RCCL](https://github.com/ROCmSoftwarePlatform/rccl/tree/3.0.0)
- [MIVisionX](https://github.com/GPUOpen-ProfessionalCompute-Libraries/MIVisionX/tree/1.5)
- [hipCUB](https://github.com/ROCmSoftwarePlatform/hipCUB/tree/2.10.0)
- [hipCUB](https://github.com/ROCmSoftwarePlatform/hipCUB/tree/3.0.0)
- [AMDMIGraphX](https://github.com/ROCmSoftwarePlatform/AMDMIGraphx/tree/0.5-hip-hcc)
Features and enhancements introduced in previous versions of ROCm can be found in [version_history.md](version_history.md)

View File

@@ -12,7 +12,7 @@
<remote name="gpuopen-tools"
fetch="https://github.com/GPUOpen-Tools/" />
<default revision="refs/tags/roc-2.10.0"
<default revision="refs/tags/roc-3.0.0"
remote="roc-github"
sync-c="true"
sync-j="4" />
@@ -24,53 +24,63 @@
<project name="rocm-cmake" />
<project name="rocminfo" />
<project name="rocprofiler" remote="rocm-devtools" revision="refs/tags/roc-2.10.0" />
<project name="roctracer" remote="rocm-devtools" revision="roc-2.10.x" />
<project name="rocprofiler" remote="rocm-devtools" revision="refs/tags/roc-3.0.0" />
<project name="roctracer" remote="rocm-devtools" revision="roc-3.0.x" />
<!-- If you want to get the full OpenCL runtime, there is a separate repo
manifest that is more authoritative than the copy in this file. It can
be found at the following URL:
https://github.com/RadeonOpenCompute/ROCm-OpenCL-Runtime/blob/roc-2.0.0/opencl.xml -->
<remote name="KhronosGroup" fetch="https://github.com/KhronosGroup/" />
<project name="ROCm-OpenCL-Runtime" />
<project path="ROCm-OpenCL-Runtime/compiler/driver" name="ROCm-OpenCL-Driver"/>
<project path="ROCm-OpenCL-Runtime/compiler/llvm" name="llvm" revision="refs/tags/roc-ocl-2.10.0" />
<project path="ROCm-OpenCL-Runtime/compiler/llvm/tools/clang" name="clang" />
<project path="ROCm-OpenCL-Runtime/compiler/llvm/tools/lld" name="lld" revision="refs/tags/roc-ocl-2.10.0" />
<project path="ROCm-OpenCL-Runtime/library/amdgcn" name="ROCm-Device-Libs" revision="refs/tags/roc-ocl-2.10.0" />
<project path="ROCm-OpenCL-Runtime/api/opencl/khronos/icd" name="OpenCL-ICD-Loader" remote="KhronosGroup" revision="6c03f8b58fafd9dd693eaac826749a5cfad515f8" />
<project name="clang-ocl" />
<!-- HCC needs to be recursively synced to get it submodules -->
<project name="hcc" sync-s="true" revision="refs/tags/roc-hcc-2.10.0" />
<project name="hcc" sync-s="true" revision="refs/tags/roc-hcc-3.0.0" />
<project name="HCC-Example-Application" remote="rocm-devtools" revision="ffd6533305e79eed667badd3c4cdb7879a1281b8" />
<project name="HIP" remote="rocm-devtools" revision="refs/tags/roc-2.10.0" />
<project name="HIP" remote="rocm-devtools" revision="refs/tags/roc-3.0.0" />
<project name="HIP-Examples" remote="rocm-devtools" />
<!-- The following projects are all associated with the AMDGPU LLVM compiler -->
<project name="llvm" path="llvm_amd-common" revision="refs/tags/roc-ocl-2.10.0" />
<project name="lld" path="llvm_amd-common/lld" revision="refs/tags/roc-ocl-2.10.0" />
<project name="clang" path="llvm_amd-common/clang" />
<project name="ROCm-Device-Libs" revision="refs/tags/roc-ocl-2.10.0" />
<project name="atmi" revision="refs/tags/rocm_2.10.0" />
<project name="llvm-project" path="llvm_amd-stg-open" revision="refs/tags/roc-ocl-3.0.0"/>
<project name="ROCm-Device-Libs" revision="refs/tags/roc-ocl-3.0.0"/>
<project name="atmi" revision="refs/tags/rocm_3.0.0" />
<project name="ROCm-CompilerSupport" />
<project name="rocr_debug_agent" remote="rocm-devtools" />
<project name="rocm_bandwidth_test" />
<project name="RCP" remote="gpuopen-tools" revision="3a49405a1500067c49d181844ec90aea606055bb" />
<!-- ROCm Libraries -->
<project name="rocBLAS" remote="rocm-swplat" revision="refs/tags/rocm-2.10" />
<project name="hipBLAS" remote="rocm-swplat" revision="refs/tags/rocm-2.10" />
<project name="rocFFT" remote="rocm-swplat" revision="refs/tags/rocm-2.10" />
<project name="rocRAND" remote="rocm-swplat" revision="refs/tags/2.10.0" />
<project name="rocSPARSE" remote="rocm-swplat" revision="refs/tags/rocm-2.10" />
<project name="hipSPARSE" remote="rocm-swplat" revision="refs/tags/rocm-2.10" />
<project name="rocALUTION" remote="rocm-swplat" revision="refs/tags/rocm-2.10" />
<project name="rocBLAS" remote="rocm-swplat" revision="refs/tags/rocm-3.0" />
<project name="hipBLAS" remote="rocm-swplat" revision="refs/tags/rocm-3.0" />
<project name="rocFFT" remote="rocm-swplat" revision="refs/tags/rocm-3.0" />
<project name="rocRAND" remote="rocm-swplat" revision="refs/tags/3.0.0" />
<project name="rocSPARSE" remote="rocm-swplat" revision="refs/tags/rocm-3.0" />
<project name="hipSPARSE" remote="rocm-swplat" revision="refs/tags/rocm-3.0" />
<project name="rocALUTION" remote="rocm-swplat" revision="refs/tags/rocm-3.0" />
<project name="MIOpenGEMM" remote="rocm-swplat" revision="6275a879995b58a6e0b8cca7b1ad8421a5e02ade" />
<project name="MIOpen" remote="rocm-swplat" revision="refs/tags/roc-2.10.0" />
<project name="rocm_smi_lib" />
<project name="rccl" remote="rocm-swplat" revision="refs/tags/2.10.0" />
<project name="MIOpen" remote="rocm-swplat" revision="refs/tags/roc-3.0.0" />
<project name="rocm_smi_lib" remote="roc-github" revision="refs/tags/roc.3.0.0" />
<project name="rccl" remote="rocm-swplat" revision="refs/tags/3.0.0" />
<project name="MIVisionX" remote="gpuopen-libs" revision="refs/tags/1.5" />
<project name="rocThrust" remote="rocm-swplat" revision="refs/tags/2.10.0" />
<project name="hipCUB" remote="rocm-swplat" revision="refs/tags/2.10.0" />
<project name="rocPRIM" remote="rocm-swplat" revision="refs/tags/2.10.0" />
<project name="rocThrust" remote="rocm-swplat" revision="refs/tags/3.0.0" />
<project name="hipCUB" remote="rocm-swplat" revision="refs/tags/3.0.0" />
<project name="rocPRIM" remote="rocm-swplat" revision="refs/tags/3.0.0" />
<project name="AMDMIGraphX" remote="rocm-swplat" revision="0.5-hip-hcc" />
<project name="ROCmValidationSuite" remote="rocm-devtools" revision="refs/tags/roc-3.0.0" />
<!-- Projects for AOMP -->
<project name="ROCT-Thunk-Interface" path="aomp/roct-thunk-interface" remote="roc-github" revision="refs/tags/roc-3.0.0" />
<project name="ROCR-Runtime" path="aomp/rocr-runtime" remote="roc-github" revision="refs/tags/roc-3.0.0" />
<project name="hcc" path="aomp/hcc" remote="roc-github" revision="refs/tags/roc-hcc-3.0.0" />
<project name="ROCm-Device-Libs" path="aomp/rocm-device-libs" remote="roc-github" revision="refs/tags/roc-ocl-2.9.0" />
<project name="ROCm-CompilerSupport" path="aomp/rocm-compilersupport" remote="roc-github" revision="refs/tags/roc-2.9.0" />
<project name="rocminfo" path="aomp/rocminfo" remote="roc-github" revision="a578b432b1d13ef0fd7803a5d3afa0e7895629d1" />
<project name="atmi" path="aomp/atmi" remote="roc-github" revision="refs/tags/rocm_3.0.0" />
<project name="HIP" path="aomp/hip" remote="rocm-devtools" revision="refs/tags/roc-2.9.0" />
<project name="aomp" path="aomp/aomp" remote="rocm-devtools" revision="refs/tags/roc-3.0.0" />
<project name="aomp-extras" path="aomp/aomp-extras" remote="rocm-devtools" revision="refs/tags/roc-3.0.0" />
<project name="flang" path="aomp/flang" remote="rocm-devtools" revision="refs/tags/roc-3.0.0" />
<project name="llvm-project" path="aomp/llvm-project" remote="rocm-devtools" revision="refs/tags/roc-aomp-3.0.0" />
</manifest>

View File

@@ -2,6 +2,7 @@
This file contains archived version history information for the [ROCm project](https://github.com/RadeonOpenCompute/ROCm)
### Current ROCm Version: 2.10
- [New features and enhancements in ROCm v2.10](#new-features-and-enhancements-in-rocm-v210)
- [New features and enhancements in ROCm 2.9](#new-features-and-enhancements-in-rocm-29)
- [New features and enhancements in ROCm 2.8](#new-features-and-enhancements-in-rocm-28)
- [New features and enhancements in ROCm 2.7.2](#new-features-and-enhancements-in-rocm-272)
@@ -23,7 +24,17 @@ This file contains archived version history information for the [ROCm project](h
- [New Features as of ROCm 1.5](#new-features-as-of-rocm-15)
### New features and enhancements in ROCm v2.10
#### rocBLAS Support for Complex GEMM
The rocBLAS library is a gpu-accelerated implementation of the standard Basic Linear Algebra Subroutines (BLAS). rocBLAS is designed to enable you to develop algorithms, including high performance computing, image analysis, and machine learning.
In the AMD ROCm release v2.10, support is extended to the General Matrix Multiply (GEMM) routine for multiple small matrices processed simultaneously for rocBLAS in AMD Radeon Instinct MI50. Both single and double precision, CGEMM and ZGEMM, are now supported in rocBLAS.
#### Support for SLES 15 SP1
In the AMD ROCm v2.10 release, support is added for SUSE Linux® Enterprise Server (SLES) 15 SP1. SLES is a modular operating system for both multimodal and traditional IT.
#### Code Marker Support for rocProfiler and rocTracer Libraries
Code markers provide the external correlation ID for the calling thread. This function indicates that the calling thread is entering and leaving an external API region.
### New features and enhancements in ROCm 2.9