mirror of
https://github.com/ROCm/ROCm.git
synced 2026-01-10 15:18:11 -05:00
Compare commits
15 Commits
docs/5.1.0
...
docs/5.0.2
| Author | SHA1 | Date | |
|---|---|---|---|
|
|
91b4c3a773 | ||
|
|
61027c5819 | ||
|
|
30b20be20b | ||
|
|
14388ff349 | ||
|
|
8b75fce399 | ||
|
|
693042f6da | ||
|
|
6c72380d7d | ||
|
|
8464b5820e | ||
|
|
653568a662 | ||
|
|
24db72b8a8 | ||
|
|
4668632fa2 | ||
|
|
270bc73661 | ||
|
|
a7ce874940 | ||
|
|
e78e6a9a23 | ||
|
|
4bf9dc9560 |
@@ -3,12 +3,19 @@
|
||||
|
||||
version: 2
|
||||
|
||||
build:
|
||||
os: ubuntu-22.04
|
||||
tools:
|
||||
python: "3.10"
|
||||
apt_packages:
|
||||
- "doxygen"
|
||||
- "graphviz" # For dot graphs in doxygen
|
||||
|
||||
python:
|
||||
install:
|
||||
- requirements: docs/sphinx/requirements.txt
|
||||
|
||||
sphinx:
|
||||
configuration: docs/conf.py
|
||||
|
||||
formats: [htmlzip, pdf, epub]
|
||||
|
||||
python:
|
||||
version: "3.8"
|
||||
install:
|
||||
- requirements: docs/sphinx/requirements.txt
|
||||
formats: []
|
||||
|
||||
553
CHANGELOG.md
553
CHANGELOG.md
@@ -15,559 +15,6 @@ The release notes for the ROCm platform.
|
||||
|
||||
-------------------
|
||||
|
||||
## ROCm 5.1.0
|
||||
<!-- markdownlint-disable first-line-h1 -->
|
||||
<!-- markdownlint-disable no-blanks-blockquote -->
|
||||
### What's New in This Release
|
||||
|
||||
#### HIP Enhancements
|
||||
|
||||
The ROCm v5.1 release consists of the following HIP enhancements.
|
||||
|
||||
##### HIP Installation Guide Updates
|
||||
|
||||
The HIP Installation Guide is updated to include installation and building HIP from source on the AMD and NVIDIA platforms.
|
||||
|
||||
Refer to the HIP Installation Guide v5.1 for more details.
|
||||
|
||||
##### Support for HIP Graph
|
||||
|
||||
ROCm v5.1 extends support for HIP Graph.
|
||||
|
||||
##### Planned Changes for HIP in Future Releases
|
||||
|
||||
###### Separation of hiprtc (libhiprtc) library from hip runtime (amdhip64)
|
||||
|
||||
On ROCm/Linux, to maintain backward compatibility, the hipruntime library (amdhip64) will continue to include hiprtc symbols in future releases. The backward compatible support may be discontinued by removing hiprtc symbols from the hipruntime library (amdhip64) in the next major release.
|
||||
|
||||
###### hipDeviceProp_t Structure Enhancements
|
||||
|
||||
Changes to the hipDeviceProp_t structure in the next major release may result in backward incompatibility. More details on these changes will be provided in subsequent releases.
|
||||
|
||||
#### ROCDebugger Enhancements
|
||||
|
||||
##### Multi-language Source Level Debugger
|
||||
|
||||
The compiler now generates a source-level variable and function argument debug information.
|
||||
|
||||
The accuracy is guaranteed if the compiler options `-g -O0` are used and apply only to HIP.
|
||||
|
||||
This enhancement enables ROCDebugger users to interact with the HIP source-level variables and function arguments.
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> The newly-suggested compiler -g option must be used instead of the previously-suggested `-ggdb` option. Although the effect of these two options is currently equivalent, this is not guaranteed for the future and might get changed by the upstream LLVM community.
|
||||
|
||||
##### Machine Interface Lanes Support
|
||||
|
||||
ROCDebugger Machine Interface (MI) extends support to lanes. The following enhancements are made:
|
||||
|
||||
- Added a new -lane-info command, listing the current thread's lanes.
|
||||
|
||||
- The -thread-select command now supports a lane switch to switch to a specific lane of a thread:
|
||||
|
||||
```sh
|
||||
-thread-select -l LANE THREAD
|
||||
```
|
||||
|
||||
- The =thread-selected notification gained a lane-id attribute. This enables the frontend to know which lane of the thread was selected.
|
||||
|
||||
- The *stopped asynchronous record gained lane-id and hit-lanes attributes. The former indicates which lane is selected, and the latter indicates which lanes explain the stop.
|
||||
|
||||
- MI commands now accept a global --lane option, similar to the global --thread and --frame options.
|
||||
|
||||
- MI varobjs are now lane-aware.
|
||||
|
||||
For more information, refer to the ROC Debugger User Guide at <https://docs.amd.com>.
|
||||
|
||||
##### Enhanced - clone-inferior Command
|
||||
|
||||
The clone-inferior command now ensures that the TTY, CMD, ARGS, and AMDGPU PRECISE-MEMORY settings are copied from the original inferior to the new one. All modifications to the environment variables done using the 'set environment' or 'unset environment' commands are also copied to the new inferior.
|
||||
|
||||
#### MIOpen Support for RDNA GPUs
|
||||
|
||||
This release includes support for AMD Radeon™ Pro W6800, in addition to other bug fixes and performance improvements as listed below:
|
||||
|
||||
- MIOpen now supports RDNA GPUs!! (via MIOpen PRs 973, 780, 764, 740, 739, 677, 660, 653, 493, 498)
|
||||
|
||||
- Fixed a correctness issue with ImplicitGemm algorithm
|
||||
|
||||
- Updated the performance data for new kernel versions
|
||||
|
||||
- Improved MIOpen build time by splitting large kernel header files
|
||||
|
||||
- Fixed an issue in reduction kernels for padded tensors
|
||||
|
||||
- Various other bug fixes and performance improvements
|
||||
|
||||
For more information, see <https://docs.amd.com/bundle/MIOpen_gh-pages/page/releasenotes.html>
|
||||
|
||||
#### Checkpoint Restore Support With CRIU
|
||||
|
||||
The new Checkpoint Restore in Userspace (CRIU) functionality is implemented to support AMD GPU and ROCm applications.
|
||||
|
||||
CRIU is a userspace tool to Checkpoint and Restore an application.
|
||||
|
||||
CRIU lacked the support for checkpoint restore applications that used device files such as a GPU. With this ROCm release, CRIU is enhanced with a new plugin to support AMD GPUs, which includes:
|
||||
|
||||
- Single and Multi GPU systems (Gfx9)
|
||||
|
||||
- Checkpoint / Restore on a different system
|
||||
|
||||
- Checkpoint / Restore inside a docker container
|
||||
|
||||
- PyTorch
|
||||
|
||||
- Tensorflow
|
||||
|
||||
- Using CRIU Image Streamer
|
||||
|
||||
For more information, refer to <https://github.com/checkpoint-restore/criu/tree/criu-dev/plugins/amdgpu>
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> The CRIU plugin (amdgpu_plugin) is merged upstream with the CRIU repository. The KFD kernel patches are also available upstream with the amd-staging-drm-next branch (public) and the ROCm 5.1 release branch.
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> This is a Beta release of the Checkpoint and Restore functionality, and some features are not available in this release.
|
||||
|
||||
For more information, refer to the following websites:
|
||||
|
||||
- <https://github.com/RadeonOpenCompute/criu/blob/amdgpu_plugin-03252022/Documentation/amdgpu_plugin.txt>
|
||||
|
||||
- <https://criu.org/Main_Page>
|
||||
|
||||
### Fixed Defects
|
||||
|
||||
The following defects are fixed in this release.
|
||||
|
||||
#### Driver Fails To Load after Installation
|
||||
|
||||
The issue with the driver failing to load after ROCm installation is now fixed.
|
||||
|
||||
The driver installs successfully, and the server reboots with working rocminfo and clinfo.
|
||||
|
||||
#### ROCDebugger Fixed Defects
|
||||
|
||||
##### Breakpoints in GPU kernel code Before Kernel Is Loaded
|
||||
|
||||
Previously, setting a breakpoint in device code by line number before the device code was loaded into the program resulted in ROCgdb incorrectly moving the breakpoint to the first following line that contains host code.
|
||||
|
||||
Now, the breakpoint is left pending. When the GPU kernel gets loaded, the breakpoint resolves to a location in the kernel.
|
||||
|
||||
##### Registers Invalidated After Write
|
||||
|
||||
Previously, the stale just-written value was presented as a current value.
|
||||
|
||||
ROCgdb now invalidates the cached values of registers whose content might differ after being written. For example, registers with read-only bits.
|
||||
|
||||
ROCgdb also invalidates all volatile registers when a volatile register is written. For example, writing VCC invalidates the content of STATUS as STATUS.VCCZ may change.
|
||||
|
||||
##### Scheduler-locking and GPU Wavefronts
|
||||
|
||||
When scheduler-locking is in effect, new wavefronts created by a resumed thread, CPU, or GPU wavefront, are held in the halt state. For example, the "set scheduler-locking" command.
|
||||
|
||||
##### ROCDebugger Fails Before Completion of Kernel Execution
|
||||
|
||||
It was possible (although erroneous) for a debugger to load GPU code in memory, send it to the device, start executing a kernel on the device, and dispose of the original code before the kernel had finished execution. If a breakpoint was hit after this point, the debugger failed with an internal error while trying to access the debug information.
|
||||
|
||||
This issue is now fixed by ensuring that the debugger keeps a local copy of the original code and debug information.
|
||||
|
||||
### Known Issues
|
||||
|
||||
#### Random Memory Access Fault Errors Observed While Running Math Libraries Unit Tests
|
||||
|
||||
**Issue:** Random memory access fault issues are observed while running Math libraries unit tests. This issue is encountered in ROCm v5.0, ROCm v5.0.1, and ROCm v5.0.2.
|
||||
|
||||
Note, the faults only occur in the SRIOV environment.
|
||||
|
||||
**Workaround:** Use SDMA to update the page table. The Guest set up steps are as follows:
|
||||
|
||||
```sh
|
||||
sudo modprobe amdgpu vm_update_mode=0
|
||||
```
|
||||
|
||||
To verify, use
|
||||
|
||||
**Guest:**
|
||||
|
||||
```sh
|
||||
cat /sys/module/amdgpu/parameters/vm_update_mode 0
|
||||
```
|
||||
|
||||
Where expectation is 0.
|
||||
|
||||
#### CU Masking Causes Application to Freeze
|
||||
|
||||
Using CU Masking results in an application freeze or runs exceptionally slowly. This issue is noticed only in the GFX10 suite of products. Note, this issue is observed only in GFX10 suite of products.
|
||||
|
||||
This issue is under active investigation at this time.
|
||||
|
||||
#### Failed Checkpoint in Docker Containers
|
||||
|
||||
A defect with Ubuntu images kernel-5.13-30-generic and kernel-5.13-35-generic with Overlay FS results in incorrect reporting of the mount ID.
|
||||
|
||||
This issue with Ubuntu causes CRIU checkpointing to fail in Docker containers.
|
||||
|
||||
As a workaround, use an older version of the kernel. For example, Ubuntu 5.11.0-46-generic.
|
||||
|
||||
#### Issue with Restoring Workloads Using Cooperative Groups Feature
|
||||
|
||||
Workloads that use the cooperative groups function to ensure all waves can be resident at the same time may fail to restore correctly.
|
||||
This issue is under investigation and will be fixed in a future release.
|
||||
|
||||
#### Radeon Pro V620 and W6800 Workstation GPUs
|
||||
|
||||
##### No Support for ROCDebugger on SRIOV
|
||||
|
||||
ROCDebugger is not supported in the SRIOV environment on any GPU.
|
||||
|
||||
This is a known issue and will be fixed in a future release.
|
||||
|
||||
#### Random Error Messages in ROCm SMI for SR-IOV
|
||||
|
||||
Random error messages are generated by unsupported functions or commands.
|
||||
|
||||
This is a known issue and will be fixed in a future release.
|
||||
|
||||
### Library Changes in ROCM 5.1.0
|
||||
|
||||
| Library | Version |
|
||||
|---------|---------|
|
||||
| hipBLAS | 0.49.0 ⇒ [0.50.0](https://github.com/ROCmSoftwarePlatform/hipBLAS/releases/tag/rocm-5.1.0) |
|
||||
| hipCUB | 2.10.13 ⇒ [2.11.0](https://github.com/ROCmSoftwarePlatform/hipCUB/releases/tag/rocm-5.1.0) |
|
||||
| hipFFT | 1.0.4 ⇒ [1.0.7](https://github.com/ROCmSoftwarePlatform/hipFFT/releases/tag/rocm-5.1.0) |
|
||||
| hipSOLVER | 1.2.0 ⇒ [1.3.0](https://github.com/ROCmSoftwarePlatform/hipSOLVER/releases/tag/rocm-5.1.0) |
|
||||
| hipSPARSE | 2.0.0 ⇒ [2.1.0](https://github.com/ROCmSoftwarePlatform/hipSPARSE/releases/tag/rocm-5.1.0) |
|
||||
| rccl | 2.10.3 ⇒ [2.11.4](https://github.com/ROCmSoftwarePlatform/rccl/releases/tag/rocm-5.1.0) |
|
||||
| rocALUTION | 2.0.1 ⇒ [2.0.2](https://github.com/ROCmSoftwarePlatform/rocALUTION/releases/tag/rocm-5.1.0) |
|
||||
| rocBLAS | 2.42.0 ⇒ [2.43.0](https://github.com/ROCmSoftwarePlatform/rocBLAS/releases/tag/rocm-5.1.0) |
|
||||
| rocFFT | 1.0.13 ⇒ [1.0.16](https://github.com/ROCmSoftwarePlatform/rocFFT/releases/tag/rocm-5.1.0) |
|
||||
| rocPRIM | 2.10.12 ⇒ [2.10.13](https://github.com/ROCmSoftwarePlatform/rocPRIM/releases/tag/rocm-5.1.0) |
|
||||
| rocRAND | 2.10.12 ⇒ [2.10.13](https://github.com/ROCmSoftwarePlatform/rocRAND/releases/tag/rocm-5.1.0) |
|
||||
| rocSOLVER | 3.16.0 ⇒ [3.17.0](https://github.com/ROCmSoftwarePlatform/rocSOLVER/releases/tag/rocm-5.1.0) |
|
||||
| rocSPARSE | 2.0.0 ⇒ [2.1.0](https://github.com/ROCmSoftwarePlatform/rocSPARSE/releases/tag/rocm-5.1.0) |
|
||||
| rocThrust | 2.13.0 ⇒ [2.14.0](https://github.com/ROCmSoftwarePlatform/rocThrust/releases/tag/rocm-5.1.0) |
|
||||
| Tensile | 4.31.0 ⇒ [4.32.0](https://github.com/ROCmSoftwarePlatform/Tensile/releases/tag/rocm-5.1.0) |
|
||||
|
||||
#### hipBLAS 0.50.0
|
||||
|
||||
hipBLAS 0.50.0 for ROCm 5.1.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Added library version and device information to hipblas-test output
|
||||
- Added --rocsolver-path command line option to choose path to pre-built rocSOLVER, as
|
||||
absolute or relative path
|
||||
- Added --cmake_install command line option to update cmake to minimum version if required
|
||||
- Added cmake-arg parameter to pass in cmake arguments while building
|
||||
- Added infrastructure to support readthedocs hipBLAS documentation.
|
||||
|
||||
##### Fixed
|
||||
|
||||
- Added hipblasVersionMinor define. hipblaseVersionMinor remains defined
|
||||
for backwards compatibility.
|
||||
- Doxygen warnings in hipblas.h header file.
|
||||
|
||||
##### Changed
|
||||
|
||||
- rocblas-path command line option can be specified as either absolute or relative path
|
||||
- Help message improvements in install.sh and rmake.py
|
||||
- Updated googletest dependency from 1.10.0 to 1.11.0
|
||||
|
||||
#### hipCUB 2.11.0
|
||||
|
||||
hipCUB 2.11.0 for ROCm 5.1.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Device segmented sort
|
||||
- Warp merge sort, WarpMask and thread sort from cub 1.15.0 supported in hipCUB
|
||||
- Device three way partition
|
||||
|
||||
##### Changed
|
||||
|
||||
- Device_scan and device_segmented_scan: inclusive_scan now uses the input-type as accumulator-type, exclusive_scan uses initial-value-type.
|
||||
- This particularly changes behaviour of small-size input types with large-size output types (e.g. short input, int output).
|
||||
- And low-res input with high-res output (e.g. float input, double output)
|
||||
- Block merge sort no longer supports non power of two blocksizes
|
||||
|
||||
#### hipFFT 1.0.7
|
||||
|
||||
hipFFT 1.0.7 for ROCm 5.1.0
|
||||
|
||||
##### Changed
|
||||
|
||||
- Use fft_params struct for accuracy and benchmark clients.
|
||||
|
||||
#### hipSOLVER 1.3.0
|
||||
|
||||
hipSOLVER 1.3.0 for ROCm 5.1.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Added functions
|
||||
- gels
|
||||
- hipsolverSSgels_bufferSize, hipsolverDDgels_bufferSize, hipsolverCCgels_bufferSize, hipsolverZZgels_bufferSize
|
||||
- hipsolverSSgels, hipsolverDDgels, hipsolverCCgels, hipsolverZZgels
|
||||
- Added library version and device information to hipsolver-test output.
|
||||
- Added compatibility API with hipsolverDn prefix.
|
||||
- Added compatibility-only functions
|
||||
- gesvdj
|
||||
- hipsolverDnSgesvdj_bufferSize, hipsolverDnDgesvdj_bufferSize, hipsolverDnCgesvdj_bufferSize, hipsolverDnZgesvdj_bufferSize
|
||||
- hipsolverDnSgesvdj, hipsolverDnDgesvdj, hipsolverDnCgesvdj, hipsolverDnZgesvdj
|
||||
- gesvdjBatched
|
||||
- hipsolverDnSgesvdjBatched_bufferSize, hipsolverDnDgesvdjBatched_bufferSize, hipsolverDnCgesvdjBatched_bufferSize, hipsolverDnZgesvdjBatched_bufferSize
|
||||
- hipsolverDnSgesvdjBatched, hipsolverDnDgesvdjBatched, hipsolverDnCgesvdjBatched, hipsolverDnZgesvdjBatched
|
||||
- syevj
|
||||
- hipsolverDnSsyevj_bufferSize, hipsolverDnDsyevj_bufferSize, hipsolverDnCheevj_bufferSize, hipsolverDnZheevj_bufferSize
|
||||
- hipsolverDnSsyevj, hipsolverDnDsyevj, hipsolverDnCheevj, hipsolverDnZheevj
|
||||
- syevjBatched
|
||||
- hipsolverDnSsyevjBatched_bufferSize, hipsolverDnDsyevjBatched_bufferSize, hipsolverDnCheevjBatched_bufferSize, hipsolverDnZheevjBatched_bufferSize
|
||||
- hipsolverDnSsyevjBatched, hipsolverDnDsyevjBatched, hipsolverDnCheevjBatched, hipsolverDnZheevjBatched
|
||||
- sygvj
|
||||
- hipsolverDnSsygvj_bufferSize, hipsolverDnDsygvj_bufferSize, hipsolverDnChegvj_bufferSize, hipsolverDnZhegvj_bufferSize
|
||||
- hipsolverDnSsygvj, hipsolverDnDsygvj, hipsolverDnChegvj, hipsolverDnZhegvj
|
||||
|
||||
##### Changed
|
||||
|
||||
- The rocSOLVER backend now allows hipsolverXXgels and hipsolverXXgesv to be called in-place when B == X.
|
||||
- The rocSOLVER backend now allows rwork to be passed as a null pointer to hipsolverXgesvd.
|
||||
|
||||
##### Fixed
|
||||
|
||||
- bufferSize functions will now return HIPSOLVER_STATUS_NOT_INITIALIZED instead of HIPSOLVER_STATUS_INVALID_VALUE when both handle and lwork are null.
|
||||
- Fixed rare memory allocation failure in syevd/heevd and sygvd/hegvd caused by improper workspace array allocation outside of rocSOLVER.
|
||||
|
||||
#### hipSPARSE 2.1.0
|
||||
|
||||
hipSPARSE 2.1.0 for ROCm 5.1.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Added gtsv_interleaved_batch and gpsv_interleaved_batch routines
|
||||
- Add SpGEMM_reuse
|
||||
|
||||
##### Changed
|
||||
|
||||
- Changed BUILD_CUDA with USE_CUDA in install script and cmake files
|
||||
- Update googletest to 11.1
|
||||
|
||||
##### Improved
|
||||
|
||||
- Fixed a bug in SpMM Alg versioning
|
||||
|
||||
##### Known Issues
|
||||
|
||||
- none
|
||||
|
||||
#### rccl 2.11.4
|
||||
|
||||
RCCL 2.11.4 for ROCm 5.1.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Compatibility with NCCL 2.11.4
|
||||
|
||||
##### Known Issues
|
||||
|
||||
- Managed memory is not currently supported for clique-based kernels
|
||||
|
||||
#### rocALUTION 2.0.2
|
||||
|
||||
rocALUTION 2.0.2 for ROCm 5.1.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Added out-of-place matrix transpose functionality
|
||||
- Added LocalVector<bool>
|
||||
|
||||
#### rocBLAS 2.43.0
|
||||
|
||||
rocBLAS 2.43.0 for ROCm 5.1.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Option to install script for number of jobs to use for rocBLAS and Tensile compilation (-j, --jobs)
|
||||
- Option to install script to build clients without using any Fortran (--clients_no_fortran)
|
||||
- rocblas_client_initialize function, to perform rocBLAS initialize for clients(benchmark/test) and report the execution time.
|
||||
- Added tests for output of reduction functions when given bad input
|
||||
- Added user specified initialization (rand_int/trig_float/hpl) for initializing matrices and vectors in rocblas-bench
|
||||
|
||||
##### Optimizations
|
||||
|
||||
- Improved performance of trsm with side == left and n == 1
|
||||
- Improved perforamnce of trsm with side == left and m <= 32 along with side == right and n <= 32
|
||||
|
||||
##### Changed
|
||||
|
||||
- For syrkx and trmm internal API use rocblas_stride datatype for offset
|
||||
- For non-batched and batched gemm_ex functions if the C matrix pointer equals the D matrix pointer (aliased) their respective type and leading dimension arguments must now match
|
||||
- Test client dependencies updated to GTest 1.11
|
||||
- non-global false positives reported by cppcheck from file based suppression to inline suppression. File based suppression will only be used for global false positives.
|
||||
- Help menu messages in install.sh
|
||||
- For ger function, typecast the 'lda'(offset) datatype to size_t during offset calculation to avoid overflow and remove duplicate template functions.
|
||||
- Modified default initialization from rand_int to hpl for initializing matrices and vectors in rocblas-bench
|
||||
|
||||
##### Fixed
|
||||
|
||||
- For function trmv (non-transposed cases) avoid overflow in offset calculation
|
||||
- Fixed cppcheck errors/warnings
|
||||
- Fixed doxygen warnings
|
||||
|
||||
#### rocFFT 1.0.16
|
||||
|
||||
rocFFT 1.0.16 for ROCm 5.1.0
|
||||
|
||||
##### Changed
|
||||
|
||||
- Supported unaligned tile dimension for SBRC_2D kernels.
|
||||
- Improved (more RAII) test and benchmark infrastructure.
|
||||
- Enabled runtime compilation of length-2304 FFT kernel during plan creation.
|
||||
|
||||
##### Optimizations
|
||||
|
||||
- Optimized more large 1D cases by using L1D_CC plan.
|
||||
- Optimized 3D 200^3 C2R case.
|
||||
- Optimized 1D 2^30 double precision on MI200.
|
||||
|
||||
##### Fixed
|
||||
|
||||
- Fixed correctness of some R2C transforms with unusual strides.
|
||||
|
||||
##### Removed
|
||||
|
||||
- The hipFFT API (header) has been removed from after a long deprecation period. Please use the [hipFFT](https://github.com/ROCmSoftwarePlatform/hipFFT) package/repository to obtain the hipFFT API.
|
||||
|
||||
#### rocPRIM 2.10.13
|
||||
|
||||
rocPRIM 2.10.13 for ROCm 5.1.0
|
||||
|
||||
##### Fixed
|
||||
|
||||
- Fixed radix sort int64_t bug introduced in [2.10.11]
|
||||
|
||||
##### Added
|
||||
|
||||
- Future value
|
||||
- Added device partition_three_way to partition input to three output iterators based on two predicates
|
||||
|
||||
##### Changed
|
||||
|
||||
- The reduce/scan algorithm precision issues in the tests has been resolved for half types.
|
||||
|
||||
##### Known Issues
|
||||
|
||||
- device_segmented_radix_sort unit test failing for HIP on Windows
|
||||
|
||||
#### rocRAND 2.10.13
|
||||
|
||||
rocRAND 2.10.13 for ROCm 5.1.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Generating a random sequence different sizes now produces the same sequence without gaps
|
||||
indepent of how many values are generated per call.
|
||||
- Only in the case of XORWOW, MRG32K3A, PHILOX4X32_10, SOBOL32 and SOBOL64
|
||||
- This only holds true if the size in each call is a divisor of the distributions
|
||||
`output_width` due to performance
|
||||
- Similarly the output pointer has to be aligned to `output_width * sizeof(output_type)`
|
||||
|
||||
##### Changed
|
||||
|
||||
- [hipRAND](https://github.com/ROCmSoftwarePlatform/hipRAND.git) split into a separate package
|
||||
- Header file installation location changed to match other libraries.
|
||||
- Using the `rocrand.h` header file should now use `#include <rocrand/rocrand.h>`, rather than `#include <rocrand/rocrand.h>`
|
||||
- rocRAND still includes hipRAND using a submodule
|
||||
- The rocRAND package also sets the provides field with hipRAND, so projects which require hipRAND can begin to specify it.
|
||||
|
||||
##### Fixed
|
||||
|
||||
- Fix offset behaviour for XORWOW, MRG32K3A and PHILOX4X32_10 generator, setting offset now
|
||||
correctly generates the same sequence starting from the offset.
|
||||
- Only uniform int and float will work as these can be generated with a single call to the generator
|
||||
|
||||
##### Known Issues
|
||||
|
||||
- kernel_xorwow unit test is failing for certain GPU architectures.
|
||||
|
||||
#### rocSOLVER 3.17.0
|
||||
|
||||
rocSOLVER 3.17.0 for ROCm 5.1.0
|
||||
|
||||
##### Optimized
|
||||
|
||||
- Optimized non-pivoting and batch cases of the LU factorization
|
||||
|
||||
##### Fixed
|
||||
|
||||
- Fixed missing synchronization in SYTRF with `rocblas_fill_lower` that could potentially
|
||||
result in incorrect pivot values.
|
||||
- Fixed multi-level logging output to file with the `ROCSOLVER_LOG_PATH`,
|
||||
`ROCSOLVER_LOG_TRACE_PATH`, `ROCSOLVER_LOG_BENCH_PATH` and `ROCSOLVER_LOG_PROFILE_PATH`
|
||||
environment variables.
|
||||
- Fixed performance regression in the batched LU factorization of tiny matrices
|
||||
|
||||
#### rocSPARSE 2.1.0
|
||||
|
||||
rocSPARSE 2.1.0 for ROCm 5.1.0
|
||||
|
||||
##### Added
|
||||
|
||||
- gtsv_interleaved_batch
|
||||
- gpsv_interleaved_batch
|
||||
- SpGEMM_reuse
|
||||
- Allow copying of mat info struct
|
||||
|
||||
##### Improved
|
||||
|
||||
- Optimization for SDDMM
|
||||
- Allow unsorted matrices in csrgemm multipass algorithm
|
||||
|
||||
##### Known Issues
|
||||
|
||||
- none
|
||||
|
||||
#### rocThrust 2.14.0
|
||||
|
||||
rocThrust 2.14.0 for ROCm 5.1.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Updated to match upstream Thrust 1.15.0
|
||||
|
||||
##### Known Issues
|
||||
|
||||
- async_copy, partition, and stable_sort_by_key unit tests are failing on HIP on Windows.
|
||||
|
||||
#### Tensile 4.32.0
|
||||
|
||||
Tensile 4.32.0 for ROCm 5.1.0
|
||||
|
||||
##### Added
|
||||
|
||||
- Better control of parallelism to control memory usage
|
||||
- Support for multiprocessing on Windows for TensileCreateLibrary
|
||||
- New JSD metric and metric selection functionality
|
||||
- Initial changes to support two-tier solution selection
|
||||
|
||||
##### Optimized
|
||||
|
||||
- Optimized runtime of TensileCreateLibraries by reducing max RAM usage
|
||||
- StoreCInUnroll additional optimizations plus adaptive K support
|
||||
- DGEMM NN optimizations with PrefetchGlobalRead(PGR)=2 support
|
||||
|
||||
##### Changed
|
||||
|
||||
- Update Googletest to 1.11.0
|
||||
|
||||
##### Removed
|
||||
|
||||
- Remove no longer supported benchmarking steps
|
||||
|
||||
-------------------
|
||||
|
||||
## ROCm 5.0.2
|
||||
<!-- markdownlint-disable first-line-h1 -->
|
||||
### Fixed Defects
|
||||
|
||||
216
RELEASE.md
216
RELEASE.md
@@ -15,219 +15,23 @@ The release notes for the ROCm platform.
|
||||
|
||||
-------------------
|
||||
|
||||
## ROCm 5.1.0
|
||||
## ROCm 5.0.2
|
||||
<!-- markdownlint-disable first-line-h1 -->
|
||||
<!-- markdownlint-disable no-duplicate-header -->
|
||||
### What's New in This Release
|
||||
|
||||
#### HIP Enhancements
|
||||
|
||||
The ROCm v5.1 release consists of the following HIP enhancements.
|
||||
|
||||
##### HIP Installation Guide Updates
|
||||
|
||||
The HIP Installation Guide is updated to include installation and building HIP from source on the AMD and NVIDIA platforms.
|
||||
|
||||
Refer to the HIP Installation Guide v5.1 for more details.
|
||||
|
||||
##### Support for HIP Graph
|
||||
|
||||
ROCm v5.1 extends support for HIP Graph.
|
||||
|
||||
##### Planned Changes for HIP in Future Releases
|
||||
|
||||
###### Separation of hiprtc (libhiprtc) library from hip runtime (amdhip64)
|
||||
|
||||
On ROCm/Linux, to maintain backward compatibility, the hipruntime library (amdhip64) will continue to include hiprtc symbols in future releases. The backward compatible support may be discontinued by removing hiprtc symbols from the hipruntime library (amdhip64) in the next major release.
|
||||
|
||||
###### hipDeviceProp_t Structure Enhancements
|
||||
|
||||
Changes to the hipDeviceProp_t structure in the next major release may result in backward incompatibility. More details on these changes will be provided in subsequent releases.
|
||||
|
||||
#### ROCDebugger Enhancements
|
||||
|
||||
##### Multi-language Source Level Debugger
|
||||
|
||||
The compiler now generates a source-level variable and function argument debug information.
|
||||
|
||||
The accuracy is guaranteed if the compiler options `-g -O0` are used and apply only to HIP.
|
||||
|
||||
This enhancement enables ROCDebugger users to interact with the HIP source-level variables and function arguments.
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> The newly-suggested compiler -g option must be used instead of the previously-suggested `-ggdb` option. Although the effect of these two options is currently equivalent, this is not guaranteed for the future and might get changed by the upstream LLVM community.
|
||||
|
||||
##### Machine Interface Lanes Support
|
||||
|
||||
ROCDebugger Machine Interface (MI) extends support to lanes. The following enhancements are made:
|
||||
|
||||
- Added a new -lane-info command, listing the current thread's lanes.
|
||||
|
||||
- The -thread-select command now supports a lane switch to switch to a specific lane of a thread:
|
||||
|
||||
```sh
|
||||
-thread-select -l LANE THREAD
|
||||
```
|
||||
|
||||
- The =thread-selected notification gained a lane-id attribute. This enables the frontend to know which lane of the thread was selected.
|
||||
|
||||
- The *stopped asynchronous record gained lane-id and hit-lanes attributes. The former indicates which lane is selected, and the latter indicates which lanes explain the stop.
|
||||
|
||||
- MI commands now accept a global --lane option, similar to the global --thread and --frame options.
|
||||
|
||||
- MI varobjs are now lane-aware.
|
||||
|
||||
For more information, refer to the ROC Debugger User Guide at
|
||||
{doc}`ROCgdb <rocgdb:index>`.
|
||||
|
||||
##### Enhanced - clone-inferior Command
|
||||
|
||||
The clone-inferior command now ensures that the TTY, CMD, ARGS, and AMDGPU PRECISE-MEMORY settings are copied from the original inferior to the new one. All modifications to the environment variables done using the 'set environment' or 'unset environment' commands are also copied to the new inferior.
|
||||
|
||||
#### MIOpen Support for RDNA GPUs
|
||||
|
||||
This release includes support for AMD Radeon™ Pro W6800, in addition to other bug fixes and performance improvements as listed below:
|
||||
|
||||
- MIOpen now supports RDNA GPUs!! (via MIOpen PRs 973, 780, 764, 740, 739, 677, 660, 653, 493, 498)
|
||||
|
||||
- Fixed a correctness issue with ImplicitGemm algorithm
|
||||
|
||||
- Updated the performance data for new kernel versions
|
||||
|
||||
- Improved MIOpen build time by splitting large kernel header files
|
||||
|
||||
- Fixed an issue in reduction kernels for padded tensors
|
||||
|
||||
- Various other bug fixes and performance improvements
|
||||
|
||||
For more information, see {doc}`Documentation <miopen:index>`.
|
||||
|
||||
#### Checkpoint Restore Support With CRIU
|
||||
|
||||
The new Checkpoint Restore in Userspace (CRIU) functionality is implemented to support AMD GPU and ROCm applications.
|
||||
|
||||
CRIU is a userspace tool to Checkpoint and Restore an application.
|
||||
|
||||
CRIU lacked the support for checkpoint restore applications that used device files such as a GPU. With this ROCm release, CRIU is enhanced with a new plugin to support AMD GPUs, which includes:
|
||||
|
||||
- Single and Multi GPU systems (Gfx9)
|
||||
|
||||
- Checkpoint / Restore on a different system
|
||||
|
||||
- Checkpoint / Restore inside a docker container
|
||||
|
||||
- PyTorch
|
||||
|
||||
- Tensorflow
|
||||
|
||||
- Using CRIU Image Streamer
|
||||
|
||||
For more information, refer to <https://github.com/checkpoint-restore/criu/tree/criu-dev/plugins/amdgpu>
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> The CRIU plugin (amdgpu_plugin) is merged upstream with the CRIU repository. The KFD kernel patches are also available upstream with the amd-staging-drm-next branch (public) and the ROCm 5.1 release branch.
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> This is a Beta release of the Checkpoint and Restore functionality, and some features are not available in this release.
|
||||
|
||||
For more information, refer to the following websites:
|
||||
|
||||
- <https://github.com/RadeonOpenCompute/criu/blob/amdgpu_plugin-03252022/Documentation/amdgpu_plugin.txt>
|
||||
|
||||
- <https://criu.org/Main_Page>
|
||||
|
||||
### Fixed Defects
|
||||
|
||||
The following defects are fixed in this release.
|
||||
The following defects are fixed in the ROCm v5.0.2 release.
|
||||
|
||||
#### Driver Fails To Load after Installation
|
||||
#### Issue with hostcall Facility in HIP Runtime
|
||||
|
||||
The issue with the driver failing to load after ROCm installation is now fixed.
|
||||
In ROCm v5.0, when using the “assert()” call in a HIP kernel, the compiler may sometimes fail to emit kernel metadata related to the hostcall facility, which results in incomplete initialization of the hostcall facility in the HIP runtime. This can cause the HIP kernel to crash when it attempts to execute the “assert()” call.
|
||||
|
||||
The driver installs successfully, and the server reboots with working rocminfo and clinfo.
|
||||
The root cause was an incorrect check in the compiler to determine whether the hostcall facility is required by the kernel. This is fixed in the ROCm v5.0.2 release.
|
||||
|
||||
#### ROCDebugger Fixed Defects
|
||||
The resolution includes a compiler change, which emits the required metadata by default, unless the compiler can prove that the hostcall facility is not required by the kernel. This ensures that the “assert()” call never fails.
|
||||
|
||||
##### Breakpoints in GPU kernel code Before Kernel Is Loaded
|
||||
Note:
|
||||
This fix may lead to breakage in some OpenMP offload use cases, which use print inside a target region and result in an abort in device code. The issue will be fixed in a future release.
|
||||
Compatibility Matrix Updates to ROCm Deep Learning Guide
|
||||
|
||||
Previously, setting a breakpoint in device code by line number before the device code was loaded into the program resulted in ROCgdb incorrectly moving the breakpoint to the first following line that contains host code.
|
||||
|
||||
Now, the breakpoint is left pending. When the GPU kernel gets loaded, the breakpoint resolves to a location in the kernel.
|
||||
|
||||
##### Registers Invalidated After Write
|
||||
|
||||
Previously, the stale just-written value was presented as a current value.
|
||||
|
||||
ROCgdb now invalidates the cached values of registers whose content might differ after being written. For example, registers with read-only bits.
|
||||
|
||||
ROCgdb also invalidates all volatile registers when a volatile register is written. For example, writing VCC invalidates the content of STATUS as STATUS.VCCZ may change.
|
||||
|
||||
##### Scheduler-locking and GPU Wavefronts
|
||||
|
||||
When scheduler-locking is in effect, new wavefronts created by a resumed thread, CPU, or GPU wavefront, are held in the halt state. For example, the "set scheduler-locking" command.
|
||||
|
||||
##### ROCDebugger Fails Before Completion of Kernel Execution
|
||||
|
||||
It was possible (although erroneous) for a debugger to load GPU code in memory, send it to the device, start executing a kernel on the device, and dispose of the original code before the kernel had finished execution. If a breakpoint was hit after this point, the debugger failed with an internal error while trying to access the debug information.
|
||||
|
||||
This issue is now fixed by ensuring that the debugger keeps a local copy of the original code and debug information.
|
||||
|
||||
### Known Issues
|
||||
|
||||
#### Random Memory Access Fault Errors Observed While Running Math Libraries Unit Tests
|
||||
|
||||
**Issue:** Random memory access fault issues are observed while running Math libraries unit tests. This issue is encountered in ROCm v5.0, ROCm v5.0.1, and ROCm v5.0.2.
|
||||
|
||||
Note, the faults only occur in the SRIOV environment.
|
||||
|
||||
**Workaround:** Use SDMA to update the page table. The Guest set up steps are as follows:
|
||||
|
||||
```sh
|
||||
sudo modprobe amdgpu vm_update_mode=0
|
||||
```
|
||||
|
||||
To verify, use
|
||||
|
||||
**Guest:**
|
||||
|
||||
```sh
|
||||
cat /sys/module/amdgpu/parameters/vm_update_mode 0
|
||||
```
|
||||
|
||||
Where expectation is 0.
|
||||
|
||||
#### CU Masking Causes Application to Freeze
|
||||
|
||||
Using CU Masking results in an application freeze or runs exceptionally slowly. This issue is noticed only in the GFX10 suite of products. Note, this issue is observed only in GFX10 suite of products.
|
||||
|
||||
This issue is under active investigation at this time.
|
||||
|
||||
#### Failed Checkpoint in Docker Containers
|
||||
|
||||
A defect with Ubuntu images kernel-5.13-30-generic and kernel-5.13-35-generic with Overlay FS results in incorrect reporting of the mount ID.
|
||||
|
||||
This issue with Ubuntu causes CRIU checkpointing to fail in Docker containers.
|
||||
|
||||
As a workaround, use an older version of the kernel. For example, Ubuntu 5.11.0-46-generic.
|
||||
|
||||
#### Issue with Restoring Workloads Using Cooperative Groups Feature
|
||||
|
||||
Workloads that use the cooperative groups function to ensure all waves can be resident at the same time may fail to restore correctly.
|
||||
This issue is under investigation and will be fixed in a future release.
|
||||
|
||||
#### Radeon Pro V620 and W6800 Workstation GPUs
|
||||
|
||||
##### No Support for ROCDebugger on SRIOV
|
||||
|
||||
ROCDebugger is not supported in the SRIOV environment on any GPU.
|
||||
|
||||
This is a known issue and will be fixed in a future release.
|
||||
|
||||
#### Random Error Messages in ROCm SMI for SR-IOV
|
||||
|
||||
Random error messages are generated by unsupported functions or commands.
|
||||
|
||||
This is a known issue and will be fixed in a future release.
|
||||
The compatibility matrix in the AMD Deep Learning Guide is updated for ROCm v5.0.2.
|
||||
@@ -14,6 +14,13 @@ shutil.copy2('../RELEASE.md','./release.md')
|
||||
# Keep capitalization due to similar linking on GitHub's markdown preview.
|
||||
shutil.copy2('../CHANGELOG.md','./CHANGELOG.md')
|
||||
|
||||
# configurations for PDF output by Read the Docs
|
||||
project = "ROCm Documentation"
|
||||
author = "Advanced Micro Devices, Inc."
|
||||
copyright = "Copyright (c) 2023 Advanced Micro Devices, Inc. All rights reserved."
|
||||
version = "5.0.2"
|
||||
release = "5.0.2"
|
||||
|
||||
setting_all_article_info = True
|
||||
all_article_info_os = ["linux"]
|
||||
all_article_info_author = ""
|
||||
@@ -57,7 +64,7 @@ article_pages = [
|
||||
|
||||
external_toc_path = "./sphinx/_toc.yml"
|
||||
|
||||
docs_core = ROCmDocs("ROCm Documentation Home")
|
||||
docs_core = ROCmDocs("ROCm 5.0.2 Documentation Home")
|
||||
docs_core.setup()
|
||||
|
||||
external_projects_current_project = "rocm"
|
||||
|
||||
@@ -18,8 +18,8 @@ following commands based on your distribution.
|
||||
|
||||
```shell
|
||||
sudo apt update
|
||||
wget https://repo.radeon.com/amdgpu-install/22.10/ubuntu/bionic/amdgpu-install_22.10.50100-1_all.deb
|
||||
sudo apt install ./amdgpu-install_22.10.50100-1_all.deb
|
||||
wget https://repo.radeon.com/amdgpu-install/21.50.2/ubuntu/bionic/amdgpu-install_21.50.2.50002-1_all.deb
|
||||
sudo apt install ./amdgpu-install_21.50.2.50002-1_all.deb
|
||||
```
|
||||
|
||||
:::
|
||||
@@ -28,8 +28,8 @@ sudo apt install ./amdgpu-install_22.10.50100-1_all.deb
|
||||
|
||||
```shell
|
||||
sudo apt update
|
||||
wget https://repo.radeon.com/amdgpu-install/22.10/ubuntu/focal/amdgpu-install_22.10.50100-1_all.deb
|
||||
sudo apt install ./amdgpu-install_22.10.50100-1_all.deb
|
||||
wget https://repo.radeon.com/amdgpu-install/21.50.2/ubuntu/focal/amdgpu-install_21.50.2.50002-1_all.deb
|
||||
sudo apt install ./amdgpu-install_21.50.2.50002-1_all.deb
|
||||
```
|
||||
|
||||
:::
|
||||
@@ -44,7 +44,16 @@ sudo apt install ./amdgpu-install_22.10.50100-1_all.deb
|
||||
:sync: RHEL-7
|
||||
|
||||
```shell
|
||||
sudo yum install https://repo.radeon.com/amdgpu-install/22.20/rhel/7.9/amdgpu-install-22.20.50200-1.el7.noarch.rpm
|
||||
sudo yum install https://repo.radeon.com/amdgpu-install/21.50.2/rhel/7.9/amdgpu-install-21.50.2.50002-1.el7.noarch.rpm
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} RHEL 8.4
|
||||
:sync: RHEL-8.4
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
sudo yum install https://repo.radeon.com/amdgpu-install/21.50.2/rhel/8.4/amdgpu-install-21.50.2.50002-1.el7.noarch.rpm
|
||||
```
|
||||
|
||||
:::
|
||||
@@ -53,7 +62,7 @@ sudo yum install https://repo.radeon.com/amdgpu-install/22.20/rhel/7.9/amdgpu-in
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
sudo yum install https://repo.radeon.com/amdgpu-install/22.20/rhel/8.5/amdgpu-install-22.20.50200-1.el8.noarch.rpm
|
||||
sudo yum install https://repo.radeon.com/amdgpu-install/21.50.2/rhel/8.5/amdgpu-install-21.50.2.50002-1.el7.noarch.rpm
|
||||
```
|
||||
|
||||
:::
|
||||
@@ -67,7 +76,7 @@ sudo yum install https://repo.radeon.com/amdgpu-install/22.20/rhel/8.5/amdgpu-in
|
||||
:sync: SLES15-SP3
|
||||
|
||||
```shell
|
||||
sudo zypper --no-gpg-checks install https://repo.radeon.com/amdgpu-install/22.10/sle/15/amdgpu-install-22.10.50100-1.noarch.rpm
|
||||
sudo zypper --no-gpg-checks install https://repo.radeon.com/amdgpu-install/21.50.2/sle/15/amdgpu-install-21.50.2.50002-1.noarch.rpm
|
||||
```
|
||||
|
||||
:::
|
||||
@@ -146,9 +155,9 @@ the installer script will install packages in the single-version layout.
|
||||
For the multi-version ROCm installation you must use the installer script from
|
||||
the latest release of ROCm that you wish to install.
|
||||
|
||||
**Example:** If you want to install ROCm releases 5.0.2 and 5.1
|
||||
**Example:** If you want to install ROCm releases 5.0.0 and 5.0.2
|
||||
simultaneously, you are required to download the installer from the latest ROCm
|
||||
release v5.1.
|
||||
release v5.0.2.
|
||||
|
||||
### Add Required Repositories
|
||||
|
||||
@@ -167,7 +176,7 @@ Run the following commands based on your distribution to add the repositories:
|
||||
:sync: ubuntu-18.04
|
||||
|
||||
```shell
|
||||
for ver in 5.0.2; do
|
||||
for ver in 5.0; do
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/trusted.gpg.d/rocm-keyring.gpg] https://repo.radeon.com/rocm/apt/$ver bionic main" | sudo tee /etc/apt/sources.list.d/rocm.list
|
||||
done
|
||||
echo -e 'Package: *\nPin: release o=repo.radeon.com\nPin-Priority: 600' | sudo tee /etc/apt/preferences.d/rocm-pin-600
|
||||
@@ -179,7 +188,7 @@ sudo apt update
|
||||
:sync: ubuntu-20.04
|
||||
|
||||
```shell
|
||||
for ver in 5.0.2; do
|
||||
for ver in 5.0; do
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/trusted.gpg.d/rocm-keyring.gpg] https://repo.radeon.com/rocm/apt/$ver focal main" | sudo tee /etc/apt/sources.list.d/rocm.list
|
||||
done
|
||||
echo -e 'Package: *\nPin: release o=repo.radeon.com\nPin-Priority: 600' | sudo tee /etc/apt/preferences.d/rocm-pin-600
|
||||
@@ -197,7 +206,7 @@ sudo apt update
|
||||
:sync: RHEL-7
|
||||
|
||||
```shell
|
||||
for ver in 5.0.2; do
|
||||
for ver in 5.0; do
|
||||
sudo tee --append /etc/yum.repos.d/rocm.repo <<EOF
|
||||
[ROCm-$ver]
|
||||
name=ROCm$ver
|
||||
@@ -216,7 +225,7 @@ sudo yum clean all
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
for ver in 5.0.2; do
|
||||
for ver in 5.0;
|
||||
sudo tee --append /etc/yum.repos.d/rocm.repo <<EOF
|
||||
[ROCm-$ver]
|
||||
name=ROCm$ver
|
||||
@@ -241,7 +250,7 @@ sudo yum clean all
|
||||
:sync: SLES15-SP3
|
||||
|
||||
```shell
|
||||
for ver in 5.0.2; do
|
||||
for ver in 5.0; do
|
||||
sudo tee --append /etc/zypp/repos.d/rocm.repo <<EOF
|
||||
name=rocm
|
||||
baseurl=https://repo.radeon.com/rocm/$ver/sle/15/main/x86_64
|
||||
@@ -274,8 +283,8 @@ driver, associated with the ROCm release v5.3, will be installed as its latest
|
||||
release in the list.
|
||||
|
||||
```none
|
||||
sudo amdgpu-install --usecase=rocm --rocmrelease=5.0.0
|
||||
sudo amdgpu-install --usecase=rocm --rocmrelease=5.0.2
|
||||
sudo amdgpu-install --usecase=rocm --rocmrelease=5.1.0
|
||||
```
|
||||
|
||||
## Additional options
|
||||
|
||||
@@ -53,18 +53,7 @@ To add the AMDGPU repository, follow these steps:
|
||||
|
||||
```shell
|
||||
# amdgpu repository for bionic
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/22.20/ubuntu bionic main' \
|
||||
| sudo tee /etc/apt/sources.list.d/amdgpu.list
|
||||
sudo apt update
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} Ubuntu 20.04
|
||||
:sync: ubuntu-20.04
|
||||
|
||||
```shell
|
||||
# amdgpu repository for bionic
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/22.10/ubuntu bionic main' \
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/21.50.2/ubuntu bionic main' \
|
||||
| sudo tee /etc/apt/sources.list.d/amdgpu.list
|
||||
sudo apt update
|
||||
```
|
||||
@@ -75,7 +64,7 @@ sudo apt update
|
||||
|
||||
```shell
|
||||
# amdgpu repository for focal
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/22.10/ubuntu focal main' \
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/21.50.2/ubuntu focal main' \
|
||||
| sudo tee /etc/apt/sources.list.d/amdgpu.list
|
||||
sudo apt update
|
||||
```
|
||||
@@ -102,7 +91,7 @@ To add the ROCm repository, use the following steps:
|
||||
|
||||
```shell
|
||||
# ROCm repositories for bionic
|
||||
for ver in 5.0.2 5.1; do
|
||||
for ver in 5.0 5.0.2; do
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/$ver bionic main" \
|
||||
| sudo tee --append /etc/apt/sources.list.d/rocm.list
|
||||
done
|
||||
@@ -117,7 +106,7 @@ sudo apt update
|
||||
|
||||
```shell
|
||||
# ROCm repositories for focal
|
||||
for ver in 5.0.2 5.1; do
|
||||
for ver in 5.0 5.0.2; do
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/$ver focal main" \
|
||||
| sudo tee --append /etc/apt/sources.list.d/rocm.list
|
||||
done
|
||||
@@ -147,7 +136,7 @@ For a comprehensive list of meta-packages, refer to
|
||||
- Sample Multi-version installation
|
||||
|
||||
```shell
|
||||
sudo apt install rocm-hip-sdk5.1.0 rocm-hip-sdk5.0.2
|
||||
sudo apt install rocm-hip-sdk5.0.2
|
||||
```
|
||||
|
||||
:::::
|
||||
@@ -171,7 +160,26 @@ section.
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/22.10/rhel/7.9/main/x86_64/
|
||||
baseurl=https://repo.radeon.com/amdgpu/21.50.2/rhel/7.9/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
EOF
|
||||
sudo yum clean all
|
||||
```
|
||||
|
||||
:::
|
||||
|
||||
:::{tab-item} RHEL 8.4
|
||||
:sync: RHEL-8.4
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/21.50.2/rhel/8.4/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
@@ -190,7 +198,7 @@ sudo yum clean all
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/22.10/rhel/8.5/main/x86_64/
|
||||
baseurl=https://repo.radeon.com/amdgpu/21.50.2/rhel/8.5/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
@@ -220,7 +228,7 @@ To add the ROCm repository, use the following steps, based on your distribution:
|
||||
:sync: RHEL-7
|
||||
|
||||
```shell
|
||||
for ver in 5.0.2 5.1; do
|
||||
for ver in 5.0 5.0.2; do
|
||||
sudo tee --append /etc/yum.repos.d/rocm.repo <<EOF
|
||||
[ROCm-$ver]
|
||||
name=ROCm$ver
|
||||
@@ -239,7 +247,7 @@ sudo yum clean all
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
for ver in 5.0.2 5.1; do
|
||||
for ver in 5.0 5.0.2; do
|
||||
sudo tee --append /etc/yum.repos.d/rocm.repo <<EOF
|
||||
[ROCm-$ver]
|
||||
name=ROCm$ver
|
||||
@@ -274,7 +282,7 @@ For a comprehensive list of meta-packages, refer to
|
||||
- Sample Multi-version installation
|
||||
|
||||
```shell
|
||||
sudo yum install rocm-hip-sdk5.1.0 rocm-hip-sdk5.0.2
|
||||
sudo yum install rocm-hip-sdk5.0.2
|
||||
```
|
||||
|
||||
:::::
|
||||
@@ -297,7 +305,7 @@ section.
|
||||
sudo tee /etc/zypp/repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/22.10/sle/15.3/main/x86_64
|
||||
baseurl=https://repo.radeon.com/amdgpu/21.50.2/sle/15.3/main/x86_64
|
||||
enabled=1
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
@@ -322,7 +330,7 @@ sudo reboot
|
||||
To add the ROCm repository, use the following steps:
|
||||
|
||||
```shell
|
||||
for ver in 5.0.2 5.1; do
|
||||
for ver in 5.0 5.0.2; do
|
||||
sudo tee --append /etc/zypp/repos.d/rocm.repo <<EOF
|
||||
[ROCm-$ver]
|
||||
name=ROCm$ver
|
||||
@@ -354,7 +362,7 @@ For a comprehensive list of meta-packages, refer to
|
||||
- Sample Multi-version installation
|
||||
|
||||
```shell
|
||||
sudo zypper --gpg-auto-import-keys install rocm-hip-sdk5.1.0 rocm-hip-sdk5.0.2
|
||||
sudo zypper --gpg-auto-import-keys install rocm-hip-sdk5.0.2
|
||||
```
|
||||
|
||||
:::::
|
||||
@@ -391,7 +399,7 @@ but are generally useful. Verification of the install is advised.
|
||||
2. Add binary paths to the `PATH` environment variable.
|
||||
|
||||
```shell
|
||||
export PATH=$PATH:/opt/rocm-5.1.0/bin:/opt/rocm-5.1.0/opencl/bin
|
||||
export PATH=$PATH:/opt/rocm-5.0.2/bin:/opt/rocm-5.0.2/opencl/bin
|
||||
```
|
||||
|
||||
```{attention}
|
||||
|
||||
@@ -26,7 +26,7 @@ repository to the new release.
|
||||
|
||||
```shell
|
||||
# amdgpu repository for bionic
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/22.10/ubuntu bionic main' \
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/21.50.2/ubuntu bionic main' \
|
||||
| sudo tee /etc/apt/sources.list.d/amdgpu.list
|
||||
sudo apt update
|
||||
```
|
||||
@@ -37,7 +37,7 @@ sudo apt update
|
||||
|
||||
```shell
|
||||
# amdgpu repository for focal
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/22.10/ubuntu focal main' \
|
||||
echo 'deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/amdgpu/21.50.2/ubuntu focal main' \
|
||||
| sudo tee /etc/apt/sources.list.d/amdgpu.list
|
||||
sudo apt update
|
||||
```
|
||||
@@ -57,7 +57,25 @@ sudo apt update
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/22.10/rhel/7.9/main/x86_64/
|
||||
baseurl=https://repo.radeon.com/amdgpu/21.50.2/rhel/7.9/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
EOF
|
||||
sudo yum clean all
|
||||
```
|
||||
|
||||
:::
|
||||
:::{tab-item} RHEL 8.4
|
||||
:sync: RHEL-8.4
|
||||
:sync: RHEL-8
|
||||
|
||||
```shell
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/21.50.2/rhel/8.4/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
@@ -75,7 +93,7 @@ sudo yum clean all
|
||||
sudo tee /etc/yum.repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/22.10/rhel/8.5/main/x86_64/
|
||||
baseurl=https://repo.radeon.com/amdgpu/21.50.2/rhel/8.5/main/x86_64/
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
@@ -99,7 +117,7 @@ sudo yum clean all
|
||||
sudo tee /etc/zypp/repos.d/amdgpu.repo <<EOF
|
||||
[amdgpu]
|
||||
name=amdgpu
|
||||
baseurl=https://repo.radeon.com/amdgpu/22.10/sle/15.3/main/x86_64
|
||||
baseurl=https://repo.radeon.com/amdgpu/21.50.2/sle/15.3/main/x86_64
|
||||
enabled=1
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
@@ -162,7 +180,7 @@ repository to the new release.
|
||||
:sync: ubuntu-18.04
|
||||
|
||||
```shell
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.1 bionic main" \
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.0.5 bionic main" \
|
||||
| sudo tee /etc/apt/sources.list.d/rocm.list
|
||||
echo -e 'Package: *\nPin: release o=repo.radeon.com\nPin-Priority: 600' \
|
||||
| sudo tee /etc/apt/preferences.d/rocm-pin-600
|
||||
@@ -174,7 +192,7 @@ sudo apt update
|
||||
:sync: ubuntu-20.04
|
||||
|
||||
```shell
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.1 focal main" \
|
||||
echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/5.0.2 focal main" \
|
||||
| sudo tee /etc/apt/sources.list.d/rocm.list
|
||||
echo -e 'Package: *\nPin: release o=repo.radeon.com\nPin-Priority: 600' \
|
||||
| sudo tee /etc/apt/preferences.d/rocm-pin-600
|
||||
@@ -193,9 +211,9 @@ sudo apt update
|
||||
|
||||
```shell
|
||||
sudo tee /etc/yum.repos.d/rocm.repo <<EOF
|
||||
[ROCm-5.1]
|
||||
name=ROCm5.1
|
||||
baseurl=https://repo.radeon.com/rocm/yum/5.1/main
|
||||
[ROCm-5.0.22]
|
||||
name=ROCm5.0.2
|
||||
baseurl=https://repo.radeon.com/rocm/yum/5.0.2/main
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
@@ -210,9 +228,9 @@ sudo yum clean all
|
||||
|
||||
```shell
|
||||
sudo tee /etc/yum.repos.d/rocm.repo <<EOF
|
||||
[ROCm-5.1]
|
||||
name=ROCm5.1
|
||||
baseurl=https://repo.radeon.com/rocm/rhel8/5.1/main
|
||||
[ROCm-5.0.2]
|
||||
name=ROCm5.0.2
|
||||
baseurl=https://repo.radeon.com/rocm/rhel8/5.0.2/main
|
||||
enabled=1
|
||||
priority=50
|
||||
gpgcheck=1
|
||||
@@ -229,10 +247,10 @@ sudo yum clean all
|
||||
|
||||
```shell
|
||||
sudo tee /etc/zypp/repos.d/rocm.repo <<EOF
|
||||
[ROCm-5.1]
|
||||
name=ROCm5.1
|
||||
[ROCm-5.0.2]
|
||||
name=ROCm5.0.2
|
||||
name=rocm
|
||||
baseurl=https://repo.radeon.com/rocm/zyp/5.1/main
|
||||
baseurl=https://repo.radeon.com/rocm/zyp/5.0.2/main
|
||||
enabled=1
|
||||
gpgcheck=1
|
||||
gpgkey=https://repo.radeon.com/rocm/rocm.gpg.key
|
||||
|
||||
@@ -18,7 +18,6 @@ Detailed walkthroughs of specific use-cases driven by frameworks using ROCm
|
||||
acceleration.
|
||||
|
||||
- [Implementing Inception V3 on ROCm with PyTorch](machine_learning/pytorch_inception.md)
|
||||
- [Optimizing Inference with MIGraphX](machine_learning/migraphx_optimization.md)
|
||||
|
||||
:::
|
||||
|
||||
|
||||
@@ -10,11 +10,4 @@ A collection of detailed and guided examples for working with Inception V3 with
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} Optimizing Inference with MIGraphX
|
||||
:link: migraphx_optimization
|
||||
:link-type: doc
|
||||
Walkthroughs of optimizing inference using MIGraphX.
|
||||
|
||||
:::
|
||||
|
||||
:::::
|
||||
|
||||
@@ -83,10 +83,6 @@ TensorFlow, \[Online image\]. [https://www.tensorflow.org/extras/tensorflow_bran
|
||||
|
||||
MAGMA, \[Online image\]. [https://bitbucket.org/icl/magma/src/master/docs/](https://bitbucket.org/icl/magma/src/master/docs/)
|
||||
|
||||
Advanced Micro Devices, Inc., \[Online\]. Available: [https://rocmsoftwareplatform.github.io/AMDMIGraphX/doc/html/](https://rocmsoftwareplatform.github.io/AMDMIGraphX/doc/html/)
|
||||
|
||||
Advanced Micro Devices, Inc., \[Online\]. Available: [https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/wiki](https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/wiki)
|
||||
|
||||
Docker, \[Online\]. [https://docs.docker.com/get-started/overview/](https://docs.docker.com/get-started/overview/)
|
||||
|
||||
Torchvision, \[Online\]. Available [https://pytorch.org/vision/master/index.html?highlight=torchvision#module-torchvision](https://pytorch.org/vision/master/index.html?highlight=torchvision#module-torchvision)
|
||||
|
||||
@@ -425,10 +425,6 @@ TensorFlow, \[Online image\]. [https://www.tensorflow.org/extras/tensorflow_bran
|
||||
|
||||
MAGMA, \[Online image\]. [https://bitbucket.org/icl/magma/src/master/docs/](https://bitbucket.org/icl/magma/src/master/docs/)
|
||||
|
||||
Advanced Micro Devices, Inc., \[Online\]. Available: [https://rocmsoftwareplatform.github.io/AMDMIGraphX/doc/html/](https://rocmsoftwareplatform.github.io/AMDMIGraphX/doc/html/)
|
||||
|
||||
Advanced Micro Devices, Inc., \[Online\]. Available: [https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/wiki](https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/wiki)
|
||||
|
||||
Docker, \[Online\]. [https://docs.docker.com/get-started/overview/](https://docs.docker.com/get-started/overview/)
|
||||
|
||||
Torchvision, \[Online\]. Available [https://pytorch.org/vision/master/index.html?highlight=torchvision#module-torchvision](https://pytorch.org/vision/master/index.html?highlight=torchvision#module-torchvision)
|
||||
|
||||
@@ -197,10 +197,6 @@ TensorFlow, \[Online image\]. [https://www.tensorflow.org/extras/tensorflow_bran
|
||||
|
||||
MAGMA, \[Online image\]. [https://bitbucket.org/icl/magma/src/master/docs/](https://bitbucket.org/icl/magma/src/master/docs/)
|
||||
|
||||
Advanced Micro Devices, Inc., \[Online\]. Available: [https://rocmsoftwareplatform.github.io/AMDMIGraphX/doc/html/](https://rocmsoftwareplatform.github.io/AMDMIGraphX/doc/html/)
|
||||
|
||||
Advanced Micro Devices, Inc., \[Online\]. Available: [https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/wiki](https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/wiki)
|
||||
|
||||
Docker, \[Online\]. [https://docs.docker.com/get-started/overview/](https://docs.docker.com/get-started/overview/)
|
||||
|
||||
Torchvision, \[Online\]. Available [https://pytorch.org/vision/master/index.html?highlight=torchvision#module-torchvision](https://pytorch.org/vision/master/index.html?highlight=torchvision#module-torchvision)
|
||||
|
||||
@@ -93,7 +93,6 @@ agile, flexible, rapid and secure manner. [more...](rocm)
|
||||
- [Examples](https://github.com/amd/rocm-examples)
|
||||
- [ML, DL, and AI](examples/machine_learning/all)
|
||||
- [](examples/machine_learning/pytorch_inception)
|
||||
- [](examples/machine_learning/migraphx_optimization)
|
||||
|
||||
:::
|
||||
::::
|
||||
|
||||
@@ -10,17 +10,10 @@ AMD's library for high performance machine learning primitives.
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} {doc}`Composable Kernel <composable-kernel:index>`
|
||||
:::{grid-item-card} {doc}`Composable Kernel <composable_kernel:index>`
|
||||
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
|
||||
|
||||
- {doc}`Documentation <composable-kernel:index>`
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} {doc}`MIGraphX <migraphx:index>`
|
||||
AMD MIGraphX is AMD's graph inference engine that accelerates machine learning model inference.
|
||||
|
||||
- {doc}`Documentation <migraphx:index>`
|
||||
- {doc}`Documentation <composable_kernel:index>`
|
||||
|
||||
:::
|
||||
|
||||
|
||||
@@ -42,8 +42,7 @@ Inter and intra-node communication is supported by the following projects:
|
||||
Libraries related to AI.
|
||||
|
||||
- {doc}`MIOpen <miopen:index>`
|
||||
- {doc}`Composable Kernel <composable-kernel:index>`
|
||||
- {doc}`MIGraphX <migraphx:index>`
|
||||
- {doc}`Composable Kernel <composable_kernel:index>`
|
||||
|
||||
:::
|
||||
|
||||
@@ -80,7 +79,7 @@ Computer vision related projects.
|
||||
|
||||
:::{grid-item-card} [Validation Tools](validation_tools)
|
||||
|
||||
- {doc}`ROCm Validation Suite <rocm-validation-suite:index>`
|
||||
- {doc}`ROCm Validation Suite <rocmvalidationsuite:index>`
|
||||
- {doc}`TransferBench <transferbench:index>`
|
||||
|
||||
:::
|
||||
|
||||
@@ -3,10 +3,10 @@
|
||||
:::::{grid} 1 1 2 2
|
||||
:gutter: 1
|
||||
|
||||
:::{grid-item-card} {doc}`RVS <rocm-validation-suite:index>`
|
||||
:::{grid-item-card} {doc}`RVS <rocmvalidationsuite:index>`
|
||||
The ROCm Validation Suite is a system administrator’s and cluster manager's tool for detecting and troubleshooting common problems affecting AMD GPU(s) running in a high-performance computing environment, enabled using the ROCm software stack on a compatible platform.
|
||||
|
||||
- {doc}`Documentation <rocm-validation-suite:index>`
|
||||
- {doc}`Documentation <rocmvalidationsuite:index>`
|
||||
|
||||
:::
|
||||
|
||||
|
||||
@@ -8,12 +8,12 @@ AMD ROCm™ Platform supports the following Linux distributions.
|
||||
|
||||
| Distribution |Processor Architectures| Validated Kernel |
|
||||
|--------------------|-----------------------|--------------------|
|
||||
| CentOS 8.4 | x86-64 | 4.18 |
|
||||
| CentOS 8.3 | x86-64 | 4.18 |
|
||||
| CentOS 7.9 | x86-64 | 3.10 |
|
||||
| RHEL 8.5 | x86-64 | 4.18 |
|
||||
| RHEL 8.5, 8.4 | x86-64 | 4.18 |
|
||||
| RHEL 7.9 | x86-64 | 3.10 |
|
||||
| SLES 15 SP3 | x86-64 | 5.3.18 |
|
||||
| Ubuntu 20.04.4 LTS | x86-64 | 5.13 |
|
||||
| Ubuntu 20.04.3 LTS | x86-64 | 5.8 |
|
||||
| Ubuntu 18.04.5 LTS | x86-64 | 5.4.0 |
|
||||
|
||||
## Virtualization Support
|
||||
|
||||
@@ -58,7 +58,6 @@ The table is ordered to follow ROCm's manifest file.
|
||||
| [rocPRIM](https://github.com/ROCmSoftwarePlatform/rocPRIM/) | [MIT](https://github.com/ROCmSoftwarePlatform/rocPRIM/blob/develop/LICENSE.txt) |
|
||||
| [rocWMMA](https://github.com/ROCmSoftwarePlatform/rocWMMA/) | [MIT](https://github.com/ROCmSoftwarePlatform/rocWMMA/blob/develop/LICENSE.md) |
|
||||
| [hipfort](https://github.com/ROCmSoftwarePlatform/hipfort/) | [MIT](https://github.com/ROCmSoftwarePlatform/hipfort/blob/master/LICENSE) |
|
||||
| [AMDMIGraphX](https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/) | [MIT](https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/blob/develop/LICENSE) |
|
||||
| [ROCmValidationSuite](https://github.com/ROCm-Developer-Tools/ROCmValidationSuite/) | [MIT](https://github.com/ROCm-Developer-Tools/ROCmValidationSuite/blob/master/LICENSE) |
|
||||
| [aomp](https://github.com/ROCm-Developer-Tools/aomp/) | [Apache 2.0](https://github.com/ROCm-Developer-Tools/aomp/blob/aomp-dev/LICENSE) |
|
||||
| [aomp-extras](https://github.com/ROCm-Developer-Tools/aomp-extras/) | [MIT](https://github.com/ROCm-Developer-Tools/aomp-extras/blob/aomp-dev/LICENSE) |
|
||||
@@ -121,4 +120,4 @@ following location: `/opt/rocm/share/doc/<component-name>/`
|
||||
For example, you can fetch the licensing information of the `_amd_comgr_`
|
||||
component (Code Object Manager) from the `amd_comgr` folder. A file named
|
||||
`LICENSE.txt` contains the license details at:
|
||||
`/opt/rocm-5.1.0/share/doc/amd_comgr/LICENSE.txt`
|
||||
`/opt/rocm-5.0.2/share/doc/amd_comgr/LICENSE.txt`
|
||||
|
||||
@@ -146,9 +146,7 @@ subtrees:
|
||||
- title: MIOpen - Machine Intelligence
|
||||
url: ${project:miopen}
|
||||
- title: Composable Kernel
|
||||
url: ${project:composable-kernel}
|
||||
- title: MIGraphX - Graph Optimization
|
||||
url: ${project:migraphx}
|
||||
url: ${project:composable_kernel}
|
||||
- file: reference/computer_vision
|
||||
subtrees:
|
||||
- entries:
|
||||
@@ -171,7 +169,7 @@ subtrees:
|
||||
title: Validation Tools
|
||||
subtrees:
|
||||
- entries:
|
||||
- url: ${project:rocm-validation-suite}
|
||||
- url: ${project:rocmvalidationsuite}
|
||||
title: RVS
|
||||
- url: ${project:transferbench}
|
||||
title: TransferBench
|
||||
@@ -223,7 +221,6 @@ subtrees:
|
||||
subtrees:
|
||||
- entries:
|
||||
- file: examples/machine_learning/pytorch_inception
|
||||
- file: examples/machine_learning/migraphx_optimization
|
||||
|
||||
- caption: About
|
||||
entries:
|
||||
|
||||
@@ -1 +1,2 @@
|
||||
rocm-docs-core==0.16.0
|
||||
rocm-docs-core==1.8.0
|
||||
sphinx-reredirects
|
||||
|
||||
@@ -1,114 +1,106 @@
|
||||
#
|
||||
# This file is autogenerated by pip-compile with Python 3.11
|
||||
# This file is autogenerated by pip-compile with Python 3.10
|
||||
# by the following command:
|
||||
#
|
||||
# pip-compile docs/sphinx/requirements.in
|
||||
# pip-compile requirements.in
|
||||
#
|
||||
accessible-pygments==0.0.3
|
||||
accessible-pygments==0.0.5
|
||||
# via pydata-sphinx-theme
|
||||
alabaster==0.7.13
|
||||
alabaster==1.0.0
|
||||
# via sphinx
|
||||
babel==2.11.0
|
||||
babel==2.16.0
|
||||
# via
|
||||
# pydata-sphinx-theme
|
||||
# sphinx
|
||||
beautifulsoup4==4.11.2
|
||||
beautifulsoup4==4.12.3
|
||||
# via pydata-sphinx-theme
|
||||
breathe==4.34.0
|
||||
breathe==4.35.0
|
||||
# via rocm-docs-core
|
||||
certifi==2022.12.7
|
||||
certifi==2024.8.30
|
||||
# via requests
|
||||
cffi==1.15.1
|
||||
cffi==1.17.1
|
||||
# via
|
||||
# cryptography
|
||||
# pynacl
|
||||
charset-normalizer==2.1.1
|
||||
charset-normalizer==3.3.2
|
||||
# via requests
|
||||
click==8.1.3
|
||||
click==8.1.7
|
||||
# via sphinx-external-toc
|
||||
colorama==0.4.6
|
||||
# via
|
||||
# click
|
||||
# sphinx
|
||||
cryptography==40.0.2
|
||||
cryptography==43.0.1
|
||||
# via pyjwt
|
||||
deprecated==1.2.13
|
||||
deprecated==1.2.14
|
||||
# via pygithub
|
||||
docutils==0.19
|
||||
docutils==0.21.2
|
||||
# via
|
||||
# breathe
|
||||
# myst-parser
|
||||
# pydata-sphinx-theme
|
||||
# sphinx
|
||||
fastjsonschema==2.16.3
|
||||
fastjsonschema==2.20.0
|
||||
# via rocm-docs-core
|
||||
gitdb==4.0.10
|
||||
gitdb==4.0.11
|
||||
# via gitpython
|
||||
gitpython==3.1.30
|
||||
gitpython==3.1.43
|
||||
# via rocm-docs-core
|
||||
idna==3.4
|
||||
idna==3.10
|
||||
# via requests
|
||||
imagesize==1.4.1
|
||||
# via sphinx
|
||||
jinja2==3.1.2
|
||||
jinja2==3.1.4
|
||||
# via
|
||||
# myst-parser
|
||||
# sphinx
|
||||
linkify-it-py==1.0.3
|
||||
# via myst-parser
|
||||
markdown-it-py==2.2.0
|
||||
markdown-it-py==3.0.0
|
||||
# via
|
||||
# mdit-py-plugins
|
||||
# myst-parser
|
||||
markupsafe==2.1.2
|
||||
markupsafe==2.1.5
|
||||
# via jinja2
|
||||
mdit-py-plugins==0.3.4
|
||||
mdit-py-plugins==0.4.2
|
||||
# via myst-parser
|
||||
mdurl==0.1.2
|
||||
# via markdown-it-py
|
||||
myst-parser[linkify]==1.0.0
|
||||
myst-parser==4.0.0
|
||||
# via rocm-docs-core
|
||||
packaging==23.0
|
||||
packaging==24.1
|
||||
# via
|
||||
# pydata-sphinx-theme
|
||||
# sphinx
|
||||
pycparser==2.21
|
||||
pycparser==2.22
|
||||
# via cffi
|
||||
pydata-sphinx-theme==0.13.3
|
||||
pydata-sphinx-theme==0.15.4
|
||||
# via
|
||||
# rocm-docs-core
|
||||
# sphinx-book-theme
|
||||
pygithub==1.58.1
|
||||
pygithub==2.4.0
|
||||
# via rocm-docs-core
|
||||
pygments==2.14.0
|
||||
pygments==2.18.0
|
||||
# via
|
||||
# accessible-pygments
|
||||
# pydata-sphinx-theme
|
||||
# sphinx
|
||||
pyjwt[crypto]==2.6.0
|
||||
pyjwt[crypto]==2.9.0
|
||||
# via pygithub
|
||||
pynacl==1.5.0
|
||||
# via pygithub
|
||||
pytz==2022.7.1
|
||||
# via babel
|
||||
pyyaml==6.0
|
||||
pyyaml==6.0.2
|
||||
# via
|
||||
# myst-parser
|
||||
# rocm-docs-core
|
||||
# sphinx-external-toc
|
||||
requests==2.28.1
|
||||
requests==2.32.3
|
||||
# via
|
||||
# pygithub
|
||||
# sphinx
|
||||
rocm-docs-core==0.16.0
|
||||
# via -r docs/sphinx/requirements.in
|
||||
smmap==5.0.0
|
||||
rocm-docs-core==1.8.0
|
||||
# via -r requirements.in
|
||||
smmap==5.0.1
|
||||
# via gitdb
|
||||
snowballstemmer==2.2.0
|
||||
# via sphinx
|
||||
soupsieve==2.4
|
||||
soupsieve==2.6
|
||||
# via beautifulsoup4
|
||||
sphinx==5.3.0
|
||||
sphinx==8.0.2
|
||||
# via
|
||||
# breathe
|
||||
# myst-parser
|
||||
@@ -119,33 +111,40 @@ sphinx==5.3.0
|
||||
# sphinx-design
|
||||
# sphinx-external-toc
|
||||
# sphinx-notfound-page
|
||||
sphinx-book-theme==1.0.1
|
||||
# sphinx-reredirects
|
||||
sphinx-book-theme==1.1.3
|
||||
# via rocm-docs-core
|
||||
sphinx-copybutton==0.5.1
|
||||
sphinx-copybutton==0.5.2
|
||||
# via rocm-docs-core
|
||||
sphinx-design==0.4.1
|
||||
sphinx-design==0.6.1
|
||||
# via rocm-docs-core
|
||||
sphinx-external-toc==0.3.1
|
||||
sphinx-external-toc==1.0.1
|
||||
# via rocm-docs-core
|
||||
sphinx-notfound-page==0.8.3
|
||||
sphinx-notfound-page==1.0.4
|
||||
# via rocm-docs-core
|
||||
sphinxcontrib-applehelp==1.0.4
|
||||
sphinx-reredirects==0.1.5
|
||||
# via -r requirements.in
|
||||
sphinxcontrib-applehelp==2.0.0
|
||||
# via sphinx
|
||||
sphinxcontrib-devhelp==1.0.2
|
||||
sphinxcontrib-devhelp==2.0.0
|
||||
# via sphinx
|
||||
sphinxcontrib-htmlhelp==2.0.1
|
||||
sphinxcontrib-htmlhelp==2.1.0
|
||||
# via sphinx
|
||||
sphinxcontrib-jsmath==1.0.1
|
||||
# via sphinx
|
||||
sphinxcontrib-qthelp==1.0.3
|
||||
sphinxcontrib-qthelp==2.0.0
|
||||
# via sphinx
|
||||
sphinxcontrib-serializinghtml==1.1.5
|
||||
sphinxcontrib-serializinghtml==2.0.0
|
||||
# via sphinx
|
||||
typing-extensions==4.5.0
|
||||
# via pydata-sphinx-theme
|
||||
uc-micro-py==1.0.1
|
||||
# via linkify-it-py
|
||||
urllib3==1.26.13
|
||||
# via requests
|
||||
wrapt==1.14.1
|
||||
tomli==2.0.1
|
||||
# via sphinx
|
||||
typing-extensions==4.12.2
|
||||
# via
|
||||
# pydata-sphinx-theme
|
||||
# pygithub
|
||||
urllib3==2.2.3
|
||||
# via
|
||||
# pygithub
|
||||
# requests
|
||||
wrapt==1.16.0
|
||||
# via deprecated
|
||||
|
||||
@@ -224,10 +224,6 @@ ROCm CMake Packages
|
||||
+-----------+----------+--------------------------------------------------------+
|
||||
| MIOpen | miopen | ``MIOpen`` |
|
||||
+-----------+----------+--------------------------------------------------------+
|
||||
| MIGraphX | migraphx | ``migraphx::migraphx``, ``migraphx::migraphx_c``, |
|
||||
| | | ``migraphx::migraphx_cpu``, ``migraphx::migraphx_gpu``,|
|
||||
| | | ``migraphx::migraphx_onnx``, ``migraphx::migraphx_tf`` |
|
||||
+-----------+----------+--------------------------------------------------------+
|
||||
|
||||
Using CMake Presets
|
||||
===================
|
||||
|
||||
Reference in New Issue
Block a user