Fix changelog and release notes

Add \n to top of 6.0.2 template

Update RELEASE.md and 6.1.1.md

Regenerate changelog

Add minor wording changes in RELEASE.md

Incorporate Leo's feedback

Reformat RELEASE.md to fix build issue

Fixes an issue preventing Changelog from appearing in the TOC.

Update AMDSMI link & change 'release highlights' to 'release notes'

Change AMD SMI link from develop to docs/6.1.1
This commit is contained in:
Peter Jun Park
2024-04-25 13:54:22 -04:00
committed by Sam Wu
parent f3d20e5215
commit a5a1e39e61
5 changed files with 81 additions and 99 deletions

View File

@@ -19,14 +19,10 @@ This page contains the changelog for AMD ROCm Software.
ROCm™ 6.1.1 introduces minor fixes and improvements to some tools and libraries.
In addition, ROCm 6.1.1 has been tested against a pre-release version of Ubuntu 22.04.5 (kernel 6.8).
#### AMD SMI
AMD SMI for ROCm 6.1.1
See the [detailed changelog](https://github.com/ROCm/amdsmi/blob/develop/CHANGELOG.md) with code samples for more information.
##### Additions
* Added deferred error correctable counts to `amd-smi metric -ecc -ecc-blocks`.
@@ -52,6 +48,10 @@ See the [detailed changelog](https://github.com/ROCm/amdsmi/blob/develop/CHANGEL
* `amd-smi bad-pages` can result in a `ValueError: Null pointer access` error when using certain PMU firmware versions.
```{note}
See the [detailed changelog](https://github.com/ROCm/amdsmi/blob/docs/6.1.1/CHANGELOG.md) with code samples for more information.
```
#### HIPCC
HIPCC for ROCm 6.1.1
@@ -70,38 +70,15 @@ HIPIFY for ROCm 6.1.1
* Added support for LLVM 18.1.2.
* Added support for cuDNN 9.0.0.
* Added new options:
* Added new option:
* `--clang-resource-directory` to specify the clang resource path (the path to the parent folder for the `include` folder that contains `__clang_cuda_runtime_wrapper.h` and other header files used during the hipification process).
#### hipSOLVER 2.1.1
hipSOLVER 2.1.1 for ROCm 6.1.1
##### Changes
* `BUILD_WITH_SPARSE` now defaults to OFF on Windows.
##### Fixes
* Fixed benchmark client build when `BUILD_WITH_SPARSE` is OFF.
#### rocFFT 1.0.27
rocFFT 1.0.27 for ROCm 6.1.1
##### Additions
* Enabled multi-GPU testing on systems without direct GPU-interconnects.
##### Fixes
* Fixed kernel launch failure when executing very large odd-length real-complex transforms.
### Library changes in ROCm 6.1.1
| Library | Version |
|---------|---------|
| AMDMIGraphX | [2.9](https://github.com/ROCm/AMDMIGraphX/releases/tag/rocm-6.1.1) |
| composable_kernel | [0.2.0](https://github.com/ROCm/composable_kernel/releases/tag/rocm-6.1.1) |
| hipBLAS | [2.1.0](https://github.com/ROCm/hipBLAS/releases/tag/rocm-6.1.1) |
| hipCUB | [3.1.0](https://github.com/ROCm/hipCUB/releases/tag/rocm-6.1.1) |
| hipFFT | [1.0.14](https://github.com/ROCm/hipFFT/releases/tag/rocm-6.1.1) |
@@ -163,7 +140,7 @@ performance of AMD Instinct™ MI300 GPU applications. Notably, we've added:
YUV frames in video memory. With decoded frames in video memory, you can run video
post-processing using ROCm HIP, avoiding unnecessary data copies via the PCIe bus.
To learn more, refer to the rocDecode
To learn more, refer to the rocDecode
[documentation](https://rocm.docs.amd.com/projects/rocDecode/en/latest/).
### OS and GPU support changes
@@ -454,11 +431,11 @@ ROCProfiler for ROCm 6.1.0
[On GitHub](https://github.com/ROCm/ROCm/issues/3027)
#### MI200 SR-IOV
#### MI200 SR-IOV
##### Known issue
* Multimedia applications may encounter compilation errors in the MI200 Single Root Input/Output Virtualization (SR-IOV) environment. This is because MI200 SR-IOV does not currently support multimedia applications.
* Multimedia applications may encounter compilation errors in the MI200 Single Root Input/Output Virtualization (SR-IOV) environment. This is because MI200 SR-IOV does not currently support multimedia applications.
[On GitHub](https://github.com/ROCm/ROCm/issues/3028)
@@ -1006,7 +983,8 @@ Tensile 4.40.0 for ROCm 6.1.0
-------------------
## ROCm 6.0.2
The ROCm 6.0.2 point release consists of minor bug fixes to improve the stability of MI300 GPU applications. This release introduces several new driver features for system qualification on our partner server offerings.
The ROCm 6.0.2 point release consists of minor bug fixes to improve the stability of MI300 GPU applications. This release introduces several new driver features for system qualification on our partner server offerings.
#### hipFFT 1.0.13
@@ -1057,7 +1035,7 @@ would copy internal references to the generators and would lead to double free o
These types should be moved instead of copied, and move constructors and operators are now defined
for them.
##### Optimized
##### Optimized
- Improved MT19937 initialization and generation performance.
@@ -2087,8 +2065,8 @@ hipFFT 1.0.13 for ROCm 6.0.0
##### Changed
- hipfft-rider has been renamed to hipfft-bench, controlled by the BUILD_CLIENTS_BENCH CMake option. A link for the
old file name is installed, and the old BUILD_CLIENTS_RIDER CMake option is accepted for compatibility but both
- hipfft-rider has been renamed to hipfft-bench, controlled by the BUILD_CLIENTS_BENCH CMake option. A link for the
old file name is installed, and the old BUILD_CLIENTS_RIDER CMake option is accepted for compatibility but both
will be removed in a future release.
- Binaries in debug builds no longer have a "-d" suffix.
- The minimum rocFFT required version has been updated to 1.0.21.
@@ -2882,7 +2860,7 @@ MIGraphX 2.7 for ROCm 5.7.0
- Improved compile times by only building for the GPU on the system
- Improve performance of pointwise/reduction kernels when using NHWC layouts
- Load specific version of the migraphx_py library
- Annotate functions with the block size so the compiler can do a better job of optimizing
- Annotate functions with the block size so the compiler can do a better job of optimizing
- Enable reshape on nonstandard shapes
- Use half HIP APIs to compute max and min
- Added support for broadcasted scalars to unsqueeze operator
@@ -3794,7 +3772,7 @@ rocSPARSE 2.5.2 for ROCm 5.6.0
rocThrust 2.18.0 for ROCm 5.6.0
##### Fixed
##### Fixed
- `lower_bound`, `upper_bound`, and `binary_search` failed to compile for certain types.
@@ -4297,9 +4275,9 @@ MIGraphX 2.5 for ROCm 5.5.0
##### Added
- Y-Model feature to store tuning information with the optimized model
- Added Python 3.10 bindings
- Added Python 3.10 bindings
- Accuracy checker tool based on ONNX Runtime
- ONNX Operators parse_split, and Trilu
- ONNX Operators parse_split, and Trilu
- Build support for ROCm MLIR
- Added migraphx-driver flag to print optimizations in python (--python)
- Added JIT implementation of the Gather and Pad operator which results in better handling of larger tensor sizes.
@@ -4313,7 +4291,7 @@ MIGraphX 2.5 for ROCm 5.5.0
##### Fixed
- Improved parsing Tensorflow Protobuf files
- Improved parsing Tensorflow Protobuf files
- Resolved various accuracy issues with some onnx models
- Resolved a gcc-12 issue with mivisionx
- Improved support for larger sized models and batches
@@ -5415,7 +5393,7 @@ function or by parameters.
##### Changed
- Improved build parallelism of the test suite by splitting up large compilation units for `DeviceRadixSort`,
- Improved build parallelism of the test suite by splitting up large compilation units for `DeviceRadixSort`,
`DeviceSegmentedRadixSort` and `DeviceSegmentedSort`.
- CUB backend references CUB and thrust version 1.17.1.
@@ -5535,8 +5513,8 @@ rocFFT 1.0.19 for ROCm 5.4.0
##### Changed
- Moved runtime compilation cache to in-memory by default. A default on-disk cache can encounter contention problems
on multi-node clusters with a shared filesystem. rocFFT can still be told to use an on-disk cache by setting the
- Moved runtime compilation cache to in-memory by default. A default on-disk cache can encounter contention problems
on multi-node clusters with a shared filesystem. rocFFT can still be told to use an on-disk cache by setting the
ROCFFT_RTC_CACHE_PATH environment variable.
#### rocPRIM 2.12.0
@@ -5545,7 +5523,7 @@ rocPRIM 2.12.0 for ROCm 5.4.0
##### Changed
- `device_partition`, `device_unique`, and `device_reduce_by_key` now support problem
- `device_partition`, `device_unique`, and `device_reduce_by_key` now support problem
sizes larger than 2^32 items.
##### Removed
@@ -7269,7 +7247,7 @@ rocSPARSE 2.2.0 for ROCm 5.2.0
##### Added
- batched SpMM for CSR, COO and Blocked ELL formats.
- batched SpMM for CSR, COO and Blocked ELL formats.
- Packages for test and benchmark executables on all supported OSes using CPack.
- Clients file importers and exporters.
@@ -8006,7 +7984,7 @@ rocSPARSE 2.1.0 for ROCm 5.1.0
##### Added
- gtsv_interleaved_batch
- gtsv_interleaved_batch
- gpsv_interleaved_batch
- SpGEMM_reuse
- Allow copying of mat info struct

View File

@@ -1,4 +1,4 @@
# ROCm 6.1.1 release highlights
# ROCm 6.1.1 release notes
<!-- Disable lints since this is an auto-generated file. -->
<!-- markdownlint-disable blanks-around-headers -->
@@ -11,14 +11,10 @@
ROCm™ 6.1.1 introduces minor fixes and improvements to some tools and libraries.
In addition, ROCm 6.1.1 has been tested against a pre-release version of Ubuntu 22.04.5 (kernel 6.8).
## AMD SMI
AMD SMI for ROCm 6.1.1
See the [detailed changelog](https://github.com/ROCm/amdsmi/blob/develop/CHANGELOG.md) with code samples for more information.
### Additions
- Added deferred error correctable counts to `amd-smi metric -ecc -ecc-blocks`.
@@ -31,8 +27,8 @@ See the [detailed changelog](https://github.com/ROCm/amdsmi/blob/develop/CHANGEL
### Fixes
- Fixed `amd-smi metric --clock`'s clock lock status and deep sleep status.
- Fixed an issue that would cause an error when attempting to reset non-AMD GPUs.
- Fixed `amd-smi metric --clock`'s clock lock and deep sleep status.
- Fixed an issue that would cause an error when resetting non-AMD GPUs.
- Fixed `amd-smi metric --pcie` and `amdsmi_get_pcie_info()` when using Navi 32 and Navi 31 hardware to prevent "UNKNOWN" reports.
- Fixed the output results of `amd-smi process` when getting processes running on a device.
@@ -42,7 +38,11 @@ See the [detailed changelog](https://github.com/ROCm/amdsmi/blob/develop/CHANGEL
### Known issues
- `amd-smi bad-pages` can result in a `ValueError: Null pointer access` error when using certain PMU firmware versions.
- `amd-smi bad-pages` can result in a `ValueError: Null pointer access` error when using some PMU firmware versions.
```{note}
See the [detailed changelog](https://github.com/ROCm/amdsmi/blob/docs/6.1.1/CHANGELOG.md) with code samples for more information.
```
## HIPCC
@@ -50,7 +50,7 @@ HIPCC for ROCm 6.1.1
### Changes
- **Upcoming:** ROCm 6.2 will enable use of compiled binaries `hipcc.bin` and `hipconfig.bin` by default. No action is needed by users; you may continue calling high-level Perl scripts `hipcc` and `hipconfig`. `hipcc.bin` and `hipconfig.bin` will be invoked by the high-level Perl scripts. To revert to the previous behavior and invoke `hipcc.pl` and `hipconfig.pl`, set the `HIP_USE_PERL_SCRIPTS` environment variable to `1`.
- **Upcoming:** ROCm 6.2 will enable use of compiled binaries `hipcc.bin` and `hipconfig.bin` by default. No action is needed by users. You can continue calling high-level Perl scripts `hipcc` and `hipconfig`. `hipcc.bin` and `hipconfig.bin` will be invoked by the high-level Perl scripts. To revert to the previous behavior and invoke `hipcc.pl` and `hipconfig.pl`, set the `HIP_USE_PERL_SCRIPTS` environment variable to `1`.
- **Upcoming:** ROCm 6.3 will remove high-level Perl scripts `hipcc` and `hipconfig`. This release will remove the `HIP_USE_PERL_SCRIPTS` environment variable. It will rename `hipcc.bin` and `hipconfig.bin` to `hipcc` and `hipconfig` respectively. No action is needed by the users. To revert to the previous behavior, invoke `hipcc.pl` and `hipconfig.pl` explicitly.
- **Upcoming:** ROCm 7.0 will remove `hipcc.pl` and `hipconfig.pl`.
@@ -60,31 +60,58 @@ HIPIFY for ROCm 6.1.1
### Additions
* Added support for LLVM 18.1.2.
* Added support for cuDNN 9.0.0.
* Added new options:
* `--clang-resource-directory` to specify the clang resource path (the path to the parent folder for the `include` folder that contains `__clang_cuda_runtime_wrapper.h` and other header files used during the hipification process).
- Added support for LLVM 18.1.2.
- Added support for cuDNN 9.0.0.
- Added a new option: `--clang-resource-directory` to specify the clang resource path (the path to the parent folder for the `include` folder that contains `__clang_cuda_runtime_wrapper.h` and other header files used during the hipification process).
## hipSOLVER 2.1.1
## Library changes in ROCm 6.1.1
| Library | Version |
|---------|---------|
| AMDMIGraphX | [2.9](https://github.com/ROCm/AMDMIGraphX/releases/tag/rocm-6.1.1) |
| hipBLAS | [2.1.0](https://github.com/ROCm/hipBLAS/releases/tag/rocm-6.1.1) |
| hipCUB | [3.1.0](https://github.com/ROCm/hipCUB/releases/tag/rocm-6.1.1) |
| hipFFT | [1.0.14](https://github.com/ROCm/hipFFT/releases/tag/rocm-6.1.1) |
| hipRAND | [2.10.17](https://github.com/ROCm/hipRAND/releases/tag/rocm-6.1.1) |
| hipSOLVER | 2.1.0 ⇒ [2.1.1](https://github.com/ROCm/hipSOLVER/releases/tag/rocm-6.1.1) |
| hipSPARSE | [3.0.1](https://github.com/ROCm/hipSPARSE/releases/tag/rocm-6.1.1) |
| hipTensor | [1.2.0](https://github.com/ROCm/hipTensor/releases/tag/rocm-6.1.1) |
| MIOpen | [3.1.0](https://github.com/ROCm/MIOpen/releases/tag/rocm-6.1.1) |
| MIVisionX | [2.5.0](https://github.com/ROCm/MIVisionX/releases/tag/rocm-6.1.1) |
| rccl | [2.18.6](https://github.com/ROCm/rccl/releases/tag/rocm-6.1.1) |
| rocALUTION | [3.1.1](https://github.com/ROCm/rocALUTION/releases/tag/rocm-6.1.1) |
| rocBLAS | [4.1.0](https://github.com/ROCm/rocBLAS/releases/tag/rocm-6.1.1) |
| rocFFT | 1.0.26 ⇒ [1.0.27](https://github.com/ROCm/rocFFT/releases/tag/rocm-6.1.1) |
| rocm-cmake | [0.12.0](https://github.com/ROCm/rocm-cmake/releases/tag/rocm-6.1.1) |
| rocPRIM | [3.1.0](https://github.com/ROCm/rocPRIM/releases/tag/rocm-6.1.1) |
| rocRAND | [3.0.1](https://github.com/ROCm/rocRAND/releases/tag/rocm-6.1.1) |
| rocSOLVER | [3.25.0](https://github.com/ROCm/rocSOLVER/releases/tag/rocm-6.1.1) |
| rocSPARSE | [3.1.2](https://github.com/ROCm/rocSPARSE/releases/tag/rocm-6.1.1) |
| rocThrust | [3.0.1](https://github.com/ROCm/rocThrust/releases/tag/rocm-6.1.1) |
| rocWMMA | [1.4.0](https://github.com/ROCm/rocWMMA/releases/tag/rocm-6.1.1) |
| rpp | [1.5.0](https://github.com/ROCm/rpp/releases/tag/rocm-6.1.1) |
| Tensile | [4.40.0](https://github.com/ROCm/Tensile/releases/tag/rocm-6.1.1) |
### hipSOLVER 2.1.1
hipSOLVER 2.1.1 for ROCm 6.1.1
### Changes
#### Changes
- `BUILD_WITH_SPARSE` now defaults to OFF on Windows.
- By default, `BUILD_WITH_SPARSE` is now set to OFF on Microsoft Windows.
### Fixes
#### Fixes
- Fixed benchmark client build when `BUILD_WITH_SPARSE` is OFF.
## rocFFT 1.0.27
### rocFFT 1.0.27
rocFFT 1.0.27 for ROCm 6.1.1
### Additions
#### Additions
- Enabled multi-GPU testing on systems without direct GPU-interconnects.
- Enable multi-gpu testing on systems without direct GPU-interconnects
### Fixes
#### Fixes
- Fixed kernel launch failure when executing very large odd-length real-complex transforms.
- Fixed kernel launch failure on execute of very large odd-length real-complex transforms.

View File

@@ -8,7 +8,7 @@ subtrees:
- entries:
- file: what-is-rocm.rst
- file: about/release-notes.md
title: Release highlights
title: Release notes
subtrees:
- entries:
- file: about/CHANGELOG.md

View File

@@ -1,4 +1,5 @@
The ROCm 6.0.2 point release consists of minor bug fixes to improve the stability of MI300 GPU applications. This release introduces several new driver features for system qualification on our partner server offerings.
The ROCm 6.0.2 point release consists of minor bug fixes to improve the stability of MI300 GPU applications. This release introduces several new driver features for system qualification on our partner server offerings.
#### hipFFT 1.0.13

View File

@@ -1,14 +1,10 @@
ROCm™ 6.1.1 introduces minor fixes and improvements to some tools and libraries.
In addition, ROCm 6.1.1 has been tested against a pre-release version of Ubuntu 22.04.5 (kernel 6.8).
#### AMD SMI
AMD SMI for ROCm 6.1.1
See the [detailed changelog](https://github.com/ROCm/amdsmi/blob/develop/CHANGELOG.md) with code samples for more information.
##### Additions
* Added deferred error correctable counts to `amd-smi metric -ecc -ecc-blocks`.
@@ -34,6 +30,10 @@ See the [detailed changelog](https://github.com/ROCm/amdsmi/blob/develop/CHANGEL
* `amd-smi bad-pages` can result in a `ValueError: Null pointer access` error when using certain PMU firmware versions.
```{note}
See the [detailed changelog](https://github.com/ROCm/amdsmi/docs/6.1.1/CHANGELOG.md) with code samples for more information.
```
#### HIPCC
HIPCC for ROCm 6.1.1
@@ -54,27 +54,3 @@ HIPIFY for ROCm 6.1.1
* Added support for cuDNN 9.0.0.
* Added new option:
* `--clang-resource-directory` to specify the clang resource path (the path to the parent folder for the `include` folder that contains `__clang_cuda_runtime_wrapper.h` and other header files used during the hipification process).
#### hipSOLVER 2.1.1
hipSOLVER 2.1.1 for ROCm 6.1.1
##### Changes
* `BUILD_WITH_SPARSE` now defaults to OFF on Windows.
##### Fixes
* Fixed benchmark client build when `BUILD_WITH_SPARSE` is OFF.
#### rocFFT 1.0.27
rocFFT 1.0.27 for ROCm 6.1.1
##### Additions
* Enabled multi-GPU testing on systems without direct GPU-interconnects.
##### Fixes
* Fixed kernel launch failure when executing very large odd-length real-complex transforms.