mirror of
https://github.com/ROCm/ROCm.git
synced 2026-01-10 15:18:11 -05:00
* Regenerate changelog * Add component changelogs and known issue Fix RELEASE.md headings Update pub datestamp for 6.1.2 Add AMDSMI and ROCm SMI to 6.1.2 template Add rccl and rocBLAS Update intro blurb and headings Add ROCm SMI fix Add missed heading to AMDSMI Update datestamp and release version number Update version and release number Add known issue re: MI300X error detection Words Add issue link Rm GitHub issue link Move known issue down Update ki wording Remove "this issue has been investigated ... " from known issue Fix changelog h1
6.8 KiB
6.8 KiB
ROCm 6.1.2 release notes
ROCm 6.1.2 includes enhancements to SMI tools and improvements to some libraries.
AMD SMI
AMD SMI for ROCm 6.1.2
Additions
- Added process isolation and clean shader APIs and CLI commands.
amdsmi_get_gpu_process_isolation()amdsmi_set_gpu_process_isolation()amdsmi_set_gpu_clear_sram_data()
- Added the
MIN_POWERmetric to output provided byamd-smi static --limit.
Optimizations
- Updated the
amd-smi monitor --pcieoutput to prevent delays with themonitorcommand.
Changes
- Updated
amismi_get_power_cap_infoto return values in uW instead of W. - Updated Python library return types for
amdsmi_get_gpu_memory_reserved_pagesandamdsmi_get_gpu_bad_page_info. - Updated the output of
amd-smi metric --ecc-blocksto show counters available from blocks.
Fixes
amdsmi_get_gpu_board_info()no longer returns junk character strings.amd-smi metric --powernow correctly details power output for RDNA3, RDNA2, and MI1x devices.- Fixed the
amdsmitstReadWrite.TestPowerCapReadWritetest for RDNA3, RDNA2, and MI100 devices. - Fixed an issue with the
amdsmi_get_gpu_memory_reserved_pagesandamdsmi_get_gpu_bad_page_infoPython interface calls.
Removals
- Removed the
amdsmi_get_gpu_process_infoAPI from the Python library. It was removed from the C library in an earlier release.
See the AMD SMI [detailed changelog](https://github.com/ROCm/amdsmi/blob/rocm-6.1.x/CHANGELOG.md) with code samples for more information.
HIPCC
HIPCC for ROCm 6.1.2
Changes
- Upcoming: a future release will enable use of compiled binaries
hipcc.binandhipconfig.binby default. No action is needed by users; you may continue calling high-level Perl scriptshipccandhipconfig.hipcc.binandhipconfig.binwill be invoked by the high-level Perl scripts. To revert to the previous behavior and invokehipcc.plandhipconfig.pl, set theHIP_USE_PERL_SCRIPTSenvironment variable to1. - Upcoming: a subsequent release will remove high-level Perl scripts
hipccandhipconfig. This release will remove theHIP_USE_PERL_SCRIPTSenvironment variable. It will renamehipcc.binandhipconfig.bintohipccandhipconfigrespectively. No action is needed by the users. To revert to the previous behavior, invokehipcc.plandhipconfig.plexplicitly. - Upcoming: a subsequent release will remove
hipcc.plandhipconfig.pl.
ROCm SMI
ROCm SMI for ROCm 6.1.2
Additions
- Added the ring hang event to the
amdsmi_evt_notification_type_tenum.
Fixes
- Fixed an issue causing ROCm SMI to incorrectly report GPU utilization for RDNA3 GPUs.
- Fixed the parsing of
pp_od_clk_voltageinget_od_clk_volt_infoto work better with MI-series hardware.
Known issue with error detection on MI300X
During poison consumption testing, the injection of uncorrectable errors will not generate an interrupt to the driver, resulting in undetected errors. This can result in reliability and recovery issues on MI300X accelerator-based setups.
Library changes in ROCm 6.1.2
| Library | Version |
|---|---|
| AMDMIGraphX | 2.9 |
| composable_kernel | 0.2.0 |
| hipBLAS | 2.1.0 |
| hipBLASLt | 0.7.0 |
| hipCUB | 3.1.0 |
| hipFFT | 1.0.14 |
| hipRAND | 2.10.17 |
| hipSOLVER | 2.1.1 |
| hipSPARSE | 3.0.1 |
| hipSPARSELt | 0.2.0 |
| hipTensor | 1.2.0 |
| MIOpen | 3.1.0 |
| MIVisionX | 2.5.0 |
| rccl | 2.18.6 |
| rocALUTION | 3.1.1 |
| rocBLAS | 4.1.0 ⇒ 4.1.2 |
| rocDecode | 0.5.0 ⇒ 0.6.0 |
| rocFFT | 1.0.27 |
| rocm-cmake | 0.12.0 |
| rocPRIM | 3.1.0 |
| rocRAND | 3.0.1 |
| rocSOLVER | 3.25.0 |
| rocSPARSE | 3.1.2 |
| rocThrust | 3.0.1 |
| rocWMMA | 1.4.0 |
| rpp | 1.5.0 |
| Tensile | 4.40.0 |
RCCL
RCCL 2.18.6 for ROCm 6.1.2
Changes
- Reduced
NCCL_TOPO_MAX_NODESto limit stack usage and avoid stack overflow.
rocBLAS
rocBLAS 4.1.2 for ROCm 6.1.2
Optimizations
- Tuned BBS TN and TT operations on the CDNA3 architecture.
Fixes
- Fixed an issue related to obtaining solutions for BF16 TT operations.
rocDecode
rocDecode 0.6.0 for ROCm 6.1.2
Additions
- Added support for FFmpeg v5.x.
Optimizations
- Updated error checking in the
rocDecode-setup.pyscript.
Changes
- Updated core dependencies.
- Updated to support the use of public LibVA headers.
Fixes
- Fixed some package dependencies.
Tested configurations
-
Linux
- Ubuntu 20.04 and 22.04
- RHEL 8 and 9
-
ROCm
- rocm-core 6.1.0.60100-64
- amdgpu-core 1:6.1.60100-1741643
-
libya-dev 2.7.0-2 and 2.14.0-1
-
mesa-amdgpu-va-drivers 1:24.1.0
-
FFmpeg 4.2.7 and 4.4.2-0
-
rocDecode setup script v1.8.0