Files
ROCm/RELEASE.md
Peter Park 9679a84a8b Add components, known issues, and fixed issues to 6.1.2 RN / CL (#87)
* Regenerate changelog

* Add component changelogs and known issue

Fix RELEASE.md headings

Update pub datestamp for 6.1.2

Add AMDSMI and ROCm SMI to 6.1.2 template

Add rccl and rocBLAS

Update intro blurb and headings

Add ROCm SMI fix

Add missed heading to AMDSMI

Update datestamp and release version number

Update version and release number

Add known issue re: MI300X error detection

Words

Add issue link

Rm GitHub issue link

Move known issue down

Update ki wording

Remove "this issue has been investigated ... " from known issue

Fix changelog h1
2024-06-03 08:51:38 -04:00

6.8 KiB

ROCm 6.1.2 release notes

ROCm 6.1.2 includes enhancements to SMI tools and improvements to some libraries.

AMD SMI

AMD SMI for ROCm 6.1.2

Additions

  • Added process isolation and clean shader APIs and CLI commands.
    • amdsmi_get_gpu_process_isolation()
    • amdsmi_set_gpu_process_isolation()
    • amdsmi_set_gpu_clear_sram_data()
  • Added the MIN_POWER metric to output provided by amd-smi static --limit.

Optimizations

  • Updated the amd-smi monitor --pcie output to prevent delays with the monitor command.

Changes

  • Updated amismi_get_power_cap_info to return values in uW instead of W.
  • Updated Python library return types for amdsmi_get_gpu_memory_reserved_pages and amdsmi_get_gpu_bad_page_info.
  • Updated the output of amd-smi metric --ecc-blocks to show counters available from blocks.

Fixes

  • amdsmi_get_gpu_board_info() no longer returns junk character strings.
  • amd-smi metric --power now correctly details power output for RDNA3, RDNA2, and MI1x devices.
  • Fixed the amdsmitstReadWrite.TestPowerCapReadWrite test for RDNA3, RDNA2, and MI100 devices.
  • Fixed an issue with the amdsmi_get_gpu_memory_reserved_pages and amdsmi_get_gpu_bad_page_info Python interface calls.

Removals

  • Removed the amdsmi_get_gpu_process_info API from the Python library. It was removed from the C library in an earlier release.
See the AMD SMI [detailed changelog](https://github.com/ROCm/amdsmi/blob/rocm-6.1.x/CHANGELOG.md) with code samples for more information.

HIPCC

HIPCC for ROCm 6.1.2

Changes

  • Upcoming: a future release will enable use of compiled binaries hipcc.bin and hipconfig.bin by default. No action is needed by users; you may continue calling high-level Perl scripts hipcc and hipconfig. hipcc.bin and hipconfig.bin will be invoked by the high-level Perl scripts. To revert to the previous behavior and invoke hipcc.pl and hipconfig.pl, set the HIP_USE_PERL_SCRIPTS environment variable to 1.
  • Upcoming: a subsequent release will remove high-level Perl scripts hipcc and hipconfig. This release will remove the HIP_USE_PERL_SCRIPTS environment variable. It will rename hipcc.bin and hipconfig.bin to hipcc and hipconfig respectively. No action is needed by the users. To revert to the previous behavior, invoke hipcc.pl and hipconfig.pl explicitly.
  • Upcoming: a subsequent release will remove hipcc.pl and hipconfig.pl.

ROCm SMI

ROCm SMI for ROCm 6.1.2

Additions

  • Added the ring hang event to the amdsmi_evt_notification_type_t enum.

Fixes

  • Fixed an issue causing ROCm SMI to incorrectly report GPU utilization for RDNA3 GPUs.
  • Fixed the parsing of pp_od_clk_voltage in get_od_clk_volt_info to work better with MI-series hardware.

Known issue with error detection on MI300X

During poison consumption testing, the injection of uncorrectable errors will not generate an interrupt to the driver, resulting in undetected errors. This can result in reliability and recovery issues on MI300X accelerator-based setups.

Library changes in ROCm 6.1.2

Library Version
AMDMIGraphX 2.9
composable_kernel 0.2.0
hipBLAS 2.1.0
hipBLASLt 0.7.0
hipCUB 3.1.0
hipFFT 1.0.14
hipRAND 2.10.17
hipSOLVER 2.1.1
hipSPARSE 3.0.1
hipSPARSELt 0.2.0
hipTensor 1.2.0
MIOpen 3.1.0
MIVisionX 2.5.0
rccl 2.18.6
rocALUTION 3.1.1
rocBLAS 4.1.0 ⇒ 4.1.2
rocDecode 0.5.0 ⇒ 0.6.0
rocFFT 1.0.27
rocm-cmake 0.12.0
rocPRIM 3.1.0
rocRAND 3.0.1
rocSOLVER 3.25.0
rocSPARSE 3.1.2
rocThrust 3.0.1
rocWMMA 1.4.0
rpp 1.5.0
Tensile 4.40.0

RCCL

RCCL 2.18.6 for ROCm 6.1.2

Changes

  • Reduced NCCL_TOPO_MAX_NODES to limit stack usage and avoid stack overflow.

rocBLAS

rocBLAS 4.1.2 for ROCm 6.1.2

Optimizations

  • Tuned BBS TN and TT operations on the CDNA3 architecture.

Fixes

  • Fixed an issue related to obtaining solutions for BF16 TT operations.

rocDecode

rocDecode 0.6.0 for ROCm 6.1.2

Additions

  • Added support for FFmpeg v5.x.

Optimizations

  • Updated error checking in the rocDecode-setup.py script.

Changes

  • Updated core dependencies.
  • Updated to support the use of public LibVA headers.

Fixes

  • Fixed some package dependencies.

Tested configurations

  • Linux

    • Ubuntu 20.04 and 22.04
    • RHEL 8 and 9
  • ROCm

    • rocm-core 6.1.0.60100-64
    • amdgpu-core 1:6.1.60100-1741643
  • libya-dev 2.7.0-2 and 2.14.0-1

  • mesa-amdgpu-va-drivers 1:24.1.0

  • FFmpeg 4.2.7 and 4.4.2-0

  • rocDecode setup script v1.8.0