diff --git a/.wordlist.txt b/.wordlist.txt index c4d8fee75..a9b3464a9 100644 --- a/.wordlist.txt +++ b/.wordlist.txt @@ -288,6 +288,7 @@ PCC PCI PCIe PEFT +PEQT PIL PILImage POR @@ -341,6 +342,7 @@ RelWithDebInfo Req Rickle RoCE +Runfile Ryzen SALU SBIOS diff --git a/RELEASE.md b/RELEASE.md index f7ca8853c..002bd58f8 100644 --- a/RELEASE.md +++ b/RELEASE.md @@ -74,12 +74,12 @@ ROCm documentation continues to be updated to provide clearer and more comprehen ## Operating system and hardware support changes -ROCm 6.3.1 adds support for Debian 12 (kernel: 6.1). Debian is supported only on AMD Instinct accelerators. See the installation instructions at [Debian native installation](https://rocm.docs.amd.com/projects/install-on-linux-internal/en/latest/install/native-install/debian.html). +ROCm 6.3.1 adds support for Debian 12 (kernel: 6.1). Debian is supported only on AMD Instinct accelerators. See the installation instructions at [Debian native installation](https://rocm.docs.amd.com/projects/install-on-linux/en/docs-6.3.1/install/native-install/debian.html). ROCm 6.3.1 enables support for AMD Instinct MI325X accelerator. For more information, see [AMD Instinct™ MI325X Accelerators](https://www.amd.com/en/products/accelerators/instinct/mi300/mi325x.html). See the [Compatibility -matrix](https://rocm-stg.amd.com/en/latest/compatibility/compatibility-matrix.html) +matrix](https://rocm.docs.amd.com/en/docs-6.3.1/compatibility/compatibility-matrix.html) for more information about operating system and hardware compatibility. ## ROCm components @@ -107,47 +107,47 @@ Click {fab}`github` to go to the component's source code on GitHub. Libraries Machine learning and computer vision - Composable Kernel + Composable Kernel 1.1.0 - MIGraphX + MIGraphX 2.11.0 - MIOpen + MIOpen 3.3.0 - MIVisionX + MIVisionX 3.1.0 ⇒ 3.1.0 - rocAL + rocAL 2.1.0 - rocDecode + rocDecode 0.8.0 - rocJPEG + rocJPEG 0.6.0 - rocPyDecode + rocPyDecode 0.2.0 - RPP + RPP 1.9.1 @@ -156,7 +156,7 @@ Click {fab}`github` to go to the component's source code on GitHub. Communication - RCCL + RCCL 2.21.5 ⇒ 2.21.5 @@ -165,82 +165,82 @@ Click {fab}`github` to go to the component's source code on GitHub. Math - hipBLAS + hipBLAS 2.3.0 - hipBLASLt + hipBLASLt 0.10.0 - hipFFT + hipFFT 1.0.17 - hipfort + hipfort 0.5.0 - hipRAND + hipRAND 2.11.1 - hipSOLVER + hipSOLVER 2.3.0 - hipSPARSE + hipSPARSE 3.1.2 - hipSPARSELt + hipSPARSELt 0.2.2 - rocALUTION + rocALUTION 3.2.1 - rocBLAS + rocBLAS 4.3.0 - rocFFT + rocFFT 1.0.31 - rocRAND + rocRAND 3.2.0 - rocSOLVER + rocSOLVER 3.27.0 - rocSPARSE + rocSPARSE 3.3.0 - rocWMMA + rocWMMA 1.6.0 - Tensile + Tensile 4.42.0 @@ -249,22 +249,22 @@ Click {fab}`github` to go to the component's source code on GitHub. Primitives - hipCUB + hipCUB 3.3.0 - hipTensor + hipTensor 1.4.0 - rocPRIM + rocPRIM 3.3.0 - rocThrust + rocThrust 3.3.0 @@ -273,27 +273,27 @@ Click {fab}`github` to go to the component's source code on GitHub. Tools System management - AMD SMI + AMD SMI 24.7.1 ⇒ 24.7.1 - ROCm Data Center Tool + ROCm Data Center Tool 0.3.0 - rocminfo + rocminfo 1.0.0 - ROCm SMI + ROCm SMI 7.4.0 - ROCmValidationSuite + ROCmValidationSuite 1.1.0 @@ -302,38 +302,38 @@ Click {fab}`github` to go to the component's source code on GitHub. Performance - ROCm Bandwidth + ROCm Bandwidth Test 1.4.0 - ROCm Compute Profiler + ROCm Compute Profiler 3.0.0 ⇒ 3.0.0 - ROCm Systems Profiler + ROCm Systems Profiler 0.1.0 ⇒ 0.1.0 - ROCProfiler + ROCProfiler 2.0.0 - ROCprofiler-SDK + ROCprofiler-SDK 0.5.0 - ROCTracer + ROCTracer 4.1.0 @@ -343,32 +343,32 @@ Click {fab}`github` to go to the component's source code on GitHub. Development - HIPIFY + HIPIFY 18.0.0 ⇒ 18.0.0 - ROCdbgapi + ROCdbgapi 0.77.0 - ROCm CMake + ROCm CMake 0.14.0 - ROCm Debugger (ROCgdb) + ROCm Debugger (ROCgdb) 15.2 - ROCr Debug Agent + ROCr Debug Agent 2.0.3 Compilers - HIPCC + HIPCC 1.1.1 - llvm-project + llvm-project 18.0.0 @@ -393,12 +393,12 @@ Click {fab}`github` to go to the component's source code on GitHub. Runtimes - HIP + HIP 6.3.0 ⇒ 6.3.1 - ROCr Runtime + ROCr Runtime 1.14.0 @@ -512,7 +512,7 @@ issues related to individual components, review the [Detailed component changes] ### PCI Express Qualification Tool failure on Debian 12 -The PCI Express Qualification Tool (PEQT) module present in the ROCm Validation Suite (RVS) might fail due to the segmentation issue in Debian 12 (bookworm). This will result in failure to determine the characteristics of the PCIe interconnect between the host platform and the GPU like support for Gen 3 atomic completers, DMA transfer statistics, link speed, and link width. The standard PCIe command `lspci` can be used as an alternative to view the characteristics of the PCIe bus interconnect with the GPU. This issue is under investigation and will be addressed in a future release. +The PCI Express Qualification Tool (PEQT) module present in the ROCm Validation Suite (RVS) might fail due to the segmentation issue in Debian 12 (bookworm). This will result in failure to determine the characteristics of the PCIe interconnect between the host platform and the GPU like support for Gen 3 atomic completers, DMA transfer statistics, link speed, and link width. The standard PCIe command `lspci` can be used as an alternative to view the characteristics of the PCIe bus interconnect with the GPU. This issue is under investigation and will be addressed in a future release. See [GitHub issue #4175](https://github.com/ROCm/ROCm/issues/4175). ## ROCm resolved issues diff --git a/tools/autotag/templates/highlights/6.3.1.md b/tools/autotag/templates/highlights/6.3.1.md new file mode 100644 index 000000000..2f8ecc330 --- /dev/null +++ b/tools/autotag/templates/highlights/6.3.1.md @@ -0,0 +1,61 @@ +# ROCm 6.3.1 release notes + +The release notes provide a summary of notable changes since the previous ROCm release. + +- [Release highlights](#release-highlights) + +- [Operating system and hardware support changes](#operating-system-and-hardware-support-changes) + +- [ROCm components versioning](#rocm-components) + +- [Detailed component changes](#detailed-component-changes) + +- [ROCm known issues](#rocm-known-issues) + +- [ROCm resolved issues](#rocm-resolved-issues) + +- [ROCm upcoming changes](#rocm-upcoming-changes) + +```{note} +If you’re using Radeon™ PRO or Radeon GPUs in a workstation setting with a +display connected, continue to use ROCm 6.2.3. See the [Use ROCm on Radeon GPUs](https://rocm.docs.amd.com/projects/radeon/en/latest/index.html) +documentation to verify compatibility and system requirements. +``` +## Release highlights + +The following are notable new features and improvements in ROCm 6.3.1. For changes to individual components, see +[Detailed component changes](#detailed-component-changes). + +### Per queue resiliency for Instinct MI300 accelerators + +The AMDGPU driver now includes enhanced resiliency for misbehaving applications on AMD Instinct MI300 accelerators. This helps isolate the impact of misbehaving applications, ensuring other workloads running on the same accelerator are unaffected. + +### ROCm Runfile Installer + +ROCm 6.3.1 introduces the ROCm Runfile Installer, with initial support for Ubuntu 22.04. The ROCm Runfile Installer facilitates ROCm installation without using a native Linux package management system, with or without network or internet access. For more information, see the [ROCm Runfile Installer documentation](https://rocm.docs.amd.com/projects/install-on-linux/en/docs-6.3.1/install/rocm-runfile-installer.html). + +### ROCm documentation updates + +ROCm documentation continues to be updated to provide clearer and more comprehensive guidance for a wider variety of user needs and use cases. + +* Added documentation on training a model with ROCm Megatron-LM. AMD offers a Docker image for MI300X accelerators + containing essential components to get started, including ROCm libraries, PyTorch, and Megatron-LM utilities. See + [Training a model using ROCm Megatron-LM](https://rocm.docs.amd.com/en/latest/how-to/rocm-for-ai/train-a-model.html) + to get started. + + The new ROCm Megatron-LM training Docker accompanies the [ROCm vLLM inference + Docker](https://rocm.docs.amd.com/en/latest/how-to/performance-validation/mi300x/vllm-benchmark.html) + as a set of ready-to-use containerized solutions to get started with using ROCm + for AI. + +* Updated the [Instinct MI300X workload tuning + guide](https://rocm.docs.amd.com/en/latest/how-to/tuning-guides/mi300x/workload.html) with more current optimization + strategies. The updated sections include guidance on vLLM optimization, PyTorch TunableOp, and hipBLASLt tuning. + +* HIP graph-safe libraries operate safely in HIP execution graphs. [HIP graphs](https://rocm.docs.amd.com/projects/HIP/en/latest/how-to/hip_runtime_api/hipgraph.html#how-to-hip-graph) are an alternative way of executing tasks on a GPU that can provide performance benefits over launching kernels using the standard method via streams. A topic that shows whether a [ROCm library is graph-safe](https://advanced-micro-devices-demo--3953.com.readthedocs.build/en/3953/reference/graph-safe-support.html) has been added. + +* The [Device memory](https://rocm.docs.amd.com/projects/HIP/en/latest/how-to/hip_runtime_api/memory_management/device_memory.html) topic in the HIP memory management section has been updated. + +* The HIP documentation has expanded with new resources for developers: + * [Multi device management](https://rocm.docs.amd.com/projects/HIP/en/latest/how-to/hip_runtime_api/multi_device.html) + * [OpenGL interoperability](https://rocm.docs.amd.com/projects/HIP/en/latest/how-to/hip_runtime_api/opengl_interop.html) diff --git a/tools/autotag/templates/known_issues/6.3.1.md b/tools/autotag/templates/known_issues/6.3.1.md new file mode 100644 index 000000000..10b93c2b9 --- /dev/null +++ b/tools/autotag/templates/known_issues/6.3.1.md @@ -0,0 +1,8 @@ +## ROCm known issues + +ROCm known issues are noted on {fab}`github` [GitHub](https://github.com/ROCm/ROCm/labels/Verified%20Issue). For known +issues related to individual components, review the [Detailed component changes](#detailed-component-changes). + +### PCI Express Qualification Tool failure on Debian 12 + +The PCI Express Qualification Tool (PEQT) module present in the ROCm Validation Suite (RVS) might fail due to the segmentation issue in Debian 12 (bookworm). This will result in failure to determine the characteristics of the PCIe interconnect between the host platform and the GPU like support for Gen 3 atomic completers, DMA transfer statistics, link speed, and link width. The standard PCIe command `lspci` can be used as an alternative to view the characteristics of the PCIe bus interconnect with the GPU. This issue is under investigation and will be addressed in a future release. See [GitHub issue #4175](https://github.com/ROCm/ROCm/issues/4175). \ No newline at end of file diff --git a/tools/autotag/templates/resolved_issues/6.3.1.md b/tools/autotag/templates/resolved_issues/6.3.1.md new file mode 100644 index 000000000..c67d597a8 --- /dev/null +++ b/tools/autotag/templates/resolved_issues/6.3.1.md @@ -0,0 +1,20 @@ +## ROCm resolved issues + +The following are previously known issues resolved in this release. For resolved issues related to +individual components, review the [Detailed component changes](#detailed-component-changes). + +### Instinct MI300 series: backward weights convolution performance issue + +Fixed a performance issue affecting certain tensor shapes during backward weights convolution when using FP16 or FP32 data types on Instinct MI300 series accelerators. See [GitHub issue #4080](https://github.com/ROCm/ROCm/issues/4080). + +### ROCm Compute Profiler and ROCm Systems Profiler post-upgrade issues + +Packaging metadata for ROCm Compute Profiler (`rocprofiler-compute`) and ROCm Systems Profiler +(`rocprofiler-systems`) has been updated to handle the renaming from Omniperf and Omnitrace, +respectively. This fixes minor issues when upgrading from ROCm 6.2 to 6.3. For more information, see the GitHub issues +[#4082](https://github.com/ROCm/ROCm/issues/4082) and +[#4083](https://github.com/ROCm/ROCm/issues/4082). + +### Stale file due to OpenCL ICD loader deprecation + +When upgrading from ROCm 6.2.x to ROCm 6.3.0, the issue of [removal of the `rocm-icd-loader` package](#opencl-icd-loader-separated-from-rocm) leaving a stale file in the old `rocm-6.2.x`directory has been resolved. The stale files left during the upgrade from ROCm 6.2.x to ROCm 6.3.0 will be removed when upgrading to ROCm 6.3.1. For more information, see [GitHub issue #4084](https://github.com/ROCm/ROCm/issues/4084). \ No newline at end of file diff --git a/tools/autotag/templates/support/6.3.1.md b/tools/autotag/templates/support/6.3.1.md new file mode 100644 index 000000000..ea458ca45 --- /dev/null +++ b/tools/autotag/templates/support/6.3.1.md @@ -0,0 +1,9 @@ +## Operating system and hardware support changes + +ROCm 6.3.1 adds support for Debian 12 (kernel: 6.1). Debian is supported only on AMD Instinct accelerators. See the installation instructions at [Debian native installation](https://rocm.docs.amd.com/projects/install-on-linux/en/docs-6.3.1/install/native-install/debian.html). + +ROCm 6.3.1 enables support for AMD Instinct MI325X accelerator. For more information, see [AMD Instinct™ MI325X Accelerators](https://www.amd.com/en/products/accelerators/instinct/mi300/mi325x.html). + +See the [Compatibility +matrix](https://rocm.docs.amd.com/en/docs-6.3.1/compatibility/compatibility-matrix.html) +for more information about operating system and hardware compatibility. \ No newline at end of file diff --git a/tools/autotag/templates/upcoming_changes/6.3.1.md b/tools/autotag/templates/upcoming_changes/6.3.1.md new file mode 100644 index 000000000..2a85d9fd1 --- /dev/null +++ b/tools/autotag/templates/upcoming_changes/6.3.1.md @@ -0,0 +1,13 @@ +## ROCm upcoming changes + +The following changes to the ROCm software stack are anticipated for future releases. + +### AMDGPU wavefront size compiler macro deprecation + +The `__AMDGCN_WAVEFRONT_SIZE__` macro will be deprecated in an upcoming +release. It is recommended to remove any use of this macro. For more information, see [AMDGPU +support](https://rocm.docs.amd.com/projects/llvm-project/en/docs-6.3.1/LLVM/clang/html/AMDGPUSupport.html). + +### HIPCC Perl scripts deprecation + +The HIPCC Perl scripts (`hipcc.pl` and `hipconfig.pl`) will be removed in an upcoming release. \ No newline at end of file