# ROCm 6.4.1 release notes The release notes provide a summary of notable changes since the previous ROCm release. - [Release highlights](#release-highlights) - [Operating system and hardware support changes](#operating-system-and-hardware-support-changes) - [ROCm components versioning](#rocm-components) - [Detailed component changes](#detailed-component-changes) - [ROCm known issues](#rocm-known-issues) - [ROCm upcoming changes](#rocm-upcoming-changes) ```{note} If you’re using Radeon™ PRO or Radeon GPUs in a workstation setting with a display connected, see the [Use ROCm on Radeon GPUs](https://rocm.docs.amd.com/projects/radeon/en/latest/docs/compatibility/native_linux/native_linux_compatibility.html) documentation to verify compatibility and system requirements. ``` ## Release highlights The following are notable new features and improvements in ROCm 6.4.1. For changes to individual components, see [Detailed component changes](#detailed-component-changes). ### Addition of DPX partition mode under NPS2 memory mode AMD Instinct MI300X now supports DPX partition mode under NPS2 memory mode. For more partitioning information, see the [Deep dive into the MI300 compute and memory partition modes](https://rocm.blogs.amd.com/software-tools-optimization/compute-memory-modes/README.html) blog and [AMD Instinct MI300X system optimization](https://instinct.docs.amd.com/projects/amdgpu-docs/en/latest/system-optimization/mi300x.html#change-gpu-partition-modes). ### Introducing the ROCm Data Science toolkit The ROCm Data Science toolkit (or ROCm-DS) is an open-source software collection for high-performance data science applications built on the core ROCm platform. You can leverage ROCm-DS to accelerate both new and existing data science workloads, allowing you to execute intensive applications with larger datasets at lightning speed. ROCm-DS is in an early access state. Running production workloads is not recommended. For more information, see [AMD ROCm-DS Documentation](https://rocm.docs.amd.com/projects/rocm-ds/en/latest/index.html). ### ROCm Offline Installer Creator updates The ROCm Offline Installer Creator 6.4.1 now allows you to use the SPACEBAR or ENTER keys for menu item selection in the GUI. It also adds support for Debian 12 and fixes an issue for “full” mode RHEL offline installer creation, where GDM packages were uninstalled during offline installation. See [ROCm Offline Installer Creator](https://rocm.docs.amd.com/projects/install-on-linux/en/latest/install/rocm-offline-installer.html) for more information. ### ROCm Runfile Installer updates The ROCm Runfile Installer 6.4.1 adds the following improvements: - Relaxed version checks for installation on different distributions. Provided the dependencies are not installed by the Runfile Installer, you can target installation for a different path from the host system running the installer. For example, the installer can run on a system using Ubuntu 22.04 and install to a partition/system that is using Ubuntu 24.04. - Performance improvements for detecting a previous ROCm install. - Removal of the extra `opt` directory created for the target during the ROCm installation. For example, installing to `target=/home/amd` now installs ROCm to `/home/amd/rocm-6.4.1` and not `/home/amd/opt/rocm-6.4.1`. For installs using `target=/`, the installation will continue to use `/opt/`. - The Runfile Installer can be used to uninstall any Runfile-based installation of the driver. - In the CLI interface, the `postrocm` argument can now be run separately from the `rocm` argument. In cases where `postrocm` was missed from the initial ROCm install, `postrocm` can now be run on the same target folder. For example, if you installed ROCm 6.4.1 using `install.run target=/myrocm rocm`, you can run the post-installation separately using the command `install.run target=/myrocm/rocm-6.4.1 postrocm`. For more information, see [ROCm Runfile Installer](https://rocm.docs.amd.com/projects/install-on-linux/en/latest/install/rocm-runfile-installer.html). ### ROCm documentation updates ROCm documentation continues to be updated to provide clearer and more comprehensive guidance for a wider variety of user needs and use cases. * [Tutorials for AI developers](https://rocm.docs.amd.com/projects/ai-developer-hub/en/latest/) have been expanded with five new tutorials. These tutorials are Jupyter notebook-based, easy-to-follow documents. They are ideal for AI developers who want to learn about specific topics, including inference, fine-tuning, and training. For more information about the changes, see [Changelog for the AI Developer Hub](https://rocm.docs.amd.com/projects/ai-developer-hub/en/latest/changelog.html). * The [Training a model with LLM Foundry](https://rocm.docs.amd.com/en/latest/how-to/rocm-for-ai/training/benchmark-docker/mpt-llm-foundry.html) performance testing guide has been added. This guide describes how to use the preconfigured [ROCm/pytorch-training](https://hub.docker.com/layers/rocm/pytorch-training/v25.5/images/sha256-d47850a9b25b4a7151f796a8d24d55ea17bba545573f0d50d54d3852f96ecde5) training environment and [https://github.com/ROCm/MAD](https://github.com/ROCm/MAD) to test the training performance of the LLM Foundry framework on AMD Instinct MI325X and MI300X accelerators using the [MPT-30B](https://huggingface.co/mosaicml/mpt-30b) model. * The [Training a model with PyTorch](https://rocm.docs.amd.com/en/latest/how-to/rocm-for-ai/training/benchmark-docker/pytorch-training.html) performance testing guide has been updated to feature the latest [ROCm/pytorch-training](https://hub.docker.com/layers/rocm/pytorch-training/v25.5/images/sha256-d47850a9b25b4a7151f796a8d24d55ea17bba545573f0d50d54d3852f96ecde5) Docker image (a preconfigured training environment with ROCm and PyTorch). Support for [Llama 3.3 70B](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct) has been added. * The [Training a model with JAX MaxText](https://rocm.docs.amd.com/en/latest/how-to/rocm-for-ai/training/benchmark-docker/jax-maxtext.html) performance testing guide has been updated to feature the latest [ROCm/jax-training](https://hub.docker.com/layers/rocm/jax-training/maxtext-v25.5/images/sha256-4e0516358a227cae8f552fb866ec07e2edcf244756f02e7b40212abfbab5217b) Docker image (a preconfigured training environment with ROCm, JAX, and [MaxText](https://github.com/AI-Hypercomputer/maxtext)). Support for [Llama 3.3 70B](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct) has been added. * The [vLLM inference performance testing](https://rocm.docs.amd.com/en/latest/how-to/rocm-for-ai/inference/vllm-benchmark.html?model=pyt_vllm_qwq-32b) guide has been updated to feature the latest [ROCm/vLLM](https://hub.docker.com/layers/rocm/vllm/latest/images/sha256-5c8b4436dd0464119d9df2b44c745fadf81512f18ffb2f4b5dc235c71ebe26b4) Docker image (a preconfigured environment for inference with ROCm and [vLLM](https://docs.vllm.ai/en/latest/)). Support for the [QwQ-32B](https://huggingface.co/Qwen/QwQ-32B) model has been added. * The [PyTorch inference performance testing](https://rocm.docs.amd.com/en/latest/how-to/rocm-for-ai/inference/pytorch-inference-benchmark.html?model=pyt_clip_inference) guide has been added, featuring the [ROCm/PyTorch](https://hub.docker.com/layers/rocm/pytorch/latest/images/sha256-ab1d350b818b90123cfda31363019d11c0d41a8f12a19e3cb2cb40cf0261137d) Docker image (a preconfigured inference environment with ROCm and PyTorch) with initial support for the [CLIP](https://huggingface.co/laion/CLIP-ViT-B-32-laion2B-s34B-b79K) and [Chai-1](https://huggingface.co/chaidiscovery/chai-1) models. ## Operating system and hardware support changes ROCm 6.4.1 introduces support for the RDNA4 architecture-based [Radeon AI PRO R9700](https://www.amd.com/en/products/graphics/workstations/radeon-ai-pro/ai-9000-series/amd-radeon-ai-pro-r9700.html), [Radeon RX 9070 XT](https://www.amd.com/en/products/graphics/desktops/radeon/9000-series/amd-radeon-rx-9070xt.html), and [Radeon RX 9060 XT](https://www.amd.com/en/products/graphics/desktops/radeon/9000-series/amd-radeon-rx-9060xt.html) GPUs for compute workloads. Currently, these GPUs are only supported on Ubuntu 24.04.2, Ubuntu 22.04.5, RHEL 9.5, and RHEL 9.4. For details, see the full list of [Supported GPUs (Linux)](https://rocm.docs.amd.com/projects/install-on-linux/en/latest/reference/system-requirements.html#supported-gpus). Operating system support remains unchanged in this release. See the [Compatibility matrix](../../docs/compatibility/compatibility-matrix.rst) for more information about operating system and hardware compatibility. ## ROCm components The following table lists the versions of ROCm components for ROCm 6.4.1, including any version changes from 6.4.0 to 6.4.1. Click the component's updated version to go to a list of its changes. Click {fab}`github` to go to the component's source code on GitHub.
| Category | Group | Name | Version | |
|---|---|---|---|---|
| Libraries | Machine learning and computer vision | Composable Kernel | 1.1.0 | |
| MIGraphX | 2.12.0 | |||
| MIOpen | 3.4.0 | |||
| MIVisionX | 3.2.0 | |||
| rocAL | 2.2.0 | |||
| rocDecode | 0.10.0 | |||
| rocJPEG | 0.8.0 | |||
| rocPyDecode | 0.3.1 | |||
| RPP | 1.9.10 | |||
| Communication | RCCL | 2.22.3 ⇒ 2.22.3 | ||
| rocSHMEM | 2.0.0 | |||
| Math | hipBLAS | 2.4.0 | ||
| hipBLASLt | 0.12.0 ⇒ 0.12.1 | |||
| hipFFT | 1.0.18 | |||
| hipfort | 0.6.0 | |||
| hipRAND | 2.12.0 | |||
| hipSOLVER | 2.4.0 | |||
| hipSPARSE | 3.2.0 | |||
| hipSPARSELt | 0.2.3 | |||
| rocALUTION | 3.2.2 ⇒ 3.2.3 | |||
| rocBLAS | 4.4.0 | |||
| rocFFT | 1.0.32 | |||
| rocRAND | 3.3.0 | |||
| rocSOLVER | 3.28.0 | |||
| rocSPARSE | 3.4.0 | |||
| rocWMMA | 1.7.0 | |||
| Tensile | 4.43.0 | |||
| Primitives | hipCUB | 3.4.0 | ||
| hipTensor | 1.5.0 | |||
| rocPRIM | 3.4.0 | |||
| rocThrust | 3.3.0 | |||
| Tools | System management | AMD SMI | 25.3.0 ⇒ 25.4.2 | |
| ROCm Data Center Tool | 0.3.0 ⇒ 0.3.0 | |||
| rocminfo | 1.0.0 | |||
| ROCm SMI | 7.5.0 ⇒ 7.5.0 | |||
| ROCmValidationSuite | 1.1.0 | |||
| Performance | ROCm Bandwidth Test | 1.4.0 | ||
| ROCm Compute Profiler | 3.1.0 | |||
| ROCm Systems Profiler | 1.0.0 ⇒ 1.0.1 | |||
| ROCProfiler | 2.0.0 | |||
| ROCprofiler-SDK | 0.6.0 | |||
| ROCTracer | 4.1.0 | |||
| Development | HIPIFY | 19.0.0 | ||
| ROCdbgapi | 0.77.2 | |||
| ROCm CMake | 0.14.0 | |||
| ROCm Debugger (ROCgdb) | 15.2 | |||
| ROCr Debug Agent | 2.0.4 | |||
| Compilers | HIPCC | 1.1.1 | ||
| llvm-project | 19.0.0 | |||
| Runtimes | HIP | 6.4.0 ⇒ 6.4.1 | ||
| ROCr Runtime | 1.15.0 ⇒ 1.15.0 | |||