mirror of
https://github.com/ROCm/ROCm.git
synced 2026-01-09 14:48:06 -05:00
499 lines
28 KiB
Markdown
499 lines
28 KiB
Markdown
<!-- Do not edit this file! -->
|
||
<!-- This file is autogenerated with -->
|
||
<!-- tools/autotag/tag_script.py -->
|
||
<!-- Disable lints since this is an auto-generated file. -->
|
||
<!-- markdownlint-disable blanks-around-headers -->
|
||
<!-- markdownlint-disable no-duplicate-header -->
|
||
<!-- markdownlint-disable no-blanks-blockquote -->
|
||
<!-- markdownlint-disable ul-indent -->
|
||
<!-- markdownlint-disable no-trailing-spaces -->
|
||
<!-- markdownlint-disable reference-links-images -->
|
||
<!-- markdownlint-disable no-missing-space-atx -->
|
||
<!-- spellcheck-disable -->
|
||
# ROCm 6.4.3 release notes
|
||
|
||
The release notes provide a summary of notable changes since the previous ROCm release.
|
||
|
||
- [Release highlights](#release-highlights)
|
||
|
||
- [Operating system and hardware support changes](#operating-system-and-hardware-support-changes)
|
||
|
||
- [ROCm components versioning](#rocm-components)
|
||
|
||
- [Detailed component changes](#detailed-component-changes)
|
||
|
||
- [ROCm known issues](#rocm-known-issues)
|
||
|
||
- [ROCm upcoming changes](#rocm-upcoming-changes)
|
||
|
||
```{note}
|
||
If you’re using AMD Radeon™ PRO or Radeon GPUs in a workstation setting with a display connected, see the [Use ROCm on Radeon GPUs](https://rocm.docs.amd.com/projects/radeon/en/latest/docs/compatibility/native_linux/native_linux_compatibility.html)
|
||
documentation to verify compatibility and system requirements.
|
||
```
|
||
|
||
## Release highlights
|
||
|
||
ROCm 6.4.3 is a quality release that resolves the following issues. For changes to individual components, see [Detailed component changes](#detailed-component-changes).
|
||
|
||
### AMDGPU driver updates
|
||
|
||
* Resolved an issue causing performance degradation in communication operations, caused by increased latency in certain RCCL applications. The fix prevents unnecessary queue eviction during the fork process.
|
||
* Fixed an issue in the AMDGPU driver’s scheduler constraints that could cause queue preemption to fail during workload execution.
|
||
|
||
### ROCm SMI update
|
||
* Fixed the failure to load GPU data like System Clock (SCLK) by adjusting the logic for retrieving GPU board voltage.
|
||
|
||
### ROCm documentation updates
|
||
|
||
ROCm documentation continues to be updated to provide clearer and more comprehensive guidance for a wider variety of user needs and use cases.
|
||
|
||
* [Tutorials for AI developers](https://rocm.docs.amd.com/projects/ai-developer-hub/en/latest/) have been expanded with the following five new tutorials:
|
||
* Inference tutorials
|
||
* [ChatQnA vLLM deployment and performance evaluation](https://rocm.docs.amd.com/projects/ai-developer-hub/en/latest/notebooks/inference/opea_deployment_and_evaluation.html)
|
||
* [Text-to-video generation with ComfyUI](https://rocm.docs.amd.com/projects/ai-developer-hub/en/latest/notebooks/inference/t2v_comfyui_radeon.html)
|
||
* [DeepSeek Janus Pro on CPU or GPU](https://rocm.docs.amd.com/projects/ai-developer-hub/en/latest/notebooks/inference/deepseek_janus_cpu_gpu.html)
|
||
* [DeepSeek-R1 with vLLM V1](https://rocm.docs.amd.com/projects/ai-developer-hub/en/latest/notebooks/inference/vllm_v1_DSR1.html)
|
||
* GPU development and optimization tutorial: [MLA decoding kernel of AITER library](https://rocm.docs.amd.com/projects/ai-developer-hub/en/latest/notebooks/gpu_dev_optimize/aiter_mla_decode_kernel.html)
|
||
|
||
For more information about the changes, see [Changelog for the AI Developer Hub](https://rocm.docs.amd.com/projects/ai-developer-hub/en/latest/changelog.html).
|
||
|
||
* ROCm provides a comprehensive ecosystem for deep learning development. For more details, see [Deep learning frameworks for ROCm](https://rocm.docs.amd.com/en/docs-6.4.3/how-to/deep-learning-rocm.html). AMD ROCm adds support for the following deep learning framework:
|
||
|
||
* Megablocks is a light-weight library for mixture-of-experts (MoE) training. The core of the system is efficient "dropless-MoE" and standard MoE layers. Megablocks is integrated with Megatron-LM, where data and pipeline parallel training of MoEs is supported. It is currently supported on ROCm 6.3.0. For more information, see [Megablocks compatibility](https://rocm.docs.amd.com/en/docs-6.4.3/compatibility/ml-compatibility/megablocks-compatibility.html).
|
||
|
||
* The [Data types and precision support](https://rocm.docs.amd.com/en/latest/reference/precision-support.html) topic now includes new hardware and library support information.
|
||
|
||
## Operating system and hardware support changes
|
||
|
||
Operating system and hardware support remain unchanged in this release.
|
||
|
||
See the [Compatibility
|
||
matrix](../../docs/compatibility/compatibility-matrix.rst)
|
||
for more information about operating system and hardware compatibility.
|
||
|
||
## ROCm components
|
||
|
||
The following table lists the versions of ROCm components for ROCm 6.4.3.
|
||
Click {fab}`github` to go to the component's source code on GitHub.
|
||
|
||
<div class="pst-scrollable-table-container">
|
||
<table id="rocm-rn-components" class="table">
|
||
<thead>
|
||
<tr>
|
||
<th>Category</th>
|
||
<th>Group</th>
|
||
<th>Name</th>
|
||
<th>Version</th>
|
||
<th></th>
|
||
</tr>
|
||
</thead>
|
||
<colgroup>
|
||
<col span="1">
|
||
<col span="1">
|
||
</colgroup>
|
||
<tbody class="rocm-components-libs rocm-components-ml">
|
||
<tr>
|
||
<th rowspan="9">Libraries</th>
|
||
<th rowspan="9">Machine learning and computer vision</th>
|
||
<td><a href="https://rocm.docs.amd.com/projects/composable_kernel/en/docs-6.4.3/index.html">Composable Kernel</a></td>
|
||
<td>1.1.0</td>
|
||
<td><a href="https://github.com/ROCm/composable_kernel"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/AMDMIGraphX/en/docs-6.4.3/index.html">MIGraphX</a></td>
|
||
<td>2.12.0</td>
|
||
<td><a href="https://github.com/ROCm/AMDMIGraphX"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/MIOpen/en/docs-6.4.3/index.html">MIOpen</a></td>
|
||
<td>3.4.0</td>
|
||
<td><a href="https://github.com/ROCm/MIOpen"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/MIVisionX/en/docs-6.4.3/index.html">MIVisionX</a></td>
|
||
<td>3.2.0</td>
|
||
<td><a href="https://github.com/ROCm/MIVisionX"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocAL/en/docs-6.4.3/index.html">rocAL</a></td>
|
||
<td>2.2.0</td>
|
||
<td><a href="https://github.com/ROCm/rocAL"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocDecode/en/docs-6.4.3/index.html">rocDecode</a></td>
|
||
<td>0.10.0</td>
|
||
<td><a href="https://github.com/ROCm/rocDecode"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocJPEG/en/docs-6.4.3/index.html">rocJPEG</a></td>
|
||
<td>0.8.0</td>
|
||
<td><a href="https://github.com/ROCm/rocJPEG"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocPyDecode/en/docs-6.4.3/index.html">rocPyDecode</a></td>
|
||
<td>0.3.1</td>
|
||
<td><a href="https://github.com/ROCm/rocPyDecode"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rpp/en/docs-6.4.3/index.html">RPP</a></td>
|
||
<td>1.9.10</td>
|
||
<td><a href="https://github.com/ROCm/rpp"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
</tbody>
|
||
<tbody class="rocm-components-libs rocm-components-communication tbody-reverse-zebra">
|
||
<tr>
|
||
<th rowspan="2"></th>
|
||
<th rowspan="2">Communication</th>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rccl/en/docs-6.4.3/index.html">RCCL</a></td>
|
||
<td>2.22.3</td>
|
||
<td><a href="https://github.com/ROCm/rccl"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocSHMEM/en/docs-6.4.3/index.html">rocSHMEM</a></td>
|
||
<td>2.0.1</td>
|
||
<td><a href="https://github.com/ROCm/rocSHMEM"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
</tbody>
|
||
<tbody class="rocm-components-libs rocm-components-math tbody-reverse-zebra">
|
||
<tr>
|
||
<th rowspan="16"></th>
|
||
<th rowspan="16">Math</th>
|
||
<td><a href="https://rocm.docs.amd.com/projects/hipBLAS/en/docs-6.4.3/index.html">hipBLAS</a></td>
|
||
<td>2.4.0</td>
|
||
<td><a href="https://github.com/ROCm/hipBLAS"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/hipBLASLt/en/docs-6.4.3/index.html">hipBLASLt</a></td>
|
||
<td>0.12.1</td>
|
||
<td><a href="https://github.com/ROCm/hipBLASLt"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/hipFFT/en/docs-6.4.3/index.html">hipFFT</a></td>
|
||
<td>1.0.18</td>
|
||
<td><a href="https://github.com/ROCm/hipFFT"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/hipfort/en/docs-6.4.3/index.html">hipfort</a></td>
|
||
<td>0.6.0</td>
|
||
<td><a href="https://github.com/ROCm/hipfort"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/hipRAND/en/docs-6.4.3/index.html">hipRAND</a></td>
|
||
<td>2.12.0</td>
|
||
<td><a href="https://github.com/ROCm/hipRAND"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/hipSOLVER/en/docs-6.4.3/index.html">hipSOLVER</a></td>
|
||
<td>2.4.0</td>
|
||
<td><a href="https://github.com/ROCm/hipSOLVER"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/hipSPARSE/en/docs-6.4.3/index.html">hipSPARSE</a></td>
|
||
<td>3.2.0</td>
|
||
<td><a href="https://github.com/ROCm/hipSPARSE"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/hipSPARSELt/en/docs-6.4.3/index.html">hipSPARSELt</a></td>
|
||
<td>0.2.3</td>
|
||
<td><a href="https://github.com/ROCm/hipSPARSELt"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocALUTION/en/docs-6.4.3/index.html">rocALUTION</a></td>
|
||
<td>3.2.3</td>
|
||
<td><a href="https://github.com/ROCm/rocALUTION"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocBLAS/en/docs-6.4.3/index.html">rocBLAS</a></td>
|
||
<td>4.4.1</td></td>
|
||
<td><a href="https://github.com/ROCm/rocBLAS"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocFFT/en/docs-6.4.3/index.html">rocFFT</a></td>
|
||
<td>1.0.32</td>
|
||
<td><a href="https://github.com/ROCm/rocFFT"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocRAND/en/docs-6.4.3/index.html">rocRAND</a></td>
|
||
<td>3.3.0</td>
|
||
<td><a href="https://github.com/ROCm/rocRAND"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocSOLVER/en/docs-6.4.3/index.html">rocSOLVER</a></td>
|
||
<td>3.28.2</td>
|
||
<td><a href="https://github.com/ROCm/rocSOLVER"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocSPARSE/en/docs-6.4.3/index.html">rocSPARSE</a></td>
|
||
<td>3.4.0</td>
|
||
<td><a href="https://github.com/ROCm/rocSPARSE"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocWMMA/en/docs-6.4.3/index.html">rocWMMA</a></td>
|
||
<td>1.7.0</td>
|
||
<td><a href="https://github.com/ROCm/rocWMMA"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/Tensile/en/docs-6.4.3/src/index.html">Tensile</a></td>
|
||
<td>4.43.0</td>
|
||
<td><a href="https://github.com/ROCm/Tensile"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
</tbody>
|
||
<tbody class="rocm-components-libs rocm-components-primitives tbody-reverse-zebra">
|
||
<tr>
|
||
<th rowspan="4"></th>
|
||
<th rowspan="4">Primitives</th>
|
||
<td><a href="https://rocm.docs.amd.com/projects/hipCUB/en/docs-6.4.3/index.html">hipCUB</a></td>
|
||
<td>3.4.0</td>
|
||
<td><a href="https://github.com/ROCm/hipCUB"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/hipTensor/en/docs-6.4.3/index.html">hipTensor</a></td>
|
||
<td>1.5.0</td>
|
||
<td><a href="https://github.com/ROCm/hipTensor"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocPRIM/en/docs-6.4.3/index.html">rocPRIM</a></td>
|
||
<td>3.4.1</td>
|
||
<td><a href="https://github.com/ROCm/rocPRIM"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocThrust/en/docs-6.4.3/index.html">rocThrust</a></td>
|
||
<td>3.3.0</td>
|
||
<td><a href="https://github.com/ROCm/rocThrust"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
</tbody>
|
||
<tbody class="rocm-components-tools rocm-components-system tbody-reverse-zebra">
|
||
<tr>
|
||
<th rowspan="7">Tools</th>
|
||
<th rowspan="7">System management</th>
|
||
<td><a href="https://rocm.docs.amd.com/projects/amdsmi/en/docs-6.4.3/index.html">AMD SMI</a></td>
|
||
<td>25.5.1</a></td>
|
||
<td><a href="https://github.com/ROCm/amdsmi"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rdc/en/docs-6.4.3/index.html">ROCm Data Center Tool</a></td>
|
||
<td>0.3.0</td>
|
||
<td><a href="https://github.com/ROCm/rdc"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocminfo/en/docs-6.4.3/index.html">rocminfo</a></td>
|
||
<td>1.0.0</td>
|
||
<td><a href="https://github.com/ROCm/rocminfo"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocm_smi_lib/en/docs-6.4.3/index.html">ROCm SMI</a></td>
|
||
<td>7.5.0 ⇒ <a href="#rocm-smi-7-7-0">7.7.0</td>
|
||
<td><a href="https://github.com/ROCm/rocm_smi_lib"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/ROCmValidationSuite/en/docs-6.4.3/index.html">ROCm Validation Suite</a></td>
|
||
<td>1.1.0</td>
|
||
<td><a href="https://github.com/ROCm/ROCmValidationSuite"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
</tbody>
|
||
<tbody class="rocm-components-tools rocm-components-perf">
|
||
<tr>
|
||
<th rowspan="6"></th>
|
||
<th rowspan="6">Performance</th>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocm_bandwidth_test/en/docs-6.4.3/index.html">ROCm Bandwidth
|
||
Test</a></td>
|
||
<td>1.4.0</td>
|
||
<td><a href="https://github.com/ROCm/rocm_bandwidth_test/"><i
|
||
class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocprofiler-compute/en/docs-6.4.3/index.html">ROCm Compute Profiler</a></td>
|
||
<td>3.1.1</td>
|
||
<td><a href="https://github.com/ROCm/rocprofiler-compute"><i
|
||
class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocprofiler-systems/en/docs-6.4.3/index.html">ROCm Systems Profiler</a></td>
|
||
<td>1.0.2</td>
|
||
<td><a href="https://github.com/ROCm/rocprofiler-systems"><i
|
||
class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocprofiler/en/docs-6.4.3/index.html">ROCProfiler</a></td>
|
||
<td>2.0.0</td>
|
||
<td><a href="https://github.com/ROCm/ROCProfiler/"><i
|
||
class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocprofiler-sdk/en/docs-6.4.3/index.html">ROCprofiler-SDK</a></td>
|
||
<td>0.6.0</td>
|
||
<td><a href="https://github.com/ROCm/rocprofiler-sdk/"><i
|
||
class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr >
|
||
<td><a href="https://rocm.docs.amd.com/projects/roctracer/en/docs-6.4.3/index.html">ROCTracer</a></td>
|
||
<td>4.1.0</td>
|
||
<td><a href="https://github.com/ROCm/ROCTracer/"><i
|
||
class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
</tbody>
|
||
<tbody class="rocm-components-tools rocm-components-dev">
|
||
<tr>
|
||
<th rowspan="5"></th>
|
||
<th rowspan="5">Development</th>
|
||
<td><a href="https://rocm.docs.amd.com/projects/HIPIFY/en/docs-6.4.3/index.html">HIPIFY</a></td>
|
||
<td>19.0.0</td>
|
||
<td><a href="https://github.com/ROCm/HIPIFY/"><i
|
||
class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/ROCdbgapi/en/docs-6.4.3/index.html">ROCdbgapi</a></td>
|
||
<td>0.77.2</td>
|
||
<td><a href="https://github.com/ROCm/ROCdbgapi/"><i
|
||
class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/ROCmCMakeBuildTools/en/docs-6.4.3/index.html">ROCm CMake</a></td>
|
||
<td>0.14.0</td>
|
||
<td><a href="https://github.com/ROCm/rocm-cmake/"><i
|
||
class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/ROCgdb/en/docs-6.4.3/index.html">ROCm Debugger (ROCgdb)</a>
|
||
</td>
|
||
<td>15.2</td>
|
||
<td><a href="https://github.com/ROCm/ROCgdb/"><i
|
||
class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/rocr_debug_agent/en/docs-6.4.3/index.html">ROCr Debug Agent</a>
|
||
</td>
|
||
<td>2.0.4</td>
|
||
<td><a href="https://github.com/ROCm/rocr_debug_agent/"><i
|
||
class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
</tbody>
|
||
<tbody class="rocm-components-compilers tbody-reverse-zebra">
|
||
<tr>
|
||
<th rowspan="2" colspan="2">Compilers</th>
|
||
<td><a href="https://rocm.docs.amd.com/projects/HIPCC/en/docs-6.4.3/index.html">HIPCC</a></td>
|
||
<td>1.1.1</td>
|
||
<td><a href="https://github.com/ROCm/llvm-project/tree/amd-staging/amd/hipcc"><i
|
||
class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/llvm-project/en/docs-6.4.3/index.html">llvm-project</a></td>
|
||
<td>19.0.0</td>
|
||
<td><a href="https://github.com/ROCm/llvm-project/"><i
|
||
class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
</tbody>
|
||
<tbody class="rocm-components-runtimes tbody-reverse-zebra">
|
||
<tr>
|
||
<th rowspan="2" colspan="2">Runtimes</th>
|
||
<td><a href="https://rocm.docs.amd.com/projects/HIP/en/docs-6.4.3/index.html">HIP</a></td>
|
||
<td>6.4.3</td>
|
||
<td><a href="https://github.com/ROCm/HIP/"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
<tr>
|
||
<td><a href="https://rocm.docs.amd.com/projects/ROCR-Runtime/en/docs-6.4.3/index.html">ROCr Runtime</a></td>
|
||
<td>1.15.0</td>
|
||
<td><a href="https://github.com/ROCm/ROCR-Runtime/"><i class="fab fa-github fa-lg"></i></a></td>
|
||
</tr>
|
||
</tbody>
|
||
</table>
|
||
</div>
|
||
|
||
## Detailed component changes
|
||
|
||
The following sections describe key changes to ROCm components.
|
||
|
||
```{note}
|
||
For a historical overview of ROCm component updates, see the {doc}`ROCm consolidated changelog </release/changelog>`.
|
||
```
|
||
|
||
### **ROCm SMI** (7.7.0)
|
||
|
||
#### Added
|
||
|
||
- Support for getting the GPU Board voltage.
|
||
|
||
```{note}
|
||
See the full [ROCm SMI changelog](https://github.com/ROCm/rocm_smi_lib/blob/release/rocm-rel-6.4/CHANGELOG.md) for details, examples, and in-depth descriptions.
|
||
```
|
||
|
||
## ROCm known issues
|
||
|
||
ROCm known issues are noted on {fab}`github` [GitHub](https://github.com/ROCm/ROCm/labels/Verified%20Issue). For known
|
||
issues related to individual components, review the [Detailed component changes](#detailed-component-changes).
|
||
|
||
## ROCm upcoming changes
|
||
|
||
The following changes to the ROCm software stack are anticipated for future releases.
|
||
|
||
### AMD SMI migration to AMDGPU driver repository
|
||
|
||
In a future release, [AMD SMI](https://github.com/ROCm/amdsmi) will be relocated from the ROCm organization repository to a new AMDTools repository to better align with its system-level functionality. `amd-smi-lib` will no longer be included in the `rocm-developer-tools` meta-package included with your standard ROCm installation. Instead, it will be packaged with the AMDGPU driver installation.
|
||
|
||
### ROCm SMI deprecation
|
||
|
||
[ROCm SMI](https://github.com/ROCm/rocm_smi_lib) will be phased out in an
|
||
upcoming ROCm release and will enter maintenance mode. After this transition,
|
||
only critical bug fixes will be addressed and no further feature development
|
||
will take place.
|
||
|
||
It's strongly recommended to transition your projects to [AMD
|
||
SMI](https://github.com/ROCm/amdsmi), the successor to ROCm SMI. AMD SMI
|
||
includes all the features of the ROCm SMI and will continue to receive regular
|
||
updates, new functionality, and ongoing support. For more information on AMD
|
||
SMI, see the [AMD SMI documentation](https://rocm.docs.amd.com/projects/amdsmi/en/latest/).
|
||
|
||
### ROCTracer, ROCProfiler, rocprof, and rocprofv2 deprecation
|
||
|
||
Development and support for ROCTracer, ROCProfiler, `rocprof`, and `rocprofv2` are being phased out in favor of ROCprofiler-SDK in upcoming ROCm releases. Starting with ROCm 6.4, only critical defect fixes will be addressed for older versions of the profiling tools and libraries. All users are encouraged to upgrade to the latest version of the ROCprofiler-SDK library and the (`rocprofv3`) tool to ensure continued support and access to new features. ROCprofiler-SDK is still in beta today and will be production-ready in a future ROCm release.
|
||
|
||
It's anticipated that ROCTracer, ROCProfiler, `rocprof`, and `rocprofv2` will reach end-of-life by future releases, aligning with Q1 of 2026.
|
||
|
||
### AMDGPU wavefront size compiler macro deprecation
|
||
|
||
Access to the wavefront size as a compile-time constant via the `__AMDGCN_WAVEFRONT_SIZE`
|
||
and `__AMDGCN_WAVEFRONT_SIZE__` macros or the `constexpr warpSize` variable is deprecated
|
||
and will be disabled in a future release.
|
||
|
||
* The `__AMDGCN_WAVEFRONT_SIZE__` macro and `__AMDGCN_WAVEFRONT_SIZE` alias will be removed in an upcoming release.
|
||
It is recommended to remove any use of this macro. For more information, see
|
||
[AMDGPU support](https://rocm.docs.amd.com/projects/llvm-project/en/docs-6.4.3/LLVM/clang/html/AMDGPUSupport.html).
|
||
* `warpSize` will only be available as a non-`constexpr` variable. Where required,
|
||
the wavefront size should be queried via the `warpSize` variable in device code,
|
||
or via `hipGetDeviceProperties` in host code. Neither of these will result in a compile-time constant. For more information, see [warpSize](https://rocm.docs.amd.com/projects/HIP/en/docs-6.4.3/how-to/hip_cpp_language_extensions.html#warpsize).
|
||
* For cases where compile-time evaluation of the wavefront size cannot be avoided,
|
||
uses of `__AMDGCN_WAVEFRONT_SIZE`, `__AMDGCN_WAVEFRONT_SIZE__`, or `warpSize`
|
||
can be replaced with a user-defined macro or `constexpr` variable with the wavefront
|
||
size(s) for the target hardware. For example:
|
||
|
||
```
|
||
#if defined(__GFX9__)
|
||
#define MY_MACRO_FOR_WAVEFRONT_SIZE 64
|
||
#else
|
||
#define MY_MACRO_FOR_WAVEFRONT_SIZE 32
|
||
#endif
|
||
```
|
||
|
||
### HIPCC Perl scripts deprecation
|
||
|
||
The HIPCC Perl scripts (`hipcc.pl` and `hipconfig.pl`) will be removed in an upcoming release.
|
||
|
||
### Changes to ROCm Object Tooling
|
||
|
||
ROCm Object Tooling tools ``roc-obj-ls``, ``roc-obj-extract``, and ``roc-obj`` are
|
||
deprecated in ROCm 6.4, and will be removed in a future release. Functionality
|
||
has been added to the ``llvm-objdump --offloading`` tool option to extract all
|
||
clang-offload-bundles into individual code objects found within the objects
|
||
or executables passed as input. The ``llvm-objdump --offloading`` tool option also
|
||
supports the ``--arch-name`` option, and only extracts code objects found with
|
||
the specified target architecture. See [llvm-objdump](https://llvm.org/docs/CommandGuide/llvm-objdump.html)
|
||
for more information.
|
||
|
||
### HIP runtime API changes
|
||
|
||
There are a number of upcoming changes planned for HIP runtime API in an upcoming major release
|
||
that are not backward compatible with prior releases. Most of these changes increase
|
||
alignment between HIP and CUDA APIs or behavior. Some of the upcoming changes are to
|
||
clean up header files, remove namespace collision, and have a clear separation between
|
||
`hipRTC` and HIP runtime. For more information, see [HIP 7.0 Is Coming: What You Need to Know to Stay Ahead](https://rocm.blogs.amd.com/ecosystems-and-partners/transition-to-hip-7.0-blog/README.html).
|