diff --git a/CHANGELOG.md b/CHANGELOG.md index cab6aca80..09d290c16 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -764,7 +764,7 @@ HIP runtime has the following functional improvements which improves runtime per #### Changed -* VX_RPP extension : Version 3.1.0 release. +* VX_RPP extension: Version 3.1.0 release. * Update the parameters and kernel API of Blur, Fog, Jitter, LensCorrection, Rain, Pixelate, Vignette and ResizeCrop wrt tensor kernels replacing the legacy BatchPD API calls in VX_RPP extensions. #### Known issues @@ -1144,7 +1144,7 @@ Review the [README](https://github.com/ROCm/rocm_bandwidth_test/blob/amd-mainlin #### Changed * Completed migration from legacy [ROCProfiler](https://rocm.docs.amd.com/projects/rocprofiler/en/latest/) to [ROCprofiler-SDK](https://rocm.docs.amd.com/projects/rocprofiler-sdk/en/latest/). -* Reorganized the configuration files internally and improved [README/installation](https://github.com/ROCm/rdc/blob/amd-staging/README.md) instructions. +* Reorganized the configuration files internally and improved [README/installation](https://github.com/ROCm/rdc/blob/release/rocm-rel-7.0/README.md) instructions. * Updated metrics and monitoring support for the latest AMD data center GPUs. #### Optimized @@ -1346,7 +1346,7 @@ The previous default accumulator types could lead to situations in which unexpec - ROCprof Trace Decoder as experimental API: - Requires [ROCprof Trace Decoder plugin](https://github.com/rocm/rocprof-trace-decoder). - Thread trace option in the `rocprofv3` tool under the `--att` parameters: - - See [using thread trace with rocprofv3](https://rocm.docs.amd.com/projects/rocprofiler-sdk/en/amd-mainline/how-to/using-thread-trace.html) + - See [using thread trace with rocprofv3](https://rocm.docs.amd.com/projects/rocprofiler-sdk/en/docs-7.0.0/how-to/using-thread-trace.html) - Requires [ROCprof Trace Decoder plugin](https://github.com/rocm/rocprof-trace-decoder). - `rocpd` output format documentation: - Requires [ROCprof Trace Decoder plugin](https://github.com/rocm/rocprof-trace-decoder). diff --git a/RELEASE.md b/RELEASE.md index fa8ec0845..403c2ed01 100644 --- a/RELEASE.md +++ b/RELEASE.md @@ -42,7 +42,7 @@ The following are notable new features and improvements in ROCm 7.0.0. For chang ### Operating system, hardware, and virtualization support changes -ROCm 7.0.0 adds support for [AMD Instinct MI355X](https://www.amd.com/en/products/accelerators/instinct/mi350/mi355x.html) and [MI350X](https://www.amd.com/en/products/accelerators/instinct/mi350/mi350x.html). For details, see the full list of [Supported GPUs (Linux)](https://rocm.docs.amd.com/projects/install-on-linux-internal/en/latest/reference/system-requirements.html#supported-gpus). +ROCm 7.0.0 adds support for [AMD Instinct MI355X](https://www.amd.com/en/products/accelerators/instinct/mi350/mi355x.html) and [MI350X](https://www.amd.com/en/products/accelerators/instinct/mi350/mi350x.html). For details, see the full list of [Supported GPUs (Linux)](https://rocm.docs.amd.com/projects/install-on-linux/en/latest/reference/system-requirements.html#supported-gpus). ROCm 7.0.0 adds support for the following operating systems and kernel versions: @@ -51,7 +51,7 @@ ROCm 7.0.0 adds support for the following operating systems and kernel versions: ROCm 7.0.0 marks the end of support (EoS) for Ubuntu 24.04.2 (kernel: 6.8 [GA], 6.11 [HWE]) and SLES 15 SP6. -For more information about supported operating systems, see [Supported operating systems](https://rocm.docs.amd.com/projects/install-on-linux-internal/en/latest/reference/system-requirements.html#supported-operating-systems) and [install instructions](https://rocm.docs.amd.com/projects/install-on-linux-internal/en/latest/). +For more information about supported operating systems, see [Supported operating systems](https://rocm.docs.amd.com/projects/install-on-linux/en/latest/reference/system-requirements.html#supported-operating-systems) and [install instructions](https://rocm.docs.amd.com/projects/install-on-linux/en/latest/). See the [Compatibility matrix](../../docs/compatibility/compatibility-matrix.rst) @@ -61,7 +61,7 @@ for more information about operating system and hardware compatibility. ROCm 7.0.0 introduces support for KVM Passthrough for AMD Instinct MI350X and MI355X GPUs. -All KVM-based SR-IOV supported configurations require the GIM SR-IOV driver version 8.4.0.K. Refer to [GIM Release note](https://github.com/amd/MxGPU-Virtualization/releases) for more details. In addition, support for VMware ESXi 8 has been introduced for AMD Instinct MI300X GPUs. For more information, see [Virtualization Support](https://rocm.docs.amd.com/projects/install-on-linux-internal/en/latest/reference/system-requirements.html#virtualization-support). +All KVM-based SR-IOV supported configurations require the GIM SR-IOV driver version 8.4.0.K. Refer to [GIM Release note](https://github.com/amd/MxGPU-Virtualization/releases) for more details. In addition, support for VMware ESXi 8 has been introduced for AMD Instinct MI300X GPUs. For more information, see [Virtualization Support](https://rocm.docs.amd.com/projects/install-on-linux/en/latest/reference/system-requirements.html#virtualization-support). ### Deep learning and AI framework updates @@ -117,14 +117,14 @@ ROCm 7.0.0 enables support for Triton 3.3.0. AMD ROCm has officially added support for the following Deep learning and AI frameworks: -* Ray is a unified framework for scaling AI and Python applications from your laptop to a full cluster, without changing your code. Ray consists of a core distributed runtime and a set of AI libraries for simplifying machine learning computations. It is currently supported on ROCm 6.4.1. For more information, see [Ray compatibility](https://advanced-micro-devices-rocm-internal--500.com.readthedocs.build/en/500/compatibility/ml-compatibility/ray-compatibility.html). +* Ray is a unified framework for scaling AI and Python applications from your laptop to a full cluster, without changing your code. Ray consists of a core distributed runtime and a set of AI libraries for simplifying machine learning computations. It is currently supported on ROCm 6.4.1. For more information, see [Ray compatibility](https://rocm.docs.amd.com/en/latest/compatibility/ml-compatibility/ray-compatibility.html). -* llama.cpp is an open-source framework for Large Language Model (LLM) inference that runs on both central processing units (CPUs) and graphics processing units (GPUs). It is written in plain C/C++, providing a simple, dependency-free setup. It is currently supported on ROCm 6.4.0. For more information, see [llama.cpp compatibility](https://advanced-micro-devices-rocm-internal--500.com.readthedocs.build/en/500/compatibility/ml-compatibility/llama-cpp-compatibility.html). +* llama.cpp is an open-source framework for Large Language Model (LLM) inference that runs on both central processing units (CPUs) and graphics processing units (GPUs). It is written in plain C/C++, providing a simple, dependency-free setup. It is currently supported on ROCm 6.4.0. For more information, see [llama.cpp compatibility](https://rocm.docs.amd.com/en/latest/compatibility/ml-compatibility/llama-cpp-compatibility.html). ### AMD GPU Driver/ROCm packaging separation -The AMD GPU Driver (amdgpu) is now distributed separately from the ROCm software stack and is stored under in its own location ``/amdgpu/`` in the package repository at [repo.radeon.com](https://repo.radeon.com/amdgpu/). The first release is designated as AMD GPU Driver (amdgpu) version 30.10. See the [User and kernel-space support matrix](https://rocm.docs.amd.com/projects/install-on-linux-internal/en/latest/reference/user-kernel-space-compat-matrix.html) for more information. +The AMD GPU Driver (amdgpu) is now distributed separately from the ROCm software stack and is stored under in its own location ``/amdgpu/`` in the package repository at [repo.radeon.com](https://repo.radeon.com/amdgpu/). The first release is designated as AMD GPU Driver (amdgpu) version 30.10. See the [User and kernel-space support matrix](https://rocm.docs.amd.com/projects/install-on-linux/en/latest/reference/user-kernel-space-compat-matrix.html) for more information. [AMD SMI](https://github.com/ROCm/amdsmi) continues to stay with the ROCm software stack under the ROCm organization repository. @@ -139,7 +139,7 @@ Other ROCm libraries are also in the process of migration along with ROCm tools ### HIP API compatibility improvements -To improve code portability between AMD ROCm and other programming models, HIP API has been updated in ROCm 7.0.0 to simplify cross-platform programming. These changes are incompatible with prior ROCm releases and might require recompiling existing HIP applications for use with ROCm 7.0.0. For more information, see the [HIP API 7.0.0 changes](https://rocm.docs.amd.com/projects/HIP/en/docs-develop/hip-7-changes.html) and the [HIP changelog](#hip-7-0-0) below. +To improve code portability between AMD ROCm and other programming models, HIP API has been updated in ROCm 7.0.0 to simplify cross-platform programming. These changes are incompatible with prior ROCm releases and might require recompiling existing HIP applications for use with ROCm 7.0.0. For more information, see the [HIP API 7.0.0 changes](https://rocm.docs.amd.com/projects/HIP/en/docs-7.0.0/hip-7-changes.html) and the [HIP changelog](#hip-7-0-0) below. ### HIP runtime updates @@ -247,12 +247,12 @@ See the [ROCgdb changelog](#rocgdb-16-3) for more details. ROCm Compute Profiler includes the following key changes: -* Interactive command line with a Textual User Interface (TUI) has been added to analyze mode. For more details, see [TUI analysis](https://rocm.docs.amd.com/projects/rocprofiler-compute/en/amd-staging/how-to/analyze/tui.html). +* Interactive command line with a Textual User Interface (TUI) has been added to analyze mode. For more details, see [TUI analysis](https://rocm.docs.amd.com/projects/rocprofiler-compute/en/docs-7.0.0/how-to/analyze/tui.html). * Support added for advanced data types: `FP4` and `FP6` * Support for AMD Instinct MI355X and MI350X with addition of performance counters: CPC, SPI, SQ, TA/TD/TCP, and TCC. * Roofline enhancement added for AMD Instinct MI350 Series. * Improved support for Selective Kernel profiling. -* Program Counter (PC) sampling (Software-based) feature has been enabled for AMD Instinct MI200, MI300X, MI350X, and MI355X GPUs. This feature helps in GPU profiling to understand code execution patterns and hotspots during GPU kernel execution. For more details, see [Using PC sampling in ROCm Compute Profiler](https://rocm.docs.amd.com/projects/rocprofiler-compute/en/amd-staging/how-to/pc_sampling.html). +* Program Counter (PC) sampling (Software-based) feature has been enabled for AMD Instinct MI200, MI300X, MI350X, and MI355X GPUs. This feature helps in GPU profiling to understand code execution patterns and hotspots during GPU kernel execution. For more details, see [Using PC sampling in ROCm Compute Profiler](https://rocm.docs.amd.com/projects/rocprofiler-compute/en/docs-7.0.0/how-to/pc_sampling.html). * Program Counter (PC) sampling (Hardware-based, Stochastic) feature has been enabled for AMD Instinct MI300X, MI350, and MI355X GPUs. * Docker files has been added to package the application and dependencies into a single portable and executable standalone binary file. @@ -356,7 +356,7 @@ ROCm documentation continues to be updated to provide clearer and more comprehen In addition, the [AI agent with MCPs using vLLM and PydanticAI](https://rocm.docs.amd.com/projects/ai-developer-hub/en/latest/notebooks/inference/build_airbnb_agent_mcp.html) tutorial has been updated. For more information about the changes, see [Changelog for the AI Developer Hub](https://rocm.docs.amd.com/projects/ai-developer-hub/en/latest/changelog.html). -* Documentation for [rocCV](https://advanced-micro-devices-roccv--28.com.readthedocs.build/en/28/), an efficient GPU-accelerated library for image pre- and post-processing, has been added. rocCV is in an early access state, and using it on production workloads is not recommended. +* Documentation for [rocCV](https://rocm.docs.amd.com/projects/rocCV/en/latest/index.html), an efficient GPU-accelerated library for image pre- and post-processing, has been added. rocCV is in an early access state, and using it on production workloads is not recommended. * ROCm Math libraries support a wide range of data types, enabling optimized performance across various precision requirements. The following Math libraries are now updated with new precision content. For more information, click the Math library’s link: @@ -530,7 +530,7 @@ For more information on these features, see [AMD SMI changelog](https://github.c #### KVM SR-IOV virtualization changes dependent on open source AMD GPU Virtualization Driver (GIM) -KVM SR-IOV support for all Instinct GPUs require the open source AMD GPU Virtualization Driver (GIM) 8.4.0.K. For detailed support information, see [virtualization support](https://rocm.docs.amd.com/projects/install-on-linux-internal/en/latest/reference/system-requirements.html#virtualization-support) and [GIM Release Note](https://github.com/amd/MxGPU-Virtualization/releases). +KVM SR-IOV support for all Instinct GPUs require the open source AMD GPU Virtualization Driver (GIM) 8.4.0.K. For detailed support information, see [virtualization support](https://rocm.docs.amd.com/projects/install-on-linux/en/latest/reference/system-requirements.html#virtualization-support) and [GIM Release Note](https://github.com/amd/MxGPU-Virtualization/releases). #### GPU partitioning support for AMD Instinct MI355X and MI350X GPUs @@ -562,47 +562,47 @@ Click {fab}`github` to go to the component's source code on GitHub. Libraries Machine learning and computer vision - Composable Kernel + Composable Kernel 1.1.0 ⇒ 1.1.0 - MIGraphX + MIGraphX 2.12.0 ⇒ 2.13.0 - MIOpen + MIOpen 3.4.0 ⇒ 3.5.0 - MIVisionX + MIVisionX 3.2.0 ⇒ 3.3.0 - rocAL + rocAL 2.2.0 ⇒ 2.3.0 - rocDecode + rocDecode 0.10.0 ⇒ 1.0.0 - rocJPEG + rocJPEG 0.8.0 ⇒ 1.1.0 - rocPyDecode + rocPyDecode 0.3.1 ⇒ 0.6.0 - RPP + RPP 1.9.10 ⇒ 2.0.0 @@ -611,12 +611,12 @@ Click {fab}`github` to go to the component's source code on GitHub. Communication - RCCL + RCCL 2.22.3 ⇒ 2.26.6 - rocSHMEM + rocSHMEM 2.0.1 ⇒ 3.0.0 @@ -625,82 +625,82 @@ Click {fab}`github` to go to the component's source code on GitHub. Math - hipBLAS + hipBLAS 2.4.0 ⇒ 3.0.0 - hipBLASLt + hipBLASLt 0.12.1 ⇒ 1.0.0 - hipFFT + hipFFT 1.0.18 ⇒ 1.0.20 - hipfort + hipfort 0.6.0 ⇒ 0.7.0 - hipRAND + hipRAND 2.12.0 ⇒ 3.0.0 - hipSOLVER + hipSOLVER 2.4.0 ⇒ 3.0.0 - hipSPARSE + hipSPARSE 3.2.0 ⇒ 4.0.1 - hipSPARSELt + hipSPARSELt 0.2.3 ⇒ 0.2.4 - rocALUTION + rocALUTION 3.2.3 ⇒ 4.0.0 - rocBLAS + rocBLAS 4.4.1 ⇒ 5.0.0 - rocFFT + rocFFT 1.0.32 ⇒ 1.0.34 - rocRAND + rocRAND 3.3.0 ⇒ 4.0.0 - rocSOLVER + rocSOLVER 3.28.2 ⇒ 3.30.0 - rocSPARSE + rocSPARSE 3.4.0 ⇒ 4.0.2 - rocWMMA + rocWMMA 1.7.0 ⇒ 2.0.0 - Tensile + Tensile 4.43.0 ⇒ 4.44.0 @@ -709,22 +709,22 @@ Click {fab}`github` to go to the component's source code on GitHub. Primitives - hipCUB + hipCUB 3.4.0 ⇒ 4.0.0 - hipTensor + hipTensor 1.5.0 ⇒ 2.0.0 - rocPRIM + rocPRIM 3.4.1 ⇒ 4.0.0 - rocThrust + rocThrust 3.3.0 ⇒ 4.0.0 @@ -733,27 +733,27 @@ Click {fab}`github` to go to the component's source code on GitHub. Tools System management - AMD SMI + AMD SMI 25.5.1 ⇒ 26.0.0 - ROCm Data Center Tool + ROCm Data Center Tool 0.3.0 ⇒ 1.1.0 - rocminfo + rocminfo 1.0.0 - ROCm SMI + ROCm SMI 7.7.0 ⇒ 7.8.0 - ROCm Validation Suite + ROCm Validation Suite 1.1.0 ⇒ 1.2.0 @@ -762,38 +762,38 @@ Click {fab}`github` to go to the component's source code on GitHub. Performance - ROCm Bandwidth + ROCm Bandwidth Test 1.4.0 ⇒ 2.6.0 - ROCm Compute Profiler + ROCm Compute Profiler 3.1.1 ⇒ 3.2.3 - ROCm Systems Profiler + ROCm Systems Profiler 1.0.2 ⇒ 1.1.0 - ROCProfiler + ROCProfiler 2.0.0 - ROCprofiler-SDK + ROCprofiler-SDK 0.6.0 ⇒ 1.0.0 - ROCTracer + ROCTracer 4.1.0 @@ -803,32 +803,32 @@ Click {fab}`github` to go to the component's source code on GitHub. Development - HIPIFY + HIPIFY 19.0.0 ⇒ 20.0.0 - ROCdbgapi + ROCdbgapi 0.77.2 ⇒ 0.77.3 - ROCm CMake + ROCm CMake 0.14.0 - ROCm Debugger (ROCgdb) + ROCm Debugger (ROCgdb) 15.2 ⇒ 16.3 - ROCr Debug Agent + ROCr Debug Agent 2.0.4 ⇒ 2.1.0 Compilers - HIPCC + HIPCC 1.1.1 - llvm-project + llvm-project 19.0.0 ⇒ 20.0.0 @@ -853,12 +853,12 @@ Click {fab}`github` to go to the component's source code on GitHub. Runtimes - HIP + HIP 6.4.3 ⇒ 7.0.0 - ROCr Runtime + ROCr Runtime 1.15.0 ⇒ 1.18.0 @@ -1629,7 +1629,7 @@ HIP runtime has the following functional improvements which improves runtime per #### Changed -* VX_RPP extension : Version 3.1.0 release. +* VX_RPP extension: Version 3.1.0 release. * Update the parameters and kernel API of Blur, Fog, Jitter, LensCorrection, Rain, Pixelate, Vignette and ResizeCrop wrt tensor kernels replacing the legacy BatchPD API calls in VX_RPP extensions. #### Known issues @@ -2009,7 +2009,7 @@ Review the [README](https://github.com/ROCm/rocm_bandwidth_test/blob/amd-mainlin #### Changed * Completed migration from legacy [ROCProfiler](https://rocm.docs.amd.com/projects/rocprofiler/en/latest/) to [ROCprofiler-SDK](https://rocm.docs.amd.com/projects/rocprofiler-sdk/en/latest/). -* Reorganized the configuration files internally and improved [README/installation](https://github.com/ROCm/rdc/blob/amd-staging/README.md) instructions. +* Reorganized the configuration files internally and improved [README/installation](https://github.com/ROCm/rdc/blob/release/rocm-rel-7.0/README.md) instructions. * Updated metrics and monitoring support for the latest AMD data center GPUs. #### Optimized @@ -2211,7 +2211,7 @@ The previous default accumulator types could lead to situations in which unexpec - ROCprof Trace Decoder as experimental API: - Requires [ROCprof Trace Decoder plugin](https://github.com/rocm/rocprof-trace-decoder). - Thread trace option in the `rocprofv3` tool under the `--att` parameters: - - See [using thread trace with rocprofv3](https://rocm.docs.amd.com/projects/rocprofiler-sdk/en/amd-mainline/how-to/using-thread-trace.html) + - See [using thread trace with rocprofv3](https://rocm.docs.amd.com/projects/rocprofiler-sdk/en/docs-7.0.0/how-to/using-thread-trace.html) - Requires [ROCprof Trace Decoder plugin](https://github.com/rocm/rocprof-trace-decoder). - `rocpd` output format documentation: - Requires [ROCprof Trace Decoder plugin](https://github.com/rocm/rocprof-trace-decoder). @@ -2615,10 +2615,10 @@ and `__AMDGCN_WAVEFRONT_SIZE__` macros are deprecated and will be disabled in a * The `__AMDGCN_WAVEFRONT_SIZE__` macro and `__AMDGCN_WAVEFRONT_SIZE` alias will be removed in an upcoming release. It is recommended to remove any use of this macro. For more information, see - [AMDGPU support](https://rocm.docs.amd.com/projects/llvm-project/en/docs-6.4.3/LLVM/clang/html/AMDGPUSupport.html). + [AMDGPU support](https://rocm.docs.amd.com/projects/llvm-project/en/docs-7.0.0/LLVM/clang/html/AMDGPUSupport.html). * `warpSize` is only available as a non-`constexpr` variable. Where required, the wavefront size should be queried via the `warpSize` variable in device code, - or via `hipGetDeviceProperties` in host code. Neither of these will result in a compile-time constant. For more information, see [warpSize](https://rocm.docs.amd.com/projects/HIP/en/docs-6.4.3/how-to/hip_cpp_language_extensions.html#warpsize). + or via `hipGetDeviceProperties` in host code. Neither of these will result in a compile-time constant. For more information, see [warpSize](https://rocm.docs.amd.com/projects/HIP/en/docs-7.0.0/how-to/hip_cpp_language_extensions.html#warpsize). * For cases where compile-time evaluation of the wavefront size cannot be avoided, uses of `__AMDGCN_WAVEFRONT_SIZE`, `__AMDGCN_WAVEFRONT_SIZE__`, or `warpSize` can be replaced with a user-defined macro or `constexpr` variable with the wavefront diff --git a/docs/conf.py b/docs/conf.py index 85c6943fc..0be79cd66 100644 --- a/docs/conf.py +++ b/docs/conf.py @@ -97,7 +97,7 @@ all_article_info_author = "" # pages with specific settings article_pages = [ - {"file": "about/release-notes", "os": ["linux"], "date": "2025-08-28"}, + {"file": "about/release-notes", "os": ["linux"], "date": "2025-09-16"}, {"file": "release/changelog", "os": ["linux"],}, {"file": "compatibility/compatibility-matrix", "os": ["linux"]}, {"file": "compatibility/ml-compatibility/pytorch-compatibility", "os": ["linux"]}, diff --git a/docs/sphinx/_toc.yml.in b/docs/sphinx/_toc.yml.in index b8cbd0e31..a8028d50c 100644 --- a/docs/sphinx/_toc.yml.in +++ b/docs/sphinx/_toc.yml.in @@ -12,14 +12,14 @@ subtrees: - file: compatibility/compatibility-matrix.rst title: Compatibility matrix entries: - - url: https://rocm.docs.amd.com/projects/install-on-linux-internal/en/latest/reference/system-requirements.html + - url: https://rocm.docs.amd.com/projects/install-on-linux/en/latest/reference/system-requirements.html title: Linux system requirements - url: https://rocm.docs.amd.com/projects/install-on-windows/en/${branch}/reference/system-requirements.html title: Windows system requirements - caption: Install entries: - - url: https://rocm.docs.amd.com/projects/install-on-linux-internal/en/latest/ + - url: https://rocm.docs.amd.com/projects/install-on-linux/en/${branch}/ title: ROCm on Linux - url: https://rocm.docs.amd.com/projects/install-on-windows/en/latest/ title: HIP SDK on Windows diff --git a/manifest_700.xml b/manifest_700.xml deleted file mode 100644 index 4f7d505d8..000000000 --- a/manifest_700.xml +++ /dev/null @@ -1,80 +0,0 @@ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - \ No newline at end of file