mirror of
https://github.com/ROCm/ROCm.git
synced 2026-01-10 15:18:11 -05:00
Merge pull request #2344 from RadeonOpenCompute/docs/5.6.0
Sync 5.6 branches
This commit is contained in:
39
CHANGELOG.md
39
CHANGELOG.md
@@ -39,6 +39,20 @@ ROCm 5.6 consists of several AI software ecosystem improvements to our fast-grow
|
||||
- Bug fixes will not be back ported to older ROCm releases for this SKU
|
||||
- Distro / Operating system updates will continue as per the ROCm release cadence for gfx906 GPUs till EOM.
|
||||
|
||||
#### AMDSMI CLI 23.0.0.4
|
||||
|
||||
##### Added
|
||||
|
||||
- AMDSMI CLI tool enabled for Linux Bare Metal & Guest
|
||||
|
||||
- Package: amd-smi-lib
|
||||
|
||||
##### Known Issues
|
||||
|
||||
- not all Error Correction Code (ECC) fields are currently supported
|
||||
|
||||
- RHEL 8 & SLES 15 have extra install steps
|
||||
|
||||
#### Kernel Modules (DKMS)
|
||||
|
||||
##### Fixes
|
||||
@@ -77,7 +91,7 @@ ROCm 5.6 consists of several AI software ecosystem improvements to our fast-grow
|
||||
- Limited testing on xnack+ configuration
|
||||
- Multiple HIP tests failures (gpuvm fault or hangs)
|
||||
- hipSetDevice and hipSetDeviceFlags APIs return hipErrorInvalidDevice instead of hipErrorNoDevice, on a system without GPU
|
||||
- Known memory leak when code object files are loaded/unloaded via hipModuleLoad/hipModuleUnload APIs. Issue will be fixed in ROCm 5.6.1
|
||||
- Known memory leak when code object files are loaded/unloaded via hipModuleLoad/hipModuleUnload APIs. Issue will be fixed in a future ROCm release
|
||||
|
||||
##### Upcoming changes in future release
|
||||
|
||||
@@ -636,6 +650,29 @@ The following hipcc changes are implemented in this release:
|
||||
- `hipCommander` at <https://github.com/ROCm-Developer-Tools/hip-tests/tree/develop/samples/1_Utils/hipCommander>
|
||||
|
||||
Note that the samples will continue to be available in previous release branches.
|
||||
- Removal of gcnarch from hipDeviceProp_t structure
|
||||
- Addition of new fields in hipDeviceProp_t structure
|
||||
- maxTexture1D
|
||||
- maxTexture2D
|
||||
- maxTexture1DLayered
|
||||
- maxTexture2DLayered
|
||||
- sharedMemPerMultiprocessor
|
||||
- deviceOverlap
|
||||
- asyncEngineCount
|
||||
- surfaceAlignment
|
||||
- unifiedAddressing
|
||||
- computePreemptionSupported
|
||||
- hostRegisterSupported
|
||||
- uuid
|
||||
- Removal of deprecated code
|
||||
- hip-hcc codes from hip code tree
|
||||
- Correct hipArray usage in HIP APIs such as hipMemcpyAtoH and hipMemcpyHtoA
|
||||
- HIPMEMCPY_3D fields correction to avoid truncation of "size_t" to "unsigned int" inside hipMemcpy3D()
|
||||
- Renaming of 'memoryType' in hipPointerAttribute_t structure to 'type'
|
||||
- Correct hipGetLastError to return the last error instead of last API call's return code
|
||||
- Update hipExternalSemaphoreHandleDesc to add "unsigned int reserved[16]"
|
||||
- Correct handling of flag values in hipIpcOpenMemHandle for hipIpcMemLazyEnablePeerAccess
|
||||
- Remove hiparray* and make it opaque with hipArray_t
|
||||
|
||||
##### New HIP APIs in This Release
|
||||
|
||||
|
||||
20
RELEASE.md
20
RELEASE.md
@@ -39,6 +39,20 @@ ROCm 5.6 consists of several AI software ecosystem improvements to our fast-grow
|
||||
- Bug fixes will not be back ported to older ROCm releases for this SKU
|
||||
- Distro / Operating system updates will continue as per the ROCm release cadence for gfx906 GPUs till EOM.
|
||||
|
||||
#### AMDSMI CLI 23.0.0.4
|
||||
|
||||
##### Added
|
||||
|
||||
- AMDSMI CLI tool enabled for Linux Bare Metal & Guest
|
||||
|
||||
- Package: amd-smi-lib
|
||||
|
||||
##### Known Issues
|
||||
|
||||
- not all Error Correction Code (ECC) fields are currently supported
|
||||
|
||||
- RHEL 8 & SLES 15 have extra install steps
|
||||
|
||||
#### Kernel Modules (DKMS)
|
||||
|
||||
##### Fixes
|
||||
@@ -77,7 +91,7 @@ ROCm 5.6 consists of several AI software ecosystem improvements to our fast-grow
|
||||
- Limited testing on xnack+ configuration
|
||||
- Multiple HIP tests failures (gpuvm fault or hangs)
|
||||
- hipSetDevice and hipSetDeviceFlags APIs return hipErrorInvalidDevice instead of hipErrorNoDevice, on a system without GPU
|
||||
- Known memory leak when code object files are loaded/unloaded via hipModuleLoad/hipModuleUnload APIs. Issue will be fixed in ROCm 5.6.1
|
||||
- Known memory leak when code object files are loaded/unloaded via hipModuleLoad/hipModuleUnload APIs. Issue will be fixed in a future ROCm release
|
||||
|
||||
##### Upcoming changes in future release
|
||||
|
||||
@@ -566,7 +580,3 @@ Tensile 4.37.0 for ROCm 5.6.0
|
||||
- Fixed ASEM + GSU + NoTailLoop opt mismatch
|
||||
- Fixed AssertSummationElementMultiple + GlobalSplitU issues
|
||||
- Fixed ASEM + GSU + TailLoop inner unroll
|
||||
|
||||
## Older versions
|
||||
|
||||
The release notes for older versions can be found in [the changelog](./CHANGELOG.md).
|
||||
|
||||
13
docs/conf.py
13
docs/conf.py
@@ -14,12 +14,25 @@ shutil.copy2('../RELEASE.md','./release.md')
|
||||
# Keep capitalization due to similar linking on GitHub's markdown preview.
|
||||
shutil.copy2('../CHANGELOG.md','./CHANGELOG.md')
|
||||
|
||||
# configurations for PDF output by Read the Docs
|
||||
project = "ROCm Documentation"
|
||||
author = "Advanced Micro Devices, Inc."
|
||||
copyright = "Copyright (c) 2023 Advanced Micro Devices, Inc. All rights reserved."
|
||||
version = "5.6.0"
|
||||
release = "5.6.0"
|
||||
|
||||
setting_all_article_info = True
|
||||
all_article_info_os = ["linux"]
|
||||
all_article_info_author = ""
|
||||
|
||||
# pages with specific settings
|
||||
article_pages = [
|
||||
{
|
||||
"file":"release",
|
||||
"os":["linux"],
|
||||
"date":"2023-06-28"
|
||||
},
|
||||
|
||||
{"file":"deploy/linux/index", "os":["linux"]},
|
||||
{"file":"deploy/linux/install_overview", "os":["linux"]},
|
||||
{"file":"deploy/linux/prerequisites", "os":["linux"]},
|
||||
|
||||
@@ -7,20 +7,26 @@
|
||||
AMD's library for high performance machine learning primitives.
|
||||
|
||||
- {doc}`Documentation <miopen:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/MIOpen)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/MIOpen/blob/develop/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} {doc}`Composable Kernel <composable-kernel:index>`
|
||||
:::{grid-item-card} {doc}`Composable Kernel <composable_kernel:index>`
|
||||
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
|
||||
|
||||
- {doc}`Documentation <composable-kernel:index>`
|
||||
- {doc}`Documentation <composable_kernel:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/composable_kernel)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/composable_kernel/blob/develop/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} {doc}`MIGraphX <migraphx:index>`
|
||||
:::{grid-item-card} {doc}`MIGraphX <amdmigraphx:index>`
|
||||
AMD MIGraphX is AMD's graph inference engine that accelerates machine learning model inference.
|
||||
|
||||
- {doc}`Documentation <migraphx:index>`
|
||||
- {doc}`Documentation <amdmigraphx:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/AMDMIGraphX)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/blob/develop/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
|
||||
|
||||
@@ -8,8 +8,9 @@
|
||||
:::{grid-item-card} [HIP](./hip)
|
||||
HIP is both AMD's GPU programming language extension and the GPU runtime.
|
||||
|
||||
- {doc}`hip:.doxygen/docBin/html/index`
|
||||
- [Examples](https://github.com/amd/rocm-examples/tree/develop/HIP-Basic)
|
||||
- {doc}`HIP <hip:index>`
|
||||
- [HIP Examples](https://github.com/amd/rocm-examples/tree/develop/HIP-Basic)
|
||||
- {doc}`HIPIFY <hipify:index>`
|
||||
|
||||
:::
|
||||
|
||||
@@ -42,8 +43,8 @@ Inter and intra-node communication is supported by the following projects:
|
||||
Libraries related to AI.
|
||||
|
||||
- {doc}`MIOpen <miopen:index>`
|
||||
- {doc}`Composable Kernel <composable-kernel:index>`
|
||||
- {doc}`MIGraphX <migraphx:index>`
|
||||
- {doc}`Composable Kernel <composable_kernel:index>`
|
||||
- {doc}`MIGraphX <amdmigraphx:index>`
|
||||
|
||||
:::
|
||||
|
||||
@@ -64,6 +65,7 @@ Computer vision related projects.
|
||||
:::{grid-item-card} [Compilers and Tools](compilers)
|
||||
|
||||
- [ROCmCC](/reference/rocmcc/rocmcc)
|
||||
- {doc}`ROCdbgapi <rocdbgapi:index>`
|
||||
- {doc}`ROCgdb <rocgdb:index>`
|
||||
- {doc}`ROCProfiler <rocprofiler:rocprof>`
|
||||
- {doc}`ROCTracer <roctracer:index>`
|
||||
@@ -72,15 +74,15 @@ Computer vision related projects.
|
||||
|
||||
:::{grid-item-card} [Management Tools](management_tools)
|
||||
|
||||
- AMD SMI
|
||||
- ROCm SMI
|
||||
- ROCm Data Center Tool
|
||||
- {doc}`AMD SMI <amdsmi:index>`
|
||||
- {doc}`ROCm SMI <rocm_smi_lib:index>`
|
||||
- {doc}`ROCm Data Center Tool <rdc:index>`
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} [Validation Tools](validation_tools)
|
||||
|
||||
- {doc}`ROCm Validation Suite <rocm-validation-suite:index>`
|
||||
- {doc}`ROCm Validation Suite <rocmvalidationsuite:index>`
|
||||
- {doc}`TransferBench <transferbench:index>`
|
||||
|
||||
:::
|
||||
|
||||
@@ -3,42 +3,46 @@
|
||||
:::::{grid} 1 1 2 2
|
||||
:gutter: 1
|
||||
|
||||
:::{grid-item-card} ROCmCC
|
||||
:link: /reference/rocmcc/rocmcc
|
||||
:link-type: doc
|
||||
:::{grid-item-card} {doc}`ROCdbgapi <rocdbgapi:index>`
|
||||
The AMD Debugger API is a library that provides all the support necessary for a
|
||||
debugger and other tools to perform low level control of the execution and
|
||||
inspection of execution state of AMD's commercially available GPU architectures.
|
||||
|
||||
- {doc}`Documentation <rocdbgapi:index>`
|
||||
- [GitHub](https://github.com/ROCm-Developer-Tools/ROCdbgapi/)
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} [ROCmCC](./rocmcc/rocmcc)
|
||||
ROCmCC is a Clang/LLVM-based compiler. It is optimized for high-performance
|
||||
computing on AMD GPUs and CPUs and supports various heterogeneous programming
|
||||
models such as HIP, OpenMP, and OpenCL.
|
||||
|
||||
- [Documentation](./rocmcc/rocmcc)
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} ROCgdb
|
||||
:link: rocgdb:index
|
||||
:link-type: doc
|
||||
:::{grid-item-card} {doc}`ROCgdb <rocgdb:index>`
|
||||
This is ROCgdb, the ROCm source-level debugger for Linux, based on GDB, the GNU source-level debugger.
|
||||
|
||||
- {doc}`Documentation <rocgdb:index>`
|
||||
- [GitHub](https://github.com/ROCm-Developer-Tools/ROCgdb/)
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} ROCProfiler
|
||||
:link: rocprofiler:rocprof
|
||||
:link-type: doc
|
||||
:::{grid-item-card} {doc}`ROCProfiler <rocprofiler:rocprof>`
|
||||
ROC profiler library. Profiling with performance counters and derived metrics. Library supports GFX8/GFX9. Hardware specific low-level performance analysis interface for profiling of GPU compute applications. The profiling includes hardware performance counters with complex performance metrics.
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} ROCTracer
|
||||
:link: roctracer:index
|
||||
:link-type: doc
|
||||
Callback/Activity Library for Performance tracing AMD GPU's
|
||||
- {doc}`Documentation <rocprofiler:rocprof>`
|
||||
- [GitHub](https://github.com/ROCm-Developer-Tools/rocprofiler/)
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} ROCdbgapi
|
||||
:link: rocdbgapi:index
|
||||
:link-type: doc
|
||||
The AMD Debugger API is a library that provides all the support necessary for a
|
||||
debugger and other tools to perform low level control of the execution and
|
||||
inspection of execution state of AMD's commercially available GPU architectures.
|
||||
:::{grid-item-card} {doc}`ROCTracer <roctracer:index>`
|
||||
Callback/Activity Library for Performance tracing AMD GPUs
|
||||
|
||||
- {doc}`Documentation <roctracer:index>`
|
||||
- [GitHub](https://github.com/ROCm-Developer-Tools/roctracer)
|
||||
|
||||
:::
|
||||
|
||||
|
||||
@@ -7,6 +7,8 @@
|
||||
MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.
|
||||
|
||||
- {doc}`Documentation <mivisionx:README>`
|
||||
- [GitHub](https://github.com/GPUOpen-ProfessionalCompute-Libraries/MIVisionX/)
|
||||
- [Changelog](https://github.com/GPUOpen-ProfessionalCompute-Libraries/MIVisionX/blob/master/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
|
||||
|
||||
@@ -11,6 +11,7 @@ transforms, reductions, scans, etc. It also serves as a common back-end for
|
||||
similar libraries found inside ROCm.
|
||||
|
||||
- {doc}`Documentation <rocprim:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/rocPRIM/)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/rocPRIM/blob/develop/CHANGELOG.md)
|
||||
- [Examples](https://github.com/amd/rocm-examples/tree/develop/Libraries/rocPRIM)
|
||||
|
||||
@@ -22,6 +23,7 @@ interface. Their CPU back-ends are identical, while the GPU back-end calls into
|
||||
rocPRIM.
|
||||
|
||||
- {doc}`Documentation <rocthrust:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/rocThrust)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/rocThrust/blob/develop/CHANGELOG.md)
|
||||
- [Examples](https://github.com/amd/rocm-examples/tree/develop/Libraries/rocThrust)
|
||||
|
||||
@@ -32,6 +34,7 @@ hipCUB is a template library of algorithm primitives with a CUB-compatible
|
||||
interface. It's back-end is rocPRIM.
|
||||
|
||||
- {doc}`Documentation <hipcub:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/hipCUB)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/hipCUB/blob/develop/CHANGELOG.md)
|
||||
- [Examples](https://github.com/amd/rocm-examples/tree/develop/Libraries/hipCUB)
|
||||
|
||||
|
||||
@@ -10,6 +10,7 @@ The collective operations are implemented using ring and tree algorithms and hav
|
||||
throughput and latency.
|
||||
|
||||
- {doc}`Documentation <rccl:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/rccl)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/rocFFT/blob/develop/CHANGELOG.md)
|
||||
- [Examples](https://github.com/ROCmSoftwarePlatform/rccl/tree/develop/tools)
|
||||
|
||||
|
||||
@@ -9,6 +9,7 @@ ROCm libraries for FFT are as follows:
|
||||
rocFFT is an AMD GPU optimized library for FFT.
|
||||
|
||||
- {doc}`Documentation <rocfft:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/rocFFT)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/rocFFT/blob/develop/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
@@ -19,6 +20,7 @@ using rocFFT. hipFFT allows for a common interface for other non AMD GPU
|
||||
FFT libraries.
|
||||
|
||||
- {doc}`Documentation <hipfft:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/hipFFT)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/hipFFT/blob/develop/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
|
||||
@@ -9,6 +9,7 @@ ROCm libraries for linear algebra are as follows:
|
||||
`rocBLAS` is an AMD GPU optimized library for BLAS (Basic Linear Algebra Subprograms).
|
||||
|
||||
- {doc}`Documentation <rocblas:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/rocBLAS)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/rocBLAS/blob/develop/CHANGELOG.md)
|
||||
- [Examples](https://github.com/amd/rocm-examples/tree/develop/Libraries/rocBLAS)
|
||||
|
||||
@@ -20,6 +21,7 @@ via `rocBLAS` and `rocSOLVER`. `hipBLAS` allows for a common interface for other
|
||||
BLAS libraries.
|
||||
|
||||
- {doc}`Documentation <hipblas:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/hipBLAS)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/hipBLAS/blob/develop/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
@@ -31,6 +33,7 @@ flexible API and extends functionalities beyond traditional BLAS library.
|
||||
optimized generator as a back-end kernel provider.
|
||||
|
||||
- {doc}`Documentation <hipblaslt:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/hipBLASLt)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/hipBLASLt/blob/develop/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
@@ -41,6 +44,7 @@ fine-grained parallelism on top of AMD's ROCm runtime and toolchains, targeting
|
||||
modern CPU and GPU platforms.
|
||||
|
||||
- {doc}`Documentation <rocalution:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/rocALUTION)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/rocALUTION/blob/develop/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
@@ -50,6 +54,7 @@ modern CPU and GPU platforms.
|
||||
(MMA) problems into fragments and distributes these over GPU wavefronts.
|
||||
|
||||
- {doc}`Documentation <rocwmma:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/rocWMMA)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/rocWMMA/blob/develop/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
@@ -58,6 +63,7 @@ modern CPU and GPU platforms.
|
||||
`rocSOLVER` provides a subset of LAPACK (Linear Algebra Package) functionality on the ROCm platform.
|
||||
|
||||
- {doc}`Documentation <rocsolver:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/rocSOLVER)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/rocSOLVER/blob/develop/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
@@ -67,6 +73,7 @@ modern CPU and GPU platforms.
|
||||
as backends whilst exporting a unified interface.
|
||||
|
||||
- {doc}`Documentation <hipsolver:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/hipSOLVER)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/hipSOLVER/blob/develop/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
@@ -75,6 +82,7 @@ as backends whilst exporting a unified interface.
|
||||
`rocSPARSE` is a library to provide BLAS for sparse computations.
|
||||
|
||||
- {doc}`Documentation <rocsparse:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/rocSPARSE)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/rocSOLVER/blob/develop/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
@@ -84,6 +92,7 @@ as backends whilst exporting a unified interface.
|
||||
supporting both `rocSPARSE` and `cuSPARSE` as backends.
|
||||
|
||||
- {doc}`Documentation <hipsparse:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/hipSPARSE)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/hipSOLVER/blob/develop/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
|
||||
@@ -7,6 +7,7 @@
|
||||
rocRAND is an AMD GPU optimized library for pseudo-random number generators (PRNG).
|
||||
|
||||
- {doc}`Documentation <rocrand:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/rocRAND/)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/rocRAND/blob/develop/CHANGELOG.md)
|
||||
- [Examples](https://github.com/amd/rocm-examples/tree/develop/Libraries/rocRAND)
|
||||
|
||||
@@ -18,6 +19,7 @@ generation (PRNG) optimized for AMD GPUs using rocRAND. hipRAND allows for a
|
||||
common interface for other non AMD GPU PRNG libraries.
|
||||
|
||||
- {doc}`Documentation <hiprand:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/hipRAND/)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/hipRAND/blob/develop/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
|
||||
@@ -12,7 +12,8 @@ page introduces the HIP runtime and other HIP libraries and tools.
|
||||
The HIP Runtime is used to enable GPU acceleration for all HIP language based
|
||||
products.
|
||||
|
||||
- {doc}`API Reference Manual <hip:.doxygen/docBin/html/index>`
|
||||
- {doc}`Documentation <hip:index>`
|
||||
- [GitHub](https://github.com/ROCm-Developer-Tools/HIP)
|
||||
- [Examples](https://github.com/amd/rocm-examples/tree/develop/HIP-Basic)
|
||||
|
||||
:::
|
||||
@@ -28,7 +29,9 @@ products.
|
||||
HIPIFY assists with porting applications from based on CUDA to the HIP Runtime.
|
||||
Supported CUDA APIs are documented here as well.
|
||||
|
||||
- {doc}`Reference Manual <hipify:index>`
|
||||
- {doc}`Documentation <hipify:index>`
|
||||
- [GitHub](https://github.com/ROCm-Developer-Tools/HIPIFY/)
|
||||
- [Changelog](https://github.com/ROCm-Developer-Tools/HIPIFY/blob/amd-staging/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
|
||||
|
||||
@@ -3,26 +3,30 @@
|
||||
:::::{grid} 1 1 3 3
|
||||
:gutter: 1
|
||||
|
||||
:::{grid-item-card} AMD SMI
|
||||
:::{grid-item-card} {doc}`AMD SMI <amdsmi:index>`
|
||||
The AMD System Management Interface Library, or AMD SMI library, is a C library for Linux that provides a user space interface for applications to monitor and control AMD devices.
|
||||
|
||||
- {doc}`Documentation <amdsmi:index>`
|
||||
- [GitHub](https://github.com/RadeonOpenCompute/amdsmi)
|
||||
- [Examples](https://github.com/amd/go_amd_smi#example)
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} ROCm SMI
|
||||
:::{grid-item-card} {doc}`ROCm SMI LIB <rocm_smi_lib:index>`
|
||||
This tool acts as a command line interface for manipulating and monitoring the AMD GPU kernel, and is intended to replace and deprecate the existing `rocm_smi.py` CLI tool. It uses `ctypes` to call the `rocm_smi_lib` API.
|
||||
|
||||
- {doc}`Documentation <rocm_smi_lib:index>`
|
||||
- [GitHub](https://github.com/RadeonOpenCompute/rocm_smi_lib)
|
||||
- [Examples](https://github.com/RadeonOpenCompute/rocm_smi_lib/tree/master/python_smi_tools)
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} ROCm Data Center Tool
|
||||
:::{grid-item-card} {doc}`ROCm Data Center Tool <rdc:index>`
|
||||
The ROCm™ Data Center Tool simplifies the administration and addresses key infrastructure challenges in AMD GPUs in cluster and data center environments.
|
||||
|
||||
- {doc}`Documentation <rdc:index>`
|
||||
- [GitHub](https://github.com/RadeonOpenCompute/rdc)
|
||||
- [Changelog](https://github.com/RadeonOpenCompute/rdc/blob/master/CHANGELOG.md)
|
||||
- [Examples](https://github.com/RadeonOpenCompute/rdc/tree/master/example)
|
||||
|
||||
:::
|
||||
|
||||
@@ -3,10 +3,12 @@
|
||||
:::::{grid} 1 1 2 2
|
||||
:gutter: 1
|
||||
|
||||
:::{grid-item-card} {doc}`RVS <rocm-validation-suite:index>`
|
||||
:::{grid-item-card} {doc}`RVS <rocmvalidationsuite:index>`
|
||||
The ROCm Validation Suite is a system administrator’s and cluster manager's tool for detecting and troubleshooting common problems affecting AMD GPU(s) running in a high-performance computing environment, enabled using the ROCm software stack on a compatible platform.
|
||||
|
||||
- {doc}`Documentation <rocm-validation-suite:index>`
|
||||
- {doc}`Documentation <rocmvalidationsuite:index>`
|
||||
- [GitHub](https://github.com/ROCm-Developer-Tools/ROCmValidationSuite)
|
||||
- [Changelog](https://github.com/ROCm-Developer-Tools/ROCmValidationSuite/blob/master/CHANGELOG.md)
|
||||
|
||||
:::
|
||||
|
||||
@@ -14,6 +16,7 @@ The ROCm Validation Suite is a system administrator’s and cluster manager's to
|
||||
TransferBench is a simple utility capable of benchmarking simultaneous transfers between user-specified devices (CPUs/GPUs).
|
||||
|
||||
- {doc}`Documentation <transferbench:index>`
|
||||
- [GitHub](https://github.com/ROCmSoftwarePlatform/TransferBench/)
|
||||
- [Changelog](https://github.com/ROCmSoftwarePlatform/TransferBench/blob/develop/CHANGELOG.md)
|
||||
- {doc}`transferbench:examples/index`
|
||||
|
||||
|
||||
@@ -95,7 +95,7 @@ Use Driver Shipped with ROCm
|
||||
|
||||
:::
|
||||
|
||||
:::{tab-item} Radeon Pro™
|
||||
:::{tab-item} Radeon™
|
||||
:sync: radeonpro
|
||||
|
||||
[Use Radeon Pro Driver](https://www.amd.com/en/support/linux-drivers)
|
||||
|
||||
@@ -61,20 +61,6 @@ subtrees:
|
||||
- caption: APIs and Reference
|
||||
entries:
|
||||
- file: reference/all
|
||||
- file: reference/compilers
|
||||
title: Compilers and Tools
|
||||
subtrees:
|
||||
- entries:
|
||||
- file: reference/rocmcc/rocmcc
|
||||
title: ROCmCC
|
||||
- url: ${project:rocgdb}
|
||||
title: ROCgdb
|
||||
- url: ${project:rocprofiler}
|
||||
title: rocprofiler
|
||||
- url: ${project:roctracer}
|
||||
title: roctracer
|
||||
- url: ${project:rocdbgapi}
|
||||
title: ROCdbgapi
|
||||
- file: reference/hip
|
||||
subtrees:
|
||||
- entries:
|
||||
@@ -82,8 +68,6 @@ subtrees:
|
||||
url: ${project:hip}
|
||||
- title: HIPify - Port Your Code
|
||||
url: ${project:hipify}
|
||||
- file: reference/openmp/openmp
|
||||
title: OpenMP
|
||||
- file: reference/gpu_libraries/math
|
||||
title: Math Libraries
|
||||
subtrees:
|
||||
@@ -148,9 +132,9 @@ subtrees:
|
||||
- title: MIOpen - Machine Intelligence
|
||||
url: ${project:miopen}
|
||||
- title: Composable Kernel
|
||||
url: ${project:composable-kernel}
|
||||
url: ${project:composable_kernel}
|
||||
- title: MIGraphX - Graph Optimization
|
||||
url: ${project:migraphx}
|
||||
url: ${project:amdmigraphx}
|
||||
- file: reference/computer_vision
|
||||
subtrees:
|
||||
- entries:
|
||||
@@ -159,6 +143,22 @@ subtrees:
|
||||
- entries:
|
||||
- url: ${project:rocal}
|
||||
title: rocAL
|
||||
- file: reference/openmp/openmp
|
||||
title: OpenMP
|
||||
- file: reference/compilers
|
||||
title: Compilers and Tools
|
||||
subtrees:
|
||||
- entries:
|
||||
- file: reference/rocmcc/rocmcc
|
||||
title: ROCmCC
|
||||
- url: ${project:rocgdb}
|
||||
title: ROCgdb
|
||||
- url: ${project:rocprofiler}
|
||||
title: rocprofiler
|
||||
- url: ${project:roctracer}
|
||||
title: roctracer
|
||||
- url: ${project:rocdbgapi}
|
||||
title: ROCdbgapi
|
||||
- file: reference/management_tools
|
||||
title: Management Tools
|
||||
subtrees:
|
||||
@@ -173,7 +173,7 @@ subtrees:
|
||||
title: Validation Tools
|
||||
subtrees:
|
||||
- entries:
|
||||
- url: ${project:rocm-validation-suite}
|
||||
- url: ${project:rocmvalidationsuite}
|
||||
title: RVS
|
||||
- url: ${project:transferbench}
|
||||
title: TransferBench
|
||||
|
||||
@@ -1 +1 @@
|
||||
rocm-docs-core==0.18.2
|
||||
rocm-docs-core==0.18.3
|
||||
|
||||
@@ -2,7 +2,7 @@
|
||||
# This file is autogenerated by pip-compile with Python 3.8
|
||||
# by the following command:
|
||||
#
|
||||
# pip-compile requirements.in
|
||||
# pip-compile --resolver=backtracking requirements.in
|
||||
#
|
||||
accessible-pygments==0.0.3
|
||||
# via pydata-sphinx-theme
|
||||
@@ -100,7 +100,7 @@ requests==2.28.1
|
||||
# via
|
||||
# pygithub
|
||||
# sphinx
|
||||
rocm-docs-core==0.18.2
|
||||
rocm-docs-core==0.18.3
|
||||
# via -r requirements.in
|
||||
smmap==5.0.0
|
||||
# via gitdb
|
||||
|
||||
@@ -95,8 +95,6 @@ The `hipcc` and `hipconfig` Perl scripts are deprecated. In a future release, co
|
||||
>
|
||||
> There will be a transition period where the Perl scripts and compiled binaries are available before the scripts are removed. There will be no functional difference between the Perl scripts and their compiled binary counterpart. No user action is required. Once these are available, users can optionally switch to `hipcc.bin` and `hipconfig.bin`. The `hipcc`/`hipconfig` soft link will be assimilated to point from `hipcc`/`hipconfig` to the respective compiled binaries as the default option.
|
||||
|
||||
(5_4_0_filesystem_reorg_deprecation_notice)=
|
||||
|
||||
##### Linux Filesystem Hierarchy Standard for ROCm
|
||||
|
||||
ROCm packages have adopted the Linux foundation filesystem hierarchy standard in this release to ensure ROCm components follow open source conventions for Linux-based distributions. While moving to a new filesystem hierarchy, ROCm ensures backward compatibility with its 5.1 version or older filesystem hierarchy. See below for a detailed explanation of the new filesystem hierarchy and backward compatibility.
|
||||
|
||||
@@ -26,6 +26,29 @@ The following hipcc changes are implemented in this release:
|
||||
- `hipCommander` at <https://github.com/ROCm-Developer-Tools/hip-tests/tree/develop/samples/1_Utils/hipCommander>
|
||||
|
||||
Note that the samples will continue to be available in previous release branches.
|
||||
- Removal of gcnarch from hipDeviceProp_t structure
|
||||
- Addition of new fields in hipDeviceProp_t structure
|
||||
- maxTexture1D
|
||||
- maxTexture2D
|
||||
- maxTexture1DLayered
|
||||
- maxTexture2DLayered
|
||||
- sharedMemPerMultiprocessor
|
||||
- deviceOverlap
|
||||
- asyncEngineCount
|
||||
- surfaceAlignment
|
||||
- unifiedAddressing
|
||||
- computePreemptionSupported
|
||||
- hostRegisterSupported
|
||||
- uuid
|
||||
- Removal of deprecated code
|
||||
- hip-hcc codes from hip code tree
|
||||
- Correct hipArray usage in HIP APIs such as hipMemcpyAtoH and hipMemcpyHtoA
|
||||
- HIPMEMCPY_3D fields correction to avoid truncation of "size_t" to "unsigned int" inside hipMemcpy3D()
|
||||
- Renaming of 'memoryType' in hipPointerAttribute_t structure to 'type'
|
||||
- Correct hipGetLastError to return the last error instead of last API call's return code
|
||||
- Update hipExternalSemaphoreHandleDesc to add "unsigned int reserved[16]"
|
||||
- Correct handling of flag values in hipIpcOpenMemHandle for hipIpcMemLazyEnablePeerAccess
|
||||
- Remove hiparray* and make it opaque with hipArray_t
|
||||
|
||||
##### New HIP APIs in This Release
|
||||
|
||||
|
||||
@@ -21,6 +21,20 @@ ROCm 5.6 consists of several AI software ecosystem improvements to our fast-grow
|
||||
- Bug fixes will not be back ported to older ROCm releases for this SKU
|
||||
- Distro / Operating system updates will continue as per the ROCm release cadence for gfx906 GPUs till EOM.
|
||||
|
||||
#### AMDSMI CLI 23.0.0.4
|
||||
|
||||
##### Added
|
||||
|
||||
- AMDSMI CLI tool enabled for Linux Bare Metal & Guest
|
||||
|
||||
- Package: amd-smi-lib
|
||||
|
||||
##### Known Issues
|
||||
|
||||
- not all Error Correction Code (ECC) fields are currently supported
|
||||
|
||||
- RHEL 8 & SLES 15 have extra install steps
|
||||
|
||||
#### Kernel Modules (DKMS)
|
||||
|
||||
##### Fixes
|
||||
@@ -59,7 +73,7 @@ ROCm 5.6 consists of several AI software ecosystem improvements to our fast-grow
|
||||
- Limited testing on xnack+ configuration
|
||||
- Multiple HIP tests failures (gpuvm fault or hangs)
|
||||
- hipSetDevice and hipSetDeviceFlags APIs return hipErrorInvalidDevice instead of hipErrorNoDevice, on a system without GPU
|
||||
- Known memory leak when code object files are loaded/unloaded via hipModuleLoad/hipModuleUnload APIs. Issue will be fixed in ROCm 5.6.1
|
||||
- Known memory leak when code object files are loaded/unloaded via hipModuleLoad/hipModuleUnload APIs. Issue will be fixed in a future ROCm release
|
||||
|
||||
##### Upcoming changes in future release
|
||||
|
||||
|
||||
Reference in New Issue
Block a user