diff --git a/docs/compatibility/compatibility-matrix.rst b/docs/compatibility/compatibility-matrix.rst new file mode 100644 index 000000000..3b8819883 --- /dev/null +++ b/docs/compatibility/compatibility-matrix.rst @@ -0,0 +1,125 @@ +.. meta:: + :description: ROCm compatibility matrix + :keywords: AMD, GPU, architecture, hardware, compatibility, requirements + +************************************************************************************** +Compatibility matrix +************************************************************************************** + +Use this matrix to view the ROCm compatibility across successive major and minor releases. + + +.. container:: format-big-table + + .. csv-table:: + :header: "ROCm Version", "6.1.0", "6.0.0" + :stub-columns: 1 + + :doc:`Operating Systems `, Ubuntu 22.04.z (z=3-4),Ubuntu 22.04.z (z=3-4) + ,Ubuntu 20.04.z (z=5-6),Ubuntu 20.04.z (z=5-6) + ,RHEL 9.y (y=2-3),RHEL 9.y (y=2-3) + ,RHEL 8.y (y=8-9),RHEL 8.y (y=8-9) + ,SLES 15 SPy (y=4-5),SLES 15 SPy (y=4-5) + ,CentOS 7.9,CentOS 7.9 + ,, + :doc:`GFX Architecture `,CDNA3,CDNA3 + ,CDNA2,CDNA2 + ,CDNA,CDNA + ,RDNA3,RDNA3 + ,RDNA2,RDNA2 + ,, + :doc:`GFX Card `,gfx1100,gfx1100 + ,gfx1030,gfx1030 + ,gfx942 [#]_, gfx942 [#]_ + ,gfx90a,gfx90a + ,gfx908,gfx908 + ,, + ECOSYSTEM SUPPORT:,, + :doc:`PyTorch `,2.y (y=0-1) and 1.13,2.y (y=0-1) and 1.13 + :doc:`Tensorflow `,2.1y (y=3-5),2.1y (y=2-4) + :doc:`JAX `,0.4.26,0.4.26 + `ONNX-RT `_,1.17.3,1.14.1 + ,, + 3RD PARTY COMMUNICATION LIBS:,, + `UCC `_,>=1.2.0,>=1.2.0 + `UCX `_,>=1.14.1,>=1.14.1 + ,, + 3RD PARTY ALGORITHM LIBS:,, + Thrust,2.1.0,2.0.1 + CUB,2.1.0,2.0.1 + ,, + ML & COMPUTER VISION LIBS:,, + :doc:`Composable Kernel `,1.1.0,1.1.0 + :doc:`MIGraphX `,2.9.0,2.8.0 + :doc:`MIOpen `,3.1.0,3.0.0 + :doc:`MIVisionX `,2.5.0,2.5.0 + :doc:`rocDecode `,0.5.0,N/A + :doc:`RPP `,1.5.0,1.4.0 + ,, + COMMUNICATION:,, + :doc:`rccl `,2.18.6,2.18.3 + ,, + MATH LIBS:,, + `half `_ ,1.12.0,1.12.0 + :doc:`hipBLAS `,2.1.0,2.0.0 + :doc:`hipBLASLt `,0.7.0,0.6.0 + :doc:`hipFFT `,1.0.14,1.0.13 + :doc:`hipFORT `,0.4.0,0.4.0 + :doc:`hipRAND `,2.10.16,2.10.16 + :doc:`hipSOLVER `,2.1.0,2.0.0 + :doc:`hipSPARSE `,3.0.1,3.0.0 + :doc:`hipSPARSELt `,0.1.0,0.1.0 + :doc:`rocALUTION `,3.1.1,3.0.3 + :doc:`rocBLAS `,4.1.0,4.0.0 + :doc:`rocFFT `,1.0.27,1.0.23 + :doc:`rocRAND `,3.0.1,2.10.17 + :doc:`rocSOLVER `,3.25.0,3.24.0 + :doc:`rocSPARSE `,3.1.2,3.0.2 + :doc:`rocWMMA `,1.4.0,1.3.0 + `Tensile `_,4.40.0,4.39.0 + ,, + PRIMITIVES:,, + :doc:`hipCUB `,3.1.0,3.0.0 + :doc:`hipTensor `,1.2.0,1.1.0 + :doc:`rocPRIM `,3.1.0,3.0.0 + :doc:`rocThrust `,3.0.1,3.0.0 + ,, + SUPPORT LIBS:,, + `hipother `_,6.1.40091,6.0.32830 + `rocm-cmake `_,0.12.0,0.11.0 + `rocm-core `_,6.1.0,6.0.0 + `ROCT-Thunk-Interface `_,20240125.3.30,20231016.2.245 + ,, + TOOLS:,, + :doc:`HIPIFY `,17.0.0,17.0.0 + :doc:`ROCdbgapi `,0.71.0,0.71.0 + `ROCdebug-Agent `_,2.0.3,2.0.3 + :doc:`rocGDB `,14.1.0,13.2.0 + :doc:`rocProfiler `,2.0.60100,2.0.0 + `rocprofiler-register `_,0.3.0,N/A + :doc:`rocTracer `,4.1.60100,4.1.0 + `rocm_bandwidth_test `_,1.4.0,1.4.0 + :doc:`ROCm Data Center Tool `,0.3.0,0.3.0 + `rocminfo `_,1.0.0,1.0.0 + :doc:`ROCm SMI Lib `,7.0.0,6.0.0 + :doc:`ROCm Validation Suite `,rocm-6.1.0,rocm-6.0.0 + :doc:`TransferBench `,1.48,1.46 + ,, + COMPILERS:,, + `AOMP `_,17.60.0,17.60.0 + `clang-ocl `_,0.5.0,0.5.0 + `Flang `_,17.0.0.24103,17.0.0.23483 + `llvm-project `_,17.0.0.24103,17.0.0.23483 + `OpenMP `_,17.0.0.24103,17.0.0.23483 + ,, + RUNTIMES:,, + :doc:`HIP `,6.1.40091,6.0.32830 + `OpenCL Runtime `_,2.0.0,2.0.0 + `ROCR Runtime `_,1.13.0,1.12.0 + + +.. rubric:: Footnotes +.. [#] **For ROCm 6.1** - MI300A (gfx942) is supported on Ubuntu 22.04.4, RHEL 9.3 & 8.9 and SLES 15 SP5. MI300X (gfx942) is only supported on Ubuntu 22.04.4. +.. [#] **For ROCm 6.0** - MI300A (gfx942) is supported on Ubuntu 22.04.3, RHEL 8.9 and SLES 15 SP5. MI300X (gfx942) is only supported on Ubuntu 22.04.3. + + diff --git a/docs/about/compatibility/precision-support.rst b/docs/compatibility/precision-support.rst similarity index 99% rename from docs/about/compatibility/precision-support.rst rename to docs/compatibility/precision-support.rst index fa0a385ad..3474dd67a 100644 --- a/docs/about/compatibility/precision-support.rst +++ b/docs/compatibility/precision-support.rst @@ -47,7 +47,7 @@ Floating-point types The floating-point types that are supported by ROCm are listed in the following table, together with their corresponding HIP type and a short description. -.. image:: ../../data/about/compatibility/floating-point-data-types.png +.. image:: ../data/about/compatibility/floating-point-data-types.png :alt: Supported floating-point types .. list-table:: diff --git a/docs/conf.py b/docs/conf.py index eade37a8f..3d31982d2 100644 --- a/docs/conf.py +++ b/docs/conf.py @@ -98,6 +98,9 @@ external_projects_current_project = "rocm" html_theme = "rocm_docs_theme" html_theme_options = {"flavor": "rocm-docs-home"} +html_static_path = ["sphinx/static/css"] +html_css_files = ["rocm_custom.css"] + html_title = "ROCm Documentation" html_theme_options = { diff --git a/docs/index.md b/docs/index.md index 862330ac8..b85ab88f3 100644 --- a/docs/index.md +++ b/docs/index.md @@ -51,13 +51,14 @@ Our documentation is organized into the following categories: :img-alt: Compatibility information :padding: 2 +* [Compatibility matrix](./compatibility/compatibility-matrix.rst) * {doc}`System requirements (Linux)` * {doc}`System requirements (Windows)` * {doc}`Third-party support` * {doc}`User/kernel space` * {doc}`Docker` * [OpenMP](./about/compatibility/openmp.md) -* [Precision support](./about/compatibility/precision-support.rst) +* [Precision support](./compatibility/precision-support.rst) * {doc}`ROCm on Radeon GPUs` ::: diff --git a/docs/reference/gpu-arch-specs.rst b/docs/reference/gpu-arch-specs.rst index 3cc70ff5a..e830bc251 100644 --- a/docs/reference/gpu-arch-specs.rst +++ b/docs/reference/gpu-arch-specs.rst @@ -7,655 +7,657 @@ Accelerator and GPU hardware specifications The following tables provide an overview of the hardware specifications for AMD Instinct™ accelerators, and AMD Radeon™ PRO and Radeon™ GPUs. -.. tab-set:: +.. container:: horizontal-scrolling-container - .. tab-item:: AMD Instinct accelerators + .. tab-set:: - .. list-table:: - :header-rows: 1 - :name: instinct-arch-spec-table + .. tab-item:: AMD Instinct accelerators - * - - Model - - Architecture - - LLVM target name - - VRAM (GiB) - - Compute Units - - Wavefront Size - - LDS (KiB) - - L3 Cache (MiB) - - L2 Cache (MiB) - - L1 Vector Cache (KiB) - - L1 Scalar Cache (KiB) - - L1 Instruction Cache (KiB) - - VGPR File (KiB) - - SGPR File (KiB) - * - - MI300X - - CDNA3 - - gfx941 or gfx942 - - 192 - - 304 - - 64 - - 64 - - 256 - - 32 - - 32 - - 16 per 2 CUs - - 64 per 2 CUs - - 512 - - 12.5 - * - - MI300A - - CDNA3 - - gfx940 or gfx942 - - 128 - - 228 - - 64 - - 64 - - 256 - - 24 - - 32 - - 16 per 2 CUs - - 64 per 2 CUs - - 512 - - 12.5 - * - - MI250X - - CDNA2 - - gfx90a - - 128 - - 220 (110 per GCD) - - 64 - - 64 - - - - 16 (8 per GCD) - - 16 - - 16 per 2 CUs - - 32 per 2 CUs - - 512 - - 12.5 - * - - MI250 - - CDNA2 - - gfx90a - - 128 - - 208 - - 64 - - 64 - - - - 16 (8 per GCD) - - 16 - - 16 per 2 CUs - - 32 per 2 CUs - - 512 - - 12.5 - * - - MI210 - - CDNA2 - - gfx90a - - 64 - - 104 - - 64 - - 64 - - - - 8 - - 16 - - 16 per 2 CUs - - 32 per 2 CUs - - 512 - - 12.5 - * - - MI100 - - CDNA - - gfx908 - - 32 - - 120 - - 64 - - 64 - - - - 8 - - 16 - - 16 per 3 CUs - - 32 per 3 CUs - - 256 VGPR and 256 AccVGPR - - 12.5 - * - - MI60 - - GCN5.1 - - gfx906 - - 32 - - 64 - - 64 - - 64 - - - - 4 - - 16 - - 16 per 3 CUs - - 32 per 3 CUs - - 256 - - 12.5 - * - - MI50 (32GB) - - GCN5.1 - - gfx906 - - 32 - - 60 - - 64 - - 64 - - - - 4 - - 16 - - 16 per 3 CUs - - 32 per 3 CUs - - 256 - - 12.5 - * - - MI50 (16GB) - - GCN5.1 - - gfx906 - - 16 - - 60 - - 64 - - 64 - - - - 4 - - 16 - - 16 per 3 CUs - - 32 per 3 CUs - - 256 - - 12.5 - * - - MI25 - - GCN5.0 - - gfx900 - - 16  - - 64 - - 64 - - 64  - - - - 4  - - 16  - - 16 per 3 CUs - - 32 per 3 CUs - - 256 - - 12.5 - * - - MI8 - - GCN3.0 - - gfx803 - - 4 - - 64 - - 64 - - 64 - - - - 2 - - 16 - - 16 per 4 CUs - - 32 per 4 CUs - - 256 - - 12.5 - * - - MI6 - - GCN4.0 - - gfx803 - - 16 - - 36 - - 64 - - 64 - - - - 2 - - 16 - - 16 per 4 CUs - - 32 per 4 CUs - - 256 - - 12.5 + .. list-table:: + :header-rows: 1 + :name: instinct-arch-spec-table - .. tab-item:: AMD Radeon PRO GPUs + * + - Model + - Architecture + - LLVM target name + - VRAM (GiB) + - Compute Units + - Wavefront Size + - LDS (KiB) + - L3 Cache (MiB) + - L2 Cache (MiB) + - L1 Vector Cache (KiB) + - L1 Scalar Cache (KiB) + - L1 Instruction Cache (KiB) + - VGPR File (KiB) + - SGPR File (KiB) + * + - MI300X + - CDNA3 + - gfx941 or gfx942 + - 192 + - 304 + - 64 + - 64 + - 256 + - 32 + - 32 + - 16 per 2 CUs + - 64 per 2 CUs + - 512 + - 12.5 + * + - MI300A + - CDNA3 + - gfx940 or gfx942 + - 128 + - 228 + - 64 + - 64 + - 256 + - 24 + - 32 + - 16 per 2 CUs + - 64 per 2 CUs + - 512 + - 12.5 + * + - MI250X + - CDNA2 + - gfx90a + - 128 + - 220 (110 per GCD) + - 64 + - 64 + - + - 16 (8 per GCD) + - 16 + - 16 per 2 CUs + - 32 per 2 CUs + - 512 + - 12.5 + * + - MI250 + - CDNA2 + - gfx90a + - 128 + - 208 + - 64 + - 64 + - + - 16 (8 per GCD) + - 16 + - 16 per 2 CUs + - 32 per 2 CUs + - 512 + - 12.5 + * + - MI210 + - CDNA2 + - gfx90a + - 64 + - 104 + - 64 + - 64 + - + - 8 + - 16 + - 16 per 2 CUs + - 32 per 2 CUs + - 512 + - 12.5 + * + - MI100 + - CDNA + - gfx908 + - 32 + - 120 + - 64 + - 64 + - + - 8 + - 16 + - 16 per 3 CUs + - 32 per 3 CUs + - 256 VGPR and 256 AccVGPR + - 12.5 + * + - MI60 + - GCN5.1 + - gfx906 + - 32 + - 64 + - 64 + - 64 + - + - 4 + - 16 + - 16 per 3 CUs + - 32 per 3 CUs + - 256 + - 12.5 + * + - MI50 (32GB) + - GCN5.1 + - gfx906 + - 32 + - 60 + - 64 + - 64 + - + - 4 + - 16 + - 16 per 3 CUs + - 32 per 3 CUs + - 256 + - 12.5 + * + - MI50 (16GB) + - GCN5.1 + - gfx906 + - 16 + - 60 + - 64 + - 64 + - + - 4 + - 16 + - 16 per 3 CUs + - 32 per 3 CUs + - 256 + - 12.5 + * + - MI25 + - GCN5.0 + - gfx900 + - 16  + - 64 + - 64 + - 64  + - + - 4  + - 16  + - 16 per 3 CUs + - 32 per 3 CUs + - 256 + - 12.5 + * + - MI8 + - GCN3.0 + - gfx803 + - 4 + - 64 + - 64 + - 64 + - + - 2 + - 16 + - 16 per 4 CUs + - 32 per 4 CUs + - 256 + - 12.5 + * + - MI6 + - GCN4.0 + - gfx803 + - 16 + - 36 + - 64 + - 64 + - + - 2 + - 16 + - 16 per 4 CUs + - 32 per 4 CUs + - 256 + - 12.5 - .. list-table:: - :header-rows: 1 - :name: radeon-pro-arch-spec-table + .. tab-item:: AMD Radeon PRO GPUs - * - - Model - - Architecture - - LLVM target name - - VRAM (GiB) - - Compute Units - - Wavefront Size - - LDS (KiB) - - Infinity Cache (MiB) - - L2 Cache (MiB) - - Graphics L1 Cache (KiB) - - L0 Vector Cache (KiB) - - L0 Scalar Cache (KiB) - - L0 Instruction Cache (KiB) - - VGPR File (KiB) - - SGPR File (KiB) - * - - Radeon PRO W7900 - - RDNA3 - - gfx1100 - - 48 - - 96 - - 32 - - 128 - - 96 - - 6 - - 256 - - 32 - - 16 - - 32 - - 384 - - 20 - * - - Radeon PRO W7800 - - RDNA3 - - gfx1100 - - 32 - - 70 - - 32 - - 128 - - 64 - - 6 - - 256 - - 32 - - 16 - - 32 - - 384 - - 20 - * - - Radeon PRO W7700 - - RDNA3 - - gfx1101 - - 16 - - 48 - - 32 - - 128 - - 64 - - 4 - - 256 - - 32 - - 16 - - 32 - - 384 - - 20 - * - - Radeon PRO W6800 - - RDNA2 - - gfx1030 - - 32 - - 60 - - 32 - - 128 - - 128 - - 4 - - 128 - - 16 - - 16 - - 32 - - 256 - - 20 - * - - Radeon PRO W6600 - - RDNA2 - - gfx1032 - - 8 - - 28 - - 32 - - 128 - - 32 - - 2 - - 128 - - 16 - - 16 - - 32 - - 256 - - 20 - * - - Radeon PRO V620 - - RDNA2 - - gfx1030 - - 32 - - 72 - - 32 - - 128 - - 128 - - 4 - - 128 - - 16 - - 16 - - 32 - - 256 - - 20 - * - - Radeon Pro W5500 - - RDNA - - gfx1012 - - 8 - - 22 - - 32 - - 128 - - - - 4 - - 128 - - 16 - - 16 - - 32 - - 256 - - 20 - * - - Radeon Pro VII - - GCN5.1 - - gfx906 - - 16 - - 60 - - 64 - - 64 - - - - 4 - - - - 16 - - 16 per 3 CUs - - 32 per 3 CUs - - 256 - - 12.5 + .. list-table:: + :header-rows: 1 + :name: radeon-pro-arch-spec-table - .. tab-item:: AMD Radeon GPUs + * + - Model + - Architecture + - LLVM target name + - VRAM (GiB) + - Compute Units + - Wavefront Size + - LDS (KiB) + - Infinity Cache (MiB) + - L2 Cache (MiB) + - Graphics L1 Cache (KiB) + - L0 Vector Cache (KiB) + - L0 Scalar Cache (KiB) + - L0 Instruction Cache (KiB) + - VGPR File (KiB) + - SGPR File (KiB) + * + - Radeon PRO W7900 + - RDNA3 + - gfx1100 + - 48 + - 96 + - 32 + - 128 + - 96 + - 6 + - 256 + - 32 + - 16 + - 32 + - 384 + - 20 + * + - Radeon PRO W7800 + - RDNA3 + - gfx1100 + - 32 + - 70 + - 32 + - 128 + - 64 + - 6 + - 256 + - 32 + - 16 + - 32 + - 384 + - 20 + * + - Radeon PRO W7700 + - RDNA3 + - gfx1101 + - 16 + - 48 + - 32 + - 128 + - 64 + - 4 + - 256 + - 32 + - 16 + - 32 + - 384 + - 20 + * + - Radeon PRO W6800 + - RDNA2 + - gfx1030 + - 32 + - 60 + - 32 + - 128 + - 128 + - 4 + - 128 + - 16 + - 16 + - 32 + - 256 + - 20 + * + - Radeon PRO W6600 + - RDNA2 + - gfx1032 + - 8 + - 28 + - 32 + - 128 + - 32 + - 2 + - 128 + - 16 + - 16 + - 32 + - 256 + - 20 + * + - Radeon PRO V620 + - RDNA2 + - gfx1030 + - 32 + - 72 + - 32 + - 128 + - 128 + - 4 + - 128 + - 16 + - 16 + - 32 + - 256 + - 20 + * + - Radeon Pro W5500 + - RDNA + - gfx1012 + - 8 + - 22 + - 32 + - 128 + - + - 4 + - 128 + - 16 + - 16 + - 32 + - 256 + - 20 + * + - Radeon Pro VII + - GCN5.1 + - gfx906 + - 16 + - 60 + - 64 + - 64 + - + - 4 + - + - 16 + - 16 per 3 CUs + - 32 per 3 CUs + - 256 + - 12.5 - .. list-table:: - :header-rows: 1 - :name: radeon-arch-spec-table + .. tab-item:: AMD Radeon GPUs - * - - Model - - Architecture - - LLVM target name - - VRAM (GiB) - - Compute Units - - Wavefront Size - - LDS (KiB) - - Infinity Cache (MiB) - - L2 Cache (MiB) - - Graphics L1 Cache (KiB) - - L0 Vector Cache (KiB) - - L0 Scalar Cache (KiB) - - L0 Instruction Cache (KiB) - - VGPR File (KiB) - - SGPR File (KiB) - * - - Radeon RX 7900 XTX - - RDNA3 - - gfx1100 - - 24 - - 96 - - 32 - - 128 - - 96 - - 6 - - 256 - - 32 - - 16 - - 32 - - 384 - - 20 - * - - Radeon RX 7900 XT - - RDNA3 - - gfx1100 - - 20 - - 84 - - 32 - - 128 - - 80 - - 6 - - 256 - - 32 - - 16 - - 32 - - 384 - - 20 - * - - Radeon RX 7900 GRE - - RDNA3 - - gfx1100 - - 16 - - 80 - - 32 - - 128 - - 64 - - 6 - - 256 - - 32 - - 16 - - 32 - - 384 - - 20 - * - - Radeon RX 7800 XT - - RDNA3 - - gfx1101 - - 16 - - 60 - - 32 - - 128 - - 64 - - 4 - - 256 - - 32 - - 16 - - 32 - - 384 - - 20 - * - - Radeon RX 7700 XT - - RDNA3 - - gfx1101 - - 12 - - 54 - - 32 - - 128 - - 48 - - 4 - - 256 - - 32 - - 16 - - 32 - - 384 - - 20 - * - - Radeon RX 7600 - - RDNA3 - - gfx1102 - - 8 - - 32 - - 32 - - 128 - - 32 - - 2 - - 256 - - 32 - - 16 - - 32 - - 256 - - 20 - * - - Radeon RX 6950 XT - - RDNA2 - - gfx1030 - - 16 - - 80 - - 32 - - 128 - - 128 - - 4 - - 128 - - 16 - - 16 - - 32 - - 256 - - 20 - * - - Radeon RX 6900 XT - - RDNA2 - - gfx1030 - - 16 - - 80 - - 32 - - 128 - - 128 - - 4 - - 128 - - 16 - - 16 - - 32 - - 256 - - 20 - * - - Radeon RX 6800 XT - - RDNA2 - - gfx1030 - - 16 - - 72 - - 32 - - 128 - - 128 - - 4 - - 128 - - 16 - - 16 - - 32 - - 256 - - 20 - * - - Radeon RX 6800 - - RDNA2 - - gfx1030 - - 16 - - 60 - - 32 - - 128 - - 128 - - 4 - - 128 - - 16 - - 16 - - 32 - - 256 - - 20 - * - - Radeon RX 6750 XT - - RDNA2 - - gfx1031 - - 12 - - 40 - - 32 - - 128 - - 96 - - 3 - - 128 - - 16 - - 16 - - 32 - - 256 - - 20 - * - - Radeon RX 6700 XT - - RDNA2 - - gfx1031 - - 12 - - 40 - - 32 - - 128 - - 96 - - 3 - - 128 - - 16 - - 16 - - 32 - - 256 - - 20 - * - - Radeon RX 6700 - - RDNA2 - - gfx1031 - - 10 - - 36 - - 32 - - 128 - - 80 - - 3 - - 128 - - 16 - - 16 - - 32 - - 256 - - 20 - * - - Radeon RX 6650 XT - - RDNA2 - - gfx1032 - - 8 - - 32 - - 32 - - 128 - - 32 - - 2 - - 128 - - 16 - - 16 - - 32 - - 256 - - 20 - * - - Radeon RX 6600 XT - - RDNA2 - - gfx1032 - - 8 - - 32 - - 32 - - 128 - - 32 - - 2 - - 128 - - 16 - - 16 - - 32 - - 256 - - 20 - * - - Radeon RX 6600 - - RDNA2 - - gfx1032 - - 8 - - 28 - - 32 - - 128 - - 32 - - 2 - - 128 - - 16 - - 16 - - 32 - - 256 - - 20 - * - - Radeon VII - - GCN5.1 - - gfx906 - - 16 - - 60 - - 64 - - 64 per CU - - - - 4 - - - - 16 - - 16 per 3 CUs - - 32 per 3 CUs - - 256 - - 12.5 + .. list-table:: + :header-rows: 1 + :name: radeon-arch-spec-table + + * + - Model + - Architecture + - LLVM target name + - VRAM (GiB) + - Compute Units + - Wavefront Size + - LDS (KiB) + - Infinity Cache (MiB) + - L2 Cache (MiB) + - Graphics L1 Cache (KiB) + - L0 Vector Cache (KiB) + - L0 Scalar Cache (KiB) + - L0 Instruction Cache (KiB) + - VGPR File (KiB) + - SGPR File (KiB) + * + - Radeon RX 7900 XTX + - RDNA3 + - gfx1100 + - 24 + - 96 + - 32 + - 128 + - 96 + - 6 + - 256 + - 32 + - 16 + - 32 + - 384 + - 20 + * + - Radeon RX 7900 XT + - RDNA3 + - gfx1100 + - 20 + - 84 + - 32 + - 128 + - 80 + - 6 + - 256 + - 32 + - 16 + - 32 + - 384 + - 20 + * + - Radeon RX 7900 GRE + - RDNA3 + - gfx1100 + - 16 + - 80 + - 32 + - 128 + - 64 + - 6 + - 256 + - 32 + - 16 + - 32 + - 384 + - 20 + * + - Radeon RX 7800 XT + - RDNA3 + - gfx1101 + - 16 + - 60 + - 32 + - 128 + - 64 + - 4 + - 256 + - 32 + - 16 + - 32 + - 384 + - 20 + * + - Radeon RX 7700 XT + - RDNA3 + - gfx1101 + - 12 + - 54 + - 32 + - 128 + - 48 + - 4 + - 256 + - 32 + - 16 + - 32 + - 384 + - 20 + * + - Radeon RX 7600 + - RDNA3 + - gfx1102 + - 8 + - 32 + - 32 + - 128 + - 32 + - 2 + - 256 + - 32 + - 16 + - 32 + - 256 + - 20 + * + - Radeon RX 6950 XT + - RDNA2 + - gfx1030 + - 16 + - 80 + - 32 + - 128 + - 128 + - 4 + - 128 + - 16 + - 16 + - 32 + - 256 + - 20 + * + - Radeon RX 6900 XT + - RDNA2 + - gfx1030 + - 16 + - 80 + - 32 + - 128 + - 128 + - 4 + - 128 + - 16 + - 16 + - 32 + - 256 + - 20 + * + - Radeon RX 6800 XT + - RDNA2 + - gfx1030 + - 16 + - 72 + - 32 + - 128 + - 128 + - 4 + - 128 + - 16 + - 16 + - 32 + - 256 + - 20 + * + - Radeon RX 6800 + - RDNA2 + - gfx1030 + - 16 + - 60 + - 32 + - 128 + - 128 + - 4 + - 128 + - 16 + - 16 + - 32 + - 256 + - 20 + * + - Radeon RX 6750 XT + - RDNA2 + - gfx1031 + - 12 + - 40 + - 32 + - 128 + - 96 + - 3 + - 128 + - 16 + - 16 + - 32 + - 256 + - 20 + * + - Radeon RX 6700 XT + - RDNA2 + - gfx1031 + - 12 + - 40 + - 32 + - 128 + - 96 + - 3 + - 128 + - 16 + - 16 + - 32 + - 256 + - 20 + * + - Radeon RX 6700 + - RDNA2 + - gfx1031 + - 10 + - 36 + - 32 + - 128 + - 80 + - 3 + - 128 + - 16 + - 16 + - 32 + - 256 + - 20 + * + - Radeon RX 6650 XT + - RDNA2 + - gfx1032 + - 8 + - 32 + - 32 + - 128 + - 32 + - 2 + - 128 + - 16 + - 16 + - 32 + - 256 + - 20 + * + - Radeon RX 6600 XT + - RDNA2 + - gfx1032 + - 8 + - 32 + - 32 + - 128 + - 32 + - 2 + - 128 + - 16 + - 16 + - 32 + - 256 + - 20 + * + - Radeon RX 6600 + - RDNA2 + - gfx1032 + - 8 + - 28 + - 32 + - 128 + - 32 + - 2 + - 128 + - 16 + - 16 + - 32 + - 256 + - 20 + * + - Radeon VII + - GCN5.1 + - gfx906 + - 16 + - 60 + - 64 + - 64 per CU + - + - 4 + - + - 16 + - 16 per 3 CUs + - 32 per 3 CUs + - 256 + - 12.5 For more information on the terms used here, see the :ref:`specific documents and guides ` or :doc:`Understanding the HIP programming model`. diff --git a/docs/reference/rocm-tools.md b/docs/reference/rocm-tools.md index 60801dd92..1a53a3273 100644 --- a/docs/reference/rocm-tools.md +++ b/docs/reference/rocm-tools.md @@ -19,6 +19,7 @@ :img-alt: Development tools :padding: 2 +* {doc}`HIPIFY ` * {doc}`ROCdbgapi ` * [ROCmCC](./rocmcc.md) * [ROCm Debug Agent](https://github.com/ROCm/rocr_debug_agent) @@ -35,6 +36,7 @@ * [RocBandwidthTest](https://github.com/ROCm/rocm_bandwidth_test) * {doc}`ROCProfiler ` +* [rocprofiler-register](https://github.com/ROCm/rocprofiler-register) * {doc}`ROCTracer ` ::: diff --git a/docs/sphinx/_toc.yml.in b/docs/sphinx/_toc.yml.in index af68deae1..433e30e81 100644 --- a/docs/sphinx/_toc.yml.in +++ b/docs/sphinx/_toc.yml.in @@ -25,11 +25,13 @@ subtrees: - caption: Compatibility entries: + - file: compatibility/compatibility-matrix.rst + title: Compatibility matrix - url: https://rocm.docs.amd.com/projects/install-on-linux/en/${branch}/reference/system-requirements.html title: Linux - url: https://rocm.docs.amd.com/projects/install-on-windows/en/${branch}/reference/system-requirements.html title: Windows - - file: about/compatibility/precision-support.rst + - file: compatibility/precision-support.rst title: Precision support - url: https://rocm.docs.amd.com/projects/install-on-linux/en/${branch}/reference/3rd-party-support-matrix.html title: Third-party diff --git a/docs/sphinx/static/css/rocm_custom.css b/docs/sphinx/static/css/rocm_custom.css new file mode 100644 index 000000000..7d14bb961 --- /dev/null +++ b/docs/sphinx/static/css/rocm_custom.css @@ -0,0 +1,53 @@ + +/* Adds container for big tables, used for Compatibility Matrix */ + + +/* Header row to have opaque background colour when sticky */ +.format-big-table th { + background-color: var(--pst-color-background); + white-space: nowrap; + } + + /* Turn on borders for whole table */ + .format-big-table th, + .format-big-table td { + border-width: 1px; + } + + /* .format-big-table th.head { */ + /* background-color: var(--pst-color-on-surface); */ + /* } */ + + /* Sticky header for table excluding the stub*/ + .format-big-table th.head:not(.stub) { + position: sticky; + top: 3rem; + z-index: 1; + } + + /* Sticky header for the head & stub: top left cell */ + .format-big-table th.head.stub { + position: sticky; + top: 3rem; + z-index: 1; + background-color: var(--pst-color-background); + white-space: nowrap; + } + + /* Sticky for the stub column */ + /*.format-big-table tbody th:not(:empty) { + position: sticky; + top: 3rem; + z-index: 2; + }*/ + + /* Removes borders for stub column */ + .format-big-table tbody th { + border-top: none; + border-bottom: none; + } + + /* For horizontal scrolling only. Can't be combined with format-big-table container */ + .horizontal-scrolling-container { + overflow-x: scroll; + }