Files
ROCm/docs/reference/gpu-arch-specs.rst
Young Hui - AMD a29a457691 Merge develop into docs/6.1.0 (#3011)
* added Getting ROCm Source Files (#2952)

* added Accessing ROCm Source Files

* changed per comments

* Update README.md

implement dgaliffi suggestions

Co-authored-by: David Galiffi <dgaliffi@amd.com>

* Update README.md

implement dgailifi suggestion

Co-authored-by: David Galiffi <dgaliffi@amd.com>

* Update README.md

implement dgailifi suggestion

Co-authored-by: David Galiffi <dgaliffi@amd.com>

* Update README.md

implement dgailifi suggestion

Co-authored-by: David Galiffi <dgaliffi@amd.com>

* add default.xml link

* update README

---------

Co-authored-by: David Galiffi <dgaliffi@amd.com>

* Bump rocm-docs-core from 0.35.1 to 0.36.0 in /docs/sphinx

Bumps [rocm-docs-core](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.35.1 to 0.36.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.35.1...v0.36.0)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

* Update rocAL link

* Add Radeon and Raden Pro specifications to the architecture reference (#2960)

* Expand architecture hardware specifications overview

Add supported Radeon and Radeon Pro GPUs

* Remove glossary from gpu architecture hardware specifications

* add reredirect extension
add redirection for openmp documentation

* Bump rocm-docs-core from 0.36.0 to 0.37.0 in /docs/sphinx

Bumps [rocm-docs-core](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.36.0 to 0.37.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.36.0...v0.37.0)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

* Update using-gpu-sanitizer.md (#2970)

* Update using-gpu-sanitizer.md

added the link text

Added the example

---------

Co-authored-by: Sam Wu <sam.wu2@amd.com>

* Update autotag README

* Add check for empty string in prev lib ver; also fix typo in ROCm

* Update what-is-rocm.rst (#2984)

* Bump rocm-docs-core from 0.37.0 to 0.38.0 in /docs/sphinx (#2986)

Bumps [rocm-docs-core](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.37.0 to 0.38.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.37.0...v0.38.0)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* minor update to the gpu-mpi section (#2983)

provide the precise parameters required to run Open MPI with libfabric and rocm
support.

* Update using-gpu-sanitizer.md (#2991)

* Update using-gpu-sanitizer.md

Minor OpenMP update

* Update using-gpu-sanitizer.md

Updated note with additional information.

* Update using-gpu-sanitizer.md

* Update using-gpu-sanitizer.md

Moved the note to another section

* Update using-gpu-sanitizer.md

* Bump rocm-docs-core from 0.38.0 to 0.38.1 in /docs/sphinx (#3004)

Bumps [rocm-docs-core](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.38.0 to 0.38.1.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.38.0...v0.38.1)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump idna from 3.4 to 3.7 in /docs/sphinx (#3007)

Bumps [idna](https://github.com/kjd/idna) from 3.4 to 3.7.
- [Release notes](https://github.com/kjd/idna/releases)
- [Changelog](https://github.com/kjd/idna/blob/master/HISTORY.rst)
- [Commits](https://github.com/kjd/idna/compare/v3.4...v3.7)

---
updated-dependencies:
- dependency-name: idna
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Reorganize "What is ROCm?" page (#3006)

* add rocm software stack diagram to What is ROCm landing page

* restructure ROCm project list table

* clean up unnecessary hyphenation

* update What is ROCm stack diagram filename

* reorder rocm project list to reflect diagram

* update "What is ROCm?" image metadata

* change 'project list' to 'components'

* change 'project' to 'component'

* Update codeowners (#3008)

* Update links (#2992)

* Update links

* table cleanup

* cross-refs

* wordlist update

* add temp hard links

* verbiage

* docs(index.md): Disable MD051 for Sphinx Markdown anchor point

In general this rule should be followed to avoid broken links

* revert gpu-arch table, remove dropdowns, quick start hyphen removedon index.md

* revise opening text as per PR comment

---------

Co-authored-by: Lisa <lisa.delaney@amd.com>
Co-authored-by: Sam Wu <sam.wu2@amd.com>
Co-authored-by: Young Hui <young.hui@amd.com>

---------

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: randyh62 <42045079+randyh62@users.noreply.github.com>
Co-authored-by: David Galiffi <dgaliffi@amd.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Sam Wu <sam.wu2@amd.com>
Co-authored-by: MKKnorr <matthias@streamhpc.com>
Co-authored-by: Bence Parajdi <bence@streamhpc.com>
Co-authored-by: Roopa Malavally <56051583+Rmalavally@users.noreply.github.com>
Co-authored-by: Lisa <lisa.delaney@amd.com>
Co-authored-by: Edgar Gabriel <edgargabriel@users.noreply.github.com>
Co-authored-by: peter <peter.park@amd.com>
Co-authored-by: Lisa <lisajdelaney@gmail.com>
2024-04-12 17:05:14 -04:00

661 lines
13 KiB
ReStructuredText
Raw Blame History

This file contains invisible Unicode characters
This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
.. meta::
:description: AMD Instinct™ GPU architecture information
:keywords: Instinct, CDNA, GPU, architecture, VRAM, Compute Units, Cache, Registers, LDS, Register File
AMD hardware specifications
########################################
The following tables provide an overview of the hardware specifications for AMD Instinct™ accelerators, AMD Radeon™ and AMD Radeon™ Pro GPUs.
.. tab-set::
.. tab-item:: AMD Instinct Accelerators
.. list-table::
:header-rows: 1
:name: instinct-arch-spec-table
*
- Model
- Architecture
- LLVM target name
- VRAM
- Compute Units
- Warp Size
- LDS
- L3 Cache
- L2 Cache
- L1 Vector Cache
- L1 Scalar Cache
- L1 Instruction Cache
- VGPR File
- SGPR File
*
- MI300X
- CDNA3
- gfx941 or gfx942
- 192 GiB
- 304
- 64
- 64 KiB
- 256 MiB
- 32 MiB
- 32 KiB
- 16 KiB per 2 CUs
- 64 KiB per 2 CUs
- 512 KiB
- 12.5 KiB
*
- MI300A
- CDNA3
- gfx940 or gfx942
- 128 GiB
- 228
- 64
- 64 KiB
- 256 MiB
- 24 MiB
- 32 KiB
- 16 KiB per 2 CUs
- 64 KiB per 2 CUs
- 512 KiB
- 12.5 KiB
*
- MI250X
- CDNA2
- gfx90a
- 128 GiB
- 220 (110 per GCD)
- 64
- 64 KiB
-
- 16 MiB (8 MiB per GCD)
- 16 KiB
- 16 KiB per 2 CUs
- 32 KiB per 2 CUs
- 512 KiB
- 12.5 KiB
*
- MI250
- CDNA2
- gfx90a
- 128 GiB
- 208
- 64
- 64 KiB
-
- 16 MiB (8 MiB per GCD)
- 16 KiB
- 16 KiB per 2 CUs
- 32 KiB per 2 CUs
- 512 KiB
- 12.5 KiB
*
- MI210
- CDNA2
- gfx90a
- 64 GiB
- 104
- 64
- 64 KiB
-
- 8 MiB
- 16 KiB
- 16 KiB per 2 CUs
- 32 KiB per 2 CUs
- 512 KiB
- 12.5 KiB
*
- MI100
- CDNA
- gfx908
- 32 GiB
- 120
- 64
- 64 KiB
-
- 8 MiB
- 16 KiB
- 16 KiB per 3 CUs
- 32 KiB per 3 CUs
- 256 KiB VGPR and 256 KiB AccVGPR
- 12.5 KiB
*
- MI60
- GCN5.1
- gfx906
- 32 GiB
- 64
- 64
- 64 KiB
-
- 4 MiB
- 16 KiB
- 16 KiB per 3 CUs
- 32 KiB per 3 CUs
- 256 KiB
- 12.5 KiB
*
- MI50 (32GB)
- GCN5.1
- gfx906
- 32 GiB
- 60
- 64
- 64 KiB
-
- 4 MiB
- 16 KiB
- 16 KiB per 3 CUs
- 32 KiB per 3 CUs
- 256 KiB
- 12.5 KiB
*
- MI50 (16GB)
- GCN5.1
- gfx906
- 16 GiB
- 60
- 64
- 64 KiB
-
- 4 MiB
- 16 KiB
- 16 KiB per 3 CUs
- 32 KiB per 3 CUs
- 256 KiB
- 12.5 KiB
*
- MI25
- GCN5.0
- gfx900
- 16 GiB
- 64
- 64
- 64 KiB
-
- 4 MiB
- 16 KiB
- 16 KiB per 3 CUs
- 32 KiB per 3 CUs
- 256 KiB
- 12.5 KiB
*
- MI8
- GCN3.0
- gfx803
- 4 GiB
- 64
- 64
- 64 KiB
-
- 2 MiB
- 16 KiB
- 16 KiB per 4 CUs
- 32 KiB per 4 CUs
- 256 KiB
- 12.5 KiB
*
- MI6
- GCN4.0
- gfx803
- 16 GiB
- 36
- 64
- 64 KiB
-
- 2 MiB
- 16 KiB
- 16 KiB per 4 CUs
- 32 KiB per 4 CUs
- 256 KiB
- 12.5 KiB
.. tab-item:: AMD Radeon Pro GPUs
.. list-table::
:header-rows: 1
:name: radeon-pro-arch-spec-table
*
- Model
- Architecture
- LLVM target name
- VRAM
- Compute Units
- Warp Size
- LDS
- Infinity Cache
- L2 Cache
- Graphics L1 Cache
- L0 Vector Cache
- L0 Scalar Cache
- L0 Instruction Cache
- VGPR File
- SGPR File
*
- Radeon PRO W7900
- RDNA3
- gfx1100
- 48 GiB
- 96
- 32
- 128 KiB
- 96 MiB
- 6 MiB
- 256 KiB
- 32 KiB
- 16 KiB
- 32 KiB
- 384 KiB
- 20 KiB
*
- Radeon PRO W7800
- RDNA3
- gfx1100
- 32 GiB
- 70
- 32
- 128 KiB
- 64 MiB
- 6 MiB
- 256 KiB
- 32 KiB
- 16 KiB
- 32 KiB
- 384 KiB
- 20 KiB
*
- Radeon PRO W7700
- RDNA3
- gfx1101
- 16 GiB
- 48
- 32
- 128 KiB
- 64 MiB
- 4 MiB
- 256 KiB
- 32 KiB
- 16 KiB
- 32 KiB
- 384 KiB
- 20 KiB
*
- Radeon PRO W6800
- RDNA2
- gfx1030
- 32 GiB
- 60
- 32
- 128 KiB
- 128 MiB
- 4 MiB
- 128 KiB
- 16 KiB
- 16 KiB
- 32 KiB
- 256 KiB
- 20 KiB
*
- Radeon PRO W6600
- RDNA2
- gfx1032
- 8 GiB
- 28
- 32
- 128 KiB
- 32 MiB
- 2 MiB
- 128 KiB
- 16 KiB
- 16 KiB
- 32 KiB
- 256 KiB
- 20 KiB
*
- Radeon PRO V620
- RDNA2
- gfx1030
- 32 GiB
- 72
- 32
- 128 KiB
- 128 MiB
- 4 MiB
- 128 KiB
- 16 KiB
- 16 KiB
- 32 KiB
- 256 KiB
- 20 KiB
*
- Radeon Pro W5500
- RDNA
- gfx1012
- 8 GiB
- 22
- 32
- 128 KiB
-
- 4 MiB
- 128 KiB
- 16 KiB
- 16 KiB
- 32 KiB
- 256 KiB
- 20 KiB
*
- Radeon Pro VII
- GCN5.1
- gfx906
- 16 GiB
- 60
- 64
- 64 KiB
-
- 4 MiB
-
- 16 KiB
- 16 KiB per 3 CUs
- 32 KiB per 3 CUs
- 256 KiB
- 12.5 KiB
.. tab-item:: AMD Radeon GPUs
.. list-table::
:header-rows: 1
:name: radeon-arch-spec-table
*
- Model
- Architecture
- LLVM target name
- VRAM
- Compute Units
- Warp Size
- LDS
- Infinity Cache
- L2 Cache
- Graphics L1 Cache
- L0 Vector Cache
- L0 Scalar Cache
- L0 Instruction Cache
- VGPR File
- SGPR File
*
- Radeon RX 7900 XTX
- RDNA3
- gfx1100
- 24 GiB
- 96
- 32
- 128 KiB
- 96 MiB
- 6 MiB
- 256 KiB
- 32 KiB
- 16 KiB
- 32 KiB
- 384 KiB
- 20 KiB
*
- Radeon RX 7900 XT
- RDNA3
- gfx1100
- 20 GiB
- 84
- 32
- 128 KiB
- 80 MiB
- 6 MiB
- 256 KiB
- 32 KiB
- 16 KiB
- 32 KiB
- 384 KiB
- 20 KiB
*
- Radeon RX 7900 GRE
- RDNA3
- gfx1100
- 16 GiB
- 80
- 32
- 128 KiB
- 64 MiB
- 6 MiB
- 256 KiB
- 32 KiB
- 16 KiB
- 32 KiB
- 384 KiB
- 20 KiB
*
- Radeon RX 7800 XT
- RDNA3
- gfx1101
- 16 GiB
- 60
- 32
- 128 KiB
- 64 MiB
- 4 MiB
- 256 KiB
- 32 KiB
- 16 KiB
- 32 KiB
- 384 KiB
- 20 KiB
*
- Radeon RX 7700 XT
- RDNA3
- gfx1101
- 12 GiB
- 54
- 32
- 128 KiB
- 48 MiB
- 4 MiB
- 256 KiB
- 32 KiB
- 16 KiB
- 32 KiB
- 384 KiB
- 20 KiB
*
- Radeon RX 7600
- RDNA3
- gfx1102
- 8 GiB
- 32
- 32
- 128 KiB
- 32 MiB
- 2 MiB
- 256 KiB
- 32 KiB
- 16 KiB
- 32 KiB
- 256 KiB
- 20 KiB
*
- Radeon RX 6950 XT
- RDNA2
- gfx1030
- 16 GiB
- 80
- 32
- 128 KiB
- 128 MiB
- 4 MiB
- 128 KiB
- 16 KiB
- 16 KiB
- 32 KiB
- 256 KiB
- 20 KiB
*
- Radeon RX 6900 XT
- RDNA2
- gfx1030
- 16 GiB
- 80
- 32
- 128 KiB
- 128 MiB
- 4 MiB
- 128 KiB
- 16 KiB
- 16 KiB
- 32 KiB
- 256 KiB
- 20 KiB
*
- Radeon RX 6800 XT
- RDNA2
- gfx1030
- 16 GiB
- 72
- 32
- 128 KiB
- 128 MiB
- 4 MiB
- 128 KiB
- 16 KiB
- 16 KiB
- 32 KiB
- 256 KiB
- 20 KiB
*
- Radeon RX 6800
- RDNA2
- gfx1030
- 16 GiB
- 60
- 32
- 128 KiB
- 128 MiB
- 4 MiB
- 128 KiB
- 16 KiB
- 16 KiB
- 32 KiB
- 256 KiB
- 20 KiB
*
- Radeon RX 6750 XT
- RDNA2
- gfx1031
- 12 GiB
- 40
- 32
- 128 KiB
- 96 MiB
- 3 MiB
- 128 KiB
- 16 KiB
- 16 KiB
- 32 KiB
- 256 KiB
- 20 KiB
*
- Radeon RX 6700 XT
- RDNA2
- gfx1031
- 12 GiB
- 40
- 32
- 128 KiB
- 96 MiB
- 3 MiB
- 128 KiB
- 16 KiB
- 16 KiB
- 32 KiB
- 256 KiB
- 20 KiB
*
- Radeon RX 6700
- RDNA2
- gfx1031
- 10 GiB
- 36
- 32
- 128 KiB
- 80 MiB
- 3 MiB
- 128 KiB
- 16 KiB
- 16 KiB
- 32 KiB
- 256 KiB
- 20 KiB
*
- Radeon RX 6650 XT
- RDNA2
- gfx1032
- 8 GiB
- 32
- 32
- 128 KiB
- 32 MiB
- 2 MiB
- 128 KiB
- 16 KiB
- 16 KiB
- 32 KiB
- 256 KiB
- 20 KiB
*
- Radeon RX 6600 XT
- RDNA2
- gfx1032
- 8 GiB
- 32
- 32
- 128 KiB
- 32 MiB
- 2 MiB
- 128 KiB
- 16 KiB
- 16 KiB
- 32 KiB
- 256 KiB
- 20 KiB
*
- Radeon RX 6600
- RDNA2
- gfx1032
- 8 GiB
- 28
- 32
- 128 KiB
- 32 MiB
- 2 MiB
- 128 KiB
- 16 KiB
- 16 KiB
- 32 KiB
- 256 KiB
- 20 KiB
*
- Radeon VII
- GCN5.1
- gfx906
- 16 GiB
- 60
- 64
- 64 KiB per CU
-
- 4 MiB
-
- 16 KiB
- 16 KiB per 3 CUs
- 32 KiB per 3 CUs
- 256 KiB
- 12.5 KiB
For a detailed explanation of the terms refer to the :ref:`specific documents and guides <gpu-arch-documentation>` or the :ref:`HIP programming guide <HIP:user_guide/hip_rtc>`.