mirror of
https://github.com/ROCm/ROCm.git
synced 2026-01-08 06:13:59 -05:00
Update precision support page part I. (#5127)
This commit is contained in:
@@ -176,6 +176,7 @@ HBM
|
||||
HCA
|
||||
HGX
|
||||
HIPCC
|
||||
hipDataType
|
||||
HIPExtension
|
||||
HIPIFY
|
||||
HIPification
|
||||
|
||||
@@ -55,7 +55,7 @@ The floating-point types supported by ROCm are listed in the following table.
|
||||
|
||||
.. list-table::
|
||||
:header-rows: 1
|
||||
:widths: 15,15,70
|
||||
:widths: 15,25,60
|
||||
|
||||
*
|
||||
- Type name
|
||||
@@ -63,18 +63,19 @@ The floating-point types supported by ROCm are listed in the following table.
|
||||
- Description
|
||||
*
|
||||
- float8 (E4M3)
|
||||
- ``__hip_fp8_e4m3_fnuz``
|
||||
- An 8-bit floating-point number that mostly follows IEEE-754 conventions
|
||||
and **S1E4M3** bit layout, as described in `8-bit Numerical Formats for Deep Neural Networks <https://arxiv.org/abs/2206.02915>`_,
|
||||
with expanded range and no infinity or signed zero. NaN is represented
|
||||
as negative zero.
|
||||
- | ``__hip_fp8_e4m3_fnuz``,
|
||||
| ``__hip_fp8_e4m3``
|
||||
- An 8-bit floating-point number with **S1E4M3** bit layout, as described in :doc:`low precision floating point types page <hip:reference/low_fp_types>`.
|
||||
The FNUZ variant has expanded range with no infinity or signed zero (NaN represented as negative zero),
|
||||
while the OCP variant follows the Open Compute Project specification.
|
||||
*
|
||||
- float8 (E5M2)
|
||||
- ``__hip_fp8_e5m2_fnuz``
|
||||
- An 8-bit floating-point number mostly following IEEE-754 conventions and
|
||||
**S1E5M2** bit layout, as described in `8-bit Numerical Formats for Deep Neural Networks <https://arxiv.org/abs/2206.02915>`_,
|
||||
with expanded range and no infinity or signed zero. NaN is represented
|
||||
as negative zero.
|
||||
- | ``__hip_fp8_e5m2_fnuz``,
|
||||
| ``__hip_fp8_e5m2``
|
||||
- An 8-bit floating-point number with **S1E5M2** bit layout, as described in :doc:`low precision floating point types page <hip:reference/low_fp_types>`.
|
||||
The FNUZ variant has expanded range with no infinity or signed zero (NaN represented as negative zero),
|
||||
while the OCP variant follows the Open Compute Project specification.
|
||||
|
||||
*
|
||||
- float16
|
||||
- ``half``
|
||||
@@ -107,9 +108,8 @@ The floating-point types supported by ROCm are listed in the following table.
|
||||
* The float8 and tensorfloat32 types are internal types used in calculations
|
||||
in Matrix Cores and can be stored in any type of the same size.
|
||||
|
||||
* The encodings for FP8 (E5M2) and FP8 (E4M3) that the
|
||||
MI300 series natively supports differ from the FP8 (E5M2) and FP8 (E4M3)
|
||||
encodings used in NVIDIA H100
|
||||
* CNDA3 natively supports FP8 FNUZ (E4M3 and E5M2), which differs from the customised
|
||||
FP8 format used in NVIDIA's H100
|
||||
(`FP8 Formats for Deep Learning <https://arxiv.org/abs/2209.05433>`_).
|
||||
|
||||
* In some AMD documents and articles, float8 (E5M2) is referred to as bfloat8.
|
||||
@@ -128,7 +128,7 @@ pages.
|
||||
:header-rows: 1
|
||||
|
||||
*
|
||||
- Icon
|
||||
- Icon
|
||||
- Definition
|
||||
|
||||
*
|
||||
@@ -163,12 +163,137 @@ pages.
|
||||
* Any type can be emulated by software, but this page does not cover such
|
||||
cases.
|
||||
|
||||
Data type support by Hardware Architecture
|
||||
Data type support by hardware architecture
|
||||
==========================================
|
||||
|
||||
The MI200 series GPUs, which include MI210, MI250, and MI250X, are based on the
|
||||
CDNA2 architecture. The MI300 series GPUs, consisting of MI300A, MI300X, and
|
||||
MI325X, are based on the CDNA3 architecture.
|
||||
AMD's GPU lineup spans multiple architecture generations:
|
||||
|
||||
* CDNA1 architecture: includes models such as MI100
|
||||
* CDNA2 architecture: includes models such as MI210, MI250, and MI250X
|
||||
* CDNA3 architecture: includes models such as MI300A, MI300X, and MI325X
|
||||
* RDNA3 architecture: includes models such as RX 7900XT and RX 7900XTX
|
||||
* RDNA4 architecture: includes models such as RX 9070 and RX 9070XT
|
||||
|
||||
HIP C++ type implementation support
|
||||
-----------------------------------
|
||||
|
||||
The HIP C++ types available on different hardware platforms are listed in the
|
||||
following table.
|
||||
|
||||
.. list-table::
|
||||
:header-rows: 1
|
||||
|
||||
*
|
||||
- HIP C++ Type
|
||||
- CDNA1
|
||||
- CDNA2
|
||||
- CDNA3
|
||||
- RDNA3
|
||||
- RDNA4
|
||||
|
||||
*
|
||||
- ``int8_t``, ``uint8_t``
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
*
|
||||
- ``int16_t``, ``uint16_t``
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
*
|
||||
- ``int32_t``, ``uint32_t``
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
*
|
||||
- ``int64_t``, ``uint64_t``
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
*
|
||||
- ``__hip_fp8_e4m3_fnuz``
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
- ❌
|
||||
- ❌
|
||||
|
||||
*
|
||||
- ``__hip_fp8_e5m2_fnuz``
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
- ❌
|
||||
- ❌
|
||||
|
||||
*
|
||||
- ``__hip_fp8_e4m3``
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
|
||||
*
|
||||
- ``__hip_fp8_e5m2``
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
|
||||
*
|
||||
- ``half``
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
*
|
||||
- ``bfloat16``
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
*
|
||||
- ``float``
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
*
|
||||
- ``double``
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
.. note::
|
||||
|
||||
Library support for specific data types is contingent upon hardware support.
|
||||
Even if a ROCm library indicates support for a particular data type, that type
|
||||
will only be fully functional if the underlying hardware architecture (as shown
|
||||
in the table above) also supports it. For example, fp8 types are only available
|
||||
on architectures shown with a checkmark in the relevant rows.
|
||||
|
||||
Compute units support
|
||||
---------------------
|
||||
@@ -190,19 +315,33 @@ The following table lists data type support for compute units.
|
||||
- int32
|
||||
- int64
|
||||
*
|
||||
- MI100
|
||||
- CDNA1
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
*
|
||||
- MI200 series
|
||||
- CDNA2
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
*
|
||||
- MI300 series
|
||||
- CDNA3
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
*
|
||||
- RDNA3
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
*
|
||||
- RDNA4
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
@@ -224,7 +363,7 @@ The following table lists data type support for compute units.
|
||||
- float32
|
||||
- float64
|
||||
*
|
||||
- MI100
|
||||
- CDNA1
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
@@ -233,7 +372,7 @@ The following table lists data type support for compute units.
|
||||
- ✅
|
||||
- ✅
|
||||
*
|
||||
- MI200 series
|
||||
- CDNA2
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
@@ -242,7 +381,27 @@ The following table lists data type support for compute units.
|
||||
- ✅
|
||||
- ✅
|
||||
*
|
||||
- MI300 series
|
||||
- CDNA3
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
- ✅
|
||||
- ❌
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
*
|
||||
- RDNA3
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
- ✅
|
||||
- ❌
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
*
|
||||
- RDNA4
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
@@ -271,19 +430,33 @@ The following table lists data type support for AMD GPU matrix cores.
|
||||
- int32
|
||||
- int64
|
||||
*
|
||||
- MI100
|
||||
- CDNA1
|
||||
- ✅
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
*
|
||||
- MI200 series
|
||||
- CDNA2
|
||||
- ✅
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
*
|
||||
- MI300 series
|
||||
- CDNA3
|
||||
- ✅
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
|
||||
*
|
||||
- RDNA3
|
||||
- ✅
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
|
||||
*
|
||||
- RDNA4
|
||||
- ✅
|
||||
- ❌
|
||||
- ❌
|
||||
@@ -305,7 +478,7 @@ The following table lists data type support for AMD GPU matrix cores.
|
||||
- float32
|
||||
- float64
|
||||
*
|
||||
- MI100
|
||||
- CDNA1
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
@@ -314,7 +487,7 @@ The following table lists data type support for AMD GPU matrix cores.
|
||||
- ✅
|
||||
- ❌
|
||||
*
|
||||
- MI200 series
|
||||
- CDNA2
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
@@ -323,7 +496,7 @@ The following table lists data type support for AMD GPU matrix cores.
|
||||
- ✅
|
||||
- ✅
|
||||
*
|
||||
- MI300 series
|
||||
- CDNA3
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
@@ -332,6 +505,26 @@ The following table lists data type support for AMD GPU matrix cores.
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
*
|
||||
- RDNA3
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
- ✅
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
|
||||
*
|
||||
- RDNA4
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ✅
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
|
||||
Atomic operations support
|
||||
-------------------------
|
||||
|
||||
@@ -357,19 +550,33 @@ page.
|
||||
- int32
|
||||
- int64
|
||||
*
|
||||
- MI100
|
||||
- CDNA1
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
- ✅
|
||||
*
|
||||
- MI200 series
|
||||
- CDNA2
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
- ✅
|
||||
*
|
||||
- MI300 series
|
||||
- CDNA3
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
*
|
||||
- RDNA3
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
*
|
||||
- RDNA4
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
@@ -391,7 +598,7 @@ page.
|
||||
- float32
|
||||
- float64
|
||||
*
|
||||
- MI100
|
||||
- CDNA1
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
@@ -400,7 +607,7 @@ page.
|
||||
- ✅
|
||||
- ❌
|
||||
*
|
||||
- MI200 series
|
||||
- CDNA2
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
@@ -409,7 +616,7 @@ page.
|
||||
- ✅
|
||||
- ✅
|
||||
*
|
||||
- MI300 series
|
||||
- CDNA3
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
@@ -418,6 +625,26 @@ page.
|
||||
- ✅
|
||||
- ✅
|
||||
|
||||
*
|
||||
- RDNA3
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
- ❌
|
||||
|
||||
*
|
||||
- RDNA4
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
- ✅
|
||||
- ❌
|
||||
- ✅
|
||||
- ❌
|
||||
|
||||
.. note::
|
||||
|
||||
You can emulate atomic operations using software for cases that are not
|
||||
@@ -452,36 +679,98 @@ detailed description.
|
||||
- int16
|
||||
- int32
|
||||
- int64
|
||||
|
||||
*
|
||||
- :doc:`hipSPARSELt <hipsparselt:reference/data-type-support>`
|
||||
- :doc:`Composable Kernel <composable_kernel:reference/Composable_Kernel_supported_scalar_types>`
|
||||
- ✅/✅
|
||||
- ❌/❌
|
||||
- ✅/✅
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
*
|
||||
- :doc:`rocRAND <rocrand:api-reference/data-type-support>`
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
*
|
||||
- :doc:`hipRAND <hiprand:api-reference/data-type-support>`
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
*
|
||||
- :doc:`rocPRIM <rocprim:reference/data-type-support>`
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`hipCUB <hipcub:api-reference/data-type-support>`
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`hipRAND <hiprand:api-reference/data-type-support>`
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
|
||||
*
|
||||
- :doc:`hipSOLVER <hipsolver:reference/precision>`
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
|
||||
*
|
||||
- :doc:`hipSPARSELt <hipsparselt:reference/data-type-support>`
|
||||
- ✅/✅
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
|
||||
*
|
||||
- :doc:`hipTensor <hiptensor:api-reference/api-reference>`
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
|
||||
*
|
||||
- :doc:`MIGraphX <amdmigraphx:reference/cpp>`
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`MIOpen <miopen:reference/datatypes>`
|
||||
- ⚠️/⚠️
|
||||
- ❌/❌
|
||||
- ⚠️/⚠️
|
||||
- ❌/❌
|
||||
|
||||
*
|
||||
- :doc:`RCCL <rccl:api-reference/library-specification>`
|
||||
- ✅/✅
|
||||
- ❌/❌
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`rocFFT <rocfft:reference/api>`
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
|
||||
*
|
||||
- :doc:`rocPRIM <rocprim:reference/data-type-support>`
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`rocRAND <rocrand:api-reference/data-type-support>`
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
|
||||
*
|
||||
- :doc:`rocSOLVER <rocsolver:reference/precision>`
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
|
||||
*
|
||||
- :doc:`rocThrust <rocthrust:data-type-support>`
|
||||
- ✅/✅
|
||||
@@ -489,6 +778,14 @@ detailed description.
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`rocWMMA <rocwmma:api-reference/api-reference-guide>`
|
||||
- ✅/✅
|
||||
- ❌/❌
|
||||
- ❌/✅
|
||||
- ❌/❌
|
||||
|
||||
|
||||
.. tab-item:: Floating-point types
|
||||
:sync: floating-point-type
|
||||
|
||||
@@ -504,42 +801,17 @@ detailed description.
|
||||
- tensorfloat32
|
||||
- float32
|
||||
- float64
|
||||
|
||||
*
|
||||
- :doc:`hipSPARSELt <hipsparselt:reference/data-type-support>`
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- :doc:`Composable Kernel <composable_kernel:reference/Composable_Kernel_supported_scalar_types>`
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
*
|
||||
- :doc:`rocRAND <rocrand:api-reference/data-type-support>`
|
||||
- NA/❌
|
||||
- NA/❌
|
||||
- NA/✅
|
||||
- NA/❌
|
||||
- NA/❌
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
*
|
||||
- :doc:`hipRAND <hiprand:api-reference/data-type-support>`
|
||||
- NA/❌
|
||||
- NA/❌
|
||||
- NA/✅
|
||||
- NA/❌
|
||||
- NA/❌
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
*
|
||||
- :doc:`rocPRIM <rocprim:reference/data-type-support>`
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ❌/❌
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`hipCUB <hipcub:api-reference/data-type-support>`
|
||||
- ❌/❌
|
||||
@@ -549,6 +821,117 @@ detailed description.
|
||||
- ❌/❌
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`hipRAND <hiprand:api-reference/data-type-support>`
|
||||
- NA/❌
|
||||
- NA/❌
|
||||
- NA/✅
|
||||
- NA/❌
|
||||
- NA/❌
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
|
||||
*
|
||||
- :doc:`hipSOLVER <hipsolver:reference/precision>`
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`hipSPARSELt <hipsparselt:reference/data-type-support>`
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
|
||||
*
|
||||
- :doc:`hipTensor <hiptensor:api-reference/api-reference>`
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ❌/❌
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`MIGraphX <amdmigraphx:reference/cpp>`
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`MIOpen <miopen:reference/datatypes>`
|
||||
- ⚠️/⚠️
|
||||
- ⚠️/⚠️
|
||||
- ✅/✅
|
||||
- ⚠️/⚠️
|
||||
- ❌/❌
|
||||
- ✅/✅
|
||||
- ⚠️/⚠️
|
||||
|
||||
*
|
||||
- :doc:`RCCL <rccl:api-reference/library-specification>`
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ❌/❌
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`rocFFT <rocfft:reference/api>`
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ✅/✅
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`rocPRIM <rocprim:reference/data-type-support>`
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ❌/❌
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`rocRAND <rocrand:api-reference/data-type-support>`
|
||||
- NA/❌
|
||||
- NA/❌
|
||||
- NA/✅
|
||||
- NA/❌
|
||||
- NA/❌
|
||||
- NA/✅
|
||||
- NA/✅
|
||||
|
||||
*
|
||||
- :doc:`rocSOLVER <rocsolver:reference/precision>`
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ❌/❌
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`rocThrust <rocthrust:data-type-support>`
|
||||
- ❌/❌
|
||||
@@ -559,62 +942,123 @@ detailed description.
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
*
|
||||
- :doc:`rocWMMA <rocwmma:api-reference/api-reference-guide>`
|
||||
- ✅/❌
|
||||
- ✅/❌
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
- ✅/✅
|
||||
|
||||
.. note::
|
||||
|
||||
|
||||
As random number generation libraries, rocRAND and hipRAND only specify output
|
||||
data types for the random values they generate, with no need for input data
|
||||
types.
|
||||
|
||||
Libraries internal calculations type support
|
||||
--------------------------------------------
|
||||
hipDataType enumeration
|
||||
-----------------------
|
||||
|
||||
The following tables list ROCm library support for specific internal data types.
|
||||
Refer to the corresponding library data type support page for a detailed
|
||||
description.
|
||||
The ``hipDataType`` enumeration defines data precision types and is primarily
|
||||
used when the data reference itself does not include type information, such as
|
||||
in ``void*`` pointers. This enumeration is mainly utilized in BLAS libraries.
|
||||
The HIP type equivalents of the ``hipDataType`` enumeration are listed in the
|
||||
following table with descriptions and values.
|
||||
|
||||
.. tab-set::
|
||||
.. list-table::
|
||||
:header-rows: 1
|
||||
:widths: 25,25,10,40
|
||||
|
||||
.. tab-item:: Integral types
|
||||
:sync: integral-type
|
||||
*
|
||||
- hipDataType
|
||||
- HIP type
|
||||
- Value
|
||||
- Description
|
||||
|
||||
.. list-table::
|
||||
:header-rows: 1
|
||||
*
|
||||
- ``HIP_R_8I``
|
||||
- ``int8_t``
|
||||
- 3
|
||||
- 8-bit real signed integer.
|
||||
|
||||
*
|
||||
- Library internal data type name
|
||||
- int8
|
||||
- int16
|
||||
- int32
|
||||
- int64
|
||||
*
|
||||
- :doc:`hipSPARSELt <hipsparselt:reference/data-type-support>`
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
- ❌
|
||||
*
|
||||
- ``HIP_R_8U``
|
||||
- ``uint8_t``
|
||||
- 8
|
||||
- 8-bit real unsigned integer.
|
||||
|
||||
*
|
||||
- ``HIP_R_16I``
|
||||
- ``int16_t``
|
||||
- 20
|
||||
- 16-bit real signed integer.
|
||||
|
||||
.. tab-item:: Floating-point types
|
||||
:sync: floating-point-type
|
||||
*
|
||||
- ``HIP_R_16U``
|
||||
- ``uint16_t``
|
||||
- 22
|
||||
- 16-bit real unsigned integer.
|
||||
|
||||
.. list-table::
|
||||
:header-rows: 1
|
||||
*
|
||||
- ``HIP_R_32I``
|
||||
- ``int32_t``
|
||||
- 10
|
||||
- 32-bit real signed integer.
|
||||
|
||||
*
|
||||
- Library internal data type name
|
||||
- float8 (E4M3)
|
||||
- float8 (E5M2)
|
||||
- float16
|
||||
- bfloat16
|
||||
- tensorfloat32
|
||||
- float32
|
||||
- float64
|
||||
*
|
||||
- :doc:`hipSPARSELt <hipsparselt:reference/data-type-support>`
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
- ❌
|
||||
- ✅
|
||||
- ❌
|
||||
*
|
||||
- ``HIP_R_32U``
|
||||
- ``uint32_t``
|
||||
- 12
|
||||
- 32-bit real unsigned integer.
|
||||
|
||||
*
|
||||
- ``HIP_R_32F``
|
||||
- ``float``
|
||||
- 0
|
||||
- 32-bit real single precision floating-point.
|
||||
|
||||
*
|
||||
- ``HIP_R_64F``
|
||||
- ``double``
|
||||
- 1
|
||||
- 64-bit real double precision floating-point.
|
||||
|
||||
*
|
||||
- ``HIP_R_16F``
|
||||
- ``half``
|
||||
- 2
|
||||
- 16-bit real half precision floating-point.
|
||||
|
||||
*
|
||||
- ``HIP_R_16BF``
|
||||
- ``bfloat16``
|
||||
- 14
|
||||
- 16-bit real bfloat16 precision floating-point.
|
||||
|
||||
*
|
||||
- ``HIP_R_8F_E4M3``
|
||||
- ``__hip_fp8_e4m3``
|
||||
- 28
|
||||
- 8-bit real float8 precision floating-point (OCP version).
|
||||
|
||||
*
|
||||
- ``HIP_R_8F_E5M2``
|
||||
- ``__hip_fp8_e5m2``
|
||||
- 29
|
||||
- 8-bit real bfloat8 precision floating-point (OCP version).
|
||||
|
||||
*
|
||||
- ``HIP_R_8F_E4M3_FNUZ``
|
||||
- ``__hip_fp8_e4m3_fnuz``
|
||||
- 1000
|
||||
- 8-bit real float8 precision floating-point (FNUZ version).
|
||||
|
||||
*
|
||||
- ``HIP_R_8F_E5M2_FNUZ``
|
||||
- ``__hip_fp8_e5m2_fnuz``
|
||||
- 1001
|
||||
- 8-bit real bfloat8 precision floating-point (FNUZ version).
|
||||
|
||||
The full list of the ``hipDataType`` enumeration listed in `library_types.h <https://github.com/ROCm/hip/blob/amd-staging/include/hip/library_types.h>`_ .
|
||||
|
||||
Reference in New Issue
Block a user