* Add C++ algorithm primitive lib cards * Add PRNG section * API Reference Manuals first * Add Tensile and rocWMMA * Change rocFFT and hipFFT order for consistency * Add RCCL * Fix PRNG links * Add rocSOLVER and hipSOLVER * Add general note on rocLIB vs hipLIB
1.8 KiB
Matrix Multiplication
ROCm libraries for BLAS are as follows:
:::::{grid} 1 1 2 2 :gutter: 1
:::{grid-item-card} hipBLAS hipBLAS is a compatiblity layer for GPU accelerated BLAS optimized for AMD GPUs via rocBLAS and rocSOLVER. hipBLAS allows for a common interface for other GPU BLAS libraries.
:::
:::{grid-item-card} rocBLAS rocBLAS is an AMD GPU optimized library for BLAS.
:::
:::{grid-item-card} Tensile Tensile is a tool for creating benchmark-driven backend libraries for GEMMs, GEMM-like problems and general N-dimensional tensor contractions on a GPU. The Tensile library is mainly used as backend library to rocBLAS. Tensile acts as the performance backbone for a wide variety of 'compute' applications running on AMD GPUs.
:::
:::{grid-item-card} rocWMMA AMD's C++ library for accelerating mixed-precision matrix multiply-accumulate (MMA) operations leveraging AMD GPU hardware.
:::
:::::