mirror of
https://github.com/ROCm/ROCm.git
synced 2026-04-05 03:01:17 -04:00
site restructure phase 1 - file reorganization (#2428)
This commit is contained in:
@@ -1,4 +1,4 @@
|
||||
# Compilers and Tools
|
||||
# Compilers and tools
|
||||
|
||||
:::::{grid} 1 1 2 2
|
||||
:gutter: 1
|
||||
@@ -13,12 +13,12 @@ inspection of execution state of AMD's commercially available GPU architectures.
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} [ROCmCC](./rocmcc/rocmcc)
|
||||
:::{grid-item-card} [ROCmCC](../rocmcc/rocmcc)
|
||||
ROCmCC is a Clang/LLVM-based compiler. It is optimized for high-performance
|
||||
computing on AMD GPUs and CPUs and supports various heterogeneous programming
|
||||
models such as HIP, OpenMP, and OpenCL.
|
||||
|
||||
- [Documentation](./rocmcc/rocmcc)
|
||||
- [Documentation](../rocmcc/rocmcc)
|
||||
|
||||
:::
|
||||
|
||||
@@ -50,4 +50,4 @@ Callback/Activity Library for Performance tracing AMD GPUs
|
||||
|
||||
## See Also
|
||||
|
||||
- [Compiler Disambiguation](../understand/compiler_disambiguation.md)
|
||||
- [Compiler Disambiguation](../../conceptual/compiler_disambiguation.md)
|
||||
3
docs/reference/compilers_tools/index.md
Normal file
3
docs/reference/compilers_tools/index.md
Normal file
@@ -0,0 +1,3 @@
|
||||
# ROCm compilers and tools
|
||||
|
||||
add links...
|
||||
@@ -1,4 +1,4 @@
|
||||
# All Reference Material
|
||||
# Reference material
|
||||
|
||||
## ROCm Software Groups
|
||||
|
||||
@@ -14,16 +14,16 @@ HIP is both AMD's GPU programming language extension and the GPU runtime.
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} [Math Libraries](./gpu_libraries/math)
|
||||
:::{grid-item-card} [Math Libraries](./libraries/gpu_libraries/math)
|
||||
HIP Math Libraries support the following domains:
|
||||
|
||||
- [Linear Algebra Libraries](./gpu_libraries/linear_algebra)
|
||||
- [Fast Fourier Transforms](./gpu_libraries/fft)
|
||||
- [Random Numbers](./gpu_libraries/rand)
|
||||
- [Linear Algebra Libraries](./libraries/gpu_libraries/linear_algebra)
|
||||
- [Fast Fourier Transforms](./libraries/gpu_libraries/fft)
|
||||
- [Random Numbers](./libraries/gpu_libraries/rand)
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} [C++ Primitive Libraries](./gpu_libraries/c++_primitives)
|
||||
:::{grid-item-card} [C++ Primitive Libraries](./libraries/gpu_libraries/c++_primitives)
|
||||
ROCm template libraries for C++ primitives and algorithms are as follows:
|
||||
|
||||
- {doc}`rocPRIM <rocprim:index>`
|
||||
@@ -33,14 +33,14 @@ ROCm template libraries for C++ primitives and algorithms are as follows:
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} [Communication Libraries](gpu_libraries/communication)
|
||||
:::{grid-item-card} [Communication Libraries](./libraries/gpu_libraries/communication)
|
||||
Inter and intra-node communication is supported by the following projects:
|
||||
|
||||
- {doc}`RCCL <rccl:index>`
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} [AI Libraries](./ai_tools)
|
||||
:::{grid-item-card} [Artificial intelligence](../rocm_ai/rocm_ai)
|
||||
Libraries related to AI.
|
||||
|
||||
- {doc}`MIOpen <miopen:index>`
|
||||
@@ -63,9 +63,9 @@ Computer vision related projects.
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} [Compilers and Tools](compilers)
|
||||
:::{grid-item-card} [Compilers and Tools](compilers_tools/index)
|
||||
|
||||
- [ROCmCC](/reference/rocmcc/rocmcc)
|
||||
- [ROCmCC](./rocmcc/rocmcc)
|
||||
- {doc}`ROCdbgapi <rocdbgapi:index>`
|
||||
- {doc}`ROCgdb <rocgdb:index>`
|
||||
- {doc}`ROCProfiler <rocprofiler:rocprof>`
|
||||
@@ -73,7 +73,7 @@ Computer vision related projects.
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} [Management Tools](management_tools)
|
||||
:::{grid-item-card} [Management Tools](./compilers_tools/management_tools)
|
||||
|
||||
- {doc}`AMD SMI <amdsmi:index>`
|
||||
- {doc}`ROCm SMI <rocm_smi_lib:index>`
|
||||
@@ -81,17 +81,17 @@ Computer vision related projects.
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} [Validation Tools](validation_tools)
|
||||
:::{grid-item-card} [Validation Tools](./compilers_tools/validation_tools)
|
||||
|
||||
- {doc}`ROCm Validation Suite <rocmvalidationsuite:index>`
|
||||
- {doc}`TransferBench <transferbench:index>`
|
||||
|
||||
:::
|
||||
|
||||
:::{grid-item-card} [GPU Architectures](gpu_arch)
|
||||
:::{grid-item-card} GPU Architectures
|
||||
|
||||
- [AMD Instinct MI200](./gpu_arch/mi250.md)
|
||||
- [AMD Instinct MI100](./gpu_arch/mi100.md)
|
||||
- [AMD Instinct MI200](../conceptual/gpu_arch/mi250.md)
|
||||
- [AMD Instinct MI100](../conceptual/gpu_arch/mi100.md)
|
||||
|
||||
:::
|
||||
|
||||
@@ -1,4 +1,4 @@
|
||||
# Math Libraries
|
||||
# Math libraries
|
||||
|
||||
AMD provides various math domain and support libraries as part of ROCm.
|
||||
|
||||
8
docs/reference/libraries/index.md
Normal file
8
docs/reference/libraries/index.md
Normal file
@@ -0,0 +1,8 @@
|
||||
# ROCm libraries
|
||||
|
||||
add links...
|
||||
|
||||
* Math
|
||||
* C++ primitive
|
||||
* Communication
|
||||
* Artificial intelligence
|
||||
@@ -9,12 +9,14 @@ Along with host APIs, the OpenMP compilers support offloading code and data onto
|
||||
GPU devices. This document briefly describes the installation location of the
|
||||
OpenMP toolchain, example usage of device offloading, and usage of `rocprof`
|
||||
with OpenMP applications. The GPUs supported are the same as those supported by
|
||||
this ROCm release. See the list of supported GPUs in {doc}`/release/gpu_os_support`.
|
||||
this ROCm release. See the list of supported GPUs in {doc}`../../about/release/linux_support`.
|
||||
|
||||
The ROCm OpenMP compiler is implemented using LLVM compiler technology.
|
||||
{numref}`openmp-toolchain` illustrates the internal steps taken to translate a user’s application into an executable that can offload computation to the AMDGPU. The compilation is a two-pass process. Pass 1 compiles the application to generate the CPU code and Pass 2 links the CPU code to the AMDGPU device code.
|
||||
The following image illustrates the internal steps taken to translate a user’s application into an executable that can offload computation to the AMDGPU. The compilation is a two-pass process. Pass 1 compiles the application to generate the CPU code and Pass 2 links the CPU code to the AMDGPU device code.
|
||||
|
||||

|
||||
```{figure} ../../data/reference/openmp/openmp_toolchain.svg
|
||||
:name: openmp-toolchain
|
||||
```
|
||||
|
||||
### Installation
|
||||
|
||||
|
||||
@@ -497,7 +497,6 @@ offload-arch gfx906 -v
|
||||
|
||||
The options are listed below:
|
||||
|
||||
:::{program} offload-arch
|
||||
:::{option} -h
|
||||
Prints the help message.
|
||||
:::
|
||||
|
||||
@@ -1 +0,0 @@
|
||||
# Management Tools
|
||||
Reference in New Issue
Block a user