mirror of
https://github.com/ROCm/ROCm.git
synced 2026-04-27 03:01:52 -04:00
Improve "tuning guides" landing page (#4504)
* Improve "tuning guides" landing page
* Update docs/how-to/gpu-performance/mi300x.rst
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>
* Update docs/how-to/gpu-performance/mi300x.rst
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>
* change tuning to optimization
---------
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>
(cherry picked from commit 58d42ec50b)
This commit is contained in:
27
docs/how-to/gpu-performance/mi300x.rst
Normal file
27
docs/how-to/gpu-performance/mi300x.rst
Normal file
@@ -0,0 +1,27 @@
|
||||
.. meta::
|
||||
:description: How to configure MI300X accelerators to fully leverage their capabilities and achieve optimal performance.
|
||||
:keywords: ROCm, AI, machine learning, MI300X, LLM, usage, tutorial, optimization, tuning
|
||||
|
||||
**************************************
|
||||
AMD Instinct MI300X performance guides
|
||||
**************************************
|
||||
|
||||
The following performance guides provide essential guidance on the necessary
|
||||
steps to properly :doc:`configure your system for AMD Instinct™ MI300X
|
||||
accelerators <../system-optimization/mi300x>`. They include detailed
|
||||
instructions on system settings and application :doc:`workload tuning
|
||||
<../rocm-for-ai/inference-optimization/workload>` to help you
|
||||
leverage the maximum capabilities of these accelerators and achieve superior
|
||||
performance.
|
||||
|
||||
* :doc:`../system-optimization/mi300x` covers essential system settings and
|
||||
system management practices to configure your AMD Instinct MI300X system for
|
||||
performance.
|
||||
|
||||
* :doc:`../rocm-for-ai/inference-optimization/workload` covers steps to
|
||||
optimize the performance of AMD Instinct MI300X series accelerators for HPC
|
||||
and deep learning operations.
|
||||
|
||||
* :doc:`../rocm-for-ai/inference/vllm-benchmark` introduces a preconfigured
|
||||
environment for LLM inference, designed to help you test performance with
|
||||
popular models on AMD Instinct MI300X series accelerators.
|
||||
@@ -90,7 +90,7 @@ subtrees:
|
||||
- file: how-to/rocm-for-ai/inference-optimization/profiling-and-debugging.rst
|
||||
title: Profile and debug
|
||||
- file: how-to/rocm-for-ai/inference-optimization/workload.rst
|
||||
title: Workload tuning
|
||||
title: Workload optimization
|
||||
|
||||
- url: https://rocm.docs.amd.com/projects/ai-developer-hub/en/latest/
|
||||
title: AI tutorials
|
||||
@@ -103,8 +103,8 @@ subtrees:
|
||||
- entries:
|
||||
- file: how-to/system-optimization/w6000-v620.md
|
||||
title: AMD RDNA 2
|
||||
- file: how-to/tuning-guides/mi300x/index.rst
|
||||
title: AMD MI300X performance validation and tuning
|
||||
- file: how-to/gpu-performance/mi300x.rst
|
||||
title: AMD Instinct MI300X performance guides
|
||||
- file: how-to/system-debugging.md
|
||||
- file: conceptual/compiler-topics.md
|
||||
title: Use advanced compiler features
|
||||
|
||||
Reference in New Issue
Block a user