From 5ce76d3b68cd349e101dc931639fbb70b5d698d2 Mon Sep 17 00:00:00 2001 From: Peter Park Date: Tue, 25 Mar 2025 13:54:27 -0400 Subject: [PATCH] Improve "tuning guides" landing page (#4504) * Improve "tuning guides" landing page * Update docs/how-to/gpu-performance/mi300x.rst Co-authored-by: Pratik Basyal * Update docs/how-to/gpu-performance/mi300x.rst Co-authored-by: Pratik Basyal * change tuning to optimization --------- Co-authored-by: Pratik Basyal (cherry picked from commit 58d42ec50b412784cf5d8373541374c99d6e369b) --- docs/how-to/gpu-performance/mi300x.rst | 27 ++++++++++++++++++++++++++ docs/sphinx/_toc.yml.in | 6 +++--- 2 files changed, 30 insertions(+), 3 deletions(-) create mode 100644 docs/how-to/gpu-performance/mi300x.rst diff --git a/docs/how-to/gpu-performance/mi300x.rst b/docs/how-to/gpu-performance/mi300x.rst new file mode 100644 index 000000000..cc65f14ec --- /dev/null +++ b/docs/how-to/gpu-performance/mi300x.rst @@ -0,0 +1,27 @@ +.. meta:: + :description: How to configure MI300X accelerators to fully leverage their capabilities and achieve optimal performance. + :keywords: ROCm, AI, machine learning, MI300X, LLM, usage, tutorial, optimization, tuning + +************************************** +AMD Instinct MI300X performance guides +************************************** + +The following performance guides provide essential guidance on the necessary +steps to properly :doc:`configure your system for AMD Instinctâ„¢ MI300X +accelerators <../system-optimization/mi300x>`. They include detailed +instructions on system settings and application :doc:`workload tuning +<../rocm-for-ai/inference-optimization/workload>` to help you +leverage the maximum capabilities of these accelerators and achieve superior +performance. + +* :doc:`../system-optimization/mi300x` covers essential system settings and + system management practices to configure your AMD Instinct MI300X system for + performance. + +* :doc:`../rocm-for-ai/inference-optimization/workload` covers steps to + optimize the performance of AMD Instinct MI300X series accelerators for HPC + and deep learning operations. + +* :doc:`../rocm-for-ai/inference/vllm-benchmark` introduces a preconfigured + environment for LLM inference, designed to help you test performance with + popular models on AMD Instinct MI300X series accelerators. diff --git a/docs/sphinx/_toc.yml.in b/docs/sphinx/_toc.yml.in index 1c8bee75c..feaf3d7fb 100644 --- a/docs/sphinx/_toc.yml.in +++ b/docs/sphinx/_toc.yml.in @@ -90,7 +90,7 @@ subtrees: - file: how-to/rocm-for-ai/inference-optimization/profiling-and-debugging.rst title: Profile and debug - file: how-to/rocm-for-ai/inference-optimization/workload.rst - title: Workload tuning + title: Workload optimization - url: https://rocm.docs.amd.com/projects/ai-developer-hub/en/latest/ title: AI tutorials @@ -103,8 +103,8 @@ subtrees: - entries: - file: how-to/system-optimization/w6000-v620.md title: AMD RDNA 2 - - file: how-to/tuning-guides/mi300x/index.rst - title: AMD MI300X performance validation and tuning + - file: how-to/gpu-performance/mi300x.rst + title: AMD Instinct MI300X performance guides - file: how-to/system-debugging.md - file: conceptual/compiler-topics.md title: Use advanced compiler features