ROCm/docs/how-to at revert-4372-wavesize-632 - ROCm

mirror of https://github.com/ROCm/ROCm.git synced 2026-02-19 02:34:19 -05:00

Files

Peter Park 9b28bc4f09 Update vLLM benchmarking guide (#4347 )

* update vllm-benchmark

fix hlist overflow

update standalone benchmarking options

update list of models

fix typo and model name

unnecessary duplicate info

update formatting

update vllm benchmark guide

- remove Llama 2 FP8
- add Jais 13B
- update commands

update docker pull tag

update MAD available models

remove extra mad models not relevant to vllm

update PyTorch version

add changelog

add model names to .wordlist.txt

* Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst

Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>

* Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst

Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>

* Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst

Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>

* fix typo

* update link

* fix link text

* change changelog to previous versions

* fix typo

* remove "for"

---------

Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>
(cherry picked from commit 2751a17cf0)

2025-02-05 17:19:58 -05:00

rocm-for-ai

Update vLLM benchmarking guide (#4347 )

2025-02-05 17:19:58 -05:00

rocm-for-hpc

2nd POC for How to Use ROCm for AI (#282 ) (#4299 )

2025-01-27 15:49:21 -05:00

system-optimization

Updated ROCm install on Linux installation method link (#4313 ) (#4324 )

2025-01-31 16:59:54 -05:00

tuning-guides/mi300x

2nd POC for How to Use ROCm for AI (#282 ) (#4299 )