mirror of
https://github.com/ROCm/ROCm.git
synced 2026-02-19 02:34:19 -05:00
* update vllm-benchmark
fix hlist overflow
update standalone benchmarking options
update list of models
fix typo and model name
unnecessary duplicate info
update formatting
update vllm benchmark guide
- remove Llama 2 FP8
- add Jais 13B
- update commands
update docker pull tag
update MAD available models
remove extra mad models not relevant to vllm
update PyTorch version
add changelog
add model names to .wordlist.txt
* Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>
* Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>
* Update docs/how-to/rocm-for-ai/inference/vllm-benchmark.rst
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>
* fix typo
* update link
* fix link text
* change changelog to previous versions
* fix typo
* remove "for"
---------
Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>
(cherry picked from commit 2751a17cf0)