Files
ROCm/docs/data
Peter Park 36b6ffaf7c Add QwQ 32B to vllm-benchmark.rst (#4685)
* Add Qwen2 MoE 2.7B to vllm-benchmark-models.yaml

* Add QwQ-32B-Preview to vllm-benchmark-models.yaml

* add links to performance results

words

* change "performance validation" to "performance testing"

* remove "-Preview" from QwQ-32B

* move qwen2 MoE after qwen2

* add TunableOp section

* fix formatting

* add link to TunableOp doc

* add tunableop note

* fix vllm-benchmark template

* remove cmdline option for --tunableop on

* update docker details

* remove "training"

* remove qwen2
2025-04-24 16:44:34 -04:00
..
2024-11-21 14:43:24 -05:00
2024-07-22 17:24:14 -04:00
2024-07-25 11:16:12 -04:00
2024-02-20 10:34:04 -07:00
2024-02-20 10:34:04 -07:00
2024-02-20 10:34:04 -07:00
2024-02-20 10:34:04 -07:00
2024-02-20 10:34:04 -07:00
2024-02-08 17:24:12 -07:00