Files
ROCm/scripts/amd
Shucai Xiao fb3f2d6feb refine gemm tuning scripts (#309)
* refine the gemm tuning scripts to reduce tuning space and better perf numbers

* added code to support tuning in full tuning space

* add a function to get best tuning config

* refine the matmul tutorial example to print out best tuning config for each input

* added even_k to gemm kernel heuristic for better performance

* address review comments
2023-09-07 08:09:11 -05:00
..
2023-09-07 08:09:11 -05:00
2022-12-21 13:13:24 -06:00
2023-05-04 16:46:59 -05:00
2022-12-22 18:47:42 -06:00
2022-12-22 18:47:42 -06:00
2022-12-21 13:13:24 -06:00
2022-12-21 13:13:24 -06:00
2022-12-21 13:13:24 -06:00
2022-12-21 13:13:24 -06:00
2022-12-21 13:13:24 -06:00
2023-05-04 16:46:59 -05:00
2022-12-21 13:13:24 -06:00
2022-12-21 13:13:24 -06:00
2022-12-21 13:13:24 -06:00
2022-12-21 13:13:24 -06:00
2022-12-21 13:13:24 -06:00
2022-12-21 13:13:24 -06:00
2022-12-21 13:13:24 -06:00
2022-12-21 13:13:24 -06:00
2022-12-21 13:13:24 -06:00
2022-12-21 13:13:24 -06:00
2023-05-01 12:49:29 -05:00
2022-12-21 13:13:24 -06:00
2023-05-04 16:46:59 -05:00
2023-05-12 15:37:08 -05:00
2022-12-21 13:13:24 -06:00
2022-12-21 13:13:24 -06:00
2023-05-12 15:37:08 -05:00