ROCm/include at 94c83d30ce63c9684ca3f9c4c3825d83818991fb - ROCm

mirror of https://github.com/ROCm/ROCm.git synced 2026-04-05 03:01:17 -04:00

Files

Philippe Tillet 94c83d30ce [GENERAL] Removed deprecated driver files and added basic compatibility with rocm (#268 )

- Removed driver module -- accelerator runtime is handled by pytorch
- Added basic support for ROCM based on @micmelesse 's PR -- now can execute empty kernel on AMD devices without any compile-time changes
- Now only using PREFER_SHARED for kernels when the size of shared memory is greater than 49k. Otherwise there can be poor L1 performance for broadcast tensors

2021-09-09 00:04:28 -07:00

triton

[GENERAL] Removed deprecated driver files and added basic compatibility with rocm (#268 )

2021-09-09 00:04:28 -07:00