mirror of
https://github.com/ROCm/ROCm.git
synced 2026-04-05 03:01:17 -04:00
* [MFMA] Introduce dot operand loading fast path This PR introduces fast path for code generation of MFMA dot operand loading from LDS. Fast path is used when operand is not swizzled and is not slice of some bigger LDS object(it is not a slice of a tensor). This is a case for current FA and GEMM kernels compiled with num_stages=1, i.e. software pipelining is disabled. * cleanup swizzle info
The file is empty.