Files
ROCm/python
Alexander Efimov 0073bb98f4 [MFMA] Introduce dot operand loading fast path (#269)
* [MFMA] Introduce dot operand loading fast path

This PR introduces fast path for code generation of MFMA dot operand
loading from LDS.

Fast path is used when operand is not swizzled and is not slice of some
bigger LDS object(it is not a slice of a tensor).
This is a case for current FA and GEMM kernels compiled with
num_stages=1, i.e. software pipelining is disabled.

* cleanup swizzle info
2023-07-27 13:46:50 -05:00
..
2023-07-26 10:45:21 -05:00