mirror of
https://github.com/ROCm/ROCm.git
synced 2026-04-05 03:01:17 -04:00
The pipeliner was overallocating shared memory for the inputs for current schedule. This reduces the shared memory usage to only what is needed. Note that improving membar analysis could allow taking advantage of allocating extra buffers to remove barriers.