George Hotz
|
cac8bcf8b5
|
use Ops.REDUCE (#9721)
* decrease bert python time [pr]
* order copies
* Revert "order copies"
This reverts commit 3f62c8693b.
* rewrite count
* Ops.REDUCE
* acc first in the add chain
* Fix tensor core acc
* arange patterns look good
* fix multireduce gate
* reduce rewrite rule
* bump that to 15 minutes
* multiwmma isn't fusing
* gep through wmma is gep pushing
* bump that timeout too, it's all env setup
* add failing test
|
2025-04-04 10:14:34 +08:00 |
|