Files
tinygrad/test
Alexander Edwards 59af9b81c5 Match Torch speed for sum reduction on M1 (#1187)
* Add additional kernel when reducing multiple dimensions at once.

* Faster for smaller inputs

* Whitespace and naming

* Cleaner, guard for Metal only, and max 1 split rather than N

* Draft of different approach

* One additional kernel call for this test (as expected)
2023-07-19 09:18:58 -07:00
..
2023-07-12 12:52:06 -07:00
2020-12-15 23:44:08 -08:00
2023-06-25 10:38:58 -07:00
2023-06-03 09:40:43 -07:00
2023-07-12 12:52:06 -07:00
2023-02-27 06:53:18 -08:00
2023-06-25 15:22:56 -07:00
2023-07-15 00:42:42 -07:00
2023-07-19 09:08:38 -07:00
2023-06-03 09:40:43 -07:00
2023-07-12 12:52:06 -07:00
2023-07-12 12:52:06 -07:00