Files
tinygrad/accel
Ollin Boer Bohan 3b1767e013 Fix OpenCL Metal texture issues (#378)
* Fix OpenCL Metal texture issues

Tile CL images when needed, to fit into the 16384 max Metal image size;
gets me to ~4.8s/iteration for SD on M1 Pro with OPENCL=1 FLOAT16=1.

* Minor cleanup

* Fix mish in CI, or no-op?

* Is mish being framed?

* It would help if any of this reproduced locally

* ???

* OPT is reverted; use original mish

* Cleanup post-review

* Fix some shape usage

* Tiler tests, shouldn't oom or overflow either

* Can't CL if there's no CL?

* Run tiler tests even if GPU=1

* relu6 segfault binary chop; revert test

* relu6 segfault binary chop; revert accel

* relu6 segfault binary chop; revert . (???)

* end relu6 segfault binary chop; repo's haunted
2022-09-29 01:21:54 -04:00
..
2022-05-14 21:25:30 -07:00
2022-07-02 22:29:09 -07:00
2021-12-30 13:53:08 -05:00
2022-08-18 07:41:00 -07:00
2021-10-30 16:41:05 -07:00
2022-08-06 19:10:22 +00:00
2021-10-30 17:02:17 -07:00

This is where we scope out adding accelerators to tinygrad

ane -- Apple Neural Engine, in the M1 + newer iPhones
cherry -- Largely defunct custom hardware based on a RISC-V extension
tpu -- Google's TPU, available for rent in Google Cloud