tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-15 01:48:23 -05:00

Files

Eli Frigo 801564f31b Remove POW llop and add SQRT llop (#1104 )

* fixed division by zero for fast operations

* made et closer to 0

* replace POW llop with SQRT

* updated mlops to swap SQRT and POW llops

* updated hlops to swap POW and SQRT

* added sqrt llop to cpu runtime

* added sqrt llop to cstyle codegen

* added POW llop to llvm ir codegen

* added SQRT llop to torch runtime

* moved pow from mlops to hlops

* found a better way to do reverse pow

* fixed indentation

* added SQRT llop to triton

* update docs to match new llops

* removed POW operator from assembly codegen

* added sqrt and rsqrt to pow hlop

* rewrote pow function in tensor.py

* Adjust tolerance

* Adjust for adamw

* Reduce for Adam too

* removed accidental leftover code

* removed all of accidental code

* added rsqrt test

* removed pow from mlops again

it was added back when resolving merge conflicts

---------

Co-authored-by: Jacky Lee <jla524@sfu.ca>

2023-07-05 18:07:58 -07:00

ane

move accel into extra

2023-06-23 16:38:15 -07:00

tpu

move accel into extra

2023-06-23 16:38:15 -07:00

triton

Remove POW llop and add SQRT llop (#1104 )

2023-07-05 18:07:58 -07:00

MAPPING

move accel into extra

2023-06-23 16:38:15 -07:00

README

move accel into extra

2023-06-23 16:38:15 -07:00

README

This is where we scope out adding accelerators to tinygrad

ane -- Apple Neural Engine, in the M1 + newer iPhones
tpu -- Google's TPU, available for rent in Google Cloud