tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-24 06:18:01 -05:00

Files

George Hotz cd97b036cc A Triton backend for tinygrad (#470 )

* triton can add

* print stuff from triton

* write out file

* ops triton working

* reduce ops

* sort of works

* Triton bugfixes & implementation of remaining ops (#490)

* padding

* support pow, max, relu, gt0

* allocate return buffer

* Fix reduce

* Add tests for power op

* Fix triton illegal memory accesses and memory leak (#512)

* Fix mypy issue

* Add triton to setup.py

* Replace torch with pycuda

* Use one cuda stream for data transfer and kernels

* Remove triton submodule

* Fix memory leak by using weakrefs for caching

* Fix memory access by adding valid as mask for load

* Fix invalid kernel launches by flattening the grid (#515)

---------

Co-authored-by: Martin Loretz <20306567+martinloretzzz@users.noreply.github.com>

2023-02-01 11:53:57 -08:00

ane

Refactor getenv into helpers (#508 )

2023-01-31 15:09:09 -08:00

cherry

Refactor getenv into helpers (#508 )

2023-01-31 15:09:09 -08:00

cuda

Accel/cuda (#319 )

2022-05-14 21:25:30 -07:00

llvm

Replace SIGN with GT0 (#511 )

2023-02-01 11:01:39 -08:00

metal

a bit of work on metal

2021-12-30 13:53:08 -05:00

tpu

header

2021-10-30 16:41:05 -07:00

triton

A Triton backend for tinygrad (#470 )

2023-02-01 11:53:57 -08:00

MAPPING

fix ane on new mac os x

2022-08-06 19:10:22 +00:00

README

refactor efficientnet loading

2021-10-30 17:02:17 -07:00

README

This is where we scope out adding accelerators to tinygrad

ane -- Apple Neural Engine, in the M1 + newer iPhones
cherry -- Largely defunct custom hardware based on a RISC-V extension
tpu -- Google's TPU, available for rent in Google Cloud