tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-10 23:48:01 -05:00

Go to file

Diogo a9a1df785f Webgpu support (#1077 )

* initial commit

* 81 passing

* 105 passing tests

* 148 passing

* CI tests

* install dep on ci

* try opencl pkgs

* try using vulkan

* down to only 6 failing

* refactor

* cleaning up

* another test skipped due to buffer limit

* linter

* segfault

* indent fix

* another segfault found

* small touchups

* Fix max and maxpool tests

* Add constant folding

* Add javascript export script

* better asserts in codegen

* manual upcasting

* reverted token type change

* skip safetensor test due to unsupported type

* FIx efficientnet and all other model tests

* Remove np copy

* fixed indent and missing import

* manually destroy the buffer

* revert back to length

* linter errors

* removed extra val

* skip broken tests

* skipping more tests

* Make the page pretty

* Save model weights as safetensor

* Fix imagenet to c test

* Fix second imagenet to c bug

* Async and paralel kernel compilation

* workgroup support

* reversed local size

* fixed non local bug

* correct local groups

* ci experiment

* removed typo

* Fix define local by using shared memory

* Refactor

* try running on mac

* match metal tests

* add more workers

* scope down tests

* trying windows runner

* fixed windows env

* see how many it can do

* merged master

* refactor

* missed refactor

* increase test suite coverage

* missing import

* whitespace in test_efficientnet.py

* getting there

* fixed reset

* fixed bufs

* switched to cstyle

* cleanup

* min/max rename

* one more linter issue

* fixed demo

* linter

* testing ci chrome

* add unsafe webgpu arg

* add build step

* remove WEBGPU from cmd line

* use module

* try forcing directx

* trying forced metal backend

* temp disable conv2d for CI

* disable conv_trasnpose2d

---------

Co-authored-by: 0x4d - Martin Loretz <20306567+martinloretzzz@users.noreply.github.com>
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>

2023-07-12 12:52:06 -07:00

.github/workflows

Webgpu support (#1077 )

2023-07-12 12:52:06 -07:00

cache

add ff_dim to transformer

2021-11-29 12:40:52 -05:00

disassemblers/adreno

fix path linter issue

2023-04-18 19:17:41 -07:00

docs

Fix constant folding for Tensor([3]) (#1227 )

2023-07-11 14:01:32 -07:00

examples

Webgpu support (#1077 )

2023-07-12 12:52:06 -07:00

extra

good stuff from tensor cores branch (#1199 )

2023-07-08 16:58:26 -07:00

models

style: else-after-return (#1216 )

2023-07-12 10:26:38 -07:00

openpilot

global -> group (#1007 )

2023-06-21 11:50:43 -07:00

test

Webgpu support (#1077 )

2023-07-12 12:52:06 -07:00

tinygrad

Webgpu support (#1077 )

2023-07-12 12:52:06 -07:00

weights

cleanup clip tokenizer

2022-09-12 09:20:12 -07:00

.editorconfig

Basic editorconfig support (#422 )

2022-11-08 10:34:25 -08:00

.gitignore

Webgpu support (#1077 )

2023-07-12 12:52:06 -07:00

.pre-commit-config.yaml

Refactor load/store before tensor cores (#1193 )

2023-07-08 15:54:58 -07:00

.pylintrc

style: else-after-return (#1216 )

2023-07-12 10:26:38 -07:00

.tokeignore

Add a quick start guide (#900 )

2023-06-04 08:51:20 -07:00

compile.sh

stop wasting time with the compiler. tinygrad needs to just jit

2023-03-12 12:08:46 -07:00

CONTRIBUTING.md

feat: reword contributing (#1131 )

2023-07-04 22:17:47 -07:00

LICENSE

Updated LICENSE year (#760 )

2023-05-01 15:35:23 -07:00

push_pypi.sh

push pypi

2020-10-27 08:13:15 -07:00

README.md

Outdated repository URL (#1218 )

2023-07-11 23:14:19 -07:00

rmso.sh

compile works (#688 )

2023-03-12 11:01:25 -07:00

run_multibackend.sh

dtypes nice and clean (#673 )

2023-03-10 16:56:07 -08:00

setup.py

Webgpu support (#1077 )

2023-07-12 12:52:06 -07:00

strip_whitespace.sh

strip whitespace

2023-06-27 10:11:43 -07:00

sz.py

move line counter to python

2023-05-29 09:21:40 -07:00

README.md

tinygrad: For something between PyTorch and karpathy/micrograd. Maintained by tiny corp.

Homepage | Documentation | Examples | Showcase | Discord

This may not be the best deep learning framework, but it is a deep learning framework.

Due to its extreme simplicity, it aims to be the easiest framework to add new accelerators to, with support for both inference and training. If XLA is CISC, tinygrad is RISC.

tinygrad is still alpha software, but we raised some money to make it good. Someday, we will tape out chips.

Features

LLaMA and Stable Diffusion

tinygrad can run LLaMA and Stable Diffusion!

Laziness

Try a matmul. See how, despite the style, it is fused into one kernel with the power of laziness.

DEBUG=3 python3 -c "from tinygrad.tensor import Tensor;
N = 1024; a, b = Tensor.rand(N, N), Tensor.rand(N, N);
c = (a.reshape(N, 1, N) * b.permute(1,0).reshape(1, N, N)).sum(axis=2);
print((c.numpy() - (a.numpy() @ b.numpy())).mean())"

And we can change DEBUG to 4 to see the generated code.

Neural networks

As it turns out, 90% of what you need for neural networks are a decent autograd/tensor library. Throw in an optimizer, a data loader, and some compute, and you have all you need.

Neural network example (from test/models/test_mnist.py)

from tinygrad.tensor import Tensor
import tinygrad.nn.optim as optim

class TinyBobNet:
  def __init__(self):
    self.l1 = Tensor.uniform(784, 128)
    self.l2 = Tensor.uniform(128, 10)

  def forward(self, x):
    return x.dot(self.l1).relu().dot(self.l2).log_softmax()

model = TinyBobNet()
optim = optim.SGD([model.l1, model.l2], lr=0.001)

# ... complete data loader here

out = model.forward(x)
loss = out.mul(y).mean()
optim.zero_grad()
loss.backward()
optim.step()

Accelerators

tinygrad already supports numerous accelerators, including:

CPU
GPU (OpenCL)
C Code (Clang)
LLVM
METAL
CUDA
Triton
PyTorch

And it is easy to add more! Your accelerator of choice only needs to support a total of 26 (optionally 27) low level ops. More information can be found in the documentation for adding new accelerators.

Installation

The current recommended way to install tinygrad is from source.

From source

git clone https://github.com/tinygrad/tinygrad.git
cd tinygrad
python3 -m pip install -e .

Don't forget the . at the end!

Documentation

Documentation along with a quick start guide can be found in the docs/ directory.

Quick example comparing to PyTorch

from tinygrad.tensor import Tensor

x = Tensor.eye(3, requires_grad=True)
y = Tensor([[2.0,0,-2.0]], requires_grad=True)
z = y.matmul(x).sum()
z.backward()

print(x.grad.numpy())  # dz/dx
print(y.grad.numpy())  # dz/dy

The same thing but in PyTorch:

import torch

x = torch.eye(3, requires_grad=True)
y = torch.tensor([[2.0,0,-2.0]], requires_grad=True)
z = y.matmul(x).sum()
z.backward()

print(x.grad.numpy())  # dz/dx
print(y.grad.numpy())  # dz/dy

Contributing

There has been a lot of interest in tinygrad lately. Here are some basic guidelines for contributing:

Bug fixes are the best and always welcome! Like this one.
If you don't understand the code you are changing, don't change it!
All code golf PRs will be closed, but conceptual cleanups are great.
Features are welcome. Though if you are adding a feature, you need to include tests.
Improving test coverage is great, with reliable non-brittle tests.

Additional guidelines can be found in CONTRIBUTING.md.

Running tests

For more examples on how to run the full test suite please refer to the CI workflow.

Some examples:

python3 -m pip install -e '.[testing]'
python3 -m pytest
python3 -m pytest -v -k TestTrain
python3 ./test/models/test_train.py TestTrain.test_efficientnet

Languages

Python 67.6%

C 19.3%

Cuda 5.3%

Assembly 2.7%

Metal 2.3%

Other 2.7%