tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-23 05:48:08 -05:00

Go to file

JaSpa99 2fd7004980 Implementation of SoftVC VITS SVC model (#1371 )

* [WIP]: implementation of SoftVC VITS SVC model

* fix typo

* fix whitespace

* Fully implement Generator & Synthesizer

- implement SineGen & SourceHnNSF to reconstruct source signal from F0
- source signal is added during Generator
- fix various typos
- start loading state dict for synthesizer

* Load Synthesizer weights

- Fix typos in Synthesizer
- Slightly modify vits::load_checkpoint to skip a specified layer
- Test with Saul Goodman model because Drake weights are on mega

* start work on ContentVec

- implement ConvFeatureExtractionModel for ContentVec
- start work on TransformerEncoder for ContentVec:
- this transformer probably needs its own MultiheadAttention implementation
- fix various typos in synthesizer
- add helpers to mask behavior of ~ and % operator of torch

* use normal and kaiming_normal

* Implement ContentVec

- load ContentVec weights and config from fairseq hyperparams
- use MultiHeadAttention from whisper.py
- TransformerSentenceEncoderLayer might still need some tweaking, will see during inference testing
- redid tilde()
- some cleanup

* rename the file so it can be imported

* forgot to lint

* use float() instead of cast()

* add contentvec256l9 and cleanup

* Implement SoVITS fully and run it

- Fully run sovits with .wav file
- Drake weights need to be manually downloaded for now
- Fix bugs
- Add examples/sovits_helpers
- Big TODO: INVALID Kernel for recordings > 4.5 secs

* temp fix for longer audio recordings

* Upsample no more torch

* cleanup & detailed inference time measuring

* Completely remove torch(audio)

- Implement sinc resample in tinygrad
- Load audio via Soundfile
- Some cleanups

* move stuff to helper files

* Cleanup

* fix invalid kernel

* Cleanup & add more models

* Metal sounds good after master merge

- But Synthesizer pass became much slower

* drake weights now marked save

* do load/store in numpy

* no commas needed here

* remove extra newline

* call Tensor::where on object

* use Tensor::cat instead of numpy

* pull out first iteration

* remove Sequential, Dropout, GELU, TransposeLast

* cast during loading

* clean up attention

* remove SamePad

* Major cleanup / line reduction

- Finish implementation of GroupNormMasked
- Simplify parts of TransformerEncoder
- Simplify parts of Generator
- Move all helpers to common section
- Only use repeat_expand_left for interp after SpeechEncoder
- Moved SVC-specfic ContentVec impls up (canonically)
- Proper annotations for get_encoder
- Finished all TODOs
- Squashed some whitespaces

* clean up preprocess as well

* more straightforward bool expr

* add demo mode

2023-08-13 19:43:23 -07:00

.github/workflows

distributed collectives (#1519 )

2023-08-11 10:22:07 -07:00

cache

add ff_dim to transformer

2021-11-29 12:40:52 -05:00

disassemblers/adreno

fix path linter issue

2023-04-18 19:17:41 -07:00

docs

just cmplt (#1493 )

2023-08-08 13:58:10 -07:00

examples

Implementation of SoftVC VITS SVC model (#1371 )

2023-08-13 19:43:23 -07:00

extra

Print more meaningfull hip error messages (#1530 )

2023-08-12 07:16:20 -07:00

models

Bert: use Tensor.scaled_dot_product_attention (#1528 )

2023-08-12 08:46:04 -07:00

openpilot

global -> group (#1007 )

2023-06-21 11:50:43 -07:00

test

fix casting behavior for interpreted buffers (#1525 )

2023-08-13 19:21:37 -07:00

tinygrad

fix casting behavior for interpreted buffers (#1525 )

2023-08-13 19:21:37 -07:00

weights

gitignore in weights

2023-08-02 16:26:41 +00:00

.editorconfig

Revert "update editorconfig, enforce via CI (#1343 )" (#1380 )

2023-07-31 10:35:50 -07:00

.flake8

flake8 (#1323 )

2023-07-24 11:19:58 -04:00

.gitignore

distributed world (#1481 )

2023-08-10 10:00:51 -07:00

.pre-commit-config.yaml

flake8 (#1323 )

2023-07-24 11:19:58 -04:00

.pylintrc

style: else-after-return (#1216 )

2023-07-12 10:26:38 -07:00

.tokeignore

Add a quick start guide (#900 )

2023-06-04 08:51:20 -07:00

compile.sh

stop wasting time with the compiler. tinygrad needs to just jit

2023-03-12 12:08:46 -07:00

CONTRIBUTING.md

feat: reword contributing (#1131 )

2023-07-04 22:17:47 -07:00

LICENSE

Updated LICENSE year (#760 )

2023-05-01 15:35:23 -07:00

push_pypi.sh

push pypi

2020-10-27 08:13:15 -07:00

pytest.ini

Update pytest.ini format (#1398 )

2023-08-01 18:00:51 -04:00

README.md

Outdated repository URL (#1218 )

2023-07-11 23:14:19 -07:00

rmso.sh

compile works (#688 )

2023-03-12 11:01:25 -07:00

run_multibackend.sh

convert $@ to "$@" in run_multibackend.sh (#1379 )

2023-07-31 10:39:22 -07:00

setup.py

Bert: use Tensor.scaled_dot_product_attention (#1528 )

2023-08-12 08:46:04 -07:00

strip_whitespace.sh

strip whitespace

2023-06-27 10:11:43 -07:00

sz.py

move line counter to python

2023-05-29 09:21:40 -07:00

README.md

tinygrad: For something between PyTorch and karpathy/micrograd. Maintained by tiny corp.

Homepage | Documentation | Examples | Showcase | Discord

This may not be the best deep learning framework, but it is a deep learning framework.

Due to its extreme simplicity, it aims to be the easiest framework to add new accelerators to, with support for both inference and training. If XLA is CISC, tinygrad is RISC.

tinygrad is still alpha software, but we raised some money to make it good. Someday, we will tape out chips.

Features

LLaMA and Stable Diffusion

tinygrad can run LLaMA and Stable Diffusion!

Laziness

Try a matmul. See how, despite the style, it is fused into one kernel with the power of laziness.

DEBUG=3 python3 -c "from tinygrad.tensor import Tensor;
N = 1024; a, b = Tensor.rand(N, N), Tensor.rand(N, N);
c = (a.reshape(N, 1, N) * b.permute(1,0).reshape(1, N, N)).sum(axis=2);
print((c.numpy() - (a.numpy() @ b.numpy())).mean())"

And we can change DEBUG to 4 to see the generated code.

Neural networks

As it turns out, 90% of what you need for neural networks are a decent autograd/tensor library. Throw in an optimizer, a data loader, and some compute, and you have all you need.

Neural network example (from test/models/test_mnist.py)

from tinygrad.tensor import Tensor
import tinygrad.nn.optim as optim

class TinyBobNet:
  def __init__(self):
    self.l1 = Tensor.uniform(784, 128)
    self.l2 = Tensor.uniform(128, 10)

  def forward(self, x):
    return x.dot(self.l1).relu().dot(self.l2).log_softmax()

model = TinyBobNet()
optim = optim.SGD([model.l1, model.l2], lr=0.001)

# ... complete data loader here

out = model.forward(x)
loss = out.mul(y).mean()
optim.zero_grad()
loss.backward()
optim.step()

Accelerators

tinygrad already supports numerous accelerators, including:

CPU
GPU (OpenCL)
C Code (Clang)
LLVM
METAL
CUDA
Triton
PyTorch

And it is easy to add more! Your accelerator of choice only needs to support a total of 26 (optionally 27) low level ops. More information can be found in the documentation for adding new accelerators.

Installation

The current recommended way to install tinygrad is from source.

From source

git clone https://github.com/tinygrad/tinygrad.git
cd tinygrad
python3 -m pip install -e .

Don't forget the . at the end!

Documentation

Documentation along with a quick start guide can be found in the docs/ directory.

Quick example comparing to PyTorch

from tinygrad.tensor import Tensor

x = Tensor.eye(3, requires_grad=True)
y = Tensor([[2.0,0,-2.0]], requires_grad=True)
z = y.matmul(x).sum()
z.backward()

print(x.grad.numpy())  # dz/dx
print(y.grad.numpy())  # dz/dy

The same thing but in PyTorch:

import torch

x = torch.eye(3, requires_grad=True)
y = torch.tensor([[2.0,0,-2.0]], requires_grad=True)
z = y.matmul(x).sum()
z.backward()

print(x.grad.numpy())  # dz/dx
print(y.grad.numpy())  # dz/dy

Contributing

There has been a lot of interest in tinygrad lately. Here are some basic guidelines for contributing:

Bug fixes are the best and always welcome! Like this one.
If you don't understand the code you are changing, don't change it!
All code golf PRs will be closed, but conceptual cleanups are great.
Features are welcome. Though if you are adding a feature, you need to include tests.
Improving test coverage is great, with reliable non-brittle tests.

Additional guidelines can be found in CONTRIBUTING.md.

Running tests

For more examples on how to run the full test suite please refer to the CI workflow.

Some examples:

python3 -m pip install -e '.[testing]'
python3 -m pytest
python3 -m pytest -v -k TestTrain
python3 ./test/models/test_train.py TestTrain.test_efficientnet

Languages

Python 70.1%

C 17.9%

Cuda 4.8%

Assembly 2.5%

Metal 2.1%

Other 2.5%