tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-04-29 03:00:14 -04:00

Files

Jacky Lee ef5f648e2f Tensor.scaled_dot_product_attention to match torch, used in LLaMA, and tested (#1502 )

* Implement scaled_dot_product_attention and test

* Support attn_mask

* Support is_causal too

* Use in llama

* Don't forget to reshape

* Set requires_grad=False for causal

* Remove staticmethod

* Remove extra spaces

2023-08-08 23:27:13 -07:00

mlperf

Fix naming conflict with huggingface datasets (#1161 )

2023-07-07 10:43:44 -07:00

vgg7_helpers

Renamed examples/yolo to examples/vgg7_helpers because that directory contains no yolo-related code and only helper code for vgg7. This was confusing to a new user when trying to understand the examples. (#1086 )

2023-07-01 12:04:28 -07:00

__init__.py

failing llama test

2023-03-11 16:28:10 -08:00

benchmark_train_efficientnet.py

Refactor nn.optim (#1091 )

2023-07-02 15:07:30 -07:00

compile_efficientnet.py

simple exporting models (#1344 )

2023-08-01 09:35:48 -07:00

compile_tensorflow.py

moved extras/jit.py -> tinygrad/jit.py (#599 )

2023-02-25 08:32:33 -08:00

deep_deterministic_policy_gradient.py

Add pylint trailing whitespace rule (#1314 )

2023-07-21 13:37:55 -04:00

efficientnet.py

Fix plt output comment (#1428 )

2023-08-03 23:35:52 -07:00

hlb_cifar10_torch.py

Fix naming conflict with huggingface datasets (#1161 )

2023-07-07 10:43:44 -07:00

hlb_cifar10.py

CIFAR 94.03% (#1340 )

2023-08-08 15:13:24 -07:00

index.html

simple exporting models (#1344 )

2023-08-01 09:35:48 -07:00

llama.py

Tensor.scaled_dot_product_attention to match torch, used in LLaMA, and tested (#1502 )

2023-08-08 23:27:13 -07:00

mask_rcnn.py

MaskRCNN Inference (#884 )

2023-06-25 15:37:51 -07:00

mnist_gan.py

Fix discriminator balancing in mnist_gan example (#1332 )

2023-07-23 12:43:05 -07:00

serious_mnist.py

Fix naming conflict with huggingface datasets (#1161 )

2023-07-07 10:43:44 -07:00

simple_conv_bn.py

examples: simple conv bn

2023-07-04 13:50:26 -07:00

stable_diffusion.py

add stable diffusion and llama (#1471 )

2023-08-06 21:31:51 -07:00

train_efficientnet.py

Fix naming conflict with huggingface datasets (#1161 )

2023-07-07 10:43:44 -07:00

train_resnet.py

Fix naming conflict with huggingface datasets (#1161 )

2023-07-07 10:43:44 -07:00

transformer.py

fix imports for examples/transformer.py (#1136 )

2023-07-05 08:15:13 -07:00

vgg7.py

2023-07-01 12:04:28 -07:00

vit.py

Remove Tensor.data (#565 )

2023-02-18 16:36:12 -08:00

vits.py

Corrected a few misspelled words (#1435 )

2023-08-04 16:51:08 -07:00

whisper.py

Removed dep of torch, torchaudio, kept librosa only (#1264 )

2023-08-02 13:52:04 -04:00

yolov3.py

Permute examples (#731 )

2023-03-29 05:07:06 +04:00

yolov8-onnx.py

Add pylint trailing whitespace rule (#1314 )

2023-07-21 13:37:55 -04:00

yolov8.py

Add pylint trailing whitespace rule (#1314 )

2023-07-21 13:37:55 -04:00