tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-04-07 03:00:26 -04:00

Files

JaSpa99 491e85597a Run onnx commavq model (#1537 )

* try to run commavq

* fix 0 dim, start implementing new ops

- Implement EmbedLayerNormalization
- Implement Attention

* SkipLayerNormalization and FastGelu

* use original torch model, cast inputs

* fix some ops:

- properly do Cast
- Attention: bi- and unidirectional
- FastGelu: add bias before gelu

* cleanup onnx_ops.py

* add validation option to benchmark

* cleanup imports

* add checks incase onnx2torch implements ops in future

* run onnx instead of original torch

* just skip gpu on m1

* reactivate the other models

* check for strange params & squash whitespace

* cleanup

* fix causal mask Attention

* Range doesn't need int cast

* embedding vocab_counter same dtype as input

* no need to cast

* always validate, fix PosixPath ort

---------

Co-authored-by: George Hotz <george@comma.ai>

2023-08-16 12:24:40 -07:00

dist

distributed collectives (#1519 )

2023-08-11 10:22:07 -07:00

external_copy_benchmark.py

good changes from the M1 Tensor Core project (#730 )

2023-03-29 05:11:02 +04:00

external_hlb_cifar.py

Fix naming conflict with huggingface datasets (#1161 )

2023-07-07 10:43:44 -07:00

external_llama_eval.py

Add option in llama.py to quantize weights to int8 at runtime (#1289 )