tinygrad/examples at 39d962106f8145b6744a8c41951e9831526ebeef - tinygrad - AtHeartEngineering

github/tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-07 22:23:55 -05:00

Files

History

chenyu 39d962106f update llama logging (#13803 )

```
REWRITE_STACK_LIMIT=1000000 SMALL=1 BASEDIR=/raid/datasets/c4-8b SAMPLES=1000 BS=8 DP=8 DEFAULT_FLOAT=bfloat16 OPTIM_DTYPE=bfloat16 LLAMA3_SIZE=8B SEQLEN=1024 PYTHONPATH=. MODEL=llama3 python3 examples/mlperf/model_train.py

    1 93.44 s run, 11.8750 loss, 0.000000000001 LR, 642.43 GB used,  19644.30 GFLOPS
    2 101.78 s run, 11.8750 loss, 0.000000000001 LR, 1454.57 GB used,  17039.35 GFLOPS
    3 7.34 s run, 11.8750 loss, 0.000000000002 LR, 1454.57 GB used, 236258.78 GFLOPS
    4 4.32 s run, 11.8750 loss, 0.000000000002 LR, 1454.57 GB used, 401488.40 GFLOPS
    5 4.36 s run, 11.9375 loss, 0.000000000003 LR, 1454.57 GB used, 398116.13 GFLOPS
    6 4.32 s run, 11.8750 loss, 0.000000000003 LR, 1454.57 GB used, 401878.60 GFLOPS
    7 4.34 s run, 11.8750 loss, 0.000000000004 LR, 1454.57 GB used, 399822.57 GFLOPS
    8 4.35 s run, 11.8750 loss, 0.000000000004 LR, 1454.57 GB used, 398512.24 GFLOPS
    9 4.36 s run, 11.8750 loss, 0.000000000005 LR, 1454.57 GB used, 397832.61 GFLOPS
   10 4.40 s run, 11.8750 loss, 0.000000000005 LR, 1454.57 GB used, 394520.83 GFLOPS
```

2025-12-22 11:28:29 -05:00

..

conversation_data

Whisper + LLAMA + VITS (#2332 )

2023-12-02 15:03:46 -08:00

clean up unused imports in examples and update CI linting (#11024 )

2025-06-30 08:21:27 -07:00

update llama logging (#13803 )

2025-12-22 11:28:29 -05:00

JIT_BATCH_SIZE=0 in compile3 (#13245 )

2025-11-12 23:12:45 -05:00

cleanup stale examples/extra (#13764 )

2025-12-19 16:27:37 -04:00

rename lazydata to uop (#10698 )

2025-06-08 08:42:22 -07:00

cleanup stale examples/extra (#13764 )

2025-12-19 16:27:37 -04:00

leakyrelu to leaky_relu (#9270 )

2025-02-26 13:22:08 -05:00

rename lazydata to uop (#10698 )

2025-06-08 08:42:22 -07:00

__init__.py

failing llama test

2023-03-11 16:28:10 -08:00

beautiful_cartpole.py

remove Tensor.no_grad, it's meaningless now [pr] (#10556 )

2025-05-28 22:20:02 -07:00

beautiful_cifar.py

remove FUSE_ARANGE (#12511 )

2025-10-08 04:54:07 -04:00

beautiful_mnist_multigpu.py

Fix mypy examples/beautiful_*.py (#6978 )

2024-10-10 11:34:29 -04:00

beautiful_mnist.py

add SGD to beautiful_mnist (#13571 )

2025-12-04 12:17:29 -05:00

benchmark_onnx.py

move frontend dir to nn [pr] (#12470 )

2025-10-07 10:42:22 +08:00

compile_efficientnet.py

CLANG -> CPU (#9189 )

2025-02-20 18:03:09 -05:00

compile_tensorflow.py

move frontend dir to nn [pr] (#12470 )

2025-10-07 10:42:22 +08:00

flux1_seed0.png

Flux.1 (#6334 )

2024-09-24 10:08:04 +08:00

gpt2.py

fix gpt2 with benchmark (#12736 )

2025-10-16 09:55:20 -04:00

gradaccum_mnist.py

update examples/gradaccum_mnist.py to use the JIT

2025-12-03 16:11:42 -08:00

hlb_cifar10.py

remove FUSE_ARANGE (#12511 )

2025-10-08 04:54:07 -04:00

llama3.py

move tiktoken import in llama3 (#13316 )

2025-11-17 14:09:37 -05:00

llama.py

replace hardcoded GPU in llama debug msg (#12102 )

2025-09-10 13:56:40 -04:00

mamba.py

added top k sampling to examples/mamba (#12061 )

2025-09-14 15:27:34 -04:00

minrf.py

remove Tensor.no_grad, it's meaningless now [pr] (#10556 )

2025-05-28 22:20:02 -07:00

mixtral.py

Subtract 1 from Variable upper bound (#10715 )

2025-06-09 09:25:53 -07:00

mnist_gan.py

leakyrelu to leaky_relu (#9270 )

2025-02-26 13:22:08 -05:00

olmoe.py

remove .float calls in olmoe (#11610 )

2025-08-10 20:33:22 -04:00

qwq.py

replace hardcoded GPU in llama debug msg (#12102 )

2025-09-10 13:56:40 -04:00

sdv2.py

fix: dead sdv2 download link (#13521 )

2025-12-01 22:50:53 -08:00

sdxl_seed0.png

fix failed threefry (#10646 )

2025-06-05 17:17:42 -07:00

sdxl.py

don't validate output in sdxl with fakeweights (#12160 )

2025-09-13 21:47:51 -04:00

stable_diffusion_seed0.png

default threefry (#6116 )

2024-09-25 17:45:13 +08:00

stable_diffusion.py

fix n^2 _apply_map_to_tensors [pr] (#13443 )

2025-11-24 18:59:16 -08:00

stunning_mnist.py

clean up unused imports in examples and update CI linting (#11024 )

2025-06-30 08:21:27 -07:00

test_onnx_imagenet.py

delete DONT_REALIZE_EXPAND and DONT_GROUP_REDUCES (#12744 )

2025-10-16 14:11:33 -04:00

test_pkl_imagenet.py

more stuff from DSP (#9689 )

2025-04-02 15:27:48 +08:00

torch_cuda_kernel.py

clean up unused imports in examples and update CI linting (#11024 )

2025-06-30 08:21:27 -07:00

train_resnet.py

fix fromarray depreceation (#12512 )

2025-10-08 09:13:26 -04:00

transformer.py

fixing transformer training bug (#9877 )

2025-04-13 19:34:20 -04:00

vgg7.py

clean up unused imports in examples and update CI linting (#11024 )

2025-06-30 08:21:27 -07:00

whisper.py

whisper: fix oob, explicit dtype (#13144 )

2025-11-07 12:55:01 -05:00

yolov3.py

fix bugs at examples/yolov3.py (#11614 )

2025-08-11 21:14:47 -04:00

yolov8-onnx.py

move frontend dir to nn [pr] (#12470 )

2025-10-07 10:42:22 +08:00

yolov8.py

clean up unused imports in examples and update CI linting (#11024 )

2025-06-30 08:21:27 -07:00