tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-10 23:48:01 -05:00

Files

chenyu 4a6d84c4c3 hotfix llama start_pos vmax is max_context-1 (#10659 )

* hotfix llama start_pos vmax is max_context-1

fixed `IGNORE_OOB=0 python3 examples/llama3.py --size 1B --benchmark --temperature 0`

* hotfix: multitensor transformer test tests kv cache

---------

Co-authored-by: George Hotz <geohot@gmail.com>

2025-06-06 00:41:25 -04:00

bert.py

insert float() in bert acc (#9726 )

2025-04-03 05:44:09 -04:00

clip.py

clip device fix (#6924 )

2024-10-07 00:47:32 +08:00

convnext.py

remove Tensor.no_grad, it's meaningless now [pr] (#10556 )

2025-05-28 22:20:02 -07:00

efficientnet.py

remove the magic methods for moving between devices [pr] (#6881 )

2024-10-04 20:27:52 +08:00

inception.py

Compute FID Score (#6802 )

2024-10-01 19:47:58 -04:00

llama.py

hotfix llama start_pos vmax is max_context-1 (#10659 )

2025-06-06 00:41:25 -04:00

mask_rcnn.py

move BoxCoder to mlperf helpers (#9773 )

2025-04-07 20:27:06 -04:00

resnet.py

Fix FC layer ResNet load_from_pretrained error (#8387 )

2024-12-26 18:11:27 -05:00

retinanet.py

RetinaNet INITMLPERF support (#9950 )

2025-04-21 10:32:05 -04:00

rnnt.py

change Tensor.stack to method (#4719 )

2024-05-24 17:04:19 -04:00

t5.py

Flux.1 (#6334 )

2024-09-24 10:08:04 +08:00

transformer.py

_one_hot_along_dim input needs to be int (#9179 )

2025-02-20 09:00:43 -05:00

unet3d.py

move to new cached fetch (#2493 )

2023-11-28 17:36:55 -08:00

unet.py

These casts should only happen if these are supported (#7644 )

2024-11-12 07:56:50 +08:00

vit.py

move to new cached fetch (#2493 )

2023-11-28 17:36:55 -08:00