Files
tinygrad/extra/models
chenyu e468601226 update llama attention casting (#5096)
* update llama attention casting

updated scaled_dot_product_attention middle cast and removed hard-coded half in llama attention.

* fix that
2024-06-22 10:57:17 -04:00
..
2023-11-28 17:36:55 -08:00
2024-03-14 20:44:34 -07:00
2024-06-14 15:38:45 -04:00
2023-11-28 17:36:55 -08:00
2024-05-24 17:04:19 -04:00
2024-01-12 14:13:40 -05:00
2023-11-28 17:36:55 -08:00
2023-11-28 17:36:55 -08:00