Files
tinygrad/examples
chenyu e468601226 update llama attention casting (#5096)
* update llama attention casting

updated scaled_dot_product_attention middle cast and removed hard-coded half in llama attention.

* fix that
2024-06-22 10:57:17 -04:00
..
2024-06-15 16:29:39 -07:00
2023-03-11 16:28:10 -08:00
2024-05-24 17:04:19 -04:00
2023-10-30 18:42:26 -07:00
2024-05-23 15:35:26 -04:00
2024-05-24 17:04:19 -04:00
2024-05-22 20:43:21 -04:00
2023-11-28 17:36:55 -08:00
2023-12-08 12:59:38 -08:00
2024-06-16 20:47:29 -04:00