Files
tinygrad/examples
chenyu 61e255d197 use max for gpt2 and llama (#2949)
not using argmax yet because there's a multinomial outside of function.
2023-12-28 23:26:00 -05:00
..
2023-12-05 16:17:57 -08:00
2023-03-11 16:28:10 -08:00
2023-12-20 17:03:41 -08:00
2023-12-28 23:26:00 -05:00
2023-10-30 18:42:26 -07:00
2023-12-10 22:04:35 -08:00
2023-08-22 07:36:24 -07:00
2023-09-28 18:02:31 -07:00
2023-11-28 17:36:55 -08:00
2023-12-08 12:59:38 -08:00
2023-11-28 17:36:55 -08:00