Files
tinygrad/examples
b1tg 45e2f916a3 add quantize fp8 in llama3 (#12893)
* add quantize fp8 in llama3

* don't truncate fp8 alu result

* cast to float32 before matmul

* --model weights/LLaMA-3/8B-SF-DPO/

---------

Co-authored-by: chenyu <chenyu@fastmail.com>
2025-10-27 10:22:57 -04:00
..
2025-10-14 09:07:43 -04:00
2025-06-08 08:42:22 -07:00
2025-06-08 08:42:22 -07:00
2023-03-11 16:28:10 -08:00
2025-10-08 04:54:07 -04:00
2025-02-20 18:03:09 -05:00
2024-09-24 10:08:04 +08:00
2024-11-12 22:11:40 -05:00
2025-10-16 09:55:20 -04:00
2025-10-08 04:54:07 -04:00
2025-10-27 10:22:57 -04:00
2025-02-26 13:22:08 -05:00
2025-08-10 20:33:22 -04:00
2024-07-02 21:39:01 -04:00
2025-06-05 17:17:42 -07:00
2024-05-22 20:43:21 -04:00
2023-11-28 17:36:55 -08:00