tinygrad/examples/llama3.py at reshape_trait

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-04-29 03:00:14 -04:00

Files

b1tg 45e2f916a3 add quantize fp8 in llama3 (#12893 )

* add quantize fp8 in llama3

* don't truncate fp8 alu result

* cast to float32 before matmul

* --model weights/LLaMA-3/8B-SF-DPO/

---------

Co-authored-by: chenyu <chenyu@fastmail.com>

2025-10-27 10:22:57 -04:00

26 KiB

Raw Permalink Blame History

View Raw

26 KiB Raw Permalink Blame History

26 KiB

Raw Permalink Blame History