tinygrad/extra/lr_scheduler.py at 70f052d2b83fa6668fb319f9a3ff98387433cdac

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-04-29 03:00:14 -04:00

Files

chenyu 5ae252ae83 use at least float32 for optim.lr (#4297 )

* use at least float32 for optim.lr

when doing mixed precision training (float32 weight, default_float=half), still use float32 to store lr.
it would have been upcasted later in actual weight update, but would have lost precision.
this improved resnet convergence significantly

* undo type annotation

2024-04-25 14:42:28 -04:00

3.7 KiB

Raw Blame History

View Raw

3.7 KiB Raw Blame History

3.7 KiB

Raw Blame History