Files
tinygrad/examples
chenyu e22d78b3d2 training cifar with BF16 on CUDA (#3905)
* training cifar with BF16 on CUDA

memory usage is between float and half due to numpy calls on dataset preprocessing, which converts into float.

* simpler bf16 functions

* bf16 cifar works for HSA too just very slow

* simpler bf16 functions, we love cuda
2024-03-24 01:37:47 -04:00
..
2023-03-11 16:28:10 -08:00
2024-03-14 20:44:34 -07:00
2023-10-30 18:42:26 -07:00
2023-08-22 07:36:24 -07:00
2023-09-28 18:02:31 -07:00
2024-01-01 14:58:48 -08:00
2024-03-14 13:34:14 -07:00
2023-11-28 17:36:55 -08:00
2023-12-08 12:59:38 -08:00
2024-03-14 17:33:45 -04:00