Files
tinygrad/examples
chenyu 18e854cdbf shrink MLB on sharded axis (#3255)
* shrink MLB on sharded axis

use onehot structure to store the real partition. goal is unsynced batchnorm2d that can be run on multigpu for training.

draft version in https://github.com/chenyuxyz/tinygrad/pull/109

* SYNCBN flag

* test unclean shrinks

* UnsyncedBatchNorm reuses BatchNorm

* more robust pad arg check

* better types

* more tests!

* 6 gpus in benchmark

* disable slow GPUS=6 benchmark
2024-01-31 21:48:25 -05:00
..
2023-03-11 16:28:10 -08:00
2023-10-30 18:42:26 -07:00
2024-01-18 14:35:18 -08:00
2023-08-22 07:36:24 -07:00
2023-09-28 18:02:31 -07:00
2024-01-01 14:58:48 -08:00
2023-11-28 17:36:55 -08:00
2024-01-01 14:58:48 -08:00
2023-12-08 12:59:38 -08:00
2023-11-28 17:36:55 -08:00