Files
tinygrad/test/external
Elias Wahl 27613dd881 MLPerf BERT: Main training loop (#4288)
* BERT language modeling head + trunc normal initializers

* add train loop + helpers

* shuffle in dataloaders + slight changes in main loop

* beam change

* Minor changes

* random.shuffle

* HParam update

* Use deque for dataloader

* wandb bert project name

* half fixes

* BENCHMARK + remove epoch

* cast + print()

---------

Co-authored-by: chenyu <chenyu@fastmail.com>
2024-04-29 14:35:27 -04:00
..
2023-09-22 07:20:27 +08:00
2024-03-29 19:35:50 -07:00
2024-03-24 11:43:12 -04:00
2024-04-23 09:00:28 +04:00
2023-09-28 18:02:31 -07:00