tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-25 06:48:22 -05:00

Files

David Hou 9f66dcf718 PolynomialDecayWithWarmup + tests (#3649 )

* working PolynomialDecayWithWarmup + tests.......

add lars_util.py, oops

* keep lars_util.py as intact as possible, simplify our interface

* whitespace

* clean up

* clean up

* asserts

* test polylr for full resnet training run

* add comment

* rename

* fix do_optim

* don't cast lr

* info

* calculate from train_files

* skip it

2024-03-07 18:53:36 -05:00

dataloader.py

use hip events (#3157 )

2024-01-17 10:39:57 -08:00

helpers.py

Add MLPerf UNet3D model (#775 )

2023-05-28 20:38:19 -07:00

lr_schedulers.py

PolynomialDecayWithWarmup + tests (#3649 )

2024-03-07 18:53:36 -05:00

metrics.py

Add MLPerf UNet3D model (#775 )

2023-05-28 20:38:19 -07:00

model_eval.py

move graph.py and jit.py into features (#3376 )

2024-02-12 17:34:34 +01:00

model_spec.py

move globalcounters to ops (#2960 )

2024-01-01 14:21:02 -08:00

model_train.py

with Tensor.train() (#1935 )

2023-09-28 18:02:31 -07:00

optimizers.py

lars optimizer + tests (#3631 )

2024-03-06 18:11:01 -05:00

README

start on mlperf models

2023-05-10 16:30:49 -07:00

README

Each model should be a clean single file.
They are imported from the top level `models` directory

It should be capable of loading weights from the reference imp.

We will focus on these 5 models:

# Resnet50-v1.5 (classic) -- 8.2 GOPS/input
# Retinanet
# 3D UNET (upconvs)
# RNNT
# BERT-large (transformer)

They are used in both the training and inference benchmark:
https://mlcommons.org/en/training-normal-21/
https://mlcommons.org/en/inference-edge-30/
And we will submit to both.

NOTE: we are Edge since we don't have ECC RAM