Files
tinygrad/examples
chenyu aa76d566c2 cleanup mamba (#4004)
make it read nicer and cleanup some movement methods and math simplification.
790m, 1.4b, 2.8b model does not really run.
sampling is not implemented.
jit is incorrect.
some deadcode / wrong code path and copied from torch stuff stuff.
2024-03-30 02:50:13 -04:00
..
2023-03-11 16:28:10 -08:00
2024-03-30 00:30:30 -04:00
2023-10-30 18:42:26 -07:00
2024-03-30 00:30:30 -04:00
2024-03-30 02:50:13 -04:00
2023-08-22 07:36:24 -07:00
2023-09-28 18:02:31 -07:00
2024-01-01 14:58:48 -08:00
2023-11-28 17:36:55 -08:00
2023-12-08 12:59:38 -08:00
2024-03-14 17:33:45 -04:00