Files
tinygrad/test
reddyn12 9b5e15db6e Mamba Implementation (#3456)
* first commit

* state back to orig

* mamba comparisions

* rm file

* rename file

* use Tensor.einsum and mke default model 370M

* Cleaned code and made a comparision test

* Simplyfy pull request. Only has 1 mamba implementation now.

* Update prompt

* rm whitespaces

* last space

* remove Einops dependency

* rm unused code

* add tests

* rm print statement

* rm imports

* skip CLANG

* Update skipIf description

* skip model test in CI and add CLANG fix

* rm Device import

* don't be stupid

* Fix conv assign

When the prompt is too short, the logic for conv_state assign messes up. This can be fixed when padding the tokenized array to min length of 4. I padded using the empty string token, but idk if proper practice is to use the PAD token

* fix p1

* temp

* fix jit import

---------

Co-authored-by: schlimeszn <schlimeszn@gmail.com>
Co-authored-by: reddyn <nikidsniper@gmail.com>
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>
2024-03-28 17:49:12 -07:00
..
2024-03-28 17:49:12 -07:00
2024-03-14 13:34:14 -07:00
2023-12-01 11:34:47 -08:00
2020-12-15 23:44:08 -08:00
2023-06-25 10:38:58 -07:00
2024-03-27 11:18:48 -07:00
2024-03-26 21:02:46 -07:00
2024-03-26 21:02:46 -07:00
2024-03-18 16:47:07 -04:00
2024-03-26 21:02:46 -07:00
2023-12-07 17:07:05 -08:00
2024-03-17 21:35:49 -07:00
2024-03-18 16:47:07 -04:00
2024-03-26 21:02:46 -07:00
2024-03-06 13:34:21 -08:00
2024-01-15 11:31:54 -08:00
2024-03-18 16:47:07 -04:00
2024-03-12 09:49:47 -07:00
2024-03-26 21:02:46 -07:00