George Hotz
|
f432ec9c33
|
Bitcast hip fix + fix mixtral (#3022)
* fix bitcast in hip
* wrong dtype for precast, double COPY
|
2024-01-05 14:51:25 -08:00 |
|
chenyu
|
f88506e630
|
move gpt2/llama sampling inside the model call (#3013)
* move gpt2/llama sampling inside the model call
* argmax uses one more kernel
|
2024-01-04 17:01:50 -05:00 |
|
Ivan Vnučec
|
8d206f6bfd
|
fix help message (#2705)
llama -> mixtral
|
2023-12-10 22:04:35 -08:00 |
|
George Hotz
|
59ab3675a3
|
faster mixtral + green for new kernels (#2701)
* green for new kernels
* track ram
|
2023-12-10 19:04:58 -08:00 |
|
George Hotz
|
b01e3907a1
|
mixtral touch up: two lines
|
2023-12-10 17:21:49 -08:00 |
|
George Hotz
|
b3982187d1
|
Mixtral Example (#2691)
* mixtral
* simpler
* global counters
* simpler
* weights arg
|
2023-12-10 17:18:31 -08:00 |
|