chenyu
|
0599e86186
|
replace hardcoded GPU in llama debug msg (#12102)
|
2025-09-10 13:56:40 -04:00 |
|
George Hotz
|
32e9949052
|
rename lazydata to uop (#10698)
|
2025-06-08 08:42:22 -07:00 |
|
George Hotz
|
b3b43a82c4
|
remove Tensor.no_grad, it's meaningless now [pr] (#10556)
|
2025-05-28 22:20:02 -07:00 |
|
chenyu
|
3eba3d6ee9
|
don't pass model in convert_from_huggingface and convert_from_gguf (#10094)
it only needs n_layers
|
2025-04-28 20:11:19 -04:00 |
|
Francis Lata
|
c3187087f7
|
QwQ-32B-Preview support (#7962)
* load weights with some debugging
* start running a prompt
* cleanup
* optionally permute layers and cleanup
* add validation for simple prompt
* small cleanup
* minor cleanup with formatting download links
* add a longer prompt
* add timing option
* some typings
* remove unused arg
* reset GlobalCounters
* minor cleanups
|
2024-12-04 21:46:37 -05:00 |
|