Francis Lata
|
c3187087f7
|
QwQ-32B-Preview support (#7962)
* load weights with some debugging
* start running a prompt
* cleanup
* optionally permute layers and cleanup
* add validation for simple prompt
* small cleanup
* minor cleanup with formatting download links
* add a longer prompt
* add timing option
* some typings
* remove unused arg
* reset GlobalCounters
* minor cleanups
|
2024-12-04 21:46:37 -05:00 |
|