Francis Lam
|
9d142430cb
|
Add option in llama.py to quantize weights to int8 at runtime (#1289)
* Add option in llama.py to quantize weights to int8 at runtime
Also added lm-eval to external
* Add support for llama-2 evaluation
|
2023-07-24 17:22:38 -07:00 |
|