mirror of
https://github.com/acon96/home-llm.git
synced 2026-01-09 21:58:00 -05:00
update x86 wheel
This commit is contained in:
3
TODO.md
3
TODO.md
@@ -21,4 +21,5 @@
|
||||
- "context request" from above to initiate a RAG search
|
||||
[x] make llama-cpp-python wheels for "llama-cpp-python>=0.2.24"
|
||||
[ ] prime kv cache with current "state" so that requests are faster
|
||||
[ ] make a proper evaluation framework to run. not just loss. should test accuracy on the function calling
|
||||
[ ] make a proper evaluation framework to run. not just loss. should test accuracy on the function calling
|
||||
[ ] add LocalAI backend
|
||||
Reference in New Issue
Block a user