update x86 wheel

This commit is contained in:
Alex O'Connell
2024-01-06 18:14:51 -05:00
parent 46042ad99e
commit 35e609108c
6 changed files with 27 additions and 8 deletions

View File

@@ -21,4 +21,5 @@
- "context request" from above to initiate a RAG search
[x] make llama-cpp-python wheels for "llama-cpp-python>=0.2.24"
[ ] prime kv cache with current "state" so that requests are faster
[ ] make a proper evaluation framework to run. not just loss. should test accuracy on the function calling
[ ] make a proper evaluation framework to run. not just loss. should test accuracy on the function calling
[ ] add LocalAI backend