Stella Laurenzo
9e37e03741
Clearly differentiate phases of loading modules to better understand if things are taking a long time. ( #1733 )
20230807.873
2023-08-07 14:03:12 -07:00
Stefan Kapusniak
9b8c4401b5
gpt_langchain.py fixes for pydantic ( #1722 )
2023-08-07 00:55:38 -07:00
Ean Garvey
a9f95a218b
Remove SD from all_models.csv ( #1706 )
...
Removes SD from pytests as it has its own test suite.
20230806.872
20230805.871
2023-08-05 15:55:52 -05:00
PhaneeshB
872bd72d0b
fix name check for file existence
2023-08-05 21:33:53 +05:30
Eliasj42
fd1c4db5d0
download all mlirs ( #1727 )
...
Co-authored-by: Elias Joseph <elias@nod-labs.com >
20230804.866
20230804.869
20230805.870
2023-08-04 18:22:06 -05:00
Daniel Garvey
759664bb48
add py files to pyinstaller for shark ( #1723 )
20230804.861
2023-08-04 14:10:43 -07:00
Daniel Garvey
14fd0cdd87
add missing subprocess import ( #1721 )
20230804.860
2023-08-04 15:15:22 -05:00
Daniel Garvey
a57eccc997
fix lint ( #1720 )
2023-08-04 14:54:33 -05:00
Daniel Garvey
a686d7d89f
temporarily disable langchain stuff in webui ( #1719 )
...
its breaking the exe
2023-08-04 12:48:06 -07:00
Eliasj42
ed484b8253
added functionality for int8 vicuna and 4 shards ( #1712 )
...
combined vicuna_4_shards.py and vicuna.py to reduce code duplication
Co-authored-by: Elias Joseph <elias@nod-labs.com >
20230804.858
2023-08-04 14:05:05 -05:00
gpetters94
7fe57ebaaf
Add vector database and add support on the web UI ( #1699 )
2023-08-04 13:47:19 -04:00
Nithin Meganathan
c287fd2be8
Add GPU ID's in model_confg.json by default for manual annotation ( #1718 )
20230804.857
2023-08-04 12:46:27 -05:00
Gaurav Shukla
51ec1a1360
[vicuna] Integrate sharded vicuna in web ( #1717 )
...
Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com >
20230804.856
2023-08-04 11:46:53 -05:00
Gaurav Shukla
bd30044c0b
[Shard] Add sharding generation in shark studio
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
20230804.855
2023-08-04 21:51:14 +05:30
Ean Garvey
c9de2729b2
Add flag for toggling constant folding. ( #1714 )
20230804.854
20230804.853
2023-08-04 04:55:52 -07:00
Vivek Khandelwal
a5b13fcc2f
[Langchain] Patch for fixing streaming of tokens ( #1709 )
20230803.852
20230803.851
2023-08-03 10:06:49 -07:00
Stefan Kapusniak
6bb329c4af
Unsharded Vicuna: Fix Memory Error compiling mlir for lmsys/vicuna-7b-v1.3 fp16 with 64 GiB ( #1702 )
20230803.850
20230801.845
2023-08-01 06:07:56 -07:00
Vivek Khandelwal
98fb6c52df
Expand pipelines to fix streaming of tokens
20230731.844
2023-07-31 22:11:01 +05:30
Stefan Kapusniak
206c1b70f4
UI/Web: Reorder tabs to separate SD and LLM ( #1701 )
...
Shuffle the tabs around so that:
* All the SD tabs are together
* All the LLM tabs are together
* All the experimental tabs are together
20230730.843
20230729.842
2023-07-29 22:25:30 -04:00
PhaneeshB
cdb037ee54
use shark_args for vulkan debug utils flag
2023-07-30 07:54:26 +05:30
PhaneeshB
ce2fd84538
fix cpu device name for SharkStudio
2023-07-30 07:54:26 +05:30
PhaneeshB
4684afad34
update upscalar example
20230728.841
2023-07-28 21:06:28 +05:30
PhaneeshB
8d65456b7a
Move vulkan runtime flags to shark_args
2023-07-28 21:06:28 +05:30
PhaneeshB
d6759a852b
add vulkan vma alloc flag
2023-07-28 21:06:28 +05:30
Daniel Garvey
ab57af43c1
Couple of fixes for vicuna.py ( #1696 )
...
* mega vicuna merge pt 2
* add fallback to ensure compile is called
20230727.840
2023-07-27 15:53:05 -07:00
jinchen62
4d5c55dd9f
Fix vicuna script ( #1697 )
2023-07-27 17:24:26 -05:00
Vivek Khandelwal
07399ad65c
[Langchain] Remove unused code ( #1698 )
2023-07-27 11:59:54 -05:00
Vivek Khandelwal
776a9c2293
Fix for Langchain ( #1694 )
...
For CPU, remove max time stopping criteria
Fix web UI issue
20230726.839
2023-07-26 09:00:23 -07:00
Eliasj42
9d399eb988
fixed bug where device_idx was hardcoded ( #1693 )
...
Co-authored-by: Elias Joseph <elias@nod-labs.com >
20230725.838
2023-07-25 19:00:13 -05:00
Vivek Khandelwal
927b662aa7
Add Langchain SHARK Compilation support for all paths
2023-07-25 22:15:42 +05:30
Abhishek Varma
47f8a79c75
[MiniGPT4] Add MiniGPT4 to SHARK ( #1554 )
...
* [MiniGPT4] Add MiniGPT4 to SHARK
-- This is the first installment of MiniGPT4 in SHARK.
Signed-off-by: Abhishek Varma <abhishek@nod-labs.com >
* Add int8 support for MiniGPT4
-- This commit adds int8 support for MiniGPT4.
Signed-off-by: Abhishek Varma <abhishek@nod-lab.com >
* Update .spec for MiniGPT4's config files
* black format MiniGPT4
---------
Signed-off-by: Abhishek Varma <abhishek@nod-labs.com >
Signed-off-by: Abhishek Varma <abhishek@nod-lab.com >
2023-07-25 09:42:27 -07:00
Stefan Kapusniak
289f983f41
SD - Implement seed arrays for batch runs ( #1690 )
...
* SD Scripts and UI tabs that support batch_count can now take a
string containing a JSON array, or a list of integers, as their seed
input.
* Each batch in a run will now take the seed specified at the
corresponding array index if one exists. If there is no seed at
that index, the seed value will be treated as -1 and a random
seed will be assigned at that position. If an integer rather than
a list or json array has been, everything works as before.
* UI seed input controls are now Textboxes with info lines about
the seed formats allowed.
* UI error handling updated to be more helpful if the seed input is
invalid.
20230725.837
2023-07-24 19:22:34 -07:00
Daniel Garvey
453e46562f
mega vicuna merge pt 2 ( #1685 )
2023-07-24 12:42:20 -05:00
Gaurav Shukla
5497af1f56
[config] Add support for uploading sharding config file in chatbot ( #1689 )
...
Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-07-24 10:18:03 -07:00
Vivek Khandelwal
f3cb63fc9c
Fix Langchain multiple device isssue ( #1688 )
2023-07-24 08:03:46 -07:00
Vivek Khandelwal
d7092aafaa
Fix multiple issue for Langchain
...
This commit fixes the following issue for the Langchain:
1.) Web UI not able to fetch results.
2.) For each query model getting reloaded.
3.) SHARK module not using user provided device and precision.
4.) Create a class for main Langchain code.
5.) Misc issues
20230723.835
20230722.834
20230721.833
2023-07-21 21:56:27 +05:30
Vivek Khandelwal
a415f3f70e
Fix Langchain Prompt issue and add web UI support ( #1682 )
2023-07-21 06:36:55 -07:00
Vivek Khandelwal
c292e5c9d7
Add Langchain CPU support and update requirements
20230720.832
20230720.831
2023-07-20 18:53:34 +05:30
Vivek Khandelwal
03c4d9e171
Add support for Llama-2-70b for web and cli, and for hf_auth_token
2023-07-20 14:57:48 +05:30
jinchen62
3662224c04
Update brevitas requirement ( #1677 )
...
also clean up useless args
Co-authored-by: powderluv <powderluv@users.noreply.github.com >
20230720.830
2023-07-19 22:03:32 -07:00
Vivek Khandelwal
db3f222933
Revert "Add Llama2 70B option in CLI and WebUI ( #1673 )" ( #1679 )
...
This reverts commit 41e5088908 .
2023-07-19 22:02:48 -07:00
Stefan Kapusniak
68b3021325
Fixes cosmetic problems with Gradio 3.37.0 ( #1676 )
...
* Fix nod-ai logo having a white border
* Fix control labels having a black background
* Remove extra lower border below Save Prompt checkboxes in Txt2Img UI
2023-07-19 17:28:53 -07:00
AyaanShah2204
336469154d
added copy-metadata for pyyaml ( #1678 )
2023-07-19 17:27:25 -07:00
Abhishek Varma
41e5088908
Add Llama2 70B option in CLI and WebUI ( #1673 )
2023-07-19 10:41:42 -07:00
PhaneeshB
0a8f7673f4
Add README for CodeGen server
2023-07-19 23:10:23 +05:30
PhaneeshB
c482ab78da
fix second vic clearing for low mem device
2023-07-19 23:10:23 +05:30
Vivek Khandelwal
4be80f7158
Add support for the Llama-2 model
2023-07-19 20:57:08 +05:30
AyaanShah2204
536aba1424
unpinned torch_mlir ( #1668 )
...
Co-authored-by: powderluv <powderluv@users.noreply.github.com >
20230719.828
2023-07-19 06:28:00 -07:00
Ean Garvey
dd738a0e02
small changes to opt_perf_comparison.py ( #1670 )
...
* Use longer prompts for OPT comparison script
* small tweaks
2023-07-19 06:26:50 -07:00
Daniel Garvey
8927cb0a2c
set optional vmfb download ( #1667 )
20230718.827
2023-07-18 10:57:28 -07:00