PhaneeshB
94594542a9
remove use of vulkaninfo
2023-09-28 21:57:00 +05:30
Gaurav Shukla
82f833e87d
[vulkan] Update vmfb naming
...
Update vmfb naming for vulkan devices in order to resolve naming
conflicts in the presence of multiple vulkan devices.
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-09-28 14:52:11 +05:30
Nelson Sharpe
6773278ec2
Fix checkpoint_path unexpected argument ( #1832 )
2023-09-24 14:17:52 -07:00
Abhishek Varma
9a0efffcca
[Llama2] Fix wrong Vulkan device ID + Add Vulkan compile flags
...
-- This commit fixes the wrong Vulkan device being selected during
runtime.
-- It also adds couple of IREE compilation flags to target specific
Vulkan device.
-- It also changes the Vulkan device listing to be more in tune with
lowering control flow.
Signed-off-by: Abhishek Varma <abhishek@nod-labs.com >
2023-09-22 22:24:18 +05:30
zjgarvey
9eceba69b7
local_tank_cache included into clear_all ( #1833 )
2023-09-18 00:27:23 -05:00
Ean Garvey
684943a4a6
(SD) Fix tokenizers imports in pyinstaller builds. ( #1828 )
...
* Fix tokenizers metadata.
* (SD) Disable VAE lowering configs (rdna3) and add versioned tunings.
* Update sd_annotation.py
* (SD) Add cv2 to spec.
* Update stencil pipeline with the new img2img arg.
2023-09-12 12:23:48 -05:00
PhaneeshB
b817bb8455
add roles for llama2
2023-09-12 10:59:28 +05:30
Ean Garvey
780f520f02
Fix vk.target_env extensions and remove redundant SD imports. ( #1826 )
...
* Remove redundant IREE runtime imports.
* Fix vulkan target env extensions.
2023-09-11 13:42:52 -05:00
Abhishek Varma
bde63ee8ae
Add logging feature in WebUI ( #1821 )
2023-09-08 05:48:05 -07:00
Gaurav Shukla
ede6bf83e2
[vicuna] Disabling the IR generation path
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-09-06 20:13:17 +05:30
Gaurav Shukla
d2f64eefa3
[chatbot] Remove few outdated models from list ( #1814 )
2023-09-04 09:26:32 -07:00
jinchen62
4c3d8a0a7f
Enable downloading vmfb/mlir for webui ( #1807 )
2023-08-31 11:05:47 -07:00
Ean Garvey
9c8cbaf498
Add support for ROCM (Windows) in Studio + compile utils ( #1770 )
...
* WIP: MSVC ROCM support for SHARK Studio
* Make get_iree_rocm_args platform-agnostic.
* Update stable_args.py
* Update rocm arg handling in SD utils
* Guard quantization imports.
Co-authored-by: jam https://github.com/jammm
2023-08-25 20:56:05 -07:00
Ean Garvey
9697981004
Pipe through a debug option to iree compile utils. ( #1796 )
...
* Update compile_utils.py
* Pipe through a flag to toggle debug options in compile utils.
* Update SharkLLMBase.py
2023-08-25 07:11:11 -07:00
Ean Garvey
8e3860c9e6
Remove flags that are default in upstream IREE ( #1785 )
...
* Remove index bits flags now set by default
* Update shark_studio_imports.py
2023-08-24 11:57:54 -05:00
xzuyn
e37d6720eb
Add Hires Fix ( #1787 )
...
* improper test hiresfix
* add sliders & use `clear_cache`
* add resample choices & fix step adjustment
* add step adjustment to img2img
* add resample options to img2img
* simplify hiresfix
- import `img2img_inf` from `img2img_ui.py` instead of just copying it into `txt2img_ui.py`
* set `hri` to None after using
* add more resample types, and don't show output until hiresfix is done
* cleaner implementation
* ran black
* ran black again with jupyter dependencies
2023-08-24 09:01:41 -07:00
Abhishek Varma
db990826d3
Add Llama2 13B int4 fp16 support ( #1784 )
...
Signed-off-by: Abhishek Varma <abhishek@nod-labs.com >
2023-08-23 10:00:32 -07:00
gpetters94
7ee3e4ba5d
Add stencil_unet_512 support ( #1778 )
...
This should fix any remaining issues with stencils and long prompts.
2023-08-22 12:23:46 -04:00
gpetters94
82b462de3a
Fix stencils for long prompts ( #1777 )
2023-08-19 00:26:51 -07:00
gpetters94
79bd0b84a1
Fix an issue with diffusers>0.19.3 ( #1775 )
2023-08-18 14:06:06 -04:00
Gaurav Shukla
9ae8bc921e
[chatbot] Fix chatbot cli and webview warning
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-08-17 21:27:10 +05:30
Gaurav Shukla
32eb78f0f9
[chatbot] Fix switching parameters in chatbot
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-08-17 19:14:17 +05:30
Ean Garvey
9dee7ae652
fix tkinter window ( #1766 )
2023-08-15 13:23:09 -07:00
Ean Garvey
343dfd901c
Update SHARK-Runtime links to SRT ( #1765 )
...
* Update nightly.yml
* Update setup_venv.ps1
* Update CMakeLists.txt
* Update shark_iree_profiling.md
* Update setup_venv.sh
* Update README.md
* Update .gitmodules
* Update CMakeLists.txt
* Update README.md
* fix signtool flags
* Update nightly.yml
* Update benchmark_utils.py
* uncomment tkinter launch
2023-08-15 12:40:44 -07:00
Ean Garvey
57260b9c37
(Studio) Add hf-hub to pyinstaller metadata ( #1761 )
2023-08-14 23:01:50 -05:00
Ean Garvey
18e7d2d061
Enable vae tunings for rdna3. ( #1764 )
2023-08-14 21:00:14 -07:00
PhaneeshB
4f61d69d86
add support passing iree flags for LLMs
2023-08-15 00:22:56 +05:30
Gaurav Shukla
4dc9c59611
[chatbot] Add tokens generated per second ( #1753 )
2023-08-13 11:25:41 -07:00
Gaurav Shukla
18801dcabc
[chat] Update chatbot ui
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-08-13 18:39:22 +05:30
Gaurav Shukla
3c577f7168
[vicuna] fix shard config generator script ( #1747 )
...
Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-08-10 11:26:03 -07:00
Stefan Kapusniak
f5e4fa6ffe
UI/Web - Revert tab order ( #1724 )
...
* Revert ui tab order
* Reverts the tab order, so that SD, LLM, and Experimental are grouped
together again as far as is possible.
* Labelled "Generate Sharding Config" as experimental as pressing the
'Get Model Config' errors for me.
* Fix formatting in index.py
2023-08-10 11:25:36 -07:00
Gaurav Shukla
8e90f1b81a
[vicuna] add default config in case of sharded vicuna
...
Signed-Off-by: Gaurav Shukla<gaurav@nod-labs.com >
2023-08-10 21:28:08 +05:30
Vivek Khandelwal
e4d7abb519
Final patch for fixing Langchain token streaming issue ( #1744 )
2023-08-09 10:09:41 -07:00
powderluv
bc22a81925
re-enable constant folding ( #1739 )
...
Tested and works well. (modulo unrelated driver issue)
2023-08-08 17:17:38 -07:00
Daniel Garvey
759664bb48
add py files to pyinstaller for shark ( #1723 )
2023-08-04 14:10:43 -07:00
Daniel Garvey
a57eccc997
fix lint ( #1720 )
2023-08-04 14:54:33 -05:00
Daniel Garvey
a686d7d89f
temporarily disable langchain stuff in webui ( #1719 )
...
its breaking the exe
2023-08-04 12:48:06 -07:00
Eliasj42
ed484b8253
added functionality for int8 vicuna and 4 shards ( #1712 )
...
combined vicuna_4_shards.py and vicuna.py to reduce code duplication
Co-authored-by: Elias Joseph <elias@nod-labs.com >
2023-08-04 14:05:05 -05:00
gpetters94
7fe57ebaaf
Add vector database and add support on the web UI ( #1699 )
2023-08-04 13:47:19 -04:00
Gaurav Shukla
51ec1a1360
[vicuna] Integrate sharded vicuna in web ( #1717 )
...
Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-08-04 11:46:53 -05:00
Gaurav Shukla
bd30044c0b
[Shard] Add sharding generation in shark studio
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-08-04 21:51:14 +05:30
Ean Garvey
c9de2729b2
Add flag for toggling constant folding. ( #1714 )
2023-08-04 04:55:52 -07:00
Vivek Khandelwal
a5b13fcc2f
[Langchain] Patch for fixing streaming of tokens ( #1709 )
2023-08-03 10:06:49 -07:00
Stefan Kapusniak
206c1b70f4
UI/Web: Reorder tabs to separate SD and LLM ( #1701 )
...
Shuffle the tabs around so that:
* All the SD tabs are together
* All the LLM tabs are together
* All the experimental tabs are together
2023-07-29 22:25:30 -04:00
PhaneeshB
cdb037ee54
use shark_args for vulkan debug utils flag
2023-07-30 07:54:26 +05:30
PhaneeshB
ce2fd84538
fix cpu device name for SharkStudio
2023-07-30 07:54:26 +05:30
PhaneeshB
8d65456b7a
Move vulkan runtime flags to shark_args
2023-07-28 21:06:28 +05:30
PhaneeshB
d6759a852b
add vulkan vma alloc flag
2023-07-28 21:06:28 +05:30
Vivek Khandelwal
776a9c2293
Fix for Langchain ( #1694 )
...
For CPU, remove max time stopping criteria
Fix web UI issue
2023-07-26 09:00:23 -07:00
Abhishek Varma
47f8a79c75
[MiniGPT4] Add MiniGPT4 to SHARK ( #1554 )
...
* [MiniGPT4] Add MiniGPT4 to SHARK
-- This is the first installment of MiniGPT4 in SHARK.
Signed-off-by: Abhishek Varma <abhishek@nod-labs.com >
* Add int8 support for MiniGPT4
-- This commit adds int8 support for MiniGPT4.
Signed-off-by: Abhishek Varma <abhishek@nod-lab.com >
* Update .spec for MiniGPT4's config files
* black format MiniGPT4
---------
Signed-off-by: Abhishek Varma <abhishek@nod-labs.com >
Signed-off-by: Abhishek Varma <abhishek@nod-lab.com >
2023-07-25 09:42:27 -07:00