jinchen62
b87efe7686
Fix venv setup for brevitas ( #1779 )
20230821.905
2023-08-21 11:58:51 -07:00
gpetters94
82b462de3a
Fix stencils for long prompts ( #1777 )
20230820.903
20230820.904
20230819.902
20230819.901
2023-08-19 00:26:51 -07:00
Daniel Garvey
d8f0f7bade
replace public with private ( #1776 )
...
unload footguns
20230818.899
2023-08-18 14:22:46 -07:00
gpetters94
79bd0b84a1
Fix an issue with diffusers>0.19.3 ( #1775 )
2023-08-18 14:06:06 -04:00
jinchen62
8738571d1e
Adapt the change of brevitas custom op name ( #1772 )
20230818.898
20230817.897
20230817.896
2023-08-17 14:24:43 -07:00
Gaurav Shukla
a4c354ce54
[version] Pin diffusers==0.19.3
...
Once the latest works with LORA train, unpin it.
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-08-17 21:27:10 +05:30
Gaurav Shukla
cc53efa89f
[cli] Fix chatbot cli
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-08-17 21:27:10 +05:30
Gaurav Shukla
9ae8bc921e
[chatbot] Fix chatbot cli and webview warning
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-08-17 21:27:10 +05:30
Gaurav Shukla
32eb78f0f9
[chatbot] Fix switching parameters in chatbot
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-08-17 19:14:17 +05:30
Ean Garvey
cb509343d9
Fix pytest benchmarks and shark_tank generation. ( #1632 )
...
- fix setup_venv.sh for benchmarks/imports etc.
- fix torch benchmarks in SharkBenchmarkRunner
- generate SD artifacts using build_tools/stable_diffusion_testing.py and --import_mlir
- decouple SD gen from tank/generate_sharktank for now
20230816.895
2023-08-16 17:48:47 -05:00
powderluv
6da391c9b1
update signtool to use /fd certHash
20230816.894
20230815.893
20230815.892
2023-08-15 15:11:40 -07:00
Ean Garvey
9dee7ae652
fix tkinter window ( #1766 )
2023-08-15 13:23:09 -07:00
Ean Garvey
343dfd901c
Update SHARK-Runtime links to SRT ( #1765 )
...
* Update nightly.yml
* Update setup_venv.ps1
* Update CMakeLists.txt
* Update shark_iree_profiling.md
* Update setup_venv.sh
* Update README.md
* Update .gitmodules
* Update CMakeLists.txt
* Update README.md
* fix signtool flags
* Update nightly.yml
* Update benchmark_utils.py
* uncomment tkinter launch
2023-08-15 12:40:44 -07:00
Ean Garvey
57260b9c37
(Studio) Add hf-hub to pyinstaller metadata ( #1761 )
20230814.887
2023-08-14 23:01:50 -05:00
Ean Garvey
18e7d2d061
Enable vae tunings for rdna3. ( #1764 )
2023-08-14 21:00:14 -07:00
Stanley Winata
51a1009796
Add Forward method to SHARKRunner and fix examples. ( #1756 )
2023-08-14 19:20:37 -07:00
Daniel Garvey
045c3c3852
enable iree-opt-const-expr-hoisting in vicuna ( #1742 )
...
Co-authored-by: powderluv <powderluv@users.noreply.github.com >
2023-08-14 18:43:42 -07:00
Ean Garvey
0139dd58d9
Specify max allocation size in IREE compile args. ( #1760 )
2023-08-14 15:43:09 -05:00
Ean Garvey
c96571855a
prevents recompiles for cuda benchmarks + update benchmark_module path ( #1759 )
...
* xfail resnet50_fp16
* Fix cuda benchmarks and prevent recompilation.
2023-08-14 15:30:32 -05:00
PhaneeshB
4f61d69d86
add support passing iree flags for LLMs
2023-08-15 00:22:56 +05:30
Phaneesh Barwaria
531d447768
set default allocator for metal device creation ( #1755 )
2023-08-14 06:17:52 -07:00
Vivek Khandelwal
16f46f8de9
Update langchain_requirements.txt
2023-08-14 14:32:19 +05:30
Vivek Khandelwal
c4723f469f
Update langchain_requirements.txt
2023-08-14 14:32:19 +05:30
Vivek Khandelwal
d804f45a61
Update langchain_requirements.txt
2023-08-14 14:32:19 +05:30
Vivek Khandelwal
d22177f936
Update requirements.txt
2023-08-14 14:32:19 +05:30
George Petterson
75e68f02f4
Remove CUDNN
2023-08-14 14:32:19 +05:30
Gaurav Shukla
4dc9c59611
[chatbot] Add tokens generated per second ( #1753 )
20230813.883
2023-08-13 11:25:41 -07:00
Gaurav Shukla
18801dcabc
[chat] Update chatbot ui
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-08-13 18:39:22 +05:30
Gaurav Shukla
3c577f7168
[vicuna] fix shard config generator script ( #1747 )
...
Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com >
20230812.882
20230811.881
20230810.880
2023-08-10 11:26:03 -07:00
Stefan Kapusniak
f5e4fa6ffe
UI/Web - Revert tab order ( #1724 )
...
* Revert ui tab order
* Reverts the tab order, so that SD, LLM, and Experimental are grouped
together again as far as is possible.
* Labelled "Generate Sharding Config" as experimental as pressing the
'Get Model Config' errors for me.
* Fix formatting in index.py
2023-08-10 11:25:36 -07:00
powderluv
48de445325
Enable caching and disable vma ( #1746 )
...
* Enable caching allocator by default
Going to toggle VMA off too and this is required for performance. Will have to monitor in the wild reports.
* Disable VMA
Disable VMA
2023-08-10 10:49:44 -07:00
Gaurav Shukla
8e90f1b81a
[vicuna] add default config in case of sharded vicuna
...
Signed-Off-by: Gaurav Shukla<gaurav@nod-labs.com >
2023-08-10 21:28:08 +05:30
Vivek Khandelwal
e8c1203be2
Fix vicuna script ( #1745 )
20230810.879
2023-08-10 06:11:14 -07:00
Vivek Khandelwal
e4d7abb519
Final patch for fixing Langchain token streaming issue ( #1744 )
20230809.878
2023-08-09 10:09:41 -07:00
powderluv
96185c9dc1
pin safetensors to 0.3.1 ( #1740 )
20230808.876
20230809.877
2023-08-08 19:24:44 -07:00
powderluv
bc22a81925
re-enable constant folding ( #1739 )
...
Tested and works well. (modulo unrelated driver issue)
2023-08-08 17:17:38 -07:00
Eliasj42
5203679f1f
Bandaid fix 2 ( #1728 )
...
* download all mlirs
* fixed install method
* download all mlirs (#1727 )
Co-authored-by: Elias Joseph <elias@nod-labs.com >
* added taggs
* fix name check for file existence
* Remove SD from all_models.csv (#1706 )
Removes SD from pytests as it has its own test suite.
* gpt_langchain.py fixes for pydantic (#1722 )
* removed dead code
---------
Co-authored-by: Elias Joseph <elias@nod-labs.com >
Co-authored-by: PhaneeshB <b.phaneesh@gmail.com >
Co-authored-by: Ean Garvey <87458719+monorimet@users.noreply.github.com >
Co-authored-by: Stefan Kapusniak <121311569+one-lithe-rune@users.noreply.github.com >
2023-08-08 12:14:57 -05:00
Vivek Khandelwal
bf073f8f37
[Langchain] Expand pipelines to fix token streaming issue
20230807.875
2023-08-08 10:27:23 +05:30
Stella Laurenzo
cec6eda6b4
Optimize device enumeration overhead and log details on long operations. ( #1734 )
...
* Optimize device enumeration overhead and log details on long operations.
* Various fixes to add `@functools.cache` to what should be one time, expensive, device enumeration and setup activities. Cuts several seconds off of initialization on my machine.
* Add detailed tracing to actual invocations if they exceed a certain timeout or have an exception.
* Add detailed tracing to loading status.
* By default detail logging is only printed if an operation takes an excessive amount of time. All logging/timing can be printed by setting the variable `$env:SHARK_DETAIL_TRACE = "1"`
* Remove cache from unhashable functions
20230807.874
2023-08-07 17:20:53 -07:00
Stella Laurenzo
9e37e03741
Clearly differentiate phases of loading modules to better understand if things are taking a long time. ( #1733 )
20230807.873
2023-08-07 14:03:12 -07:00
Stefan Kapusniak
9b8c4401b5
gpt_langchain.py fixes for pydantic ( #1722 )
2023-08-07 00:55:38 -07:00
Ean Garvey
a9f95a218b
Remove SD from all_models.csv ( #1706 )
...
Removes SD from pytests as it has its own test suite.
20230806.872
20230805.871
2023-08-05 15:55:52 -05:00
PhaneeshB
872bd72d0b
fix name check for file existence
2023-08-05 21:33:53 +05:30
Eliasj42
fd1c4db5d0
download all mlirs ( #1727 )
...
Co-authored-by: Elias Joseph <elias@nod-labs.com >
20230804.866
20230804.869
20230805.870
2023-08-04 18:22:06 -05:00
Daniel Garvey
759664bb48
add py files to pyinstaller for shark ( #1723 )
20230804.861
2023-08-04 14:10:43 -07:00
Daniel Garvey
14fd0cdd87
add missing subprocess import ( #1721 )
20230804.860
2023-08-04 15:15:22 -05:00
Daniel Garvey
a57eccc997
fix lint ( #1720 )
2023-08-04 14:54:33 -05:00
Daniel Garvey
a686d7d89f
temporarily disable langchain stuff in webui ( #1719 )
...
its breaking the exe
2023-08-04 12:48:06 -07:00
Eliasj42
ed484b8253
added functionality for int8 vicuna and 4 shards ( #1712 )
...
combined vicuna_4_shards.py and vicuna.py to reduce code duplication
Co-authored-by: Elias Joseph <elias@nod-labs.com >
20230804.858
2023-08-04 14:05:05 -05:00
gpetters94
7fe57ebaaf
Add vector database and add support on the web UI ( #1699 )
2023-08-04 13:47:19 -04:00