PhaneeshB
7963abb8ec
remove caching for rocm args
20231029.1007
20231028.1006
2023-10-29 07:07:57 +05:30
Ean Garvey
98244232dd
Add smoothquant OPT to examples. ( #1922 )
20231027.1005
2023-10-27 12:32:12 -05:00
PhaneeshB
679a452139
fix calls and remove unused imports for check_device_drivers
20231026.1004
2023-10-27 10:30:40 +05:30
PhaneeshB
72c0a8abc8
remove dependency on external commands for driver installation check
2023-10-27 10:30:40 +05:30
Vivek Khandelwal
ea920f2955
Add sharded Falcon support
2023-10-26 21:53:25 +05:30
Phaneesh Barwaria
486202377a
update dependency on rocm/hip info command ( #1900 )
...
* add support for rocm flags
* add rocm target flag to chat args
* rm rocm libs dependency message
2023-10-26 15:18:25 +05:30
Sungsoon Cho
0c38c33d0a
Add opt_causallm_samples.py. ( #1916 )
20231025.1003
2023-10-25 11:52:51 -05:00
Ean Garvey
841773fa32
Updates to opt_causallm example ( #1905 )
...
* Updates to opt_causallm example
* Fixup opt_perf_comparison.py
* Use same filenames across opt examples.
20231024.1002
2023-10-24 10:54:39 -07:00
Stefan Kapusniak
0361db46f9
SD: Fix unet untuned opt_flags ( #1912 )
...
* correct my sloppy copy/paste for the untuned unet default compilation
flags that introduced an extra 'detach' into what should have been
'iree-global-opt-convert-1x1-filter-conv2d-to-matmul'
2023-10-24 12:47:33 -05:00
xzuyn
a012433ffd
Save hiresfix info if used ( #1914 )
2023-10-24 12:45:10 -05:00
xzuyn
5061193da3
Move Generate, Randomize Seed, & Stop Batch to same positions as txt2img ( #1915 )
2023-10-24 12:44:39 -05:00
xzuyn
bff48924be
LLaMa 2 Chat template fix ( #1913 )
20231023.1001
2023-10-23 18:51:15 -05:00
Stefan Kapusniak
825b36cbdd
Fix MLIR Textual PassPipeline Error ( #1910 )
20231022.1000
2023-10-22 07:39:52 -07:00
Stefan Kapusniak
134441957d
SD - Fix civitai download on Windows +improvements ( #1907 )
20231021.999
2023-10-21 11:17:41 -07:00
Stefan Kapusniak
7cd14fdc47
SD/UI: Use a single model selection box on UI tabs ( #1906 )
...
* Allow entry of a huggingface model id or civitai download url to be
done in the main model selection dropdown on SD tabs
* Remove separate textbox for entering huggingface model id or civitai
download url on SD Tabs
* Remove 'None' option from the model selection dropdown (no longer
needed) on SD tabs
* Update png metadata drop zone on txt2img tab to work with a single
argument for model selection
* Update UI generate functions on SD tabs to work with single argument
model selection
* Update API code for changes to the UI generate functions
* Move info about the custom model path to the logging textarea on SD
tabs
2023-10-21 10:06:05 -07:00
Ean Garvey
e6cb5cef57
Add --additional_runtime_args option and use in OPT example. ( #1855 )
...
* Add --additional_runtime_args option and use in OPT example.
Fix the func name. (#1838 )
Co-authored-by: Sungsoon Cho <sungsoon.cho@gmail.com >
20231019.997
20231020.998
2023-10-19 13:29:39 -05:00
Huang Qi
66abee8e5b
SharkInference: Fix various examples and README.md ( #1903 )
...
Follow https://github.com/nod-ai/SHARK/pull/708 , remove parameter 'func_name'
for SharkInference.
2023-10-19 09:28:36 -05:00
Ean Garvey
4797bb89f5
Stringify path for ireec.compile_file ( #1901 )
...
* Stringify path for ireec.compile_file
* Update test-models.yml
20231018.994
2023-10-18 14:59:23 -05:00
Vivek Khandelwal
205e57683a
Modify Falcon-180b-GPTQ sharded pipeline
20231017.993
2023-10-17 20:26:01 +05:30
Vivek Khandelwal
2866d665ee
Fix Sharded Falcon-180b-GPTQ Pipeline
2023-10-17 20:26:01 +05:30
Stefan Kapusniak
71d25ec5d8
SD: Fix repeatable seeds when intial seed is random ( #1893 )
20231016.992
20231015.991
2023-10-14 22:50:42 -07:00
Vivek Khandelwal
202ffff67b
Add support for sharded Falcon model
20231014.990
20231013.989
2023-10-13 22:05:10 +05:30
Ean Garvey
0b77059628
Add matmul reassociation flags ( #1891 )
20231012.988
20231012.987
2023-10-12 20:12:37 -05:00
Stefan Kapusniak
a208302bb9
Fix repeatable seeds consistency over batch counts ( #1889 )
...
* Set the input seed for the random number generator when
generating repeatable seeds to exclude any negative numbers
in the parsed seed input. The makes seeds generated for
different batch counts consistent where they have the same
input for the initial seed or set of seeds.
2023-10-12 17:15:19 -05:00
Vivek Khandelwal
b83d32fafe
Fix Falcon GPTQ Pipeline
20231011.986
2023-10-11 20:09:32 +05:30
Vivek Khandelwal
0a618e1863
Add support for Falcon GPTQ
2023-10-11 10:47:48 +05:30
Phaneesh Barwaria
a731eb6ed4
Macos fixes ( #1883 )
...
* fix venv setup for MacOS
* allow stream fuse binding on mac
* clean iree metal args
20231010.985
2023-10-09 23:36:12 -07:00
Ean Garvey
2004d16945
Revert "[SDXL] Add SDXL pipeline to SHARK ( #1731 )" ( #1882 )
...
This reverts commit 9f0a421764 .
20231009.984
2023-10-09 18:01:44 -07:00
Gaurav Shukla
6e409bfb77
fix else if syntax error
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-10-10 06:23:56 +05:30
Gaurav Shukla
77727d149c
[warning] Fix dropdown warning
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-10-10 05:18:43 +05:30
Ean Garvey
66f6e79d68
Split CPU/GPU definitions conditionally outside of torch contexts. ( #1879 )
2023-10-09 16:46:41 -07:00
Ean Garvey
3b825579a7
(LLaMa-2) Point to int4 + f32 acc .mlir for cpu ( #1878 )
...
- fixes some issues with non-system prompt invocation
Co-authored-by: Gaurav Shukla <gauravshukla789@gmail.com >
2023-10-09 14:37:35 -05:00
Abhishek Varma
9f0a421764
[SDXL] Add SDXL pipeline to SHARK ( #1731 )
...
-- This commit adds SDXL pipeline to SHARK.
Signed-off-by: Abhishek Varma <abhishek@nod-labs.com >
2023-10-09 13:01:37 -05:00
Gaurav Shukla
c28682110c
[chatbot] Flag to add system prompt
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-10-09 22:17:39 +05:30
Ean Garvey
caf6cc5d8f
Switch most compile flows to use ireec.compile_file. ( #1863 )
...
* Switch most compile flows to use ireec.compile_file.
* re-add input type to compile_str path.
* Check if mlir_module exists before checking if it's a path or pyobject.
* Fix some save_dir cases
20231009.983
20231006.980
2023-10-06 23:04:43 -05:00
Ean Garvey
8614a18474
Remove tf dependencies from importer path. ( #1874 )
...
* Remove tf dependencies from import path.
* Fix formatting.
20231006.979
2023-10-06 12:27:12 -07:00
Jakub Kuderski
86c1c0c215
Add aggregate statistics to microbenchmark ( #1871 )
...
Print averaged results at the end of all iterations. Increase the
default number of iterations to 5.
Example:
```
Number of iterations: 5
Prefill: avg. 0.03 s, stddev 0.00
Decode: avg. 43.34 tokens/s, stdev 0.13
```
Also remove the -2 in the number of generated tokens -- I did not find
any evidence we need it.
2023-10-06 10:03:07 -07:00
Daniel Garvey
8bb364bcb8
enforce fp32 accumulates for cpu ( #1873 )
2023-10-06 11:34:49 -05:00
Daniel Garvey
7abddd01ec
argmax inside model + brevitas pin ( #1872 )
20231005.978
2023-10-05 20:15:21 -07:00
Abhishek Varma
2a451fa0c7
[Llama2] Add a standalone utility for dynamic and combining IRs
...
-- This script adds a standalone utility for converting Llama IRs
to dynamic and combining them as well.
Signed-off-by: Abhishek Varma <abhishek@nod-labs.com >
2023-10-05 20:01:06 +05:30
Jakub Kuderski
9c4610b9da
Add microbenchmark mode to vicuna CLI ( #1864 )
...
Add flags to enable a non-internactive mode for microbenchmarking llama
models. In this mode, the system and user prompts are specified with CLI
flags, and the number of generated tokens and iterations is fixed.
Also move the stats below the response and trim any response blankspace.
20231004.977
2023-10-05 00:12:08 -04:00
powderluv
a38cc9d216
Update vulkan_utils.py for Radeon 780m igpu ( #1866 )
2023-10-04 20:33:07 -07:00
Jakub Kuderski
1c382449ec
[vulkan] Print note about module load times. NFC. ( #1862 )
...
Print a note ahead of a potentially long inactivity to set the right expectations.
Separately, we should add progress to the UI and make this loading faster.
20231004.976
20231003.975
2023-10-03 17:27:27 -04:00
Gaurav Shukla
7cc9b3f8e8
[llama cli] Fix llama cli
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
20231003.974
2023-10-03 20:39:53 +05:30
Gaurav Shukla
e54517e967
[UI] Disable config generator, lora train and model manager ( #1858 )
...
Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-10-02 22:34:40 -07:00
Ean Garvey
326327a799
Collect pipeline submodules for diffusers ckpt preprocessing. ( #1859 )
20231002.973
20231002.972
2023-10-03 00:29:28 -04:00
Ean Garvey
785b65c7b0
Add flag for specifying device-local caching allocator heap key. ( #1856 )
2023-10-03 00:28:39 -04:00
Sungsoon Cho
0d16c81687
Remove unused import. ( #1857 )
2023-10-02 11:36:08 -05:00
Vivek Khandelwal
8dd7850c69
Add Falcon-GPTQ support
2023-10-02 16:39:57 +05:30
Gaurav Shukla
e930ba85b4
[os] Remove os dependency from vmfb naming ( #1854 )
...
Also fixes a small ui issue for chatbot.
Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com >
20231001.971
20230930.970
20230930.969
20230929.968
2023-09-29 12:38:17 -05:00