Elias Joseph
16daba99fe
wip script for lowering dlrm training
2023-09-06 03:48:20 +00:00
PhaneeshB
4684afad34
update upscalar example
2023-07-28 21:06:28 +05:30
PhaneeshB
8d65456b7a
Move vulkan runtime flags to shark_args
2023-07-28 21:06:28 +05:30
jinchen62
4d5c55dd9f
Fix vicuna script ( #1697 )
2023-07-27 17:24:26 -05:00
Abhishek Varma
47f8a79c75
[MiniGPT4] Add MiniGPT4 to SHARK ( #1554 )
...
* [MiniGPT4] Add MiniGPT4 to SHARK
-- This is the first installment of MiniGPT4 in SHARK.
Signed-off-by: Abhishek Varma <abhishek@nod-labs.com >
* Add int8 support for MiniGPT4
-- This commit adds int8 support for MiniGPT4.
Signed-off-by: Abhishek Varma <abhishek@nod-lab.com >
* Update .spec for MiniGPT4's config files
* black format MiniGPT4
---------
Signed-off-by: Abhishek Varma <abhishek@nod-labs.com >
Signed-off-by: Abhishek Varma <abhishek@nod-lab.com >
2023-07-25 09:42:27 -07:00
Vivek Khandelwal
11f62d7fac
Minor fixes for MiniLM Training
2023-07-18 17:16:44 +05:30
Ranvir Singh Virk
9fcae4f808
Metal testing ( #1595 )
...
* Fixing metal_platform and device selection
* fixing for metal platform
* fixed for black lint formating
2023-07-08 15:22:53 -07:00
powderluv
c41f58042a
Update compile_utils.py ( #1617 )
...
* Update compile_utils.py
* Update compile_utils.py
* Update compile_utils.py
2023-07-05 10:06:48 -07:00
jinchen62
bc6fee1a0c
Add int4/int8 vicuna ( #1598 )
2023-07-05 07:01:51 -07:00
Prashant Kumar
1c32915570
Add the shark compile downstream due to https://github.com/pytorch/pytorch/pull/104185#issuecomment-1615110613 ( #1615 )
2023-07-01 08:30:58 -07:00
Abhishek Varma
d496053590
[SHARK] Add a compile API to use for quick testing of inference ( #1606 )
2023-06-28 08:40:28 -07:00
Daniel Garvey
75672c0e28
set task_topology_max_group to cpu_count ( #1594 )
...
by default. Can be overriden with a flag of the same str
2023-06-26 14:54:06 -07:00
Prashant Kumar
74a7202173
Make the tensors contiguous.
2023-06-26 17:29:54 +05:30
Prashant Kumar
27a08735db
Add the shark backend for torch.compile API. ( #1596 )
2023-06-26 03:53:32 -07:00
Prashant Kumar
6ef78ee7ba
Add cpu compile time flags. ( #1585 )
2023-06-23 07:23:26 -07:00
PhaneeshB
28e0919321
Add AMD cpu device
2023-06-23 18:47:04 +05:30
Daniel Garvey
28f4d44a6b
downloader was double downloading ( #1580 )
2023-06-22 18:30:27 -07:00
Daniel Garvey
0ca3b9fce3
fix some mmap and vicuna bugs ( #1576 )
2023-06-22 17:39:55 -05:00
Nithin Meganathan
045f2bb147
Add dispatch-level config file generator for manual annotation ( #1566 )
2023-06-22 15:11:41 -07:00
Prashant Kumar
a811b867b9
Add shark_eager mode.
...
-- Eager mode with step by step op compilation and execution.
2023-06-22 22:59:14 +05:30
Abhishek Varma
cdd505e2dd
[SharkInference-SharkRuntime] Adds capability to mmap vmfbs
...
-- This commit is based on [VmModule.mmap() API](https://github.com/openxla/iree/pull/14124 ).
-- It thereby adds capability to mmap vmfbs in SHARK.
Signed-off-by: Abhishek Varma <abhishek@nod-labs.com >
2023-06-22 20:43:40 +05:30
Ranvir Singh Virk
18c8e9e51e
Metal typo fix ( #1572 )
...
* fixing typos for metal changes
* black formating
2023-06-21 21:56:11 -07:00
Ranvir Singh Virk
07c1e1d712
Adding metal_utils for iree_utils ( #1561 )
...
* Adding metal_utils for iree_utils
* Add patch for making compile API work for both MEGABYTE and MiniGPT4 (#1559 )
-- It also modifies the mega_test.py script
Signed-off-by: Abhishek Varma <abhishek@nod-labs.com >
* [SD] Update unet in_channels API and add PIL metadata to spec. (#1560 )
* Fix deprecation warning for unet config.
* Include PIL metadata instead of hidden imports in SD spec.
* Fixing iree-metal-target-platform
* adding metal to txt2img pipeline
* Fixing Copyright date
* removing debug prints
* black lint formating
* fixing device dump
---------
Signed-off-by: Abhishek Varma <abhishek@nod-labs.com >
Co-authored-by: Abhishek Varma <avarma094@gmail.com >
Co-authored-by: Ean Garvey <87458719+monorimet@users.noreply.github.com >
Co-authored-by: powderluv <powderluv@users.noreply.github.com >
2023-06-21 19:09:03 -07:00
Abhishek Varma
3fb72e192e
Add patch for making compile API work for both MEGABYTE and MiniGPT4 ( #1559 )
...
-- It also modifies the mega_test.py script
Signed-off-by: Abhishek Varma <abhishek@nod-labs.com >
2023-06-20 10:04:17 -07:00
powderluv
a544f30a8f
Move mega to the shark examples ( #1555 )
2023-06-19 11:10:51 -07:00
Vivek Khandelwal
015d0132c3
Modify falcon pipeline to add fp16 support ( #1551 )
2023-06-19 09:57:13 -07:00
Chi_Liu
f60ab1f4fa
Add Deberta to stablehlo in shark tank ( #1545 )
2023-06-16 13:24:44 -07:00
Surya Jasper
b203779462
Added Adreno target triples to vulkan_utils ( #1543 )
2023-06-15 16:42:59 -07:00
Ean Garvey
eb6d11cfed
Change mlir dialects for tf tests to stablehlo. ( #1535 )
...
* Change mlir dialects for tf tests to stablehlo
* Update shark_runner.py
2023-06-14 10:43:49 -07:00
dan
bec82a665f
mega vicuna merge
...
single endpoint in apps/language/models/scripts/vicuna.py
removed main functions from pipelines
replaced divergent utils compile with shark_importer
adds support for different precisions
2023-06-14 19:06:29 +05:30
Ean Garvey
9551490341
Remove deprecared --iree-mhlo-demote-164-to-132 flag usage. ( #1533 )
2023-06-13 22:40:47 -05:00
Ean Garvey
44273b0791
Fix conditional in transform_fx() ( #1530 )
2023-06-13 16:24:53 -05:00
Prashant Kumar
0a4c8fcb3e
Minor changes in the fx transforms.
2023-06-13 21:23:35 +05:30
Stefan Kapusniak
2fec3c8169
re-indents add_upcast in shark importer ( #1523 )
...
* The two with blocks in add_upcast appear to be underindented making
SD 1.4 break on rdna3, I've pushed them out one more tab, and then
everything appears to work again.
2023-06-12 14:41:10 -05:00
Prashant Kumar
b6dbd20250
Modify the fx transforms. ( #1521 )
...
- The bounds are set properly.
- The upcasting and downcasting is done for vicuna.
2023-06-12 09:40:14 -07:00
Nithin Meganathan
34f1295349
Add a model config generator ( #1511 )
...
Model config generator takes a PyTorch model as input and generates a JSON file with model layers and other propperties that define sharding on a particular hardware.
2023-06-09 15:32:00 -07:00
Phaneesh Barwaria
1980d7b2c3
Cpu device map ( #1515 )
...
* update cpu iree device
* fix vmfb paths vic unsharded
2023-06-09 11:27:02 -05:00
Ean Garvey
a07fe80530
Update OPT, ResNet example scripts. ( #1492 )
...
* Update API in OPT example.
* fix resnet50 script
* Add OPT1.3b test script.
2023-06-05 20:19:35 -07:00
Ean Garvey
218ed78c40
Change instances of input_type='mhlo' to 'auto' ( #1482 )
2023-06-02 16:43:47 -05:00
Foxlum
5915bf7de3
Add to and tweak vulkan configuration environments. ( #1475 )
...
* Update vulkan_target_env_utils.py
* Update vulkan_target_env_utils.py
Adjust target environment capabilities.
* Update vulkan_target_env_utils.py
black linted?
2023-06-01 22:25:20 -07:00
yzhang93
47b346d428
Modify the lowering config format for SPIRVMatmulPromoteVectorize pipeline ( #1471 )
2023-05-29 21:53:48 -07:00
Ean Garvey
54e57f7771
Revive SD downloads from shark_tank. ( #1465 )
2023-05-25 12:03:21 -05:00
Prashant Kumar
8e571d165f
Enable cpu f16 dtype tracing for the vicuna model. ( #1461 )
2023-05-24 09:37:57 -07:00
Chi_Liu
64c2b2d96b
Add gpt2 to stablehlo support in shark tank ( #1447 )
...
- Add torch decomposition support when generating shark tank
- Add gpt2 stablehlo
2023-05-22 10:45:51 -07:00
Boian Petkantchin
20114deea0
In MiniLM JAX example verify MLIR result against JAX
2023-05-16 09:54:07 -07:00
Boian Petkantchin
bdf37b5311
If device/backend is unknown pass it to IREE verbatim
2023-05-16 09:54:07 -07:00
Chi_Liu
a0097a1ead
Add mlir_type for torch_model_list.csv ( #1428 )
...
- Enable stablehlo/tosa mlir output for torch model
- Add BERT stablehlo support
2023-05-15 10:23:54 -07:00
Ean Garvey
a9bae00606
Fix vulkan device selection at compile time and adapt to IREE python changes. ( #1407 )
...
* Add support for vulkan device selection at compile time.
* Don't convert device ID to int and fix .exe imports
2023-05-12 23:31:50 -07:00
Ean Garvey
6c95ac0f37
Revert dialect registration in model annotator ( #1406 )
...
Matches https://github.com/nod-ai/SHARK-Runtime/pull/58
2023-05-09 11:50:19 -07:00
powderluv
d816cc015e
Revert "added standalone vicuna script ( #1399 )" ( #1402 )
...
This reverts commit 0e4a8ca240 .
2023-05-05 16:08:05 -07:00