AMD-SHARK-Studio

mirror of https://github.com/nod-ai/AMD-SHARK-Studio.git synced 2026-02-19 11:56:43 -05:00

Author	SHA1	Message	Date
Vivek Khandelwal	ea920f2955	Add sharded Falcon support	2023-10-26 21:53:25 +05:30
Ean Garvey	e6cb5cef57	Add --additional_runtime_args option and use in OPT example. (#1855 ) * Add --additional_runtime_args option and use in OPT example. Fix the func name. (#1838) Co-authored-by: Sungsoon Cho <sungsoon.cho@gmail.com>	2023-10-19 13:29:39 -05:00
Ean Garvey	caf6cc5d8f	Switch most compile flows to use ireec.compile_file. (#1863 ) * Switch most compile flows to use ireec.compile_file. * re-add input type to compile_str path. * Check if mlir_module exists before checking if it's a path or pyobject. * Fix some save_dir cases	2023-10-06 23:04:43 -05:00
Ean Garvey	9697981004	Pipe through a debug option to iree compile utils. (#1796 ) * Update compile_utils.py * Pipe through a flag to toggle debug options in compile utils. * Update SharkLLMBase.py	2023-08-25 07:11:11 -07:00
Stanley Winata	51a1009796	Add Forward method to SHARKRunner and fix examples. (#1756 )	2023-08-14 19:20:37 -07:00
Abhishek Varma	cdd505e2dd	[SharkInference-SharkRuntime] Adds capability to mmap vmfbs -- This commit is based on [VmModule.mmap() API](https://github.com/openxla/iree/pull/14124). -- It thereby adds capability to mmap vmfbs in SHARK. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2023-06-22 20:43:40 +05:30
Ean Garvey	a504903dd5	Fix formatting issues. (#903 )	2023-02-01 09:12:45 +05:30
Eliasj42	8111f8bf35	added ability to select gpu (#891 ) Co-authored-by: Elias Joseph <elias@nod-labs.com>	2023-01-30 13:39:12 -08:00
Abhishek Varma	e60b4568c6	[SharkInference] Make SharkInference compile the entire module (#708 ) * [SharkInference] Make SharkInference compile the entire module -- Previously SharkInference was compiling and providing run APIs for a harcoded function with function name "forward". -- This commit makes the compiling functionality generic and now any function being defined within the module can be run. -- It also creates an API to fetch all the function names defined within the compiled module. -- This commit updates both web and command-line execution of Stable Diffusion to use new API of SharkInference. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2023-01-03 23:25:23 +05:30
Quinn Dawkins	2bc6de650d	[SD] Add support for a compiled version of the discrete Euler scheduler (#657 ) * Add Shark version of euler scheduler * Add Shark version of euler scheduler to web ui	2022-12-17 19:25:43 -08:00
aldesilv	f80b85f10c	dump spv for dispatches (#509 )	2022-11-23 22:34:27 -06:00
Phaneesh Barwaria	d9f4a9954a	modify to get correct target triple (#485 )	2022-11-13 20:13:44 -08:00
Eliasj42	7f37599a60	Added a dispatch benchmarking tool (#441 ) To produce benchmarks of individual dispatches, you can add --dispatch_benchmarks=All --dispatch_benchmarks_dir=<output_dir> to your command line argument. Co-authored-by: Elias Joseph <elias@nod-labs.com>	2022-10-28 14:31:03 -07:00
Ean Garvey	fd7baae548	Serialize torch-mlir CAPI module as bytecode instead of string. (#435 ) * Serialize torch-mlir CAPI as bytecode instead of string. * Minor fixes to MLIR data handling in SHARK python.	2022-10-27 14:37:15 -05:00
Quinn Dawkins	1d33913d48	Add option to save and load precompiled flatbuffer (#425 )	2022-10-23 16:24:09 -07:00
Quinn Dawkins	7be1d7d0be	Add option for extra arguments through SharkInference.compile (#408 )	2022-10-19 15:32:48 -05:00
Ean Garvey	d82b305781	Fix issues with loading .vmfb into SharkInference	2022-09-23 09:53:13 +05:30
Prashant Kumar	4e5c592094	Enable `--devices` to pass through CLI.	2022-07-27 22:13:52 +05:30
Ean Garvey	6401687733	Update TF pytest marks and fix benchmark runner import. (#196 )	2022-07-19 15:42:32 -05:00
Ean Garvey	c5ae01311d	Make PyTorch model tests importer-independent and add benchmarks.	2022-07-15 21:54:41 -05:00
Prashant Kumar	0dcf387089	Add shark_importer for torch_models. (#183 ) All the torch_models are imported to gs::shark_tank. Scripts have been updated.	2022-07-12 20:38:19 -07:00
Anush Elangovan	a7435973d9	Fix black formatting	2022-06-30 20:42:02 +00:00
Prashant Kumar	83855e7b08	Capture input information from mlir_graph and generate random inputs.	2022-06-29 22:51:53 +05:30
Prashant Kumar	b07377cbfd	Refactor the shark_runner shark_inference to only support mlir_modules. 1. The shark_inference is divided into shark_importer and shark_inference. 2. All the tank/pytorch tests have been updated.	2022-06-28 18:46:18 +05:30
Prashant Kumar	e8aa105b2a	Divide iree_utils and do module imports on function calls.	2022-06-22 14:17:33 +05:30
Ean Garvey	08eda2ce35	Equip TF tests with save_mlir, save_vmfb, and benchmark to .csv options. (#148 )	2022-06-21 21:54:44 -07:00
Chi_Liu	af582925f2	Enable tosa.mlir as input for SharkImporter inference (#145 ) * Change shark_importer to use tosa.mlir as tflite model input from local gen_shark_tank	2022-06-20 23:15:14 -07:00
Prashant Kumar	4d3acff484	Add tf masked lm models (17 models - 12 pass, 5 fail). (#135 )	2022-06-16 07:28:00 -07:00
Prashant Kumar	380b0b7c54	Add seq_classification shark module to run seq_classification on hugging face supported models.	2022-06-14 22:36:19 +05:30
Stanley Winata	e48642f8a5	Update benchmark command to ToM and Add to test (#125 ) -Update benchmark_cl to latest benchmark_module API -tensorflow OOM issue.	2022-06-13 23:57:05 -07:00
Ean Garvey	cbd131d588	Refactor testing framework. (#117 )	2022-06-12 20:15:30 -07:00
Chi_Liu	8640c3ebfe	Add shark_importer tflite module and albert_shark_test example (#110 )	2022-06-09 06:17:22 -07:00
Prashant Kumar	9d2ce33bd1	yapf format python files. Please use `yapf -i --style .style.yapf shark/*.py` to format python files.	2022-06-06 14:34:59 +05:30
yzhang93	16c50cac15	Add the option to use tuned model in shark_runner (#79 )	2022-06-02 18:25:33 -07:00
Prashant Kumar	cee02f6a61	Add frontend error checks.	2022-05-27 15:45:45 +05:30
Prashant Kumar	e5517f63f5	Add the refactored training module.	2022-05-27 14:21:34 +05:30
stanley	91867e1fc2	Add Shark Benchmark -Introduce SharkBenchmark that bench models on regular torch, shark-py, and shark-c. -Integrate iree-benchmark-module into Shark.	2022-05-27 00:07:31 +00:00
Prashant Kumar	15c99adab8	Refactor the whole SharkInference API. 1. SharkInference API is refactored to target any frontends or files, i.e. pytorch, tensorflow, mhlo, linalg, tosa. 2. Refactor all torch APIs.	2022-05-26 18:55:47 +05:30

38 Commits