AMD-SHARK-Studio

mirror of https://github.com/nod-ai/AMD-SHARK-Studio.git synced 2026-02-19 11:56:43 -05:00

Author	SHA1	Message	Date
Boian Petkantchin	79267931c1	Add argument --additional_compile_args (#1119 ) This allows to pass more arguemnts to the IREE compiler Example: python my-app.py --additional_compile_args="--mlir-pretty-debuginfo --mlir-timing" Co-authored-by: Boian Petkantchin <boian@nod-labs.com>	2023-09-19 11:26:03 -05:00
Gaurav Shukla	11bdce9790	[flags] Fix vulkan runtime flags as vma is dropped from iree (#1831 )	2023-09-14 08:58:59 -05:00
powderluv	48de445325	Enable caching and disable vma (#1746 ) * Enable caching allocator by default Going to toggle VMA off too and this is required for performance. Will have to monitor in the wild reports. * Disable VMA Disable VMA	2023-08-10 10:49:44 -07:00
PhaneeshB	8d65456b7a	Move vulkan runtime flags to shark_args	2023-07-28 21:06:28 +05:30
Daniel Garvey	75672c0e28	set task_topology_max_group to cpu_count (#1594 ) by default. Can be overriden with a flag of the same str	2023-06-26 14:54:06 -07:00
Ean Garvey	2191fc8952	Separate pytest benchmark modes and fix model updates for SHARK downloader / pytest. (#1264 ) * Only xfail windows models in CI * downloader: make model updates more robust. * Separate baseline and native benchmarks in pytest. * Fix native benchmarks * Fix torchvision model utils.	2023-04-03 08:24:21 -07:00
Boian Petkantchin	a8ce7680db	Add flag to augment the device allocator (#1182 ) Example: $ python my_app.py --device_allocator caching debug This will wrap the device allocator with first caching allocator then debug allocator. $ python my_app.py --device_allocator caching Only wrap with caching allocator. Co-authored-by: Boian Petkantchin <boian@nod-labs.com>	2023-03-13 15:49:26 -07:00
Ean Garvey	1eb9436836	Fix generate_sharktank args.	2023-02-07 14:06:07 +05:30
Ean Garvey	481d0553d8	Remove unnecessary repro_dir / shark_tmp usage	2023-02-07 14:06:07 +05:30
aldesilv	b3fc0f29cc	enable additional flags for tank test models (#866 ) Co-authored-by: Alex <alexander@nod-labs.com>	2023-02-02 11:19:33 -08:00
Ean Garvey	a90812133b	Enable pytests on Windows (#901 )	2023-02-01 18:36:41 -06:00
Ean Garvey	a14a47af12	Move most xfails to entries in tank/all_models.csv and temporarily remove multiprocessing and TF gpu support. (#646 ) -Adds date variable back to nightly.yml so shark_tank uploads are dated again -added specification for nightly pytests to not run tests on metal (vulkan is sufficient) -added some paths/filetypes to be ignored when triggering workflow runs. (no test-models on changes to .md files or anything in the shark/examples/ directory or its subdirectories. -pytest only picks up tank/test_models.py, so no need to specify which file to run when running pytest from SHARK base directory. -Cleaned up xfails so that they can be added to models as csv entries. Columns 7-9 in all_models.csv trigger xfails with cpu, cuda, vulkan, respectively, and row 10 can be populated with a reason for the xfails. -Fixed a few defaults for shark_args and pytest args (defined in conftest.py) -Fixes --update_tank option in shark_downloader removes some multiprocessing in pytest / TF+CUDA support because it breaks pytest and false passes, leaving regressions at large. -Adds xfails for and removes albert torch from gen_sharktank list (tank/torch_model_list.csv). -Cleans up xfails for cpu, cuda, vulkan (removing old ones)	2022-12-16 12:56:32 +05:30
powderluv	d946cffabc	Revert "Move most xfails to entries in tank/all_models.csv and temporarily remove multiprocessing and TF gpu support. (#602 )" (#622 ) This reverts commit `fe618811ee`.	2022-12-13 21:49:46 -08:00
Ean Garvey	fe618811ee	Move most xfails to entries in tank/all_models.csv and temporarily remove multiprocessing and TF gpu support. (#602 ) * Move most xfails to entries in tank/all_models.csv * enable usage of pytest without specifying tank/test_models.py * add dict_configs.py to gitignore. * Pin versions for runtimes and torch-mlir for setup.	2022-12-13 18:11:17 -08:00
Ean Garvey	0225292a44	Remove print statements from compile utils (#593 )	2022-12-08 13:40:47 -08:00
Ean Garvey	40eea21863	Enable conv nchw-to-nhwc flag by default for most models + minor fixes (#584 )	2022-12-07 16:24:02 -08:00
Eliasj42	7f37599a60	Added a dispatch benchmarking tool (#441 ) To produce benchmarks of individual dispatches, you can add --dispatch_benchmarks=All --dispatch_benchmarks_dir=<output_dir> to your command line argument. Co-authored-by: Elias Joseph <elias@nod-labs.com>	2022-10-28 14:31:03 -07:00
Ean Garvey	9035a2eed3	Add --local_tank_cache flag and update requirements. (#368 ) * Add --local_tank_cache flag and update requirements. * Update requirements-importer.txt	2022-09-28 03:02:59 -05:00
Ean Garvey	6cf5564c84	Remove "gpu" device alias and migrate to using "cuda" for NVIDIA GPU. (#325 ) * Replace instances of "gpu" alias for devices with "cuda"	2022-09-13 01:16:56 -05:00
Ean Garvey	6aeda3670f	Split nightly workflow by backend (IREE / SHARK) (#313 ) * Fix validation for nightly builds. * Add option to generate shark_tank inside SHARK project Add shark_arg for updating tank on mismatched hash (downloader) * Fixup CI tank dir option. * Fixup work directory variable	2022-09-09 22:51:30 +05:30
Daniel Garvey	d45a496030	adds a flag to enable directory choice (#303 ) individual tests will require implementation of the flag alternatively, simply passing shark_default_sha in your individual app's download function will allow for this behavior	2022-08-31 22:17:40 -07:00
Ean Garvey	3514822cac	Improvements to pytest benchmarks. (#267 ) * Add ONNX env var flags for venv setup. * Setup arguments for ONNX benchmarking via pytest. * Enable ONNX benchmarking on MiniLM via pytest (experimental) * Fix sequence lengths to 128 for TF model creation and fix issue with benchmarks. * Disable CI CPU benchmarks on A100, change some default args. * add xfails for roberta TF model tests on GPU.	2022-08-17 02:29:48 -05:00
Ean Garvey	a8b021dc8d	Add benchmarks to MHLO miniLM and resnet50 and add dialect, num_iterations (#264 )	2022-08-16 13:55:40 -05:00
Ean Garvey	23619068eb	Disable passing of sm_arch to iree-compile CL args by default. (#253 ) * Disable passing of sm_arch to iree-compile CL args by default. * Fix formatting.	2022-08-10 01:19:24 -07:00
Prashant Kumar	0dcf387089	Add shark_importer for torch_models. (#183 ) All the torch_models are imported to gs::shark_tank. Scripts have been updated.	2022-07-12 20:38:19 -07:00
Anush Elangovan	a7435973d9	Fix black formatting	2022-06-30 20:42:02 +00:00
Ean Garvey	0a6bc6e17f	Generate test-specific repro path for each TF model test. (#158 ) Set TempFileSaver path directory to shark_args.repro_dir	2022-06-23 21:58:45 -07:00
Chi_Liu	fa0aaf63c2	Fix repo_dir cannot export/write mlir file bug (#157 ) -Set repro_dir as a temporary dir within current working directory.	2022-06-23 18:37:16 -07:00
powderluv	a7ca9b8d68	Update parser.py	2022-06-23 15:26:17 -07:00
Prashant Kumar	e8aa105b2a	Divide iree_utils and do module imports on function calls.	2022-06-22 14:17:33 +05:30
Prashant Kumar	9d2ce33bd1	yapf format python files. Please use `yapf -i --style .style.yapf shark/*.py` to format python files.	2022-06-06 14:34:59 +05:30
Stanley Winata	c70ab55717	Add saving and loading of mlir to run. (#89 )	2022-06-05 08:42:40 -07:00
yzhang93	16c50cac15	Add the option to use tuned model in shark_runner (#79 )	2022-06-02 18:25:33 -07:00
Stanley Winata	5ece8f6f5d	Fix Test Issues on macos (#57 ) -Added a separate pytest config for macos -Adjusted shark parser to ignore unknown args	2022-05-24 14:40:35 -07:00
Prashant Kumar	a280b0a651	Add decompositions for bert_training.	2022-05-09 16:39:11 +05:30
Prashant Kumar	08df345fbc	Add make_fx whole graph extract.	2022-05-04 20:58:59 +05:30
Vivek Khandelwal	7f6159462d	Add num_iterations and num_warmup_iterations args.	2022-05-04 20:38:19 +05:30
Prashant Kumar	db5be15310	Centralize the parser location. Also add the --device flag. Centralized the shark_args parser. Also added the --device flag that specifies the device on which the inference or training is to be done.	2022-05-02 15:35:22 +05:30

38 Commits