github/SHARK-Studio

mirror of https://github.com/nod-ai/SHARK-Studio.git synced 2026-01-14 16:28:01 -05:00

Files

History

Sungsoon Cho 51e1bd1c5d (OPT) Fix typo in the message; s/reponse/response (#1920 )

2023-11-09 11:00:48 -06:00

..

opt_causallm_samples.py

(OPT) Fix typo in the message; s/reponse/response (#1920 )

2023-11-09 11:00:48 -06:00

opt_causallm_torch_test.py

Switch most compile flows to use ireec.compile_file. (#1863 )

2023-10-06 23:04:43 -05:00

opt_causallm.py

Add smoothquant OPT to examples. (#1922 )

2023-10-27 12:32:12 -05:00

opt_perf_comparison_batch.py

Add a short README for the OPT examples and small tweaks. (#1793 )

2023-08-24 17:26:11 -07:00

opt_perf_comparison.py

Add smoothquant OPT to examples. (#1922 )

2023-10-27 12:32:12 -05:00

opt_torch_test.py

Update OPT, ResNet example scripts. (#1492 )

2023-06-05 20:19:35 -07:00

opt_util.py

Add opt_causallm_samples.py. (#1916 )

2023-10-25 11:52:51 -05:00

README.md

Add a short README for the OPT examples and small tweaks. (#1793 )

2023-08-24 17:26:11 -07:00

shark_hf_base_opt.py

Switch most compile flows to use ireec.compile_file. (#1863 )

2023-10-06 23:04:43 -05:00

shark_opt_wrapper_train.py

OPT Refactor (#1516 )

2023-06-13 22:40:07 -05:00

shark_opt_wrapper.py

OPT Refactor (#1516 )

2023-06-13 22:40:07 -05:00

README.md

Run OPT for sentence completion through SHARK

From base SHARK directory, follow instructions to set up a virtual environment with SHARK. (./setup_venv.sh or ./setup_venv.ps1) Then, you may run opt_causallm.py to get a very simple sentence completion application running through SHARK

python opt_causallm.py

Run OPT performance comparison on SHARK vs. PyTorch

python opt_perf_comparison.py --max-seq-len=512 --model-name=facebook/opt-1.3b \
        --platform=shark

Any OPT model from huggingface should work with this script, and you can choose between --platform=shark or --platform=huggingface to generate benchmarks of OPT inference on SHARK / PyTorch.

Run a small suite of OPT models through the benchmark script

python opt_perf_comparison_batch.py

This script will run benchmarks from a suite of OPT configurations:

Sequence Lengths: 32, 128, 256, 512
Parameter Counts: 125m, 350m, 1.3b

note: Most of these scripts are written for use on CPU, as perf comparisons against pytorch can be problematic across platforms otherwise.