AMD-SHARK-Studio

mirror of https://github.com/nod-ai/AMD-SHARK-Studio.git synced 2026-02-19 11:56:43 -05:00

Files

Abhishek Varma 310d5d0a49 Fix llama2 13b crashing + add spec file for CLI execution of Llama (#1797 )

* [Llama2] Add a fix for Llama2 13B downloading/crashing

-- This commit fixes downloading/crashing of llama2 13B on wrong
   .mlir file.
-- Also adds support for downloading vmfb from shark_tank in CLI.

Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>

* [llama2] Add a spec file to run Llama/Vicuna CLI exe

-- This commit adds a spec file to run Llama/Vicuna CLI exe.

Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>

---------

Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>

2023-08-25 09:36:09 -05:00

langchain

Adapt the change of brevitas custom op name (#1772 )

2023-08-17 14:24:43 -07:00

scripts

Fix llama2 13b crashing + add spec file for CLI execution of Llama (#1797 )

2023-08-25 09:36:09 -05:00

src

Pipe through a debug option to iree compile utils. (#1796 )

2023-08-25 07:11:11 -07:00

README.md

Add README for CodeGen server

2023-07-19 23:10:23 +05:30

shark_llama_cli.spec

Fix llama2 13b crashing + add spec file for CLI execution of Llama (#1797 )

2023-08-25 09:36:09 -05:00

utils.py

[MiniGPT4] Add MiniGPT4 to SHARK (#1554 )

2023-07-25 09:42:27 -07:00

README.md

CodeGen Setup using SHARK-server

Setup Server

clone SHARK and setup the venv
host the server using python apps/stable_diffusion/web/index.py --api --server_port=<PORT>
default server address is http://0.0.0.0:8080

Setup Client

fauxpilot-vscode (VSCode Extension):

Code for the extension can be found here
PreReq: VSCode extension (will need nodejs and npm to compile and run the extension)
Compile and Run the extension on VSCode (press F5 on VSCode), this opens a new VSCode window with the extension running
Open VSCode settings, search for fauxpilot in settings and modify server : http://<IP>:<PORT>, Model : codegen , Max Lines : 30

Others (REST API curl, OpenAI Python bindings) as shown here

using Github Copilot VSCode extension with SHARK-server needs more work to be functional.