Files
AMD-SHARK-Studio/apps/language_models
Abhishek Varma 9a0efffcca [Llama2] Fix wrong Vulkan device ID + Add Vulkan compile flags
-- This commit fixes the wrong Vulkan device being selected during
   runtime.
-- It also adds couple of IREE compilation flags to target specific
   Vulkan device.
-- It also changes the Vulkan device listing to be more in tune with
   lowering control flow.

Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>
2023-09-22 22:24:18 +05:30
..
2023-08-25 23:04:28 -07:00
2023-07-19 23:10:23 +05:30

CodeGen Setup using SHARK-server

Setup Server

  • clone SHARK and setup the venv
  • host the server using python apps/stable_diffusion/web/index.py --api --server_port=<PORT>
  • default server address is http://0.0.0.0:8080

Setup Client

  1. fauxpilot-vscode (VSCode Extension):
  • Code for the extension can be found here
  • PreReq: VSCode extension (will need nodejs and npm to compile and run the extension)
  • Compile and Run the extension on VSCode (press F5 on VSCode), this opens a new VSCode window with the extension running
  • Open VSCode settings, search for fauxpilot in settings and modify server : http://<IP>:<PORT>, Model : codegen , Max Lines : 30
  1. Others (REST API curl, OpenAI Python bindings) as shown here
  • using Github Copilot VSCode extension with SHARK-server needs more work to be functional.