AMD-SHARK-Studio/apps/shark_studio/web/utils.py at diffusers-version

mirror of https://github.com/nod-ai/AMD-SHARK-Studio.git synced 2026-02-19 11:56:43 -05:00

Files

Ean Garvey 05b498267e Add StreamingLLM support to studio2 chat (#2060 )

* Streaming LLM 

* Update precision and add gpu support

* (studio2) Separate weights generation for quantization support

* Adapt prompt changes to studio flow

* Remove outdated flag from llm compile flags.

* (studio2) use turbine vmfbRunner

* tweaks to prompts

* Update CPU path and llm api test.

* Change device in test to cpu.

* Fixes to runner, device names, vmfb mgmt

* Use small test without external weights.

2024-01-18 19:01:07 -06:00

326 B

Raw Permalink Blame History

View Raw

326 B Raw Permalink Blame History

326 B

Raw Permalink Blame History