AMD-SHARK-Studio

mirror of https://github.com/nod-ai/AMD-SHARK-Studio.git synced 2026-02-19 11:56:43 -05:00

Files

Quinn Dawkins ded74d09cd [vicuna.py] Keep past key values on device (#1836 )

The past key values are only used within the models themselves and can
be kept on device. For vulkan int4, this gives 44 tok/s (for the first
prompt) and settles at around 26 tok/s on 7900xtx.

2023-09-19 18:17:41 -04:00

language_models

[vicuna.py] Keep past key values on device (#1836 )

2023-09-19 18:17:41 -04:00

stable_diffusion

local_tank_cache included into clear_all (#1833 )

2023-09-18 00:27:23 -05:00

__init__.py

[SD] Reorganize the stable diffusion model. (#806 )

2023-01-31 14:42:41 -08:00