Brandon Rising
|
d328eaf743
|
Remove no longer used dequantize_tensor function
|
2024-10-02 18:33:05 -04:00 |
|
Ryan Dick
|
bc63e2acc5
|
Add workaround for FLUX GGUF models with incorrect img_in.weight shape.
|
2024-10-02 18:33:05 -04:00 |
|
Ryan Dick
|
ec7e771942
|
Add a compute_dtype field to GGMLTensor.
|
2024-10-02 18:33:05 -04:00 |
|
Ryan Dick
|
fe84013392
|
Add unit tests for GGMLTensor.
|
2024-10-02 18:33:05 -04:00 |
|
Ryan Dick
|
710f81266b
|
Fix type errors in GGMLTensor.
|
2024-10-02 18:33:05 -04:00 |
|
Brandon Rising
|
446e2884bc
|
Remove no longer used code paths, general cleanup of new dequantization code, update probe
|
2024-10-02 18:33:05 -04:00 |
|
Brandon Rising
|
7d9f125232
|
Run ruff and update imports
|
2024-10-02 18:33:05 -04:00 |
|
Brandon Rising
|
66bbd62758
|
Run ruff and fix typing in torch patcher
|
2024-10-02 18:33:05 -04:00 |
|
Brandon Rising
|
0875e861f5
|
Various updates to gguf performance
|
2024-10-02 18:33:05 -04:00 |
|
Ryan Dick
|
f06765dfba
|
Get alternative GGUF implementation working... barely.
|
2024-10-02 18:33:05 -04:00 |
|
Ryan Dick
|
f347b26999
|
Initial experimentation with Tensor-like extension for GGUF.
|
2024-10-02 18:33:05 -04:00 |
|
Brandon Rising
|
2bfb0ddff5
|
Initial GGUF support for flux models
|
2024-10-02 18:33:05 -04:00 |
|