Ean Garvey
caf6cc5d8f
Switch most compile flows to use ireec.compile_file. ( #1863 )
...
* Switch most compile flows to use ireec.compile_file.
* re-add input type to compile_str path.
* Check if mlir_module exists before checking if it's a path or pyobject.
* Fix some save_dir cases
2023-10-06 23:04:43 -05:00
Abhishek Varma
db990826d3
Add Llama2 13B int4 fp16 support ( #1784 )
...
Signed-off-by: Abhishek Varma <abhishek@nod-labs.com >
2023-08-23 10:00:32 -07:00
Gaurav Shukla
3c577f7168
[vicuna] fix shard config generator script ( #1747 )
...
Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-08-10 11:26:03 -07:00
Gaurav Shukla
8e90f1b81a
[vicuna] add default config in case of sharded vicuna
...
Signed-Off-by: Gaurav Shukla<gaurav@nod-labs.com >
2023-08-10 21:28:08 +05:30
Nithin Meganathan
c287fd2be8
Add GPU ID's in model_confg.json by default for manual annotation ( #1718 )
2023-08-04 12:46:27 -05:00
Gaurav Shukla
bd30044c0b
[Shard] Add sharding generation in shark studio
...
Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com >
2023-08-04 21:51:14 +05:30
Nithin Meganathan
045f2bb147
Add dispatch-level config file generator for manual annotation ( #1566 )
2023-06-22 15:11:41 -07:00
Nithin Meganathan
34f1295349
Add a model config generator ( #1511 )
...
Model config generator takes a PyTorch model as input and generates a JSON file with model layers and other propperties that define sharding on a particular hardware.
2023-06-09 15:32:00 -07:00