Commit Graph

91 Commits

Author SHA1 Message Date
Alex O'Connell
9f51dd0e94 more training updates 2025-11-30 15:31:58 -05:00
Alex O'Connell
61140713d7 refactor dataset generation code + add new synthesis script 2025-11-27 22:19:38 -05:00
Alex O'Connell
753a990a98 add new names 2025-11-26 22:08:33 -05:00
Alex O'Connell
14640bd14b finish implementing alternate dataset generation mode 2025-11-26 22:01:08 -05:00
Alex O'Connell
07507ee5f5 more fixes 2025-11-26 19:09:09 -05:00
Alex O'Connell
a16523f9e5 start re-writing dataset generation to use the new HA Assist API 2025-11-26 19:08:46 -05:00
Alex O'Connell
73f1d82c76 fix bug in data generation script 2025-03-08 22:29:22 -05:00
Alex O'Connell
2712f605a5 fix evaluate + add train notebook 2025-02-10 17:11:44 -05:00
Alex O'Connell
fca80d0504 reorganize requirements files 2025-02-09 22:12:18 -05:00
Alex O'Connell
22c9469f66 try to support SFT for models without system prompts 2024-08-17 22:43:05 -04:00
Witold Gren
2837af8443 Added full translations for all languages during generate data and creating default prompt system (#196) 2024-08-11 22:05:20 +00:00
Witold Gren
cf89f9f478 Added support for Polish language (#193) 2024-08-04 19:19:10 +00:00
Alex O'Connell
179e794283 add cmdline arguments to translate script + add defaults for command r 2024-05-08 20:50:25 -04:00
Alex O'Connell
03c23f1a8c local translation + training update 2024-04-24 19:01:05 -04:00
Ryan Voots
eafb26267b Correct the DeviceType for todo (#121) 2024-04-24 21:37:32 +00:00
Alex O'Connell
adae87addd training fixes, default values + other fixes 2024-04-21 23:40:28 -04:00
Alex O'Connell
bdda97eb45 add date to system prompt + per language and words 2024-04-21 20:48:44 -04:00
Alex O'Connell
3326bd7d6e handle other languages in component 2024-04-21 20:38:46 -04:00
Alex O'Connell
8144267e69 fix templated action quotes + add fsdp config 2024-04-14 18:08:11 -04:00
Alex O'Connell
f5bab7b119 better dpo parameters 2024-04-14 15:42:51 -04:00
Alex O'Connell
85cd5ec036 more DPO example types 2024-04-14 08:02:20 -04:00
Alex O'Connell
ce75bf0d7c very basic DPO data generator is working now 2024-04-13 22:47:07 -04:00
Alex O'Connell
547d2d9989 add proper multi-language support to dataset generator 2024-04-13 22:15:37 -04:00
Alex O'Connell
f1659893d7 start working on dpo for the datasets 2024-03-19 21:31:34 -04:00
Alex O'Connell
b9d394f860 actually fix cover service names 2024-03-16 13:50:04 -04:00
Alex O'Connell
9b71a1860b fix "cover" type system call names 2024-03-06 20:40:25 -05:00
Alex O'Connell
841beb5e77 Release v0.2.8 2024-03-05 21:46:25 -05:00
Alex O'Connell
b197632b3e Update llama-cpp-python and text-generation-webui 2024-02-25 17:43:57 -05:00
Alex O'Connell
5dec71eae2 finalize new model version 2024-02-22 21:12:38 -05:00
Alex O'Connell
c285e3c6a9 instructions for adding personas 2024-02-17 23:13:45 -05:00
Alex O'Connell
43510d1d7c finish up dataset changes for new device types 2024-02-17 21:16:56 -05:00
Alex O'Connell
984cf2c0a3 add vacuum, todo, and timer device names 2024-02-17 19:55:19 -05:00
Alex O'Connell
6cc3b13c2a Merge branch 'develop' into feature/dataset-customization 2024-02-17 19:55:07 -05:00
colino17
e192a8aee6 Add additional data for new entity types and services (#67) 2024-02-18 00:49:58 +00:00
Alex O'Connell
bcd67aef37 start working on new entities 2024-02-16 23:21:22 -05:00
Alex O'Connell
fdfea02e1d typo 2024-02-15 00:04:45 -05:00
Alex O'Connell
4e873a873c Merge branch 'develop' into feature/dataset-customization 2024-02-13 20:23:39 -05:00
Alex O'Connell
411276408b train new models based on stablelm + properly add new response types 2024-02-13 20:21:51 -05:00
Alex O'Connell
1e0113218f move system prompts to a pile 2024-02-05 21:11:29 -05:00
Alex O'Connell
3bf674ae29 fix dataset generation 2024-02-05 21:05:28 -05:00
Alex O'Connell
cc2c21cab5 more work on making the piles easier to extend 2024-02-04 11:34:22 -05:00
Alex O'Connell
278f860e37 re-organize responses 2024-02-03 20:29:51 -05:00
Alex O'Connell
74173ec4cc Upload dataset snapshot to HF 2024-02-03 20:26:38 -05:00
Alex O'Connell
cecf9bc53e move to jsonl, finish sharegpt dataset format, and add flag to add chatml prompt template 2024-01-31 23:00:32 -05:00
Alex O'Connell
d901eaffdf start working on other base models 2024-01-30 22:12:46 -05:00
Alex O'Connell
371ac513b2 typo in pile 2024-01-30 22:12:12 -05:00
Alex O'Connell
d023cfee28 add another test prompt 2024-01-30 22:12:12 -05:00
Alex O'Connell
680ab96bb7 Upload dataset snapshot to HF 2024-01-28 21:01:59 -05:00
Alex O'Connell
038d869ded random readme fixes + format notes 2024-01-28 10:17:50 -05:00
Alex O'Connell
9723a98139 more dataset + model experiments using the evaluation script 2024-01-27 14:54:14 -05:00