Commit Graph

62 Commits

Author SHA1 Message Date
Alex O'Connell
14640bd14b finish implementing alternate dataset generation mode 2025-11-26 22:01:08 -05:00
Alex O'Connell
07507ee5f5 more fixes 2025-11-26 19:09:09 -05:00
Alex O'Connell
a16523f9e5 start re-writing dataset generation to use the new HA Assist API 2025-11-26 19:08:46 -05:00
Alex O'Connell
73f1d82c76 fix bug in data generation script 2025-03-08 22:29:22 -05:00
Alex O'Connell
2712f605a5 fix evaluate + add train notebook 2025-02-10 17:11:44 -05:00
Alex O'Connell
22c9469f66 try to support SFT for models without system prompts 2024-08-17 22:43:05 -04:00
Witold Gren
2837af8443 Added full translations for all languages during generate data and creating default prompt system (#196) 2024-08-11 22:05:20 +00:00
Witold Gren
cf89f9f478 Added support for Polish language (#193) 2024-08-04 19:19:10 +00:00
Ryan Voots
eafb26267b Correct the DeviceType for todo (#121) 2024-04-24 21:37:32 +00:00
Alex O'Connell
adae87addd training fixes, default values + other fixes 2024-04-21 23:40:28 -04:00
Alex O'Connell
bdda97eb45 add date to system prompt + per language and words 2024-04-21 20:48:44 -04:00
Alex O'Connell
f5bab7b119 better dpo parameters 2024-04-14 15:42:51 -04:00
Alex O'Connell
85cd5ec036 more DPO example types 2024-04-14 08:02:20 -04:00
Alex O'Connell
ce75bf0d7c very basic DPO data generator is working now 2024-04-13 22:47:07 -04:00
Alex O'Connell
547d2d9989 add proper multi-language support to dataset generator 2024-04-13 22:15:37 -04:00
Alex O'Connell
f1659893d7 start working on dpo for the datasets 2024-03-19 21:31:34 -04:00
Alex O'Connell
43510d1d7c finish up dataset changes for new device types 2024-02-17 21:16:56 -05:00
Alex O'Connell
6cc3b13c2a Merge branch 'develop' into feature/dataset-customization 2024-02-17 19:55:07 -05:00
Alex O'Connell
bcd67aef37 start working on new entities 2024-02-16 23:21:22 -05:00
Alex O'Connell
fdfea02e1d typo 2024-02-15 00:04:45 -05:00
Alex O'Connell
411276408b train new models based on stablelm + properly add new response types 2024-02-13 20:21:51 -05:00
Alex O'Connell
1e0113218f move system prompts to a pile 2024-02-05 21:11:29 -05:00
Alex O'Connell
3bf674ae29 fix dataset generation 2024-02-05 21:05:28 -05:00
Alex O'Connell
cc2c21cab5 more work on making the piles easier to extend 2024-02-04 11:34:22 -05:00
Alex O'Connell
278f860e37 re-organize responses 2024-02-03 20:29:51 -05:00
Alex O'Connell
cecf9bc53e move to jsonl, finish sharegpt dataset format, and add flag to add chatml prompt template 2024-01-31 23:00:32 -05:00
Alex O'Connell
d901eaffdf start working on other base models 2024-01-30 22:12:46 -05:00
Alex O'Connell
9723a98139 more dataset + model experiments using the evaluation script 2024-01-27 14:54:14 -05:00
Alex O'Connell
946623713f add "extra exposed attributes" to dataset as function call arguments + fix pile template inconsistencies 2024-01-26 22:36:34 -05:00
Alex O'Connell
e6fae06133 wizardlm merge + fix eval 2024-01-25 20:46:59 -05:00
Alex O'Connell
57634519ca move to eval script instead of during training 2024-01-25 20:46:59 -05:00
Alex O'Connell
0b9beb85ca finish trianing new models 2024-01-17 18:24:25 -05:00
Alex O'Connell
2a0dbd8806 fix training script for Phi-2 lora 2024-01-16 21:47:15 -05:00
Alex O'Connell
c1c949612e add different dataset "sizes" 2024-01-14 12:26:10 -05:00
Alex O'Connell
8f99025507 scale dataset halfway back to where it was 2024-01-13 23:36:56 -05:00
Alex O'Connell
74bd516148 resize dataset + add state attributes to status requests 2024-01-12 12:44:26 -05:00
Alex O'Connell
eef7c1b932 add RGB + brightness to lighting requests and clean up dataset generation script 2024-01-11 19:05:28 -05:00
Alex O'Connell
fac7cc2b03 start working on rgb lighting 2024-01-11 00:43:18 -05:00
Alex O'Connell
5c34c7e3b9 climate should mostly work + make media status requests better 2024-01-11 00:18:23 -05:00
Alex O'Connell
fb20caefe2 allow exposing some entity attributes + work on climate type 2024-01-06 16:06:02 -05:00
Alex O'Connell
e2a6bfa8c3 start adding new device types, handle json function calling in HA component, and add more data for underrepresented existing device types 2024-01-04 23:34:36 -05:00
Alex O'Connell
ce0ec6b48f fix chatml again + gbnf + phi1.5 2023-12-31 16:00:24 -05:00
Alex O'Connell
b1575f2512 chatml prompt tweaks, support multi-turn examples in the SFT masking step, and pre-tokenize dataset to avoid CPU bottleneck 2023-12-26 23:52:27 -05:00
Alex O'Connell
53c47b6eb7 tweak chatml format + make function calls json objects 2023-12-23 22:53:25 -05:00
Alex O'Connell
c31cb75e58 some cleanup + readmes 2023-12-20 21:22:16 -05:00
Alex O'Connell
b921a3234b lots of changes from phi-2 fine-tuning 2023-12-18 00:57:07 -05:00
Alex O'Connell
2384b75583 add merge with alpaca dataset 2023-12-13 21:53:48 -05:00
Alex O'Connell
46b6ee4f65 updates with new Phi model release 2023-12-13 18:29:47 -05:00
Alex O'Connell
412637222d start expanding dataset features 2023-11-06 21:27:58 -05:00
Alex O'Connell
5d79fe6a88 fix prompt typo 2023-11-05 08:11:28 -05:00