Alex O'Connell
|
14640bd14b
|
finish implementing alternate dataset generation mode
|
2025-11-26 22:01:08 -05:00 |
|
Alex O'Connell
|
07507ee5f5
|
more fixes
|
2025-11-26 19:09:09 -05:00 |
|
Alex O'Connell
|
a16523f9e5
|
start re-writing dataset generation to use the new HA Assist API
|
2025-11-26 19:08:46 -05:00 |
|
Alex O'Connell
|
73f1d82c76
|
fix bug in data generation script
|
2025-03-08 22:29:22 -05:00 |
|
Alex O'Connell
|
2712f605a5
|
fix evaluate + add train notebook
|
2025-02-10 17:11:44 -05:00 |
|
Alex O'Connell
|
22c9469f66
|
try to support SFT for models without system prompts
|
2024-08-17 22:43:05 -04:00 |
|
Witold Gren
|
2837af8443
|
Added full translations for all languages during generate data and creating default prompt system (#196)
|
2024-08-11 22:05:20 +00:00 |
|
Witold Gren
|
cf89f9f478
|
Added support for Polish language (#193)
|
2024-08-04 19:19:10 +00:00 |
|
Ryan Voots
|
eafb26267b
|
Correct the DeviceType for todo (#121)
|
2024-04-24 21:37:32 +00:00 |
|
Alex O'Connell
|
adae87addd
|
training fixes, default values + other fixes
|
2024-04-21 23:40:28 -04:00 |
|
Alex O'Connell
|
bdda97eb45
|
add date to system prompt + per language and words
|
2024-04-21 20:48:44 -04:00 |
|
Alex O'Connell
|
f5bab7b119
|
better dpo parameters
|
2024-04-14 15:42:51 -04:00 |
|
Alex O'Connell
|
85cd5ec036
|
more DPO example types
|
2024-04-14 08:02:20 -04:00 |
|
Alex O'Connell
|
ce75bf0d7c
|
very basic DPO data generator is working now
|
2024-04-13 22:47:07 -04:00 |
|
Alex O'Connell
|
547d2d9989
|
add proper multi-language support to dataset generator
|
2024-04-13 22:15:37 -04:00 |
|
Alex O'Connell
|
f1659893d7
|
start working on dpo for the datasets
|
2024-03-19 21:31:34 -04:00 |
|
Alex O'Connell
|
43510d1d7c
|
finish up dataset changes for new device types
|
2024-02-17 21:16:56 -05:00 |
|
Alex O'Connell
|
6cc3b13c2a
|
Merge branch 'develop' into feature/dataset-customization
|
2024-02-17 19:55:07 -05:00 |
|
Alex O'Connell
|
bcd67aef37
|
start working on new entities
|
2024-02-16 23:21:22 -05:00 |
|
Alex O'Connell
|
fdfea02e1d
|
typo
|
2024-02-15 00:04:45 -05:00 |
|
Alex O'Connell
|
411276408b
|
train new models based on stablelm + properly add new response types
|
2024-02-13 20:21:51 -05:00 |
|
Alex O'Connell
|
1e0113218f
|
move system prompts to a pile
|
2024-02-05 21:11:29 -05:00 |
|
Alex O'Connell
|
3bf674ae29
|
fix dataset generation
|
2024-02-05 21:05:28 -05:00 |
|
Alex O'Connell
|
cc2c21cab5
|
more work on making the piles easier to extend
|
2024-02-04 11:34:22 -05:00 |
|
Alex O'Connell
|
278f860e37
|
re-organize responses
|
2024-02-03 20:29:51 -05:00 |
|
Alex O'Connell
|
cecf9bc53e
|
move to jsonl, finish sharegpt dataset format, and add flag to add chatml prompt template
|
2024-01-31 23:00:32 -05:00 |
|
Alex O'Connell
|
d901eaffdf
|
start working on other base models
|
2024-01-30 22:12:46 -05:00 |
|
Alex O'Connell
|
9723a98139
|
more dataset + model experiments using the evaluation script
|
2024-01-27 14:54:14 -05:00 |
|
Alex O'Connell
|
946623713f
|
add "extra exposed attributes" to dataset as function call arguments + fix pile template inconsistencies
|
2024-01-26 22:36:34 -05:00 |
|
Alex O'Connell
|
e6fae06133
|
wizardlm merge + fix eval
|
2024-01-25 20:46:59 -05:00 |
|
Alex O'Connell
|
57634519ca
|
move to eval script instead of during training
|
2024-01-25 20:46:59 -05:00 |
|
Alex O'Connell
|
0b9beb85ca
|
finish trianing new models
|
2024-01-17 18:24:25 -05:00 |
|
Alex O'Connell
|
2a0dbd8806
|
fix training script for Phi-2 lora
|
2024-01-16 21:47:15 -05:00 |
|
Alex O'Connell
|
c1c949612e
|
add different dataset "sizes"
|
2024-01-14 12:26:10 -05:00 |
|
Alex O'Connell
|
8f99025507
|
scale dataset halfway back to where it was
|
2024-01-13 23:36:56 -05:00 |
|
Alex O'Connell
|
74bd516148
|
resize dataset + add state attributes to status requests
|
2024-01-12 12:44:26 -05:00 |
|
Alex O'Connell
|
eef7c1b932
|
add RGB + brightness to lighting requests and clean up dataset generation script
|
2024-01-11 19:05:28 -05:00 |
|
Alex O'Connell
|
fac7cc2b03
|
start working on rgb lighting
|
2024-01-11 00:43:18 -05:00 |
|
Alex O'Connell
|
5c34c7e3b9
|
climate should mostly work + make media status requests better
|
2024-01-11 00:18:23 -05:00 |
|
Alex O'Connell
|
fb20caefe2
|
allow exposing some entity attributes + work on climate type
|
2024-01-06 16:06:02 -05:00 |
|
Alex O'Connell
|
e2a6bfa8c3
|
start adding new device types, handle json function calling in HA component, and add more data for underrepresented existing device types
|
2024-01-04 23:34:36 -05:00 |
|
Alex O'Connell
|
ce0ec6b48f
|
fix chatml again + gbnf + phi1.5
|
2023-12-31 16:00:24 -05:00 |
|
Alex O'Connell
|
b1575f2512
|
chatml prompt tweaks, support multi-turn examples in the SFT masking step, and pre-tokenize dataset to avoid CPU bottleneck
|
2023-12-26 23:52:27 -05:00 |
|
Alex O'Connell
|
53c47b6eb7
|
tweak chatml format + make function calls json objects
|
2023-12-23 22:53:25 -05:00 |
|
Alex O'Connell
|
c31cb75e58
|
some cleanup + readmes
|
2023-12-20 21:22:16 -05:00 |
|
Alex O'Connell
|
b921a3234b
|
lots of changes from phi-2 fine-tuning
|
2023-12-18 00:57:07 -05:00 |
|
Alex O'Connell
|
2384b75583
|
add merge with alpaca dataset
|
2023-12-13 21:53:48 -05:00 |
|
Alex O'Connell
|
46b6ee4f65
|
updates with new Phi model release
|
2023-12-13 18:29:47 -05:00 |
|
Alex O'Connell
|
412637222d
|
start expanding dataset features
|
2023-11-06 21:27:58 -05:00 |
|
Alex O'Connell
|
5d79fe6a88
|
fix prompt typo
|
2023-11-05 08:11:28 -05:00 |
|