home-llm

mirror of https://github.com/acon96/home-llm.git synced 2026-01-09 21:58:00 -05:00

Author	SHA1	Message	Date
Alex O'Connell	14640bd14b	finish implementing alternate dataset generation mode	2025-11-26 22:01:08 -05:00
Alex O'Connell	07507ee5f5	more fixes	2025-11-26 19:09:09 -05:00
Alex O'Connell	a16523f9e5	start re-writing dataset generation to use the new HA Assist API	2025-11-26 19:08:46 -05:00
Alex O'Connell	73f1d82c76	fix bug in data generation script	2025-03-08 22:29:22 -05:00
Alex O'Connell	2712f605a5	fix evaluate + add train notebook	2025-02-10 17:11:44 -05:00
Alex O'Connell	22c9469f66	try to support SFT for models without system prompts	2024-08-17 22:43:05 -04:00
Witold Gren	2837af8443	Added full translations for all languages during generate data and creating default prompt system (#196 )	2024-08-11 22:05:20 +00:00
Witold Gren	cf89f9f478	Added support for Polish language (#193 )	2024-08-04 19:19:10 +00:00
Ryan Voots	eafb26267b	Correct the DeviceType for todo (#121 )	2024-04-24 21:37:32 +00:00
Alex O'Connell	adae87addd	training fixes, default values + other fixes	2024-04-21 23:40:28 -04:00
Alex O'Connell	bdda97eb45	add date to system prompt + per language and words	2024-04-21 20:48:44 -04:00
Alex O'Connell	f5bab7b119	better dpo parameters	2024-04-14 15:42:51 -04:00
Alex O'Connell	85cd5ec036	more DPO example types	2024-04-14 08:02:20 -04:00
Alex O'Connell	ce75bf0d7c	very basic DPO data generator is working now	2024-04-13 22:47:07 -04:00
Alex O'Connell	547d2d9989	add proper multi-language support to dataset generator	2024-04-13 22:15:37 -04:00
Alex O'Connell	f1659893d7	start working on dpo for the datasets	2024-03-19 21:31:34 -04:00
Alex O'Connell	43510d1d7c	finish up dataset changes for new device types	2024-02-17 21:16:56 -05:00
Alex O'Connell	6cc3b13c2a	Merge branch 'develop' into feature/dataset-customization	2024-02-17 19:55:07 -05:00
Alex O'Connell	bcd67aef37	start working on new entities	2024-02-16 23:21:22 -05:00
Alex O'Connell	fdfea02e1d	typo	2024-02-15 00:04:45 -05:00
Alex O'Connell	411276408b	train new models based on stablelm + properly add new response types	2024-02-13 20:21:51 -05:00
Alex O'Connell	1e0113218f	move system prompts to a pile	2024-02-05 21:11:29 -05:00
Alex O'Connell	3bf674ae29	fix dataset generation	2024-02-05 21:05:28 -05:00
Alex O'Connell	cc2c21cab5	more work on making the piles easier to extend	2024-02-04 11:34:22 -05:00
Alex O'Connell	278f860e37	re-organize responses	2024-02-03 20:29:51 -05:00
Alex O'Connell	cecf9bc53e	move to jsonl, finish sharegpt dataset format, and add flag to add chatml prompt template	2024-01-31 23:00:32 -05:00
Alex O'Connell	d901eaffdf	start working on other base models	2024-01-30 22:12:46 -05:00
Alex O'Connell	9723a98139	more dataset + model experiments using the evaluation script	2024-01-27 14:54:14 -05:00
Alex O'Connell	946623713f	add "extra exposed attributes" to dataset as function call arguments + fix pile template inconsistencies	2024-01-26 22:36:34 -05:00
Alex O'Connell	e6fae06133	wizardlm merge + fix eval	2024-01-25 20:46:59 -05:00
Alex O'Connell	57634519ca	move to eval script instead of during training	2024-01-25 20:46:59 -05:00
Alex O'Connell	0b9beb85ca	finish trianing new models	2024-01-17 18:24:25 -05:00
Alex O'Connell	2a0dbd8806	fix training script for Phi-2 lora	2024-01-16 21:47:15 -05:00
Alex O'Connell	c1c949612e	add different dataset "sizes"	2024-01-14 12:26:10 -05:00
Alex O'Connell	8f99025507	scale dataset halfway back to where it was	2024-01-13 23:36:56 -05:00
Alex O'Connell	74bd516148	resize dataset + add state attributes to status requests	2024-01-12 12:44:26 -05:00
Alex O'Connell	eef7c1b932	add RGB + brightness to lighting requests and clean up dataset generation script	2024-01-11 19:05:28 -05:00
Alex O'Connell	fac7cc2b03	start working on rgb lighting	2024-01-11 00:43:18 -05:00
Alex O'Connell	5c34c7e3b9	climate should mostly work + make media status requests better	2024-01-11 00:18:23 -05:00
Alex O'Connell	fb20caefe2	allow exposing some entity attributes + work on climate type	2024-01-06 16:06:02 -05:00
Alex O'Connell	e2a6bfa8c3	start adding new device types, handle json function calling in HA component, and add more data for underrepresented existing device types	2024-01-04 23:34:36 -05:00
Alex O'Connell	ce0ec6b48f	fix chatml again + gbnf + phi1.5	2023-12-31 16:00:24 -05:00
Alex O'Connell	b1575f2512	chatml prompt tweaks, support multi-turn examples in the SFT masking step, and pre-tokenize dataset to avoid CPU bottleneck	2023-12-26 23:52:27 -05:00
Alex O'Connell	53c47b6eb7	tweak chatml format + make function calls json objects	2023-12-23 22:53:25 -05:00
Alex O'Connell	c31cb75e58	some cleanup + readmes	2023-12-20 21:22:16 -05:00
Alex O'Connell	b921a3234b	lots of changes from phi-2 fine-tuning	2023-12-18 00:57:07 -05:00
Alex O'Connell	2384b75583	add merge with alpaca dataset	2023-12-13 21:53:48 -05:00
Alex O'Connell	46b6ee4f65	updates with new Phi model release	2023-12-13 18:29:47 -05:00
Alex O'Connell	412637222d	start expanding dataset features	2023-11-06 21:27:58 -05:00
Alex O'Connell	5d79fe6a88	fix prompt typo	2023-11-05 08:11:28 -05:00

1 2

62 Commits