mirror of
https://github.com/acon96/home-llm.git
synced 2026-01-09 21:58:00 -05:00
start working on dpo for the datasets
This commit is contained in:
5
TODO.md
5
TODO.md
@@ -1,5 +1,6 @@
|
||||
# TODO
|
||||
- [ ] setup github actions to build wheels that are optimized for RPIs
|
||||
- [ ] setup github actions to build wheels that are optimized for RPIs??
|
||||
- [ ] setup github actions to publish docker images for text-gen-webui addon
|
||||
- [ ] detection/mitigation of too many entities being exposed & blowing out the context length
|
||||
- [ ] areas/room support
|
||||
- [ ] figure out DPO for refusals + fixing incorrect entity id
|
||||
@@ -7,6 +8,8 @@
|
||||
- add in context learning variables to sys prompt template
|
||||
- add new options to setup process for setting prompt style + picking fine-tuned/ICL
|
||||
- [ ] prime kv cache with current "state" so that requests are faster
|
||||
- [ ] support fine-tuning with RoPE for longer contexts
|
||||
- [ ] support config via yaml instead of configflow
|
||||
- [x] ChatML format (actually need to add special tokens)
|
||||
- [x] Vicuna dataset merge (yahma/alpaca-cleaned)
|
||||
- [x] Phi-2 fine tuning
|
||||
|
||||
Reference in New Issue
Block a user