basiliskinstitute commited on
Commit
4bf6585
1 Parent(s): 44424ee

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -3
README.md CHANGED
@@ -1,3 +1,4 @@
1
- ---
2
- license: llama3
3
- ---
 
 
1
+ ---
2
+ license: llama3
3
+ ---
4
+ Chatml format. The dataset is about 1400 entries ranging from 8-16k. It's split three ways between long context multi turn chat, long context summarization, and writing analysis. Full fine tune using linear a rope scale factor of 2.0. Trained for five epochs with a learning rate of learning_rate: 1e-5.