kangqi-ni commited on
Commit
bfab303
1 Parent(s): 8533dc5

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -0
README.md ADDED
@@ -0,0 +1,23 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ tags:
6
+ - mistral
7
+ - dpo
8
+ - biology
9
+ - education
10
+ ---
11
+ This model is trained on zephyr-7b-beta with FastChat (for SFT) and TRL (for DPO). The purpose is to develop a more capable educational chatbot that helps students learn biology.
12
+
13
+ If you use this work, please cite: Pedagogical Alignment of Large Language Models https://arxiv.org/abs/2402.05000
14
+ ```
15
+ @misc{sonkar2024pedagogical,
16
+ title={Pedagogical Alignment of Large Language Models},
17
+ author={Shashank Sonkar and Kangqi Ni and Sapana Chaudhary and Richard G. Baraniuk},
18
+ year={2024},
19
+ eprint={2402.05000},
20
+ archivePrefix={arXiv},
21
+ primaryClass={cs.CL}
22
+ }
23
+ ```