chtmp223
/

suri-sft

chtmp223 commited on Jun 28

Commit

e172057

•

1 Parent(s): de20bc7

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -9,7 +9,7 @@ datasets:
 ---
 # Suri-SFT
-Suri-SFT is a fine-tuned version of mistralai/Mistral-7B-Instruct-v0.2 using supervised fine-tuning with LoRA. Please check [our paper](TODO) for more details on the method.
 ## 📒 Model Details
@@ -22,7 +22,7 @@ Suri-SFT is a fine-tuned version of mistralai/Mistral-7B-Instruct-v0.2 using sup
 ### Model Sources
 - **Repository:** [Github repository](https://github.com/chtmp223/suri) -- contains code to reconstruct books3 subset.
-- **Paper:** TODO
 - **Demo:** [Website](https://chtmp223.github.io/suri)
 ## ⚠️ Getting Started
@@ -100,7 +100,15 @@ print(tokenizer.decode(output[0]))
 ## 📜 Citation
 ```
-TODO
 ```
 ### ⚙️ Framework versions

 ---
 # Suri-SFT
+Suri-SFT is a fine-tuned version of mistralai/Mistral-7B-Instruct-v0.2 using supervised fine-tuning with LoRA. Please check [our paper](https://arxiv.org/abs/2406.19371) for more details on the method.
 ## 📒 Model Details
 ### Model Sources
 - **Repository:** [Github repository](https://github.com/chtmp223/suri) -- contains code to reconstruct books3 subset.
+- **Paper:** [Link](https://arxiv.org/abs/2406.19371)
 - **Demo:** [Website](https://chtmp223.github.io/suri)
 ## ⚠️ Getting Started
 ## 📜 Citation
 ```
+@misc{pham2024surimulticonstraintinstructionfollowing,
+      title={Suri: Multi-constraint Instruction Following for Long-form Text Generation},
+      author={Chau Minh Pham and Simeng Sun and Mohit Iyyer},
+      year={2024},
+      eprint={2406.19371},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2406.19371},
+}
 ```
 ### ⚙️ Framework versions