Update README.md
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ datasets:
|
|
9 |
---
|
10 |
|
11 |
# Suri-SFT
|
12 |
-
Suri-SFT is a fine-tuned version of mistralai/Mistral-7B-Instruct-v0.2 using supervised fine-tuning with LoRA. Please check [our paper](
|
13 |
|
14 |
## π Model Details
|
15 |
|
@@ -22,7 +22,7 @@ Suri-SFT is a fine-tuned version of mistralai/Mistral-7B-Instruct-v0.2 using sup
|
|
22 |
### Model Sources
|
23 |
|
24 |
- **Repository:** [Github repository](https://github.com/chtmp223/suri) -- contains code to reconstruct books3 subset.
|
25 |
-
- **Paper:**
|
26 |
- **Demo:** [Website](https://chtmp223.github.io/suri)
|
27 |
|
28 |
## β οΈ Getting Started
|
@@ -100,7 +100,15 @@ print(tokenizer.decode(output[0]))
|
|
100 |
## π Citation
|
101 |
|
102 |
```
|
103 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
104 |
```
|
105 |
|
106 |
### βοΈ Framework versions
|
|
|
9 |
---
|
10 |
|
11 |
# Suri-SFT
|
12 |
+
Suri-SFT is a fine-tuned version of mistralai/Mistral-7B-Instruct-v0.2 using supervised fine-tuning with LoRA. Please check [our paper](https://arxiv.org/abs/2406.19371) for more details on the method.
|
13 |
|
14 |
## π Model Details
|
15 |
|
|
|
22 |
### Model Sources
|
23 |
|
24 |
- **Repository:** [Github repository](https://github.com/chtmp223/suri) -- contains code to reconstruct books3 subset.
|
25 |
+
- **Paper:** [Link](https://arxiv.org/abs/2406.19371)
|
26 |
- **Demo:** [Website](https://chtmp223.github.io/suri)
|
27 |
|
28 |
## β οΈ Getting Started
|
|
|
100 |
## π Citation
|
101 |
|
102 |
```
|
103 |
+
@misc{pham2024surimulticonstraintinstructionfollowing,
|
104 |
+
title={Suri: Multi-constraint Instruction Following for Long-form Text Generation},
|
105 |
+
author={Chau Minh Pham and Simeng Sun and Mohit Iyyer},
|
106 |
+
year={2024},
|
107 |
+
eprint={2406.19371},
|
108 |
+
archivePrefix={arXiv},
|
109 |
+
primaryClass={cs.CL},
|
110 |
+
url={https://arxiv.org/abs/2406.19371},
|
111 |
+
}
|
112 |
```
|
113 |
|
114 |
### βοΈ Framework versions
|