haritzpuerto
commited on
Commit
•
6307cd1
1
Parent(s):
e3cb8a8
Update README.md
Browse files
README.md
CHANGED
@@ -22,7 +22,7 @@ widget:
|
|
22 |
|
23 |
This is the official model from the publication "Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models" (arXiv, 2024).
|
24 |
|
25 |
-
> TLDR: Divergent Chain of Thought (DCoT) consists of requiring models to generate multiple CoTs before choosing an answer
|
26 |
|
27 |
|
28 |
Stay tuned for the release of the paper!
|
|
|
22 |
|
23 |
This is the official model from the publication "Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models" (arXiv, 2024).
|
24 |
|
25 |
+
> TLDR: Divergent Chain of Thought (DCoT) consists of requiring models to generate multiple CoTs before choosing an answer. Adding DCoT data to instruction tuning allows models to improve performance through self-correction.
|
26 |
|
27 |
|
28 |
Stay tuned for the release of the paper!
|