euclaise
/

ReMask-3B

Text Generation

Inference Endpoints

Model card Files Files and versions Community

euclaise commited on Apr 2, 2024

Commit

00f4710

·

verified ·

1 Parent(s): 01e78cf

Update README.md

Files changed (1) hide show

README.md +24 -1

README.md CHANGED Viewed

@@ -116,4 +116,27 @@ As I expected, it improves GSM8K, but doesn't do much to ARC.
 - Epochs: 6
 - Learning rate: 1e-5
 - Learning rate schedule: One Cycle, cosine, no cycle_momentum
-- Regularization weight: 0.1

 - Epochs: 6
 - Learning rate: 1e-5
 - Learning rate schedule: One Cycle, cosine, no cycle_momentum
+- Regularization weight: 0.1
+## Prompt format
+The format for reddit-instruct and oasst2 was:
+```
+<|user|>
+[insert instruction here]
+<|assistant|>
+[insert response here]
+<|user|>
+...
+```
+The format for TinyCoT was:
+```
+<|user|>
+[insert instruction here]
+<|rationale|>
+[insert reasoning here]
+<|answer|>
+[insert direct answer here]
+```