Update README.md
Browse files
README.md
CHANGED
@@ -64,7 +64,7 @@ The defautl `GenerationConfig` uses contrastive search with `top_k=4` and `penal
|
|
64 |
## Intended uses & limitations
|
65 |
|
66 |
- **Intended use:** research/exploration into comparing RLHF tuning vs. "guided"/specific tuning on "quality" datasets/responses of _"what the human would want as answer anyway"_
|
67 |
-
- This is **not** trained/fine-tuned with RLHF and therefore will not be as helpful/generalizable/safe as chatGPT
|
68 |
|
69 |
## Training and evaluation data
|
70 |
|
|
|
64 |
## Intended uses & limitations
|
65 |
|
66 |
- **Intended use:** research/exploration into comparing RLHF tuning vs. "guided"/specific tuning on "quality" datasets/responses of _"what the human would want as answer anyway"_
|
67 |
+
- This is **not** trained/fine-tuned with RLHF and therefore will not be as helpful/generalizable/safe as chatGPT (_outside of the fact that this model is ~30x smaller_)
|
68 |
|
69 |
## Training and evaluation data
|
70 |
|