Transformers
English
trl
rlhf
lvwerra HF staff commited on
Commit
df0e302
1 Parent(s): 2f1a0d2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -28,13 +28,13 @@ Answer: <Response>
28
  ```
29
 
30
  ## Intended Uses & Limitations
31
- The **Llama-se-rl** model was trained for long form QA using [Stack Exchange](https://stackexchange.com) data wich is released under a [CC-BY-SA 4.0](https://creativecommons.org/licenses/by-sa/4.0/), and covers topics such as programming, mathematics, and physics.
32
  It is intended to demonstrate a Large Language Model's ability to follow a target behavior (in this case, generating answers to a question that would have been rated more highly on SE).
33
  It is not intended to replace human expertise, and answers should be validated through the use of external sources.
34
  Further research is also needed to attribute model generations to sources in the training data, especially in cases where the model may copy answers from the training data *verbatim*.
35
 
36
  ## Limitations and Bias
37
- The **Llama-se-rl** model inherits limitations and biases from the Llama model and also those contained in the Stack Exchange dataset.
38
  In particular, per the [latest developer survey for Stack Overflow](https://survey.stackoverflow.co/2022/),
39
  which constitutes a significant part of the StackExchange data,
40
  most users who answered the survey identified themselves as [White or European, men, between 25 and 34 years old, and based in the US (with a significant part of responders from India).](https://survey.stackoverflow.co/2022/#developer-profile-demographics)
 
28
  ```
29
 
30
  ## Intended Uses & Limitations
31
+ The **Llama-se** model was trained for long form QA using [Stack Exchange](https://stackexchange.com) data wich is released under a [CC-BY-SA 4.0](https://creativecommons.org/licenses/by-sa/4.0/), and covers topics such as programming, mathematics, and physics.
32
  It is intended to demonstrate a Large Language Model's ability to follow a target behavior (in this case, generating answers to a question that would have been rated more highly on SE).
33
  It is not intended to replace human expertise, and answers should be validated through the use of external sources.
34
  Further research is also needed to attribute model generations to sources in the training data, especially in cases where the model may copy answers from the training data *verbatim*.
35
 
36
  ## Limitations and Bias
37
+ The **Llama-se** model inherits limitations and biases from the Llama model and also those contained in the Stack Exchange dataset.
38
  In particular, per the [latest developer survey for Stack Overflow](https://survey.stackoverflow.co/2022/),
39
  which constitutes a significant part of the StackExchange data,
40
  most users who answered the survey identified themselves as [White or European, men, between 25 and 34 years old, and based in the US (with a significant part of responders from India).](https://survey.stackoverflow.co/2022/#developer-profile-demographics)