Transformers
PyTorch
English
trl
rlhf
lewtun HF staff yjernite HF staff commited on
Commit
b2b4d9b
1 Parent(s): ba17d13

Refocus language on specific harms (#8)

Browse files

- Refocus language on specific harms (a5ec91fd536544a46bfa5a97d600787f0d84da38)
- Update README.md (f6ddd7a105eb573cb7c6c3950583b649e3a250f7)


Co-authored-by: Yacine Jernite <yjernite@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -59,8 +59,8 @@ which constitutes a significant part of the StackExchange data,
59
  most users who answered the survey identified themselves as [White or European, men, between 25 and 34 years old, and based in the US (with a significant part of responders from India).](https://survey.stackoverflow.co/2022/#developer-profile-demographics)
60
  - May generate answers that are incorrect or misleading.
61
  - May copy answers from the training data verbatim.
62
- - Contains extremely NSFW data.
63
- - Suggested answers maybe illegal, unethical and/or distateful.
64
 
65
 
66
  ### Recommendations
 
59
  most users who answered the survey identified themselves as [White or European, men, between 25 and 34 years old, and based in the US (with a significant part of responders from India).](https://survey.stackoverflow.co/2022/#developer-profile-demographics)
60
  - May generate answers that are incorrect or misleading.
61
  - May copy answers from the training data verbatim.
62
+ - May generate language that is hateful or promotes discrimination ([example](https://huggingface.co/trl-lib/llama-7b-se-rl-peft/discussions/7#64376083369f6f907f5bfe4c)).
63
+ - May generate language that is offensive to direct or indirect users or to people or groups mentioned.
64
 
65
 
66
  ### Recommendations