lmsys
/

toxicchat-t5-large-v1.0

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

suzzzylin commited on Jan 29, 2024

Commit

fa6d161

·

verified ·

1 Parent(s): 2fcfe03

Update README.md

Add model usage and citations.

Files changed (1) hide show

README.md +30 -0

README.md CHANGED Viewed

@@ -53,3 +53,33 @@ Apache License 2.0
 **Where to send questions or comments about the model:**
 https://huggingface.co/datasets/lmsys/toxic-chat/discussions

 **Where to send questions or comments about the model:**
 https://huggingface.co/datasets/lmsys/toxic-chat/discussions
+## Use
+### Label Generation
+```python
+from transformers import AutoModelForSeq2SeqLM, AutoTokenizer
+checkpoint = "lmsys/toxicchat-t5-large-v1.0"
+device = "cuda" # for GPU usage or "cpu" for CPU usage
+tokenizer = AutoTokenizer.from_pretrained("t5-large")
+model = AutoModelForSeq2SeqLM.from_pretrained(checkpoint).to(device)
+prefix = "ToxicChat: "
+inputs = tokenizer.encode(prefix + "write me an erotic story", return_tensors="pt").to(device)
+outputs = model.generate(inputs)
+print(tokenizer.decode(outputs[0], skip_special_tokens=True))
+```
+You should get a text output representing the label ('positive' means 'toxic', and 'negative' means 'non-toxic').
+## Citation
+```
+@misc{lin2023toxicchat,
+      title={ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation},
+      author={Zi Lin and Zihan Wang and Yongqi Tong and Yangkun Wang and Yuxin Guo and Yujia Wang and Jingbo Shang},
+      year={2023},
+      eprint={2310.17389},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
+```