Update README.md
Browse files
README.md
CHANGED
@@ -71,6 +71,14 @@ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
|
|
71 |
```
|
72 |
You should get a text output representing the label ('positive' means 'toxic', and 'negative' means 'non-toxic').
|
73 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
74 |
## Citation
|
75 |
```
|
76 |
@misc{lin2023toxicchat,
|
|
|
71 |
```
|
72 |
You should get a text output representing the label ('positive' means 'toxic', and 'negative' means 'non-toxic').
|
73 |
|
74 |
+
## Evaluation
|
75 |
+
We report precision, recall, F1 score and AUPRC on ToxicChat (0124) test set:
|
76 |
+
|
77 |
+
| Model | Precision | Recall | F1 | AUPRC |
|
78 |
+
| --- | --- | --- | --- | --- |
|
79 |
+
| ToxicChat-T5-large | 0.7983 | 0.8475 | 0.8221 | 0.8850 |
|
80 |
+
| OpenAI Moderation (Updated Jan 25, 2024, threshold=0.02) | 0.5476 | 0.6989 | 0.6141 | 0.6313 |
|
81 |
+
|
82 |
## Citation
|
83 |
```
|
84 |
@misc{lin2023toxicchat,
|