Commit
·
57a43c5
1
Parent(s):
5d94d19
Update README.md
Browse files
README.md
CHANGED
@@ -14,9 +14,9 @@ tags:
|
|
14 |
---
|
15 |
# ToxicityModelPT (Portuguese)
|
16 |
|
17 |
-
The `ToxicityModelPT` is a modified BERT model that can be used to score the toxicity of
|
18 |
|
19 |
-
The `
|
20 |
|
21 |
The model was trained with a dataset composed of `demonstrations`, and annotated `toxicity scores`.
|
22 |
|
|
|
14 |
---
|
15 |
# ToxicityModelPT (Portuguese)
|
16 |
|
17 |
+
The `ToxicityModelPT` is a modified BERT model that can be used to score the toxicity of a sentence (prompt + completion). It is based on a [BERT model](https://huggingface.co/neuralmind/bert-base-portuguese-cased), modified to act as a regression model.
|
18 |
|
19 |
+
The `ToxicityModelPT` allows the specification of an `alpha` parameter, which is a multiplier to the toxicity score. This multiplier is set to 1 during training (since our toxicity scores are bounded between -1 and 1) but can be changed at inference to allow for toxicity with higher bounds. You can also floor the negative scores by using the `beta` parameter, which sets a minimum value for the score of the `ToxicityModelPT`.
|
20 |
|
21 |
The model was trained with a dataset composed of `demonstrations`, and annotated `toxicity scores`.
|
22 |
|