DarwinAnim8or commited on
Commit
783ffdf
1 Parent(s): f016188

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +45 -7
README.md CHANGED
@@ -3,19 +3,57 @@ tags:
3
  - autotrain
4
  - text-classification
5
  language:
6
- - en
7
  widget:
8
- - text: "I love AutoTrain"
 
 
 
 
9
  datasets:
 
10
  - DarwinAnim8or/autotrain-data-text-moderation-v2-small
11
  co2_eq_emissions:
12
  emissions: 0.03967468113268738
 
13
  ---
14
 
15
- # Model Trained Using AutoTrain
 
 
16
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
17
  - Problem type: Multi-class Classification
18
- - Model ID: 93240145801
19
  - CO2 Emissions (in grams): 0.0397
20
 
21
  ## Validation Metrics
@@ -38,7 +76,7 @@ co2_eq_emissions:
38
  You can use cURL to access this model:
39
 
40
  ```
41
- $ curl -X POST -H "Authorization: Bearer YOUR_API_KEY" -H "Content-Type: application/json" -d '{"inputs": "I love AutoTrain"}' https://api-inference.huggingface.co/models/DarwinAnim8or/autotrain-text-moderation-v2-small-93240145801
42
  ```
43
 
44
  Or Python API:
@@ -46,9 +84,9 @@ Or Python API:
46
  ```
47
  from transformers import AutoModelForSequenceClassification, AutoTokenizer
48
 
49
- model = AutoModelForSequenceClassification.from_pretrained("DarwinAnim8or/autotrain-text-moderation-v2-small-93240145801", use_auth_token=True)
50
 
51
- tokenizer = AutoTokenizer.from_pretrained("DarwinAnim8or/autotrain-text-moderation-v2-small-93240145801", use_auth_token=True)
52
 
53
  inputs = tokenizer("I love AutoTrain", return_tensors="pt")
54
 
 
3
  - autotrain
4
  - text-classification
5
  language:
6
+ - unk
7
  widget:
8
+ - text: 'I love AutoTrain'
9
+ - text: 'I absolutely hate those people'
10
+ - text: 'I love cake!'
11
+ - text: 'lets build the wall and deport illegals "they walk across the border like this is Central park"'
12
+ - text: 'EU offers to pay countries 6,000 euros per person to take in migrants'
13
  datasets:
14
+ - mmathys/openai-moderation-api-evaluation
15
  - DarwinAnim8or/autotrain-data-text-moderation-v2-small
16
  co2_eq_emissions:
17
  emissions: 0.03967468113268738
18
+ license: openrail
19
  ---
20
 
21
+ # Text Moderation Large
22
+ This model is a text classification model based on Deberta-v3 that predicts whether a text contains text that could be considered offensive.
23
+ It is split up in the following labels:
24
 
25
+ | Category | Label | Definition |
26
+ | -------- | ----- | ---------- |
27
+ | sexual | `S` | Content meant to arouse sexual excitement, such as the description of sexual activity, or that promotes sexual services (excluding sex education and wellness). |
28
+ | hate | `H` | Content that expresses, incites, or promotes hate based on race, gender, ethnicity, religion, nationality, sexual orientation, disability status, or caste. |
29
+ | violence | `V` | Content that promotes or glorifies violence or celebrates the suffering or humiliation of others. |
30
+ | harassment | `HR` | Content that may be used to torment or annoy individuals in real life, or make harassment more likely to occur. |
31
+ | self-harm | `SH` | Content that promotes, encourages, or depicts acts of self-harm, such as suicide, cutting, and eating disorders. |
32
+ | sexual/minors | `S3` | Sexual content that includes an individual who is under 18 years old. |
33
+ | hate/threatening | `H2` | Hateful content that also includes violence or serious harm towards the targeted group. |
34
+ | violence/graphic | `V2` | Violent content that depicts death, violence, or serious physical injury in extreme graphic detail. |
35
+ | OK | 'OK' | Not offensive
36
+
37
+ ## Ethical Considerations
38
+ This is a model that deals with sensitive and potentially harmful language. Users should consider the ethical implications and potential risks of using or deploying this model in their applications or contexts. Some of the ethical issues that may arise are:
39
+
40
+ - The model may reinforce or amplify existing biases or stereotypes in the data or in the society. For example, the model may associate certain words or topics with offensive language based on the frequency or co-occurrence in the data, without considering the meaning or intent behind them. This may result in unfair or inaccurate predictions for some groups or individuals.
41
+
42
+ Users should carefully consider the purpose, context, and impact of using this model, and take appropriate measures to prevent or mitigate any potential harm. Users should also respect the privacy and consent of the data subjects, and adhere to the relevant laws and regulations in their jurisdictions.
43
+
44
+ ## License
45
+
46
+ This model is licensed under the CodeML OpenRAIL-M 0.1 license, which is a variant of the BigCode OpenRAIL-M license. This license allows you to freely access, use, modify, and distribute this model and its derivatives, for research, commercial or non-commercial purposes, as long as you comply with the following conditions:
47
+
48
+ - You must include a copy of the license and the original source of the model in any copies or derivatives of the model that you distribute.
49
+ - You must not use the model or its derivatives for any unlawful, harmful, abusive, discriminatory, or offensive purposes, or to cause or contribute to any social or environmental harm.
50
+ - You must respect the privacy and consent of the data subjects whose data was used to train or evaluate the model, and adhere to the relevant laws and regulations in your jurisdiction.
51
+ - You must acknowledge that the model and its derivatives are provided "as is", without any warranties or guarantees of any kind, and that the licensor is not liable for any damages or losses arising from your use of the model or its derivatives.
52
+
53
+ By accessing or using this model, you agree to be bound by the terms of this license. If you do not agree with the terms of this license, you must not access or use this model.
54
+
55
+ ## Training Details
56
  - Problem type: Multi-class Classification
 
57
  - CO2 Emissions (in grams): 0.0397
58
 
59
  ## Validation Metrics
 
76
  You can use cURL to access this model:
77
 
78
  ```
79
+ $ curl -X POST -H "Authorization: Bearer YOUR_API_KEY" -H "Content-Type: application/json" -d '{"inputs": "I love AutoTrain"}' https://api-inference.huggingface.co/models/KoalaAI/Text-Moderation
80
  ```
81
 
82
  Or Python API:
 
84
  ```
85
  from transformers import AutoModelForSequenceClassification, AutoTokenizer
86
 
87
+ model = AutoModelForSequenceClassification.from_pretrained("KoalaAI/Text-Moderation", use_auth_token=True)
88
 
89
+ tokenizer = AutoTokenizer.from_pretrained("KoalaAI/Text-Moderation", use_auth_token=True)
90
 
91
  inputs = tokenizer("I love AutoTrain", return_tensors="pt")
92