ManRo's picture
Update README.md
9e9e2c0
---
license: apache-2.0
tags:
-
datasets:
- EXIST Dataset
- MeTwo Machismo and Sexism Twitter Identification dataset
metrics:
- accuracy
model-index:
- name: twitter_sexismo-finetuned-exist2021
results:
- task:
name: Text Classification
type: text-classification
dataset:
name: EXIST Dataset
type: EXIST Dataset
args: es
metrics:
- name: Accuracy
type: accuracy
value: 0.83
---
# twitter_sexismo-finetuned-exist2021
This model is a fine-tuned version of [pysentimiento/robertuito-hate-speech](https://huggingface.co/pysentimiento/robertuito-hate-speech) on the EXIST dataset and MeTwo: Machismo and Sexism Twitter Identification dataset https://github.com/franciscorodriguez92/MeTwo.
It achieves the following results on the evaluation set:
- Loss: 0.54
- Accuracy: 0.83
## Model description
Modelo para el Hackaton de Somos NLP para detección de sexismo en twitts en español. Creado por:
medardodt
MariaIsabel
ManRo
lucel172
robertou2
## Intended uses & limitations
More information needed
## Training and evaluation data
More information needed
## Training procedure
### Training hyperparameters
The following hyperparameters were used during training:
- my_learning_rate = 5E-5
- my_adam_epsilon = 1E-8
- my_number_of_epochs = 8
- my_warmup = 3
- my_mini_batch_size = 32
- optimizer: AdamW with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 8
### Training results
Epoch Training Loss Validation Loss Accuracy F1 Precision Recall
1 0.389900 0.397857 0.827133 0.699620 0.786325 0.630137
2 0.064400 0.544625 0.831510 0.707224 0.794872 0.636986
3 0.004800 0.837723 0.818381 0.704626 0.733333 0.678082
4 0.000500 1.045066 0.820569 0.702899 0.746154 0.664384
5 0.000200 1.172727 0.805252 0.669145 0.731707 0.616438
6 0.000200 1.202422 0.827133 0.720848 0.744526 0.698630
7 0.000000 1.195012 0.827133 0.718861 0.748148 0.691781
8 0.000100 1.215515 0.824945 0.705882 0.761905 0.657534
9 0.000100 1.233099 0.827133 0.710623 0.763780 0.664384
10 0.000100 1.237268 0.829322 0.713235 0.769841 0.664384
|Epoch|Training Loss|Validation Loss|Accuracy|F1|Precision|Precision|
|----|-------|-------|-------|-------|-------|-------|
|1|0.389900 |0.397857 |0.827133 |0.699620 |0.786325 |0.630137 |
|2|0.064400 |0.544625 |0.831510 |0.707224 |0.794872 |0.636986 |
|3|0.004800 |0.837723 |0.818381 |0.704626 |0.733333 |0.678082 |
|4|0.000500 |1.045066 |0.820569 | 0.702899 |0.746154 |0.664384 |
|5|0.000200 |1.172727 |0.805252 |0.669145 |0.731707 |0.616438 |
|6|0.000200 |1.202422 |0.827133 |0.720848 |0.744526 |0.698630 |
|7|0.000000 |1.195012 |0.827133 |0.718861 |0.748148 |0.691781 |
|8|0.000100 |1.215515 |0.824945 |0.705882 |0.761905 |0.657534 |
|9|0.000100|1.233099 |0.827133 |0.710623 |0.763780 |0.664384 |
|10|0.000100|1.237268 |0.829322 |0.713235 |0.769841 |0.664384 |
### Framework versions
- Transformers 4.17.0
- Pytorch 1.10.0+cu111
- Tokenizers 0.11.6
## Model in Action
Fast usage with pipelines:
``` python
###libraries required
!pip install transformers
from transformers import pipeline
### usage pipelines
model_checkpoint = "hackathon-pln-es/twitter_sexismo-finetuned-exist2021-metwo"
pipeline_nlp = pipeline("text-classification", model=model_checkpoint)
pipeline_nlp("mujer al volante peligro!")
#pipeline_nlp("¡me encanta el ipad!")
#pipeline_nlp (["mujer al volante peligro!", "Los hombre tienen más manias que las mujeres", "me encanta el ipad!"] )
# OUTPUT MODEL #
# LABEL_0: "NON SEXISM"or LABEL_1: "SEXISM" and score: probability of accuracy per model.
# [{'label': 'LABEL_1', 'score': 0.9967633485794067}]
# [{'label': 'LABEL_0', 'score': 0.9934417009353638}]
#[{‘label': 'LABEL_1', 'score': 0.9967633485794067},
# {'label': 'LABEL_1', 'score': 0.9755664467811584},
# {'label': 'LABEL_0', 'score': 0.9955045580863953}]
```