Update README.md

9e9e2c0 about 2 years ago

No virus

3.95 kB

	---
	license: apache-2.0
	tags:
	-
	datasets:
	- EXIST Dataset
	- MeTwo Machismo and Sexism Twitter Identification dataset

	metrics:
	- accuracy
	model-index:
	- name: twitter_sexismo-finetuned-exist2021
	results:
	- task:
	name: Text Classification
	type: text-classification
	dataset:
	name: EXIST Dataset
	type: EXIST Dataset
	args: es
	metrics:
	- name: Accuracy
	type: accuracy
	value: 0.83
	---

	# twitter_sexismo-finetuned-exist2021

	This model is a fine-tuned version of [pysentimiento/robertuito-hate-speech](https://huggingface.co/pysentimiento/robertuito-hate-speech) on the EXIST dataset and MeTwo: Machismo and Sexism Twitter Identification dataset https://github.com/franciscorodriguez92/MeTwo.
	It achieves the following results on the evaluation set:
	- Loss: 0.54
	- Accuracy: 0.83

	## Model description

	Modelo para el Hackaton de Somos NLP para detección de sexismo en twitts en español. Creado por:

	medardodt

	MariaIsabel

	ManRo

	lucel172

	robertou2

	## Intended uses & limitations

	More information needed

	## Training and evaluation data

	More information needed

	## Training procedure

	### Training hyperparameters

	The following hyperparameters were used during training:
	- my_learning_rate = 5E-5
	- my_adam_epsilon = 1E-8
	- my_number_of_epochs = 8
	- my_warmup = 3
	- my_mini_batch_size = 32
	- optimizer: AdamW with betas=(0.9,0.999) and epsilon=1e-08
	- lr_scheduler_type: linear
	- num_epochs: 8

	### Training results
	Epoch Training Loss Validation Loss Accuracy F1 Precision Recall

	1 0.389900 0.397857 0.827133 0.699620 0.786325 0.630137

	2 0.064400 0.544625 0.831510 0.707224 0.794872 0.636986

	3 0.004800 0.837723 0.818381 0.704626 0.733333 0.678082

	4 0.000500 1.045066 0.820569 0.702899 0.746154 0.664384

	5 0.000200 1.172727 0.805252 0.669145 0.731707 0.616438

	6 0.000200 1.202422 0.827133 0.720848 0.744526 0.698630

	7 0.000000 1.195012 0.827133 0.718861 0.748148 0.691781

	8 0.000100 1.215515 0.824945 0.705882 0.761905 0.657534

	9 0.000100 1.233099 0.827133 0.710623 0.763780 0.664384

	10 0.000100 1.237268 0.829322 0.713235 0.769841 0.664384

	\|Epoch\|Training Loss\|Validation Loss\|Accuracy\|F1\|Precision\|Precision\|
	\|----\|-------\|-------\|-------\|-------\|-------\|-------\|
	\|1\|0.389900 \|0.397857 \|0.827133 \|0.699620 \|0.786325 \|0.630137 \|
	\|2\|0.064400 \|0.544625 \|0.831510 \|0.707224 \|0.794872 \|0.636986 \|
	\|3\|0.004800 \|0.837723 \|0.818381 \|0.704626 \|0.733333 \|0.678082 \|
	\|4\|0.000500 \|1.045066 \|0.820569 \| 0.702899 \|0.746154 \|0.664384 \|
	\|5\|0.000200 \|1.172727 \|0.805252 \|0.669145 \|0.731707 \|0.616438 \|
	\|6\|0.000200 \|1.202422 \|0.827133 \|0.720848 \|0.744526 \|0.698630 \|
	\|7\|0.000000 \|1.195012 \|0.827133 \|0.718861 \|0.748148 \|0.691781 \|
	\|8\|0.000100 \|1.215515 \|0.824945 \|0.705882 \|0.761905 \|0.657534 \|
	\|9\|0.000100\|1.233099 \|0.827133 \|0.710623 \|0.763780 \|0.664384 \|
	\|10\|0.000100\|1.237268 \|0.829322 \|0.713235 \|0.769841 \|0.664384 \|

	### Framework versions

	- Transformers 4.17.0
	- Pytorch 1.10.0+cu111
	- Tokenizers 0.11.6


	## Model in Action
	Fast usage with pipelines:
	``` python
	###libraries required
	!pip install transformers
	from transformers import pipeline

	### usage pipelines
	model_checkpoint = "hackathon-pln-es/twitter_sexismo-finetuned-exist2021-metwo"
	pipeline_nlp = pipeline("text-classification", model=model_checkpoint)
	pipeline_nlp("mujer al volante peligro!")
	#pipeline_nlp("¡me encanta el ipad!")
	#pipeline_nlp (["mujer al volante peligro!", "Los hombre tienen más manias que las mujeres", "me encanta el ipad!"] )

	# OUTPUT MODEL #
	# LABEL_0: "NON SEXISM"or LABEL_1: "SEXISM" and score: probability of accuracy per model.

	# [{'label': 'LABEL_1', 'score': 0.9967633485794067}]
	# [{'label': 'LABEL_0', 'score': 0.9934417009353638}]

	#[{‘label': 'LABEL_1', 'score': 0.9967633485794067},
	# {'label': 'LABEL_1', 'score': 0.9755664467811584},
	# {'label': 'LABEL_0', 'score': 0.9955045580863953}]
	```