FpOliveira
/

tupi-bert-base-portuguese-cased

Text Classification

Inference Endpoints

Model card Files Files and versions Community

tupi-bert-base-portuguese-cased / README.md

FpOliveira's picture

Update README.md

4b709df about 1 year ago

|

1.64 kB

	---
	license: mit
	datasets:
	- FpOliveira/TuPi-Portuguese-Hate-Speech-Dataset-Binary
	language:
	- pt
	metrics:
	- accuracy
	- precision
	- recall
	- f1
	pipeline_tag: text-classification
	---

	## Introduction


	Tupi-BERT-Base represents a fine-tuned BERT model designed specifically for binary classification of hate speech in Portuguese. Derived from the [BERTimbau base](https://huggingface.co/neuralmind/bert-base-portuguese-cased), TuPi-Base is refinde solution for addressing hate speech concerns.
	For more details or specific inquiries, please refer to the [BERTimbau repository](https://github.com/neuralmind-ai/portuguese-bert/).

	The efficacy of Language Models can exhibit notable variations when confronted with a shift in domain between training and test data. In the creation of a specialized Portuguese Language Model tailored for hate speech classification, the original BERTimbau model underwent fine-tuning processe carried out on the [TuPi Hate Speech DataSet](https://huggingface.co/datasets/FpOliveira/TuPi-Portuguese-Hate-Speech-Dataset-Binary), sourced from diverse social networks.

	## Available models

	\| Model \| Arch. \| #Layers \| #Params \|
	\| ---------------------------------------- \| ---------- \| ------- \| ------- \|
	\| `FpOliveira/tupi-bert-base-portuguese-cased` \| BERT-Base \|12 \|109M\|
	\| `FpOliveira/tupi-bert-large-portuguese-cased` \| BERT-Large \| 24 \| 334M \|
	\| `FpOliveira/tupi-bert-base-portuguese-cased-multiclass-multilabel` \| BERT-Base \| 24 \| 109M \|
	\| `FpOliveira/tupi-bert-large-portuguese-cased-multiclass-multilabel` \| BERT-Large \| 24 \| 334M \|