Edit model card

BETO_Galen-cantemist

This model is a finetuned version of BETO_Galen for the cantemist dataset used in a benchmark in the paper TODO. The model has a F1 of 0.802

Please refer to the original publication for more information TODO LINK

Parameters used

parameter Value
batch size 16
learning rate 3e05
classifier dropout 0.1
warmup ratio 0
warmup steps 0
weight decay 0
optimizer AdamW
epochs 10
early stopping patience 3

BibTeX entry and citation info

TODO
Downloads last month
3
Safetensors
Model size
110M params
Tensor type
I64
·
F32
·

Dataset used to train IIC/BETO_Galen-cantemist

Collection including IIC/BETO_Galen-cantemist

Evaluation results