Edit model card

mdeberta-v3-base-cantemist

This model is a finetuned version of mdeberta-v3-base for the cantemist dataset used in a benchmark in the paper TODO. The model has a F1 of 0.89

Please refer to the original publication for more information TODO LINK

Parameters used

parameter Value
batch size 16
learning rate 3e-05
classifier dropout 0.2
warmup ratio 0
warmup steps 0
weight decay 0
optimizer AdamW
epochs 10
early stopping patience 3

BibTeX entry and citation info

TODO
Downloads last month
3
Safetensors
Model size
279M params
Tensor type
I64
·
F32
·

Dataset used to train IIC/mdeberta-v3-base-cantemist

Collection including IIC/mdeberta-v3-base-cantemist

Evaluation results