crosloengual-bert-si-nli

CroSloEngual BERT model finetuned on the SI-NLI dataset for Slovene natural language inference.
Fine-tuned in a classic sequence pair classification setting on the official training/validation/test split for 10 epochs, using validation set accuracy for model selection. Optimized using the AdamW optimizer (learning rate 2e-5) and cross-entropy loss.
Using batch size 82 (selected based on the available GPU memory) and maximum sequence length 107 (99th percentile of the lengths in the training set).

Achieves the following metrics:

best validation accuracy: 0.660
test accuracy = 0.673