cjvt
/

File size: 690 Bytes
d1e01e4
cdad94d
 
 
 
 
d1e01e4
 
6d2c5e4
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
---
language: 
- sl
- hr
- en

license: cc-by-4.0
---

# crosloengual-bert-si-nli

CroSloEngual BERT model finetuned on the SI-NLI dataset for Slovene natural language inference.  
Fine-tuned in a classic sequence pair classification setting on the official training/validation/test split for 10 epochs, using validation set accuracy for model selection.
Optimized using the AdamW optimizer (learning rate 2e-5) and cross-entropy loss.  
Using batch size `82` (selected based on the available GPU memory) and maximum sequence length `107` (99th percentile of the lengths in the training set).

Achieves the following metrics:  
- best validation accuracy: `0.660`
- test accuracy = `0.673`