cointegrated
/

rubert-base-cased-nli-threeway

@@ -12,12 +12,11 @@ widget:
   candidate_labels: "спорт,путешествия,музыка,кино,книги,наука,политика"
   hypothesis_template: "Тема текста - {}."
 ---
-# RuBERT base model (cased) fine-tuned for NLI (natural language inference)
-The model has been trained on a series of NLI datasets automatically translated to Russian from English [from this repo](https://github.com/felipessalvatore/NLI_datasets).
-It predicts the logical relationship between two short texts: entailment, contradiction, or neutral.
 How to run the model for NLI:
 ```python
 # !pip install transformers sentencepiece --quiet
@@ -59,4 +58,46 @@ predict_zero_shot('Какая вкусная эта ваша заливная р
 # array([0.9059292 , 0.09407079], dtype=float32)
 ```
-Alternatively, you can use [Huggingface pipelines](https://huggingface.co/transformers/main_classes/pipelines.html) for inference.

   candidate_labels: "спорт,путешествия,музыка,кино,книги,наука,политика"
   hypothesis_template: "Тема текста - {}."
 ---
+# RuBERT for NLI (natural language inference)
+This is the [DeepPavlov/rubert-base-cased](https://huggingface.co/DeepPavlov/rubert-base-cased) fine-tuned to predict the logical relationship between two short texts: entailment, contradiction, or neutral.
+## Usage
 How to run the model for NLI:
 ```python
 # !pip install transformers sentencepiece --quiet
 # array([0.9059292 , 0.09407079], dtype=float32)
 ```
+Alternatively, you can use [Huggingface pipelines](https://huggingface.co/transformers/main_classes/pipelines.html) for inference.
+## Sources
+The model has been trained on a series of NLI datasets automatically translated to Russian from English.
+Most datasets were taken [from the repo of Felipe Salvatore](https://github.com/felipessalvatore/NLI_datasets):
+[JOCI](https://github.com/sheng-z/JOCI),
+[MNLI](https://cims.nyu.edu/~sbowman/multinli/),
+[MPE](https://aclanthology.org/I17-1011/),
+[SICK](http://www.lrec-conf.org/proceedings/lrec2014/pdf/363_Paper.pdf),
+[SNLI](https://nlp.stanford.edu/projects/snli/).
+Some datasets obtained from the original sources:
+[ANLI](https://github.com/facebookresearch/anli),
+[NLI-style FEVER](https://github.com/easonnie/combine-FEVER-NSMN/blob/master/other_resources/nli_fever.md),
+[IMPPRES](https://github.com/facebookresearch/Imppres).
+## Performance
+The table below shows ROC AUC for three models on small samples of the DEV sets:
+- [tiny](https://huggingface.co/cointegrated/rubert-tiny-bilingual-nli): a small BERT predicting entailment vs not_entailment
+- [twoway](https://huggingface.co/cointegrated/rubert-base-cased-nli-twoway): a base-sized BERT predicting entailment vs not_entailment
+- [threeway](https://huggingface.co/cointegrated/rubert-base-cased-nli-threeway) (**this model**): a base-sized BERT predicting entailment vs contradiction vs neutral
+|model      |tiny/entailment|twoway/entailment|threeway/entailment|threeway[3]/contradiction|threeway[3]/neutral|
+|-----------|---------------|-----------------|-------------------|-------------------------|-------------------|
+|add_one_rte|0.82           |0.90             |0.92               |                         |                   |
+|anli_r1    |0.50           |0.68             |0.66               |0.70                     |0.75               |
+|anli_r2    |0.55           |0.62             |0.62               |0.62                     |0.69               |
+|anli_r3    |0.50           |0.63             |0.59               |0.62                     |0.64               |
+|copa       |0.55           |0.60             |0.62               |                         |                   |
+|fever      |0.88           |0.94             |0.94               |0.91                     |0.92               |
+|help       |0.74           |0.87             |0.46               |                         |                   |
+|iie        |0.79           |0.85             |0.54               |                         |                   |
+|imppres    |0.94           |0.99             |0.99               |0.99                     |0.99               |
+|joci       |0.87           |0.93             |0.93               |0.85                     |0.80               |
+|mnli       |0.87           |0.92             |0.93               |0.89                     |0.86               |
+|monli      |0.94           |1.00             |0.67               |                         |                   |
+|mpe        |0.82           |0.90             |0.90               |0.91                     |0.80               |
+|scitail    |0.80           |0.96             |0.85               |                         |                   |
+|sick       |0.97           |0.99             |0.99               |0.98                     |0.96               |
+|snli       |0.95           |0.98             |0.98               |0.99                     |0.97               |
+|terra      |0.73           |0.93             |0.93               |                         |                   |