cross-encoder
/

nli-deberta-base

@@ -1,16 +1,30 @@
-# Cross-Encoder for Quora Duplicate Questions Detection
 This model was trained using [SentenceTransformers](https://sbert.net) [Cross-Encoder](https://www.sbert.net/examples/applications/cross-encoder/README.html) class.
 ## Training Data
 The model was trained on the [SNLI](https://nlp.stanford.edu/projects/snli/) and [MultiNLI](https://cims.nyu.edu/~sbowman/multinli/) datasets. For a given sentence pair, it will output three scores corresponding to the labels: contradiction, entailment, neutral.
 ## Usage
 Pre-trained models can be used like this:
 ```python
 from sentence_transformers import CrossEncoder
-model = CrossEncoder('model_name')
 scores = model.predict([('A man is eating pizza', 'A man eats something'), ('A black race car starts up in front of a crowd of people.', 'A man is driving down a lonely road.')])
 #Convert scores to labels
@@ -24,8 +38,8 @@ You can use the model also directly with Transformers library (without SentenceT
 from transformers import AutoTokenizer, AutoModelForSequenceClassification
 import torch
-model = AutoModelForSequenceClassification.from_pretrained('model_name')
-tokenizer = AutoTokenizer.from_pretrained('model_name')
 features = tokenizer(['A man is eating pizza', 'A black race car starts up in front of a crowd of people.'], ['A man eats something', 'A man is driving down a lonely road.'],  padding=True, truncation=True, return_tensors="pt")
@@ -35,4 +49,17 @@ with torch.no_grad():
     label_mapping = ['contradiction', 'entailment', 'neutral']
     labels = [label_mapping[score_max] for score_max in scores.argmax(dim=1)]
     print(labels)
-```

+---
+language: en
+pipeline_tag: zero-shot-classification
+tags:
+- deberta-base-base
+datasets:
+- multi_nli
+- snli
+metrics:
+- accuracy
+---
+# Cross-Encoder for Natural Language Inference
 This model was trained using [SentenceTransformers](https://sbert.net) [Cross-Encoder](https://www.sbert.net/examples/applications/cross-encoder/README.html) class.
 ## Training Data
 The model was trained on the [SNLI](https://nlp.stanford.edu/projects/snli/) and [MultiNLI](https://cims.nyu.edu/~sbowman/multinli/) datasets. For a given sentence pair, it will output three scores corresponding to the labels: contradiction, entailment, neutral.
+## Performance
+For evaluation results, see [SBERT.net - Pretrained Cross-Encoder](https://www.sbert.net/docs/pretrained_cross-encoders.html#nli).
 ## Usage
 Pre-trained models can be used like this:
 ```python
 from sentence_transformers import CrossEncoder
+model = CrossEncoder('cross-encoder/nli-deberta-base')
 scores = model.predict([('A man is eating pizza', 'A man eats something'), ('A black race car starts up in front of a crowd of people.', 'A man is driving down a lonely road.')])
 #Convert scores to labels
 from transformers import AutoTokenizer, AutoModelForSequenceClassification
 import torch
+model = AutoModelForSequenceClassification.from_pretrained('cross-encoder/nli-deberta-base')
+tokenizer = AutoTokenizer.from_pretrained('cross-encoder/nli-deberta-base')
 features = tokenizer(['A man is eating pizza', 'A black race car starts up in front of a crowd of people.'], ['A man eats something', 'A man is driving down a lonely road.'],  padding=True, truncation=True, return_tensors="pt")
     label_mapping = ['contradiction', 'entailment', 'neutral']
     labels = [label_mapping[score_max] for score_max in scores.argmax(dim=1)]
     print(labels)
+```
+## Zero-Shot Classification
+This model can also be used for zero-shot-classification:
+```python
+from transformers import pipeline
+classifier = pipeline("zero-shot-classification", model='cross-encoder/nli-deberta-base')
+sent = "Apple just announced the newest iPhone X"
+candidate_labels = ["technology", "sports", "politics"]
+res = classifier(sent, candidate_labels)
+print(res)
+```

config.json CHANGED Viewed

@@ -8,16 +8,16 @@
   "hidden_dropout_prob": 0.1,
   "hidden_size": 768,
   "id2label": {
-    "0": "LABEL_0",
-    "1": "LABEL_1",
-    "2": "LABEL_2"
   },
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "label2id": {
-    "LABEL_0": 0,
-    "LABEL_1": 1,
-    "LABEL_2": 2
   },
   "layer_norm_eps": 1e-07,
   "max_position_embeddings": 512,

   "hidden_dropout_prob": 0.1,
   "hidden_size": 768,
   "id2label": {
+    "0": "contradiction",
+    "1": "entailment",
+    "2": "neutral"
   },
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "label2id": {
+    "contradiction": 0,
+    "entailment": 1,
+    "neutral": 2
   },
   "layer_norm_eps": 1e-07,
   "max_position_embeddings": 512,