cmarkea
/

bloomz-560m-nli

Zero-Shot Classification

text-classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Cyrile commited on Mar 22

Commit

3549489

•

1 Parent(s): 9d33a33

Update README.md

Files changed (1) hide show

README.md +27 -0

README.md CHANGED Viewed

@@ -25,6 +25,24 @@ It should be noted that hypotheses and premises are randomly chosen between Engl
 ### Performance
 # Zero-shot Classification
 The primary appeal of training such models lies in their zero-shot classification performance. This means the model is capable of classifying any text with any label
 without specific training. What sets the Bloomz-560m-NLI LLMs apart in this realm is their ability to model and extract information from significantly more complex
@@ -38,6 +56,15 @@ is the sentence we aim to classify.
 ### Performance
 # How to use Bloomz-560m-NLI
 ```python

 ### Performance
+| **class**          | **precision (%)** | **f1-score (%)** | **support** |
+| :----------------: | :---------------: | :--------------: | :---------: |
+| **global**         | 69.20             | 68.35            | 5,010       |
+| **contradiction**  | 63.66             | 70.60            | 1,670       |
+| **entailment**     | 73.45             | 73.01            | 1,670       |
+| **neutral**        | 70.75             | 61.45            | 1,670       |
+### Benchmark
+| **model**          | **accuracy (%)** | **MCC (x100)** |
+| :--------------: | :-----------: | :--------------: | :------------: |
+| [cmarkea/distilcamembert-base-nli](https://huggingface.co/cmarkea/distilcamembert-base-nli) | 77.45     | 66.24         |
+| [BaptisteDoyen/camembert-base-xnli](https://huggingface.co/BaptisteDoyen/camembert-base-xnli) | 81.72     | 72.67         |
+| [MoritzLaurer/mDeBERTa-v3-base-mnli-xnli](https://huggingface.co/MoritzLaurer/mDeBERTa-v3-base-mnli-xnli) | 83.43 | 75.15     |
+| [cmarkea/bloomz-560m-nli](https://huggingface.co/cmarkea/bloomz-560m-nli) | 68.70 | 53.57     |
+| [cmarkea/bloomz-3b-nli](https://huggingface.co/cmarkea/bloomz-3b-nli) | 81.08 | 71.66     |
+| [cmarkea/bloomz-7b1-mt-nli](https://huggingface.co/cmarkea/bloomz-7b1-mt-nli) | 83.13 | 74.89     |
 # Zero-shot Classification
 The primary appeal of training such models lies in their zero-shot classification performance. This means the model is capable of classifying any text with any label
 without specific training. What sets the Bloomz-560m-NLI LLMs apart in this realm is their ability to model and extract information from significantly more complex
 ### Performance
+| **model**     | **accuracy (%)** | **MCC (x100)** |
+| :--------------: | :-----------: | :--------------: | :------------: |
+| [cmarkea/distilcamembert-base-nli](https://huggingface.co/cmarkea/distilcamembert-base-nli) | 80.59         | 63.71         |
+| [BaptisteDoyen/camembert-base-xnli](https://huggingface.co/BaptisteDoyen/camembert-base-xnli) | 86.37     | 73.74     |
+| [MoritzLaurer/mDeBERTa-v3-base-mnli-xnli](https://huggingface.co/MoritzLaurer/mDeBERTa-v3-base-mnli-xnli) | 84.97         | 70.05         |
+| [cmarkea/bloomz-560m-nli](https://huggingface.co/cmarkea/bloomz-560m-nli) | 71.13 | 46.3     |
+| [cmarkea/bloomz-3b-nli](https://huggingface.co/cmarkea/bloomz-3b-nli) | 89.06 | 78.10     |
+| [cmarkea/bloomz-7b1-mt-nli](https://huggingface.co/cmarkea/bloomz-7b1-mt-nli) | 95.12 | 90.27     |
 # How to use Bloomz-560m-NLI
 ```python