MoritzLaurer
/

xtremedistil-l6-h256-zeroshot-v1.1-all-33

Zero-Shot Classification

text-classification

Inference Endpoints

Model card Files Files and versions Community

MoritzLaurer HF staff commited on Jan 11, 2024

Commit

535a9f1

·

verified ·

1 Parent(s): 7a91fe7

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -11,7 +11,6 @@ license: mit
 ---
 # xtremedistil-l6-h256-zeroshot-v1.1-all-33
 This model was fine-tuned using the same pipeline as described in
@@ -19,7 +18,9 @@ the model card for [MoritzLaurer/deberta-v3-large-zeroshot-v1.1-all-33](https://
 and in this [paper](https://arxiv.org/pdf/2312.17543.pdf).
 The foundation model is [microsoft/xtremedistil-l6-h256-uncased](https://huggingface.co/microsoft/xtremedistil-l6-h256-uncased).
-The model only has 22 million parameters and is 51 MB small, providing a significant speedup over larger models.
 This model was trained to provide a very small and highly efficient zeroshot option,
 especially for edge devices or in-browser use-cases with transformers.js.

 ---
 # xtremedistil-l6-h256-zeroshot-v1.1-all-33
 This model was fine-tuned using the same pipeline as described in
 and in this [paper](https://arxiv.org/pdf/2312.17543.pdf).
 The foundation model is [microsoft/xtremedistil-l6-h256-uncased](https://huggingface.co/microsoft/xtremedistil-l6-h256-uncased).
+The model only has 22 million backbone parameters and 30 million vocabulary parameters.
+The backbone parameters are the main parameters active during inference, providing a significant speedup over larger models.
+The model is 51 MB small.
 This model was trained to provide a very small and highly efficient zeroshot option,
 especially for edge devices or in-browser use-cases with transformers.js.