Intel
/

bert-mini-sst2-distilled-sparse-90-1X4-block

Text Classification

Inference Endpoints

Model card Files Files and versions Community

bert-mini-sst2-distilled-sparse-90-1X4-block / README.md

BoDong's picture

First model version

4d03cd5 almost 2 years ago

|

No virus

769 Bytes

	---
	license: mit
	---

	# Sparse BERT mini model (uncased)

	Finetuned model pruned to 1:4 structured sparsity.
	The model is a pruned version of the [BERT mini model](https://huggingface.co/prajjwal1/bert-mini).

	## Intended Use

	The model can be used for inference with sparsity optimisztion.
	For further details on the model and its usage, see our repo and our implementation available [here](https://github.com/intel-innersource/frameworks.ai.nlp-toolkit.intel-nlp-toolkit).
	We also upload the quanted int8 BERT mini sparse Neural Engine IR (acc 87.15) here, could be directly used by NLP Toolkit ref inference.

	## Evaluation Results
	We get the following results on the sst2 tasks development set:

	\| Task \| SST-2 (Acc) \|
	\|------\|-------------\|
	\| \| 87.2 \|