This model is pre-trained XLNET with 12 layers.
It comes with paper: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models
Project Page: SBERT-WK
- Downloads last month
- 0
This model is pre-trained XLNET with 12 layers.
It comes with paper: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models
Project Page: SBERT-WK