carlesoctav commited on
Commit
1347b6a
1 Parent(s): 1a0827b

Add SetFit model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,288 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ license: apache-2.0
4
+ library_name: setfit
5
+ tags:
6
+ - setfit
7
+ - sentence-transformers
8
+ - text-classification
9
+ - generated_from_setfit_trainer
10
+ base_model: sentence-transformers/paraphrase-mpnet-base-v2
11
+ metrics:
12
+ - accuracy
13
+ - precision
14
+ - recall
15
+ - f1
16
+ widget:
17
+ - text: a cinematic masterpiece.. i can't believe my eyes what just i experienced
18
+ in theatre, this movie is one of the best cinematic experience i've ever had..
19
+ from opening scene to end credit every second is worth it.. epic score by hans
20
+ zimmer elevates the experience, filled with beautiful shots and cinematography
21
+ to a great scrrenplay and some top notch acting by every cast member dune 2 will
22
+ be remembered as one of the best films of this generation even after 50 years
23
+ later that's my guarantee.. denis villeneuve is one of the best director of this
24
+ generation who gave us masterpiece like blade runner 2049 and currently working
25
+ on 3rd part of this trilogy which i can't wait to see.. action scenes were great
26
+ featuring big ships, blasters laser guns and every other kind of weapons, sound
27
+ effects were amazing but most importantly screenplay was well paced and story
28
+ featuring blend of sci fi and fantasy elements is absolute best of the best even
29
+ though it is an adaptation of 1965 novel of same name it is one of the best adaptations
30
+ ever.. i loved both movies and can't wait to see what happens in future because
31
+ i enjoy films about the chosen one prophecy like star wars, the matrix and now
32
+ this.. i won't give any spoiler here and will not talk about anything shown in
33
+ the movie since you need to see this movie for yourself to feel the magic.. don't
34
+ miss it at any cost.. watch it on the biggest screen possible especially imax
35
+ to enjoy epic sound effects and see the larger than life picture you'll love it..
36
+ - text: hopeful for the future of this franchise this one's very personal for me since
37
+ i've been invested in this franchise for years now and all the waiting throughout
38
+ the production of both parts, considering the multiple delays, have been worthwhile
39
+ and it only enriched the experience, so it's really hard for me to not sound like
40
+ an absolute fanboy.every tidbit of news from the start of the production only
41
+ fueled my excitement, from cast and crew announcements to first look reveals to
42
+ teasers and trailers which i probably watched and obsessively dissected dozens
43
+ of times. which always put at rest my lore obsessed brain because it all made
44
+ sense, i could tell from the beginning that both parts were crafted by people
45
+ who understood and cared about the source material without approaching it like
46
+ some sort of sacred text. on that note i was quite surprised and pleased by the
47
+ changes they did with characters, timeline and most specifically the ending, which
48
+ only left me wanting more.despite the fact that unfortunately my local cinema
49
+ doesn't have imax screens the experience was an absolute blast and i was nothing
50
+ short of euphoric with a stupid grin the entire movie it felt surreal and overwhelming
51
+ at times taking it all in. i loved how it expanded from the themes established
52
+ in the first one, particularly the detailed exploration of fremen culture, how
53
+ ominous and mysterious felt the bene gesserit, as for the harkonnens besides the
54
+ obvious praise for how brutal, weird and cool they're portrayed i particularly
55
+ appreciated the h. r giger inspired look of their black sun lit giedi prime. all
56
+ while staying true to the essence of the story.with a chunky and lengthy runtime
57
+ its impossible for me to pinpoint favorite moments. my jaw dropped at the sheer
58
+ scale of many sequences, namely the opening scene with the orange hues and the
59
+ levitating harkonnens, also the gladiator scene, the big feyd-rautha entrance
60
+ was bonkers the fighting choreography felt natural and realistic and well thought
61
+ out, tho i gotta admit i was iffy at first but ultimately pleasantly surprised
62
+ by austin's feyd-rautha.overall, the performances, direction, music, sound design,
63
+ cinematography, and production design were outstanding. the themes of fanaticism
64
+ and the difficulty of reasoning with true believers resonated deeply with me.
65
+ plus, getting to watch it with three of my favorite people made the movie watching
66
+ experience even more lovely. dune part two exceeded my every expectations and
67
+ left me eagerly awaiting the next installment. i can confidently say that this
68
+ is the perfect dune adaptation.
69
+ - text: disappointing like seriously... what the hell was that? :-( first part was
70
+ mostly on point... but the second part?? why alter perfection?it totally missed
71
+ in my opinion. the nukes were used for the shields, not for the spice. the water
72
+ of life was meant to be used for the spice... the great houses came together with
73
+ the emperor, why was this changed? where is the baby? where is the sister? why
74
+ was the guild cut out all together? where is the scene between. gurney and lady
75
+ jessica. all this gave context to the story what now went totally missing.as it
76
+ was cut together now, the only thing truly great about the movie was the sound
77
+ editing. this gave a great atmosphere. i in general like villeneuve alot, sicario
78
+ is one of the top3 the greatest movies ever made, but this was disappointing.
79
+ - text: dune 1 & 2 are probably the best sci-fi movies ever... yes, dune 1 & 2 are
80
+ probably the best sci-fi movies ever. i can't really come up with anything else
81
+ that would beat these anyway.part two was a nice continuation to part one, however
82
+ i felt the last part of the movie could have been better or maybe it just went
83
+ too fast. without spoiling its hard to explain. also missed some more insight
84
+ about the mother and a lot of other things as well... imagine if this could have
85
+ been a 20 episode serie with same quality!anyhow visually and the main cast were
86
+ just excellent. i did expect more out of the emperor christofer walken.just love
87
+ the tonality of the entire movie and the sound and music made it masterful.please
88
+ more of these "adult" epic sci-fi movies and series!
89
+ - text: great movie with holes in the plot dune 2 is a great movie that can be enjoyed
90
+ despite some holes in the plot and questionable story elements. the movie is great
91
+ visually and serves as an example of how cgi doesn't have to make a movie seem
92
+ sterile and fake, or like watching a video game like the new brand of marvel movies
93
+ and aquaman do.the cast is great - christopher walken should have had more scenes
94
+ and other than it being him, his performance was not unique in any way - and there
95
+ is a non credited appearance that i'm not going to spoil that is a wow!the movie
96
+ is long, but there's a difference between long and too long, and any movie that
97
+ wants to be an epic. 2 out of 5 found this helpful. was this review helpful? sign
98
+ in to vote. permalink
99
+ pipeline_tag: text-classification
100
+ inference: true
101
+ model-index:
102
+ - name: SetFit with sentence-transformers/paraphrase-mpnet-base-v2 on data/raw/15239678.jsonl
103
+ results:
104
+ - task:
105
+ type: text-classification
106
+ name: Text Classification
107
+ dataset:
108
+ name: data/raw/15239678.jsonl
109
+ type: unknown
110
+ split: test
111
+ metrics:
112
+ - type: accuracy
113
+ value: 0.8246753246753247
114
+ name: Accuracy
115
+ - type: precision
116
+ value: 0.9915611814345991
117
+ name: Precision
118
+ - type: recall
119
+ value: 0.818815331010453
120
+ name: Recall
121
+ - type: f1
122
+ value: 0.8969465648854962
123
+ name: F1
124
+ ---
125
+
126
+ # SetFit with sentence-transformers/paraphrase-mpnet-base-v2 on data/raw/15239678.jsonl
127
+
128
+ This is a [SetFit](https://github.com/huggingface/setfit) model that can be used for Text Classification. This SetFit model uses [sentence-transformers/paraphrase-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-mpnet-base-v2) as the Sentence Transformer embedding model. A [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance is used for classification.
129
+
130
+ The model has been trained using an efficient few-shot learning technique that involves:
131
+
132
+ 1. Fine-tuning a [Sentence Transformer](https://www.sbert.net) with contrastive learning.
133
+ 2. Training a classification head with features from the fine-tuned Sentence Transformer.
134
+
135
+ ## Model Details
136
+
137
+ ### Model Description
138
+ - **Model Type:** SetFit
139
+ - **Sentence Transformer body:** [sentence-transformers/paraphrase-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-mpnet-base-v2)
140
+ - **Classification head:** a [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance
141
+ - **Maximum Sequence Length:** 512 tokens
142
+ - **Number of Classes:** 2 classes
143
+ <!-- - **Training Dataset:** [Unknown](https://huggingface.co/datasets/unknown) -->
144
+ - **Language:** en
145
+ - **License:** apache-2.0
146
+
147
+ ### Model Sources
148
+
149
+ - **Repository:** [SetFit on GitHub](https://github.com/huggingface/setfit)
150
+ - **Paper:** [Efficient Few-Shot Learning Without Prompts](https://arxiv.org/abs/2209.11055)
151
+ - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
152
+
153
+ ### Model Labels
154
+ | Label | Examples |
155
+ |:---------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
156
+ | negative | <ul><li>"it's like watching a cult and not really a good movie. sorry not for me. all the characters gave justice on their roles, well executed. the pace of the story is so slow that it made me so sleepy in the middle of the film. actually the film made me bored coz i expected more but a lousy story. i would like to mention rebecca ferguson, she's a great actress and really fit to be the lady jessica itself. in the middle of the film, me and three of my friends wanted to go home coz it feels like we're watching a cult movie. not a good one for children's to watch. in this movie, nothing much to expect, such a waste of time and it was a very boring film fighting for a spice. 19 out of 100 found this helpful. was this review helpful? sign in to vote. permalink"</li><li>'the avengers-era "cinema" for a call of duty generation i watched dune: part two on imax (as seems befitting that sort of big-budget picture), in pretty good seats considering the house was packed with a diverse crowd of young adults for the late-night showing.my review of the first of denis villeneuve\'s adaptations was that it was pretty to look at, but basically uninteresting in its plot, a major let-down considering how hyped the books have been, when on the screen they\'re no better than star wars. this second one exhibits those same features, but worse.this is basically an action movie--a genre which i\'m fundamentally uninterested in--which seems at odds with all the talk of dune being characterized by complex political & economic maneuvering--of which this film does include some, albeit parenthetically. the scenes are impressionistic in their pacing and content, and expressionistic in that the photography & score are more about attempting to convey emotionality, or at least get the adrenaline pumping, than any sort of profound characterizations or themes. this is lawrence of arabia in space but with the cold war as the contemporary allegory. that sort of story would still have been resonant during the post-9/11 era, but we\'ve moved on from those wars by now into an altogether different geopolitics and political economy.the monumentalism of the shots can be impressive, and certainly the design of the mechanical vehicles is (in addition to the still-eye candy costumes), but there are too many moments where it\'s like watching a video game cut-scene, or even a video game playing out (c.f. the entire opening sequence). cinema is going backwards when that\'s the case. and there are some egregious dissonances in the sound design, most of all in the various accents the actors were allowed to use, which takes us out of the suspension of disbelief, but also in the over-application of non-diegetic score, which treats the audience as dumb emotions-receptacles, during scenes that by-and-large do not earn such responses.(the only outstanding sequence of the film is when paul attempts his first worm-ride, which is almost as exhilirating as the roller coaster ride it attempts to be for us on screen.)there are also weird dissonances in this film attempting to be humorous, as the on-again off-again nature of those lines (and how they\'re delivered) feel like a wink to the audience that this allegory of abrahamic religions co-opting a people\'s revolution--albeit against a totalitarian capitalist empire, mixing all kinds of economic stages of history, much as the genres of swords & planets and sci-fi and fantasy are mixed--is not to be taken too seriously in our secular times. but then, what are we to take from this "epic"?'</li><li>"try as hard as you like i left feeling bored, and that's a bad sign for an epic sci-fi. i watched dune i and loved it, the deep sonorous sounds of an alien nation were exciting, and the dialogue kept minimal and thick. dune ii blasts you with ott noise, over the top effects, and piss poor dialogue... 'for the duke, and my friends' i felt this movie was aimed at a pre-teen audience with spiderman fetish and a heavy hand of americanism... the emperor was not an emperor. recommend putting a human written storyline first and leave ai/cgi to a bare minimum. i watched dune 20 years ago and loved it. i read dune 30 years ago and loved it. the actors tried hard with what they got but it wasn't enough to make it a masterpiece, it's more like an aprenticepiece."</li></ul> |
157
+ | positive | <ul><li>"best movie in recent years. the movie is a more impressive, more exciting and deeper production than the first part. give it 9 out of 10.the acting, visual effects, music and atmosphere in the movie are amazing. paul's evolution is perfectly demonstrated by timothee chalamet.the visual effects deliver a breathtaking rendering of arrakis' desert landscapes, giant worms, and battle scenes. you can smell the scent of a spice behind curtain.music adds soul to the film with the mastery of hans zimmer. the atmosphere is to keep tension and emotion at a high level throughout the film.while the film's script remains faithful to the book, it also offers some differences. while some changes make the film better, others do not fit the spirit of the book. however, i can honestly say that the changes made do not harm the film at all.dune: part two is one of the masterpieces of science fiction cinema. it's the best movie i've seen in nearly 15 years. maybe oppenheimer.nah, this movie is better."</li><li>"best movie ever what a movie!!!!!!!!!!the whole movie is so good! the setting is amazing. the story is so good. and the plot and the premise is amazing my favorite characters are paul and chani. this whole movie is seeking revenge from the darkness and the evil that was causing the world to collapse. and following that they decide just like on star wars to choose what side they're on the light side or the dark side. and man the cliffhanger of the movie was the most shocking thing ever. the whole movie is a very well done movie. i love this movie so much it gave me chills. this movie will get the best picture next year for the oscars. 0 out of 0 found this helpful. was this review helpful? sign in to vote. permalink"</li><li>"very spicey! undoubtedly one of the greatest cinematic experiences i've ever had. most movies nowadays try to wow you with either visuals, action, character development, and seemingly less and less so everyday with a reasonable plot. this movie manages to check off every box. it is hard to pinpoint an actor in it that wasn't superb in their role. the visual effects, especially in imax , are stunning and are not simply artistic in nature, but actually work to advance the plot.the film is pieced together so brilliantly with each act seamlessly fitting in with the last. the score by zimmer is unsurprisingly absolute perfection as well.for the length of the movie it actually feels quite short which is a huge achievement and shows how immersed you become in villneuve's world.however, i would have liked to see more of feyd rautha. they had an epic scene of his in the harkonnen arena, but not too much else to develop him for his final battle against paul. i also would have liked to see the sardaukar put up a little better of a fight. they somehow went from the empires greatest fighters to pretty pathetic. perhaps this was to show how strong paul and his forces had become, but it just seemed a bit too easy. still these are just personal aspects i would have liked to see and are not issues in the movie itself.overall, an epic movie, and the trilogy is likely to be sitting right alongside the lord of the rings and the original star wars movies when all is said and done.this is a can't miss in imax."</li></ul> |
158
+
159
+ ## Evaluation
160
+
161
+ ### Metrics
162
+ | Label | Accuracy | Precision | Recall | F1 |
163
+ |:--------|:---------|:----------|:-------|:-------|
164
+ | **all** | 0.8247 | 0.9916 | 0.8188 | 0.8969 |
165
+
166
+ ## Uses
167
+
168
+ ### Direct Use for Inference
169
+
170
+ First install the SetFit library:
171
+
172
+ ```bash
173
+ pip install setfit
174
+ ```
175
+
176
+ Then you can load this model and run inference.
177
+
178
+ ```python
179
+ from setfit import SetFitModel
180
+
181
+ # Download from the 🤗 Hub
182
+ model = SetFitModel.from_pretrained("carlesoctav/SentimentClassifierDune-8shot")
183
+ # Run inference
184
+ preds = model("great movie with holes in the plot dune 2 is a great movie that can be enjoyed despite some holes in the plot and questionable story elements. the movie is great visually and serves as an example of how cgi doesn't have to make a movie seem sterile and fake, or like watching a video game like the new brand of marvel movies and aquaman do.the cast is great - christopher walken should have had more scenes and other than it being him, his performance was not unique in any way - and there is a non credited appearance that i'm not going to spoil that is a wow!the movie is long, but there's a difference between long and too long, and any movie that wants to be an epic. 2 out of 5 found this helpful. was this review helpful? sign in to vote. permalink")
185
+ ```
186
+
187
+ <!--
188
+ ### Downstream Use
189
+
190
+ *List how someone could finetune this model on their own dataset.*
191
+ -->
192
+
193
+ <!--
194
+ ### Out-of-Scope Use
195
+
196
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
197
+ -->
198
+
199
+ <!--
200
+ ## Bias, Risks and Limitations
201
+
202
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
203
+ -->
204
+
205
+ <!--
206
+ ### Recommendations
207
+
208
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
209
+ -->
210
+
211
+ ## Training Details
212
+
213
+ ### Training Set Metrics
214
+ | Training set | Min | Median | Max |
215
+ |:-------------|:----|:-------|:----|
216
+ | Word count | 121 | 271.5 | 993 |
217
+
218
+ | Label | Training Sample Count |
219
+ |:---------|:----------------------|
220
+ | negative | 8 |
221
+ | positive | 8 |
222
+
223
+ ### Training Hyperparameters
224
+ - batch_size: (16, 16)
225
+ - num_epochs: (1, 1)
226
+ - max_steps: -1
227
+ - sampling_strategy: oversampling
228
+ - body_learning_rate: (2e-05, 1e-05)
229
+ - head_learning_rate: 0.01
230
+ - loss: CosineSimilarityLoss
231
+ - distance_metric: cosine_distance
232
+ - margin: 0.25
233
+ - end_to_end: False
234
+ - use_amp: False
235
+ - warmup_proportion: 0.1
236
+ - seed: 42
237
+ - eval_max_steps: -1
238
+ - load_best_model_at_end: True
239
+
240
+ ### Training Results
241
+ | Epoch | Step | Training Loss | Validation Loss |
242
+ |:-------:|:-----:|:-------------:|:---------------:|
243
+ | 0.1111 | 1 | 0.2058 | - |
244
+ | **1.0** | **9** | **-** | **0.2368** |
245
+
246
+ * The bold row denotes the saved checkpoint.
247
+ ### Framework Versions
248
+ - Python: 3.10.11
249
+ - SetFit: 1.0.3
250
+ - Sentence Transformers: 2.5.1
251
+ - Transformers: 4.38.2
252
+ - PyTorch: 2.0.1
253
+ - Datasets: 2.18.0
254
+ - Tokenizers: 0.15.2
255
+
256
+ ## Citation
257
+
258
+ ### BibTeX
259
+ ```bibtex
260
+ @article{https://doi.org/10.48550/arxiv.2209.11055,
261
+ doi = {10.48550/ARXIV.2209.11055},
262
+ url = {https://arxiv.org/abs/2209.11055},
263
+ author = {Tunstall, Lewis and Reimers, Nils and Jo, Unso Eun Seo and Bates, Luke and Korat, Daniel and Wasserblat, Moshe and Pereg, Oren},
264
+ keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, FOS: Computer and information sciences},
265
+ title = {Efficient Few-Shot Learning Without Prompts},
266
+ publisher = {arXiv},
267
+ year = {2022},
268
+ copyright = {Creative Commons Attribution 4.0 International}
269
+ }
270
+ ```
271
+
272
+ <!--
273
+ ## Glossary
274
+
275
+ *Clearly define terms in order to be accessible across audiences.*
276
+ -->
277
+
278
+ <!--
279
+ ## Model Card Authors
280
+
281
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
282
+ -->
283
+
284
+ <!--
285
+ ## Model Card Contact
286
+
287
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
288
+ -->
config.json ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "exp/dune2-sent-classfier-8-shot/step_9",
3
+ "architectures": [
4
+ "MPNetModel"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.1,
7
+ "bos_token_id": 0,
8
+ "eos_token_id": 2,
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.1,
11
+ "hidden_size": 768,
12
+ "initializer_range": 0.02,
13
+ "intermediate_size": 3072,
14
+ "layer_norm_eps": 1e-05,
15
+ "max_position_embeddings": 514,
16
+ "model_type": "mpnet",
17
+ "num_attention_heads": 12,
18
+ "num_hidden_layers": 12,
19
+ "pad_token_id": 1,
20
+ "relative_attention_num_buckets": 32,
21
+ "torch_dtype": "float32",
22
+ "transformers_version": "4.38.2",
23
+ "vocab_size": 30527
24
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "2.0.0",
4
+ "transformers": "4.7.0",
5
+ "pytorch": "1.9.0+cu102"
6
+ },
7
+ "prompts": {},
8
+ "default_prompt_name": null
9
+ }
config_setfit.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "normalize_embeddings": false,
3
+ "labels": [
4
+ "negative",
5
+ "positive"
6
+ ]
7
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3b23d44a901b90d56e7f1c2f67e3d2082dede6e4c7fe8b4c42e080b978b65b80
3
+ size 437967672
model_head.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4ab948312a0da79000ae118a0f9ef0933b96a21d5412aa2814d43f7688bb0700
3
+ size 6997
modules.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ }
14
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 512,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "cls_token": {
10
+ "content": "<s>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "eos_token": {
17
+ "content": "</s>",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "mask_token": {
24
+ "content": "<mask>",
25
+ "lstrip": true,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "pad_token": {
31
+ "content": "<pad>",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ },
37
+ "sep_token": {
38
+ "content": "</s>",
39
+ "lstrip": false,
40
+ "normalized": false,
41
+ "rstrip": false,
42
+ "single_word": false
43
+ },
44
+ "unk_token": {
45
+ "content": "[UNK]",
46
+ "lstrip": false,
47
+ "normalized": false,
48
+ "rstrip": false,
49
+ "single_word": false
50
+ }
51
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,66 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "<s>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "<pad>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "</s>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "104": {
28
+ "content": "[UNK]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "30526": {
36
+ "content": "<mask>",
37
+ "lstrip": true,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "bos_token": "<s>",
45
+ "clean_up_tokenization_spaces": true,
46
+ "cls_token": "<s>",
47
+ "do_basic_tokenize": true,
48
+ "do_lower_case": true,
49
+ "eos_token": "</s>",
50
+ "mask_token": "<mask>",
51
+ "max_length": 512,
52
+ "model_max_length": 512,
53
+ "never_split": null,
54
+ "pad_to_multiple_of": null,
55
+ "pad_token": "<pad>",
56
+ "pad_token_type_id": 0,
57
+ "padding_side": "right",
58
+ "sep_token": "</s>",
59
+ "stride": 0,
60
+ "strip_accents": null,
61
+ "tokenize_chinese_chars": true,
62
+ "tokenizer_class": "MPNetTokenizer",
63
+ "truncation_side": "right",
64
+ "truncation_strategy": "longest_first",
65
+ "unk_token": "[UNK]"
66
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff