Corran commited on
Commit
eebdeb4
1 Parent(s): 57e95ce

Add SetFit model

Browse files
1_Pooling/config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "word_embedding_dimension": 384,
3
  "pooling_mode_cls_token": false,
4
  "pooling_mode_mean_tokens": true,
5
  "pooling_mode_max_tokens": false,
 
1
  {
2
+ "word_embedding_dimension": 768,
3
  "pooling_mode_cls_token": false,
4
  "pooling_mode_mean_tokens": true,
5
  "pooling_mode_max_tokens": false,
README.md CHANGED
@@ -8,25 +8,26 @@ tags:
8
  metrics:
9
  - accuracy
10
  widget:
11
- - text: This paper focuses on mining association rules between sets of items in large
12
- databases, which can reveal interesting patterns and relationships among the data.
13
- - text: In this paper, the authors explore the economic concepts of fairness and retaliation
14
- within the context of reciprocity, demonstrating how these principles shape market
15
- behaviors and interactions.
16
- - text: Further research is needed to explore the applicability of the proposed model
17
- to more complex multi-echelon inventory systems with additional features, such
18
- as lead time variability and supplier reliability.
19
- - text: The NCEP/NCAR 40-Year Reanalysis Project provides retrospective atmospheric
20
- data sets by assimilating observational data into a model, resulting in improved
21
- estimates of historical weather patterns for meteorological research and applications.
22
- - text: This study aims to assess the accuracy of aerosol optical properties retrieved
23
- from Aerosol Robotic Network (AERONET) Sun and sky radiance measurements using
24
- ground-based reference data.
 
25
  pipeline_tag: text-classification
26
  inference: true
27
- base_model: sentence-transformers/paraphrase-MiniLM-L3-v2
28
  model-index:
29
- - name: SetFit with sentence-transformers/paraphrase-MiniLM-L3-v2
30
  results:
31
  - task:
32
  type: text-classification
@@ -37,13 +38,13 @@ model-index:
37
  split: test
38
  metrics:
39
  - type: accuracy
40
- value: 0.7407692307692307
41
  name: Accuracy
42
  ---
43
 
44
- # SetFit with sentence-transformers/paraphrase-MiniLM-L3-v2
45
 
46
- This is a [SetFit](https://github.com/huggingface/setfit) model that can be used for Text Classification. This SetFit model uses [sentence-transformers/paraphrase-MiniLM-L3-v2](https://huggingface.co/sentence-transformers/paraphrase-MiniLM-L3-v2) as the Sentence Transformer embedding model. A [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance is used for classification.
47
 
48
  The model has been trained using an efficient few-shot learning technique that involves:
49
 
@@ -54,10 +55,10 @@ The model has been trained using an efficient few-shot learning technique that i
54
 
55
  ### Model Description
56
  - **Model Type:** SetFit
57
- - **Sentence Transformer body:** [sentence-transformers/paraphrase-MiniLM-L3-v2](https://huggingface.co/sentence-transformers/paraphrase-MiniLM-L3-v2)
58
  - **Classification head:** a [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance
59
- - **Maximum Sequence Length:** 128 tokens
60
- - **Number of Classes:** 13 classes
61
  <!-- - **Training Dataset:** [Unknown](https://huggingface.co/datasets/unknown) -->
62
  <!-- - **Language:** Unknown -->
63
  <!-- - **License:** Unknown -->
@@ -76,10 +77,9 @@ The model has been trained using an efficient few-shot learning technique that i
76
  | Hypothesis | <ul><li>'Despite having average cholesterol levels, patients who received Pravastatin experienced a significant reduction in coronary events, suggesting a potential role for statins in preventing cardiovascular events beyond cholesterol level management in internal medicine.'</li><li>'This prospective observational study aimed to investigate the association between glycaemia levels and the risk of developing macrovascular and microvascular complications in individuals with type 2 diabetes, as previously identified in the UKPDS 35 study.'</li><li>'The results suggest that self-regulatory skills, particularly in the area of attention, significantly impact academic performance in elementary school students.'</li></ul> |
77
  | Implications | <ul><li>'From 1995 to 1998, the UK Prospective Diabetes Study (UKPDS) 35 observed a significant association between higher glycaemia levels and increased risk of both macrovascular and microvascular complications in patients with type 2 diabetes.'</li><li>'The UKPDS 35 study provides robust evidence that every 1 mmol/L increase in HbA1c is associated with a 25% increased risk of macrovascular events and a 37% increased risk of microvascular complications in patients with type 2 diabetes, highlighting the importance of strict glycaemic control in internal medicine.'</li><li>"This study provides valuable insights into the early dynamics of the COVID-19 outbreak in Italy, contributing to the understanding of the disease's transmission patterns and impact on public health."</li></ul> |
78
  | Importance | <ul><li>'Stroke and transient ischemic attack (TIA) are leading causes of long-term disability and mortality in internal medicine, with an estimated 15 million survivors worldwide.'</li><li>'The accurate assessment of insulin resistance and beta-cell function is crucial in the diagnosis and management of various metabolic disorders, including type 2 diabetes and metabolic syndrome.'</li><li>'The COVID-19 outbreak in Italy, which began in late February 2020, quickly became one of the most severe epidemic hotspots in Europe.'</li></ul> |
79
- | Keywords | <ul><li>'Pravastatin is a statin drug commonly used in the treatment of hypercholesterolemia, specifically to lower low-density lipoprotein (LDL) cholesterol levels and reduce the risk of cardiovascular events in internal medicine.'</li><li>'Self-regulation refers to the ability of students to manage their emotions, behavior, and cognitive processes to achieve optimal learning (Zimmerman & Kitsantas, 2005).'</li><li>'The proposed method utilizes deep convolutional neural networks to extract rich features from input images, enabling both object detection and semantic segmentation with high accuracy in the field of artificial intelligence.'</li></ul> |
80
  | Limitations | <ul><li>'However, it is important to note that the Homeostasis Model Assessment (HOMA) index does not directly measure insulin sensitivity or β-cell function, but rather provides an estimate based on fasting plasma glucose and insulin concentrations.'</li><li>'Despite providing a useful estimate of insulin resistance and beta-cell function, the Homeostasis Model Assessment has limitations in its applicability to individuals with extreme glucose or insulin levels, as well as those with certain diseases such as liver disease or pregnancy.'</li><li>'Despite the large sample size and long follow-up period, the observational nature of the study limits the ability to establish causality between glycaemia and the observed complications in type 2 diabetes.'</li></ul> |
81
  | Method | <ul><li>'The study employed a randomized, double-blind, placebo-controlled design to investigate the effect of Pravastatin on coronary events in patients with average cholesterol levels.'</li><li>'Patients with a history of myocardial infarction and an average cholesterol level between 180 and 240 mg/dL were included in the study.'</li><li>'The study aimed to assess the impact of Pravastatin administration on the incidence of coronary events in internal medicine patients with average cholesterol levels.'</li></ul> |
82
- | None | <ul><li>'The study enrolled patients with a recent myocardial infarction and an average cholesterol level, who were then randomly assigned to receive either pravastatin or placebo.'</li><li>'This systematic review and meta-analysis aimed to assess the efficacy and safety of dual antiplatelet therapy with aspirin and clopidogrel in the secondary prevention of stroke and transient ischemic attack in the field of internal medicine.'</li><li>'This study aims to evaluate the effectiveness of the Homeostasis Model Assessment (HOMA) in estimating insulin resistance and pancreatic beta-cell function in internal medicine, offering valuable insights for the diagnosis and management of metabolic disorders.'</li></ul> |
83
  | Purpose | <ul><li>'This study investigates the impact of Pravastatin on reducing coronary events in internal medicine patients with average cholesterol levels after a myocardial infarction.'</li><li>'This systematic review and meta-analysis aimed to assess the efficacy and safety of dual antiplatelet therapy with aspirin and clopidogrel in the secondary prevention of stroke and transient ischemic attack in internal medicine.'</li><li>'This study aims to evaluate the effectiveness of the Homeostasis Model Assessment (HOMA) in estimating insulin resistance and beta-cell function in internal medicine patients, addressing the need for a simple and widely applicable method for diagnosing and monitoring these conditions.'</li></ul> |
84
  | Reccomendations | <ul><li>'Further studies are needed to investigate the optimal duration of dual antiplatelet therapy in secondary prevention of stroke and transient ischemic attack, as well as the role of individual patient characteristics in determining the most effective treatment regimen.'</li><li>'Further research is warranted to explore the underlying mechanisms linking glycaemia to macrovascular and microvascular complications in type 2 diabetes, particularly in multi-ethnic populations.'</li><li>'Further studies are needed to investigate the potential role of IL-6 signaling in the prevention of bone loss in postmenopausal women.'</li></ul> |
85
  | Result | <ul><li>'Despite having average cholesterol levels, patients treated with Pravastatin did not experience a significant reduction in coronary events compared to the placebo group.'</li><li>'In interviews with patients who experienced a reduction in coronary events after Pravastatin treatment, themes included improved energy levels and increased confidence in managing their heart health.'</li><li>'The study found that Pravastatin significantly reduced the risk of coronary events in patients with average cholesterol levels, consistent with previous research suggesting that statins benefit a wider population beyond those with hypercholesterolemia.'</li></ul> |
@@ -90,7 +90,7 @@ The model has been trained using an efficient few-shot learning technique that i
90
  ### Metrics
91
  | Label | Accuracy |
92
  |:--------|:---------|
93
- | **all** | 0.7408 |
94
 
95
  ## Uses
96
 
@@ -110,7 +110,7 @@ from setfit import SetFitModel
110
  # Download from the 🤗 Hub
111
  model = SetFitModel.from_pretrained("Corran/SciGenSetfit2")
112
  # Run inference
113
- preds = model("This paper focuses on mining association rules between sets of items in large databases, which can reveal interesting patterns and relationships among the data.")
114
  ```
115
 
116
  <!--
@@ -142,30 +142,29 @@ preds = model("This paper focuses on mining association rules between sets of it
142
  ### Training Set Metrics
143
  | Training set | Min | Median | Max |
144
  |:-------------|:----|:--------|:----|
145
- | Word count | 11 | 28.3123 | 71 |
146
 
147
  | Label | Training Sample Count |
148
  |:----------------|:----------------------|
149
- | Aims | 200 |
150
- | Background | 200 |
151
- | Hypothesis | 200 |
152
- | Implications | 200 |
153
- | Importance | 200 |
154
- | Keywords | 200 |
155
- | Limitations | 200 |
156
- | Method | 200 |
157
- | None | 200 |
158
- | Purpose | 200 |
159
- | Reccomendations | 200 |
160
- | Result | 200 |
161
- | Uncertainty | 200 |
162
 
163
  ### Training Hyperparameters
164
  - batch_size: (256, 256)
165
  - num_epochs: (1, 1)
166
  - max_steps: -1
167
  - sampling_strategy: oversampling
168
- - num_iterations: 40
169
  - body_learning_rate: (2e-05, 1e-05)
170
  - head_learning_rate: 0.01
171
  - loss: CosineSimilarityLoss
@@ -181,23 +180,10 @@ preds = model("This paper focuses on mining association rules between sets of it
181
  ### Training Results
182
  | Epoch | Step | Training Loss | Validation Loss |
183
  |:------:|:----:|:-------------:|:---------------:|
184
- | 0.0012 | 1 | 0.4201 | - |
185
- | 0.0615 | 50 | 0.2562 | - |
186
- | 0.1230 | 100 | 0.2334 | - |
187
- | 0.1845 | 150 | 0.1974 | - |
188
- | 0.2460 | 200 | 0.195 | - |
189
- | 0.3075 | 250 | 0.1768 | - |
190
- | 0.3690 | 300 | 0.146 | - |
191
- | 0.4305 | 350 | 0.1541 | - |
192
- | 0.4920 | 400 | 0.1647 | - |
193
- | 0.5535 | 450 | 0.154 | - |
194
- | 0.6150 | 500 | 0.1568 | - |
195
- | 0.6765 | 550 | 0.1494 | - |
196
- | 0.7380 | 600 | 0.1554 | - |
197
- | 0.7995 | 650 | 0.1456 | - |
198
- | 0.8610 | 700 | 0.1527 | - |
199
- | 0.9225 | 750 | 0.1488 | - |
200
- | 0.9840 | 800 | 0.1312 | - |
201
 
202
  ### Framework Versions
203
  - Python: 3.10.12
 
8
  metrics:
9
  - accuracy
10
  widget:
11
+ - text: Further research is needed to develop more effective methods for the detection
12
+ and inhibition of ESBLs in clinical settings.
13
+ - text: Although the phosphomolybdenum method presents high accuracy and precision
14
+ for vitamin E quantitation, its applicability to other antioxidants may require
15
+ further investigation.
16
+ - text: The persistent inflammation observed in Interleukin-10-deficient mice provides
17
+ insight into the role of this cytokine in maintaining intestinal homeostasis and
18
+ highlights the potential implications for human diseases, such as inflammatory
19
+ bowel syndrome.
20
+ - text: The proposed algorithms in this paper utilize Hamilton-Jacobi formulations
21
+ to calculate the front propagation speed, which depends on the curvature of the
22
+ front.
23
+ - text: The IC50 values obtained from the semiautomated microdilution assay suggest
24
+ that artesunate and dihydroartemisinin exhibit comparable antimalarial activity
25
+ against the Plasmodium falciparum strains tested.
26
  pipeline_tag: text-classification
27
  inference: true
28
+ base_model: kaisugi/scitoricsbert
29
  model-index:
30
+ - name: SetFit with kaisugi/scitoricsbert
31
  results:
32
  - task:
33
  type: text-classification
 
38
  split: test
39
  metrics:
40
  - type: accuracy
41
+ value: 0.8833333333333333
42
  name: Accuracy
43
  ---
44
 
45
+ # SetFit with kaisugi/scitoricsbert
46
 
47
+ This is a [SetFit](https://github.com/huggingface/setfit) model that can be used for Text Classification. This SetFit model uses [kaisugi/scitoricsbert](https://huggingface.co/kaisugi/scitoricsbert) as the Sentence Transformer embedding model. A [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance is used for classification.
48
 
49
  The model has been trained using an efficient few-shot learning technique that involves:
50
 
 
55
 
56
  ### Model Description
57
  - **Model Type:** SetFit
58
+ - **Sentence Transformer body:** [kaisugi/scitoricsbert](https://huggingface.co/kaisugi/scitoricsbert)
59
  - **Classification head:** a [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance
60
+ - **Maximum Sequence Length:** 512 tokens
61
+ - **Number of Classes:** 12 classes
62
  <!-- - **Training Dataset:** [Unknown](https://huggingface.co/datasets/unknown) -->
63
  <!-- - **Language:** Unknown -->
64
  <!-- - **License:** Unknown -->
 
77
  | Hypothesis | <ul><li>'Despite having average cholesterol levels, patients who received Pravastatin experienced a significant reduction in coronary events, suggesting a potential role for statins in preventing cardiovascular events beyond cholesterol level management in internal medicine.'</li><li>'This prospective observational study aimed to investigate the association between glycaemia levels and the risk of developing macrovascular and microvascular complications in individuals with type 2 diabetes, as previously identified in the UKPDS 35 study.'</li><li>'The results suggest that self-regulatory skills, particularly in the area of attention, significantly impact academic performance in elementary school students.'</li></ul> |
78
  | Implications | <ul><li>'From 1995 to 1998, the UK Prospective Diabetes Study (UKPDS) 35 observed a significant association between higher glycaemia levels and increased risk of both macrovascular and microvascular complications in patients with type 2 diabetes.'</li><li>'The UKPDS 35 study provides robust evidence that every 1 mmol/L increase in HbA1c is associated with a 25% increased risk of macrovascular events and a 37% increased risk of microvascular complications in patients with type 2 diabetes, highlighting the importance of strict glycaemic control in internal medicine.'</li><li>"This study provides valuable insights into the early dynamics of the COVID-19 outbreak in Italy, contributing to the understanding of the disease's transmission patterns and impact on public health."</li></ul> |
79
  | Importance | <ul><li>'Stroke and transient ischemic attack (TIA) are leading causes of long-term disability and mortality in internal medicine, with an estimated 15 million survivors worldwide.'</li><li>'The accurate assessment of insulin resistance and beta-cell function is crucial in the diagnosis and management of various metabolic disorders, including type 2 diabetes and metabolic syndrome.'</li><li>'The COVID-19 outbreak in Italy, which began in late February 2020, quickly became one of the most severe epidemic hotspots in Europe.'</li></ul> |
 
80
  | Limitations | <ul><li>'However, it is important to note that the Homeostasis Model Assessment (HOMA) index does not directly measure insulin sensitivity or β-cell function, but rather provides an estimate based on fasting plasma glucose and insulin concentrations.'</li><li>'Despite providing a useful estimate of insulin resistance and beta-cell function, the Homeostasis Model Assessment has limitations in its applicability to individuals with extreme glucose or insulin levels, as well as those with certain diseases such as liver disease or pregnancy.'</li><li>'Despite the large sample size and long follow-up period, the observational nature of the study limits the ability to establish causality between glycaemia and the observed complications in type 2 diabetes.'</li></ul> |
81
  | Method | <ul><li>'The study employed a randomized, double-blind, placebo-controlled design to investigate the effect of Pravastatin on coronary events in patients with average cholesterol levels.'</li><li>'Patients with a history of myocardial infarction and an average cholesterol level between 180 and 240 mg/dL were included in the study.'</li><li>'The study aimed to assess the impact of Pravastatin administration on the incidence of coronary events in internal medicine patients with average cholesterol levels.'</li></ul> |
82
+ | None | <ul><li>'Pravastatin is a statin drug commonly used in the treatment of hypercholesterolemia, specifically to lower low-density lipoprotein (LDL) cholesterol levels and reduce the risk of cardiovascular events in internal medicine.'</li><li>'The study enrolled patients with a recent myocardial infarction and an average cholesterol level, who were then randomly assigned to receive either pravastatin or placebo.'</li><li>'This systematic review and meta-analysis aimed to assess the efficacy and safety of dual antiplatelet therapy with aspirin and clopidogrel in the secondary prevention of stroke and transient ischemic attack in the field of internal medicine.'</li></ul> |
83
  | Purpose | <ul><li>'This study investigates the impact of Pravastatin on reducing coronary events in internal medicine patients with average cholesterol levels after a myocardial infarction.'</li><li>'This systematic review and meta-analysis aimed to assess the efficacy and safety of dual antiplatelet therapy with aspirin and clopidogrel in the secondary prevention of stroke and transient ischemic attack in internal medicine.'</li><li>'This study aims to evaluate the effectiveness of the Homeostasis Model Assessment (HOMA) in estimating insulin resistance and beta-cell function in internal medicine patients, addressing the need for a simple and widely applicable method for diagnosing and monitoring these conditions.'</li></ul> |
84
  | Reccomendations | <ul><li>'Further studies are needed to investigate the optimal duration of dual antiplatelet therapy in secondary prevention of stroke and transient ischemic attack, as well as the role of individual patient characteristics in determining the most effective treatment regimen.'</li><li>'Further research is warranted to explore the underlying mechanisms linking glycaemia to macrovascular and microvascular complications in type 2 diabetes, particularly in multi-ethnic populations.'</li><li>'Further studies are needed to investigate the potential role of IL-6 signaling in the prevention of bone loss in postmenopausal women.'</li></ul> |
85
  | Result | <ul><li>'Despite having average cholesterol levels, patients treated with Pravastatin did not experience a significant reduction in coronary events compared to the placebo group.'</li><li>'In interviews with patients who experienced a reduction in coronary events after Pravastatin treatment, themes included improved energy levels and increased confidence in managing their heart health.'</li><li>'The study found that Pravastatin significantly reduced the risk of coronary events in patients with average cholesterol levels, consistent with previous research suggesting that statins benefit a wider population beyond those with hypercholesterolemia.'</li></ul> |
 
90
  ### Metrics
91
  | Label | Accuracy |
92
  |:--------|:---------|
93
+ | **all** | 0.8833 |
94
 
95
  ## Uses
96
 
 
110
  # Download from the 🤗 Hub
111
  model = SetFitModel.from_pretrained("Corran/SciGenSetfit2")
112
  # Run inference
113
+ preds = model("Further research is needed to develop more effective methods for the detection and inhibition of ESBLs in clinical settings.")
114
  ```
115
 
116
  <!--
 
142
  ### Training Set Metrics
143
  | Training set | Min | Median | Max |
144
  |:-------------|:----|:--------|:----|
145
+ | Word count | 11 | 28.3767 | 60 |
146
 
147
  | Label | Training Sample Count |
148
  |:----------------|:----------------------|
149
+ | Aims | 100 |
150
+ | Background | 100 |
151
+ | Hypothesis | 100 |
152
+ | Implications | 100 |
153
+ | Importance | 100 |
154
+ | Limitations | 100 |
155
+ | Method | 100 |
156
+ | None | 100 |
157
+ | Purpose | 100 |
158
+ | Reccomendations | 100 |
159
+ | Result | 100 |
160
+ | Uncertainty | 100 |
 
161
 
162
  ### Training Hyperparameters
163
  - batch_size: (256, 256)
164
  - num_epochs: (1, 1)
165
  - max_steps: -1
166
  - sampling_strategy: oversampling
167
+ - num_iterations: 20
168
  - body_learning_rate: (2e-05, 1e-05)
169
  - head_learning_rate: 0.01
170
  - loss: CosineSimilarityLoss
 
180
  ### Training Results
181
  | Epoch | Step | Training Loss | Validation Loss |
182
  |:------:|:----:|:-------------:|:---------------:|
183
+ | 0.0053 | 1 | 0.2248 | - |
184
+ | 0.2660 | 50 | 0.1239 | - |
185
+ | 0.5319 | 100 | 0.1105 | - |
186
+ | 0.7979 | 150 | 0.0665 | - |
 
 
 
 
 
 
 
 
 
 
 
 
 
187
 
188
  ### Framework Versions
189
  - Python: 3.10.12
config.json CHANGED
@@ -1,26 +1,25 @@
1
  {
2
- "_name_or_path": "/root/.cache/torch/sentence_transformers/sentence-transformers_paraphrase-MiniLM-L3-v2/",
3
  "architectures": [
4
  "BertModel"
5
  ],
6
  "attention_probs_dropout_prob": 0.1,
7
  "classifier_dropout": null,
8
- "gradient_checkpointing": false,
9
  "hidden_act": "gelu",
10
  "hidden_dropout_prob": 0.1,
11
- "hidden_size": 384,
12
  "initializer_range": 0.02,
13
- "intermediate_size": 1536,
14
  "layer_norm_eps": 1e-12,
15
  "max_position_embeddings": 512,
16
  "model_type": "bert",
17
  "num_attention_heads": 12,
18
- "num_hidden_layers": 3,
19
  "pad_token_id": 0,
20
  "position_embedding_type": "absolute",
21
  "torch_dtype": "float32",
22
  "transformers_version": "4.36.2",
23
  "type_vocab_size": 2,
24
  "use_cache": true,
25
- "vocab_size": 30522
26
  }
 
1
  {
2
+ "_name_or_path": "/root/.cache/torch/sentence_transformers/kaisugi_scitoricsbert",
3
  "architectures": [
4
  "BertModel"
5
  ],
6
  "attention_probs_dropout_prob": 0.1,
7
  "classifier_dropout": null,
 
8
  "hidden_act": "gelu",
9
  "hidden_dropout_prob": 0.1,
10
+ "hidden_size": 768,
11
  "initializer_range": 0.02,
12
+ "intermediate_size": 3072,
13
  "layer_norm_eps": 1e-12,
14
  "max_position_embeddings": 512,
15
  "model_type": "bert",
16
  "num_attention_heads": 12,
17
+ "num_hidden_layers": 12,
18
  "pad_token_id": 0,
19
  "position_embedding_type": "absolute",
20
  "torch_dtype": "float32",
21
  "transformers_version": "4.36.2",
22
  "type_vocab_size": 2,
23
  "use_cache": true,
24
+ "vocab_size": 31090
25
  }
config_sentence_transformers.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "__version__": {
3
- "sentence_transformers": "2.0.0",
4
- "transformers": "4.7.0",
5
- "pytorch": "1.9.0+cu102"
6
  }
7
  }
 
1
  {
2
  "__version__": {
3
+ "sentence_transformers": "2.2.2",
4
+ "transformers": "4.36.2",
5
+ "pytorch": "2.1.0+cu121"
6
  }
7
  }
config_setfit.json CHANGED
@@ -6,7 +6,6 @@
6
  "Hypothesis",
7
  "Implications",
8
  "Importance",
9
- "Keywords",
10
  "Limitations",
11
  "Method",
12
  "None",
 
6
  "Hypothesis",
7
  "Implications",
8
  "Importance",
 
9
  "Limitations",
10
  "Method",
11
  "None",
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:782421e8a8f86650f5c4c24184bb8cde66eb095e4f2bce737ad3508d1c844bd8
3
- size 69565312
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8e7b341f8e3b8fedb613b442497d31738e71ba61a696c7f6c6afb4d5f9356ed5
3
+ size 439696224
model_head.pkl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6790c7fffe6c2ab476607806d7b8ab06f8b147b2dce5a6a6eba84ea624ba05b8
3
- size 41647
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7e64694fc9f6f4206f0376da8dbcc8dafcf28de90cb1968623575fa434fb57f6
3
+ size 75367
sentence_bert_config.json CHANGED
@@ -1,4 +1,4 @@
1
  {
2
- "max_seq_length": 128,
3
  "do_lower_case": false
4
  }
 
1
  {
2
+ "max_seq_length": 512,
3
  "do_lower_case": false
4
  }
tokenizer.json CHANGED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json CHANGED
@@ -8,7 +8,7 @@
8
  "single_word": false,
9
  "special": true
10
  },
11
- "100": {
12
  "content": "[UNK]",
13
  "lstrip": false,
14
  "normalized": false,
@@ -16,7 +16,7 @@
16
  "single_word": false,
17
  "special": true
18
  },
19
- "101": {
20
  "content": "[CLS]",
21
  "lstrip": false,
22
  "normalized": false,
@@ -24,7 +24,7 @@
24
  "single_word": false,
25
  "special": true
26
  },
27
- "102": {
28
  "content": "[SEP]",
29
  "lstrip": false,
30
  "normalized": false,
@@ -32,7 +32,7 @@
32
  "single_word": false,
33
  "special": true
34
  },
35
- "103": {
36
  "content": "[MASK]",
37
  "lstrip": false,
38
  "normalized": false,
@@ -47,12 +47,9 @@
47
  "do_lower_case": true,
48
  "mask_token": "[MASK]",
49
  "max_length": 128,
50
- "model_max_length": 512,
51
  "never_split": null,
52
- "pad_to_multiple_of": null,
53
  "pad_token": "[PAD]",
54
- "pad_token_type_id": 0,
55
- "padding_side": "right",
56
  "sep_token": "[SEP]",
57
  "stride": 0,
58
  "strip_accents": null,
 
8
  "single_word": false,
9
  "special": true
10
  },
11
+ "101": {
12
  "content": "[UNK]",
13
  "lstrip": false,
14
  "normalized": false,
 
16
  "single_word": false,
17
  "special": true
18
  },
19
+ "102": {
20
  "content": "[CLS]",
21
  "lstrip": false,
22
  "normalized": false,
 
24
  "single_word": false,
25
  "special": true
26
  },
27
+ "103": {
28
  "content": "[SEP]",
29
  "lstrip": false,
30
  "normalized": false,
 
32
  "single_word": false,
33
  "special": true
34
  },
35
+ "104": {
36
  "content": "[MASK]",
37
  "lstrip": false,
38
  "normalized": false,
 
47
  "do_lower_case": true,
48
  "mask_token": "[MASK]",
49
  "max_length": 128,
50
+ "model_max_length": 1000000000000000019884624838656,
51
  "never_split": null,
 
52
  "pad_token": "[PAD]",
 
 
53
  "sep_token": "[SEP]",
54
  "stride": 0,
55
  "strip_accents": null,
vocab.txt CHANGED
The diff for this file is too large to render. See raw diff