Add SetFit model

Browse files

Files changed (5) hide show

README.md +43 -40
config.json +1 -1
model.safetensors +1 -1
model_head.pkl +1 -1
sentencepiece.bpe.model +3 -0

README.md CHANGED Viewed

@@ -9,16 +9,16 @@ base_model: BAAI/bge-m3
 metrics:
 - accuracy
 widget:
-- text: What is the primary difference between a Bayesian neural network and a traditional
-    feedforward neural network in the context of machine learning?
-- text: What is the difference betweensupervised and unsupervised machine learning
-    algorithms in terms of data labeling and model training?
-- text: What is the primary application of Natural Language Processing (NLP) in Google's
-    BERT language model, and how does it utilize masked language modeling to improve
-    contextual understanding?
-- text: What is the main advantage of using GraphQL over traditional RESTful APIs,
-    as demonstrated by social media giant Facebook in their Facebook ADS API?
-- text: Qui est Robin Mancini ?
 pipeline_tag: text-classification
 inference: true
 model-index:
@@ -65,10 +65,10 @@ The model has been trained using an efficient few-shot learning technique that i
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
-| Label    | Examples                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
-|:---------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
-| lexical  | <ul><li>'What is the definition of semantics in the context ofontology-based data integration, and how does it differ from outright data normalization, as implementented in graph databases like neo4j orAmazon Neptune?'</li><li>'What is the primary application of graph convolutional neural networks (GCNNs) in natural language processing (NLP) for modeling syntactic dependencies in parsing?'</li><li>"What is the distinguising feature of Apache Hive's Metadata Tables, used for maintaining and managingtables in Hadoop Distributed File System (HDFS)?"</li></ul> |
-| semantic | <ul><li>'What is a key challenge faced by managers in sustaining a work culture that encourages creativity, innovation, and critical thinking within the technological industry globally?'</li><li>'How might shifting societal values influence the dynamics between multinational corporations and governments, leading to Changes in the global economic landscape?'</li><li>'How does the allocation of limited resources affect the allocation of decision-making power within an organization?'</li></ul>                                                                    |
 ## Evaluation
@@ -95,7 +95,7 @@ from setfit import SetFitModel
 # Download from the 🤗 Hub
 model = SetFitModel.from_pretrained("yaniseuranova/setfit-paraphrase-mpnet-base-v2-sst2")
 # Run inference
-preds = model("Qui est Robin Mancini ?")
 ```
 <!--
@@ -127,12 +127,12 @@ preds = model("Qui est Robin Mancini ?")
 ### Training Set Metrics
 | Training set | Min | Median  | Max |
 |:-------------|:----|:--------|:----|
-| Word count   | 4   | 19.1392 | 56  |
 | Label    | Training Sample Count |
 |:---------|:----------------------|
-| lexical  | 36                    |
-| semantic | 43                    |
 ### Training Hyperparameters
 - batch_size: (16, 16)
@@ -154,27 +154,30 @@ preds = model("Qui est Robin Mancini ?")
 ### Training Results
 | Epoch   | Step    | Training Loss | Validation Loss |
 |:-------:|:-------:|:-------------:|:---------------:|
-| 0.0050  | 1       | 0.1549        | -               |
-| 0.2475  | 50      | 0.0045        | -               |
-| 0.4950  | 100     | 0.0009        | -               |
-| 0.7426  | 150     | 0.0005        | -               |
-| 0.9901  | 200     | 0.0005        | -               |
-| 1.0     | 202     | -             | 0.0001          |
-| 1.2376  | 250     | 0.0006        | -               |
-| 1.4851  | 300     | 0.0006        | -               |
-| 1.7327  | 350     | 0.0005        | -               |
-| 1.9802  | 400     | 0.0004        | -               |
-| 2.0     | 404     | -             | 0.0             |
-| 2.2277  | 450     | 0.0003        | -               |
-| 2.4752  | 500     | 0.0003        | -               |
-| 2.7228  | 550     | 0.0003        | -               |
-| 2.9703  | 600     | 0.0003        | -               |
-| **3.0** | **606** | **-**         | **0.0**         |
-| 3.2178  | 650     | 0.0003        | -               |
-| 3.4653  | 700     | 0.0004        | -               |
-| 3.7129  | 750     | 0.0003        | -               |
-| 3.9604  | 800     | 0.0002        | -               |
-| 4.0     | 808     | -             | 0.0             |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions
@@ -182,7 +185,7 @@ preds = model("Qui est Robin Mancini ?")
 - SetFit: 1.0.3
 - Sentence Transformers: 2.6.1
 - Transformers: 4.39.0
-- PyTorch: 2.3.0+cu121
 - Datasets: 2.18.0
 - Tokenizers: 0.15.2

 metrics:
 - accuracy
 widget:
+- text: How doCompaniesbalanceIndividualCreativitywithTeamCollaboration to driveInnovationinthe
+    WORKPlace?
+- text: How do the values of a learning organization impact its ability to innovate
+    and respond to constant change?
+- text: What is the primary function of the Domain Name System (DNS) layer in the
+    Internet Protocol Stack, as defined by ICANN?
+- text: What distinguishes a transforming industry from one that merely innovates
+    to existing practices?
+- text: How can artificial intelligence systems balance individual autonomy with collective
+    responsibility in decision-making processes?
 pipeline_tag: text-classification
 inference: true
 model-index:
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
+| Label    | Examples                                                                                                                                                                                                                                                                                                                                                                                                                                             |
+|:---------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| lexical  | <ul><li>'What is the primary function of the Apache Kafka distributed streaming platform in Big Data processing?'</li><li>"What is the primary difference between Hadoop's FileSystem-based architecture and Apache Cassandra's distributed, masterlessArchitecture in scale-out design?"</li><li>'What is the main difference between optimistic concurrency control and pessimistic concurrency control in database management systems?'</li></ul> |
+| semantic | <ul><li>"How does organizational morale impact the competitiveness of a company in today's fast-paced market?"</li><li>'How do organizations balance individual creativity with collective goal achievement in a dynamic environment?'</li><li>'What is a key challenge faced by managers in sustaining a work culture that encourages creativity, innovation, and critical thinking within the technological industry globally?'</li></ul>          |
 ## Evaluation
 # Download from the 🤗 Hub
 model = SetFitModel.from_pretrained("yaniseuranova/setfit-paraphrase-mpnet-base-v2-sst2")
 # Run inference
+preds = model("What distinguishes a transforming industry from one that merely innovates to existing practices?")
 ```
 <!--
 ### Training Set Metrics
 | Training set | Min | Median  | Max |
 |:-------------|:----|:--------|:----|
+| Word count   | 4   | 19.1839 | 42  |
 | Label    | Training Sample Count |
 |:---------|:----------------------|
+| lexical  | 43                    |
+| semantic | 44                    |
 ### Training Hyperparameters
 - batch_size: (16, 16)
 ### Training Results
 | Epoch   | Step    | Training Loss | Validation Loss |
 |:-------:|:-------:|:-------------:|:---------------:|
+| 0.0041  | 1       | 0.2391        | -               |
+| 0.2066  | 50      | 0.0033        | -               |
+| 0.4132  | 100     | 0.0007        | -               |
+| 0.6198  | 150     | 0.0007        | -               |
+| 0.8264  | 200     | 0.0007        | -               |
+| **1.0** | **242** | **-**         | **0.0001**      |
+| 1.0331  | 250     | 0.0005        | -               |
+| 1.2397  | 300     | 0.0004        | -               |
+| 1.4463  | 350     | 0.0004        | -               |
+| 1.6529  | 400     | 0.0003        | -               |
+| 1.8595  | 450     | 0.0004        | -               |
+| 2.0     | 484     | -             | 0.0001          |
+| 2.0661  | 500     | 0.0003        | -               |
+| 2.2727  | 550     | 0.0003        | -               |
+| 2.4793  | 600     | 0.0002        | -               |
+| 2.6860  | 650     | 0.0003        | -               |
+| 2.8926  | 700     | 0.0002        | -               |
+| 3.0     | 726     | -             | 0.0001          |
+| 3.0992  | 750     | 0.0003        | -               |
+| 3.3058  | 800     | 0.0002        | -               |
+| 3.5124  | 850     | 0.0002        | -               |
+| 3.7190  | 900     | 0.0002        | -               |
+| 3.9256  | 950     | 0.0003        | -               |
+| 4.0     | 968     | -             | 0.0001          |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions
 - SetFit: 1.0.3
 - Sentence Transformers: 2.6.1
 - Transformers: 4.39.0
+- PyTorch: 2.3.1+cu121
 - Datasets: 2.18.0
 - Tokenizers: 0.15.2

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "checkpoints/step_606",
   "architectures": [
     "XLMRobertaModel"
   ],

 {
+  "_name_or_path": "checkpoints/step_242",
   "architectures": [
     "XLMRobertaModel"
   ],

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b6b631443242ee9cb7eaa44b335a5b8a0932d0f7730c1e523a2972f095dd5fe6
 size 2271064456

 version https://git-lfs.github.com/spec/v1
+oid sha256:b1b888990ee5269d1f4c3795f8aeeb46da209d188e03543ea23b7fa884aaf2b5
 size 2271064456

model_head.pkl CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8f26505603a392dfebb8ca914a16aa7e94aeeb8b35f89376e80a56616a8b08a4
 size 9087

 version https://git-lfs.github.com/spec/v1
+oid sha256:96d22b0c74de93b5a70d706bf42366826ce9a80c2d3a555a2fadaed9e3d0c5e3
 size 9087

sentencepiece.bpe.model ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cfc8146abe2a0488e9e2a0c56de7952f7c11ab059eca145a0a727afce0db2865
+size 5069051