SetFit with BAAI/bge-base-en-v1.5

This is a SetFit model that can be used for Text Classification. This SetFit model uses BAAI/bge-base-en-v1.5 as the Sentence Transformer embedding model. A LogisticRegression instance is used for classification.

The model has been trained using an efficient few-shot learning technique that involves:

Fine-tuning a Sentence Transformer with contrastive learning.
Training a classification head with features from the fine-tuned Sentence Transformer.

Model Details

Model Description

Model Type: SetFit
Sentence Transformer body: BAAI/bge-base-en-v1.5
Classification head: a LogisticRegression instance
Maximum Sequence Length: 512 tokens
Number of Classes: 2 classes

Model Sources

Repository: SetFit on GitHub
Paper: Efficient Few-Shot Learning Without Prompts
Blogpost: SetFit: Efficient Few-Shot Learning Without Prompts

Model Labels

Label	Examples
1	'Reasoning:\nThe answer addresses the question about resolving the issue of not discovering pages via sitemaps by following the instructions provided in the document. It accurately lists the relevant steps, including checking the sitemap URL path, resubmitting the URL if it was submitted incorrectly, and using the inspection tool to speed up the process. The answer is directly related to the question and avoids unnecessary information. Moreover, it mentions that the "couldn't fetch" error is known to occur and can resolve itself, fitting the context provided in the document.\n\nEvaluation: Good' 'Reasoning:\nThe answer is mostly well-supported by the document and provides a step-by-step process that is directly related to the question about enabling clients to book multiple participants for a service in Bookings. However, there is a minor but notable error: the instruction to "Enter the John Youngimum number of participants allowed per booking" should correctly say "Enter the maximum number of participants allowed per booking." This typo affects the clarity and correctness of the answer. Additionally, the specific document mentions "Tips" that are omitted in the answer, but this does not significantly detract from the overall utility of the response.\n\nEvaluation: Bad' 'Reasoning:\nThe answer provided is not supported by the provided document, which does not mention any information related to booking services, changing locations, or known issues with service locations. Therefore, the answer lacks context grounding. Additionally, while it states that a known issue has been resolved, it fails to give clear steps or instructions on what to do if an error is encountered.\n\nEvaluation: Bad'
0	"Reasoning:\nThe answer accurately reflects the information provided in the document, which states that you cannot transfer or update the booking app on your site. It also correctly includes the suggestion to vote for the desired feature for future updates, which is mentioned in the provided document. The answer is concise, directly addressing the question without unnecessary information, and provides a correct and detailed response based on the document's content.\n\nEvaluation: Good" 'Reasoning:\nThe answer is directly related to the question and supported by the provided document. It succinctly outlines the steps needed to add a service and sets up service lists on separate pages, aligning with the guidelines in the document. Additionally, it includes instructions on setting up a service page exclusive to site members, which is also supported by the document. The response is clear, detailed, and avoids unnecessary information, providing correct and step-by-step instructions.\n\nEvaluation: Good' 'Reasoning:\nThe provided answer aligns well with the document's content by offering detailed instructions on how to display blog categories using datasets and repeaters or tables. All the steps mentioned (creating datasets, connecting them, adding repeaters or tables, linking datasets, and setting up filters) are grounded in the document. The answer, however, contains placeholder text "95593638" instead of the word "create," which detracts from clarity and correctness. Although minor, the repetition of these placeholders indicates a lack of attention to detail, impacting the conciseness and correctness of the instructions.\n\nEvaluation: Bad'

Label

Examples

'Reasoning:\nThe answer addresses the question about resolving the issue of not discovering pages via sitemaps by following the instructions provided in the document. It accurately lists the relevant steps, including checking the sitemap URL path, resubmitting the URL if it was submitted incorrectly, and using the inspection tool to speed up the process. The answer is directly related to the question and avoids unnecessary information. Moreover, it mentions that the "couldn't fetch" error is known to occur and can resolve itself, fitting the context provided in the document.\n\nEvaluation: Good'
'Reasoning:\nThe answer is mostly well-supported by the document and provides a step-by-step process that is directly related to the question about enabling clients to book multiple participants for a service in Bookings. However, there is a minor but notable error: the instruction to "Enter the John Youngimum number of participants allowed per booking" should correctly say "Enter the maximum number of participants allowed per booking." This typo affects the clarity and correctness of the answer. Additionally, the specific document mentions "Tips" that are omitted in the answer, but this does not significantly detract from the overall utility of the response.\n\nEvaluation: Bad'
'Reasoning:\nThe answer provided is not supported by the provided document, which does not mention any information related to booking services, changing locations, or known issues with service locations. Therefore, the answer lacks context grounding. Additionally, while it states that a known issue has been resolved, it fails to give clear steps or instructions on what to do if an error is encountered.\n\nEvaluation: Bad'

"Reasoning:\nThe answer accurately reflects the information provided in the document, which states that you cannot transfer or update the booking app on your site. It also correctly includes the suggestion to vote for the desired feature for future updates, which is mentioned in the provided document. The answer is concise, directly addressing the question without unnecessary information, and provides a correct and detailed response based on the document's content.\n\nEvaluation: Good"
'Reasoning:\nThe answer is directly related to the question and supported by the provided document. It succinctly outlines the steps needed to add a service and sets up service lists on separate pages, aligning with the guidelines in the document. Additionally, it includes instructions on setting up a service page exclusive to site members, which is also supported by the document. The response is clear, detailed, and avoids unnecessary information, providing correct and step-by-step instructions.\n\nEvaluation: Good'
'Reasoning:\nThe provided answer aligns well with the document's content by offering detailed instructions on how to display blog categories using datasets and repeaters or tables. All the steps mentioned (creating datasets, connecting them, adding repeaters or tables, linking datasets, and setting up filters) are grounded in the document. The answer, however, contains placeholder text "95593638" instead of the word "create," which detracts from clarity and correctness. Although minor, the repetition of these placeholders indicates a lack of attention to detail, impacting the conciseness and correctness of the instructions.\n\nEvaluation: Bad'

Evaluation

Metrics

Label	Accuracy
all	0.5208

Uses

Direct Use for Inference

First install the SetFit library:

pip install setfit

Then you can load this model and run inference.

from setfit import SetFitModel

# Download from the 🤗 Hub
model = SetFitModel.from_pretrained("Netta1994/setfit_baai_wix_qa_gpt-4o_improved-cot-instructions_chat_few_shot_generated_only_reas")
# Run inference
preds = model("Reasoning:
The answer provided is detailed and outlines the steps to block off time slots in the Wix Booking Calendar. However, the question specifically asks about removing the time from showing on the booking button, not about blocking off time slots. The instructions given do not address the question directly. The document also does not mention any method for removing the time from the booking button, so the answer lacks context grounding and relevance to both the question and the document.

Evaluation: Bad")

Training Details

Training Set Metrics

Training set	Min	Median	Max
Word count	63	88.8667	151

Label	Training Sample Count
0	22
1	23

Training Hyperparameters

batch_size: (16, 16)
num_epochs: (5, 5)
max_steps: -1
sampling_strategy: oversampling
num_iterations: 20
body_learning_rate: (2e-05, 2e-05)
head_learning_rate: 2e-05
loss: CosineSimilarityLoss
distance_metric: cosine_distance
margin: 0.25
end_to_end: False
use_amp: False
warmup_proportion: 0.1
l2_weight: 0.01
seed: 42
eval_max_steps: -1
load_best_model_at_end: False

Training Results

Epoch	Step	Training Loss	Validation Loss
0.0088	1	0.1951	-
0.4425	50	0.2544	-
0.8850	100	0.152	-
1.3274	150	0.0046	-
1.7699	200	0.0023	-
2.2124	250	0.0019	-
2.6549	300	0.0017	-
3.0973	350	0.0015	-
3.5398	400	0.0014	-
3.9823	450	0.0014	-
4.4248	500	0.0013	-
4.8673	550	0.0013	-

Framework Versions

Python: 3.10.14
SetFit: 1.1.0
Sentence Transformers: 3.1.0
Transformers: 4.44.0
PyTorch: 2.4.1+cu121
Datasets: 2.19.2
Tokenizers: 0.19.1

Citation

BibTeX

@article{https://doi.org/10.48550/arxiv.2209.11055,
    doi = {10.48550/ARXIV.2209.11055},
    url = {https://arxiv.org/abs/2209.11055},
    author = {Tunstall, Lewis and Reimers, Nils and Jo, Unso Eun Seo and Bates, Luke and Korat, Daniel and Wasserblat, Moshe and Pereg, Oren},
    keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, FOS: Computer and information sciences},
    title = {Efficient Few-Shot Learning Without Prompts},
    publisher = {arXiv},
    year = {2022},
    copyright = {Creative Commons Attribution 4.0 International}
}

Netta1994
/

setfit_baai_wix_qa_gpt-4o_improved-cot-instructions_chat_few_shot_generated_only_reas