gabrielchua
commited on
Commit
•
806cf24
1
Parent(s):
ba6803f
Update README to add further results
Browse files
README.md
CHANGED
@@ -2,6 +2,17 @@
|
|
2 |
license: other
|
3 |
license_name: govtech-singapore
|
4 |
license_link: LICENSE
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
5 |
---
|
6 |
|
7 |
# Off-Topic Classification Model
|
@@ -16,9 +27,18 @@ This repository contains a fine-tuned **Jina Embeddings model** designed to perf
|
|
16 |
|
17 |
## Performance
|
18 |
|
|
|
|
|
19 |
| Approach | Model | ROC-AUC | F1 | Precision | Recall |
|
20 |
|---------------------------------------|--------------------------------|---------|------|-----------|--------|
|
21 |
-
| Fine-tuned bi-encoder classifier | jina-embeddings-v2-small-en | 0.99 | 0.97 | 0.99 | 0.95 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
22 |
|
23 |
## Usage
|
24 |
1. Clone this repository and install the required dependencies:
|
|
|
2 |
license: other
|
3 |
license_name: govtech-singapore
|
4 |
license_link: LICENSE
|
5 |
+
datasets:
|
6 |
+
- gabrielchua/off-topic
|
7 |
+
language:
|
8 |
+
- en
|
9 |
+
metrics:
|
10 |
+
- roc_auc
|
11 |
+
- f1
|
12 |
+
- precision
|
13 |
+
- recall
|
14 |
+
base_model:
|
15 |
+
- jinaai/jina-embeddings-v2-small-en
|
16 |
---
|
17 |
|
18 |
# Off-Topic Classification Model
|
|
|
27 |
|
28 |
## Performance
|
29 |
|
30 |
+
We evaluated our fine-tuned models on synthetic data modelling system and user prompt pairs reflecting real world enterprise use cases of LLMs. The dataset is available [here](https://huggingface.co/datasets/gabrielchua/off-topic).
|
31 |
+
|
32 |
| Approach | Model | ROC-AUC | F1 | Precision | Recall |
|
33 |
|---------------------------------------|--------------------------------|---------|------|-----------|--------|
|
34 |
+
| [Fine-tuned bi-encoder classifier](https://huggingface.co/govtech/jina-embeddings-v2-small-en-off-topic) | jina-embeddings-v2-small-en | 0.99 | 0.97 | 0.99 | 0.95 |
|
35 |
+
| 👉 [Fine-tuned cross-encoder classifier](https://huggingface.co/govtech/stsb-roberta-base-off-topic) | stsb-roberta-base | 0.99 | 0.99 | 0.99 | 0.99 |
|
36 |
+
| Pre-trained cross-encoder | stsb-roberta-base | 0.73 | 0.68 | 0.53 | 0.93 |
|
37 |
+
| Prompt Engineering | GPT 4o (2024-08-06) | - | 0.95 | 0.94 | 0.97 |
|
38 |
+
| Prompt Engineering | GPT 4o Mini (2024-07-18) | - | 0.91 | 0.85 | 0.91 |
|
39 |
+
| Zero-shot Classification | GPT 4o Mini (2024-07-18) | 0.99 | 0.97 | 0.95 | 0.99 |
|
40 |
+
|
41 |
+
Further evaluation results on additional synthetic and external datasets (e.g.,`JailbreakBench`, `HarmBench`, `TrustLLM`) are available in our [technical report](https://arxiv.org/abs/2411.12946).
|
42 |
|
43 |
## Usage
|
44 |
1. Clone this repository and install the required dependencies:
|