[up]: model card
Browse files- Calibration_plot.png +0 -0
- README.md +68 -1
- SA_logo.png +0 -0
Calibration_plot.png
ADDED
README.md
CHANGED
@@ -13,5 +13,72 @@ tags:
|
|
13 |
datasets:
|
14 |
- Hello-SimpleAI/HC3
|
15 |
- tum-nlp/IDMGSP
|
|
|
16 |
library_name: transformers
|
17 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
13 |
datasets:
|
14 |
- Hello-SimpleAI/HC3
|
15 |
- tum-nlp/IDMGSP
|
16 |
+
- mlabonne/Evol-Instruct-Python-26k
|
17 |
library_name: transformers
|
18 |
+
---
|
19 |
+
|
20 |
+
<p style="text-align:center;">
|
21 |
+
<img src="SA_logo.png" alt="SuperAnnotate Logo" width="100" height="100"/>
|
22 |
+
</p>
|
23 |
+
|
24 |
+
<h1 align="center">SuperAnnotate</h1>
|
25 |
+
<h3 align="center">
|
26 |
+
LLM Content Detector<br/>
|
27 |
+
Fine-Tuned RoBERTa Large<br/>
|
28 |
+
</h3>
|
29 |
+
|
30 |
+
## Description
|
31 |
+
|
32 |
+
The model designed to detect generated/synthetic text. \
|
33 |
+
At the moment, such functionality is critical for check your training data and detecting fraud and cheating in scientific and educational areas.
|
34 |
+
|
35 |
+
## Model Details
|
36 |
+
|
37 |
+
### Model Description
|
38 |
+
|
39 |
+
- **Model type:** The custom architecture for binary sequence classification based on pre-trained RoBERTa, with a single output label.
|
40 |
+
- **Language(s):** Primarily English.
|
41 |
+
- **License:** Apache 2.0
|
42 |
+
- **Finetuned from model:** [RoBERTa Large](https://huggingface.co/FacebookAI/roberta-large)
|
43 |
+
|
44 |
+
### Model Sources
|
45 |
+
|
46 |
+
- **Repository:** [GitHub](https://github.com/superannotateai/generated_text_detector) for HTTP service
|
47 |
+
|
48 |
+
### Training data
|
49 |
+
|
50 |
+
The training data was sourced from three open datasets with different proportions and underwent filtering:
|
51 |
+
|
52 |
+
1. [**HC3**](https://huggingface.co/datasets/Hello-SimpleAI/HC3) | **50%**
|
53 |
+
1. [**IDMGSP**](https://huggingface.co/datasets/tum-nlp/IDMGSP) | **30%**
|
54 |
+
1. [**Evol-Instruct-Python-26k**](https://huggingface.co/datasets/mlabonne/Evol-Instruct-Python-26k) | **20%**
|
55 |
+
|
56 |
+
As a result, the training dataset contained approximately ***25k*** pairs of text-label with an approximate balance of classes. \
|
57 |
+
It's worth noting that the dataset's texts follow a logical structure: \
|
58 |
+
Human-written and model-generated texts refer to a single prompt/instruction, though the prompts themselves were not used during training.
|
59 |
+
|
60 |
+
### Peculiarity
|
61 |
+
|
62 |
+
During training, one of the priorities was not only maximizing the quality of predictions but also avoiding overfitting and obtaining an adequately confident predictor. \
|
63 |
+
We are pleased to achieve the following state of model calibration:
|
64 |
+
|
65 |
+
<img src="Calibration_plot.png" alt="SuperAnnotate Logo" width="390" height="300"/>
|
66 |
+
|
67 |
+
## Usage
|
68 |
+
|
69 |
+
TODO
|
70 |
+
|
71 |
+
## Performance
|
72 |
+
|
73 |
+
The model was evaluated on a benchmark collected from the same datasets used for training, alongside a closed subset of SuperAnnotate. \
|
74 |
+
However, there are no direct intersections of samples between the training data and the benchmark. \
|
75 |
+
The benchmark comprises 1k samples, with 200 samples per category. \
|
76 |
+
The model's performance is compared with open-source solutions and popular API detectors in the table below:
|
77 |
+
|
78 |
+
| Model/API | Wikipedia | Reddit QA | SA instruction | Papers | Code | Average |
|
79 |
+
|--------------------------------------------------------------------------------------------------|----------:|----------:|---------------:|-------:|-------:|--------:|
|
80 |
+
| [Hello-SimpleAI](https://huggingface.co/Hello-SimpleAI/chatgpt-detector-roberta) | **0.97**| 0.95 | 0.82 | 0.69 | 0.47 | 0.78 |
|
81 |
+
| [RADAR](https://huggingface.co/spaces/TrustSafeAI/RADAR-AI-Text-Detector) | 0.47 | 0.84 | 0.59 | 0.82 | 0.65 | 0.68 |
|
82 |
+
| [GPTZero](https://gptzero.me) | 0.72 | 0.79 | **0.90**| 0.67 | 0.74 | 0.76 |
|
83 |
+
| [Originality.ai](https://originality.ai) | 0.91 | **0.97**| 0.77 |**0.93**| 0.46 | 0.81 |
|
84 |
+
| [LLM content detector](https://huggingface.co/SuperAnnotate/roberta-large-llm-content-detector) | 0.88 | 0.95 | 0.84 | 0.81 |**0.96**| **0.89**|
|
SA_logo.png
ADDED