SA-Yur-or commited on
Commit
4b27fb4
1 Parent(s): 079419a

[up]: model card

Browse files
Files changed (3) hide show
  1. Calibration_plot.png +0 -0
  2. README.md +68 -1
  3. SA_logo.png +0 -0
Calibration_plot.png ADDED
README.md CHANGED
@@ -13,5 +13,72 @@ tags:
13
  datasets:
14
  - Hello-SimpleAI/HC3
15
  - tum-nlp/IDMGSP
 
16
  library_name: transformers
17
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  datasets:
14
  - Hello-SimpleAI/HC3
15
  - tum-nlp/IDMGSP
16
+ - mlabonne/Evol-Instruct-Python-26k
17
  library_name: transformers
18
+ ---
19
+
20
+ <p style="text-align:center;">
21
+ <img src="SA_logo.png" alt="SuperAnnotate Logo" width="100" height="100"/>
22
+ </p>
23
+
24
+ <h1 align="center">SuperAnnotate</h1>
25
+ <h3 align="center">
26
+ LLM Content Detector<br/>
27
+ Fine-Tuned RoBERTa Large<br/>
28
+ </h3>
29
+
30
+ ## Description
31
+
32
+ The model designed to detect generated/synthetic text. \
33
+ At the moment, such functionality is critical for check your training data and detecting fraud and cheating in scientific and educational areas.
34
+
35
+ ## Model Details
36
+
37
+ ### Model Description
38
+
39
+ - **Model type:** The custom architecture for binary sequence classification based on pre-trained RoBERTa, with a single output label.
40
+ - **Language(s):** Primarily English.
41
+ - **License:** Apache 2.0
42
+ - **Finetuned from model:** [RoBERTa Large](https://huggingface.co/FacebookAI/roberta-large)
43
+
44
+ ### Model Sources
45
+
46
+ - **Repository:** [GitHub](https://github.com/superannotateai/generated_text_detector) for HTTP service
47
+
48
+ ### Training data
49
+
50
+ The training data was sourced from three open datasets with different proportions and underwent filtering:
51
+
52
+ 1. [**HC3**](https://huggingface.co/datasets/Hello-SimpleAI/HC3) | **50%**
53
+ 1. [**IDMGSP**](https://huggingface.co/datasets/tum-nlp/IDMGSP) | **30%**
54
+ 1. [**Evol-Instruct-Python-26k**](https://huggingface.co/datasets/mlabonne/Evol-Instruct-Python-26k) | **20%**
55
+
56
+ As a result, the training dataset contained approximately ***25k*** pairs of text-label with an approximate balance of classes. \
57
+ It's worth noting that the dataset's texts follow a logical structure: \
58
+ Human-written and model-generated texts refer to a single prompt/instruction, though the prompts themselves were not used during training.
59
+
60
+ ### Peculiarity
61
+
62
+ During training, one of the priorities was not only maximizing the quality of predictions but also avoiding overfitting and obtaining an adequately confident predictor. \
63
+ We are pleased to achieve the following state of model calibration:
64
+
65
+ <img src="Calibration_plot.png" alt="SuperAnnotate Logo" width="390" height="300"/>
66
+
67
+ ## Usage
68
+
69
+ TODO
70
+
71
+ ## Performance
72
+
73
+ The model was evaluated on a benchmark collected from the same datasets used for training, alongside a closed subset of SuperAnnotate. \
74
+ However, there are no direct intersections of samples between the training data and the benchmark. \
75
+ The benchmark comprises 1k samples, with 200 samples per category. \
76
+ The model's performance is compared with open-source solutions and popular API detectors in the table below:
77
+
78
+ | Model/API | Wikipedia | Reddit QA | SA instruction | Papers | Code | Average |
79
+ |--------------------------------------------------------------------------------------------------|----------:|----------:|---------------:|-------:|-------:|--------:|
80
+ | [Hello-SimpleAI](https://huggingface.co/Hello-SimpleAI/chatgpt-detector-roberta) | **0.97**| 0.95 | 0.82 | 0.69 | 0.47 | 0.78 |
81
+ | [RADAR](https://huggingface.co/spaces/TrustSafeAI/RADAR-AI-Text-Detector) | 0.47 | 0.84 | 0.59 | 0.82 | 0.65 | 0.68 |
82
+ | [GPTZero](https://gptzero.me) | 0.72 | 0.79 | **0.90**| 0.67 | 0.74 | 0.76 |
83
+ | [Originality.ai](https://originality.ai) | 0.91 | **0.97**| 0.77 |**0.93**| 0.46 | 0.81 |
84
+ | [LLM content detector](https://huggingface.co/SuperAnnotate/roberta-large-llm-content-detector) | 0.88 | 0.95 | 0.84 | 0.81 |**0.96**| **0.89**|
SA_logo.png ADDED