grgera commited on
Commit
ca15627
1 Parent(s): b8c9770

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +26 -10
README.md CHANGED
@@ -11,11 +11,13 @@ base_model:
11
  - microsoft/deberta-v3-base
12
 
13
  ---
14
- # The solution for COLING 2025 Workshop on Detecting AI Generated Content (DAIGenC)
15
 
16
- The model that won first place in the [competition](https://genai-content-detection.gitlab.io) in the task-1A.
17
 
18
- # Get Usage
 
 
19
 
20
  ```python
21
  import torch
@@ -45,8 +47,7 @@ model.eval()
45
 
46
  inputs = tokenizer(
47
  ['Hello, Thanks for sharing your health concern with us. I have gone through your query and here are your answers: 1. If you have regular cycles, there is no further need to use any medication to regulate cycles. 2. Establishment of regular ovulation and timing of intercourse properly is necessary. 3. If you want to conceive quickly, you have to get further evaluation and plan management. Hope this helps.',
48
- 'He might have small intestinal TB rather than stomach TB. Amoebas also involves small intestine/some part of large intestine. If he has taken medicines for both diseases in form of a Complete Course, he should be fine. U can go for an oral+iv contrast CT scan of him. Now, the diagnosis of a lax cardiac can be confirmed by an upper GI endoscopy with manometry (if available). Lax cardiac may cause acidity with reflux.',
49
- 'In Micronesia during the early 1900s, there was a unique culture known as the Yap. What sets the Yap apart from other cultures is their unusual form of currency - enormous limestone disks called Rai Stones. These massive stones are not easily movable, with the largest one weighing four tons and measuring 12 feet in diameter. Instead of physically exchanging the stones, the Yap people simply keep track of ownership through an intricate system. A fascinating tale revolves around some sailors who attempted to transport a Rai Stone across the ocean but encountered difficulties along the way, resulting in the stone falling into the sea.'],
50
  max_length=512,
51
  truncation=True,
52
  padding="max_length",
@@ -58,13 +59,28 @@ torch.softmax(
58
  ).detach().cpu()[:, 1].tolist()
59
  ```
60
 
61
- # Quality
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
62
 
63
- Quality on the delayed sample of competition organizers.
64
 
65
- | Main Score (F1 Macro) | Auxiliary Score (F1 Micro) |
66
- | --------------------- | -------------------------- |
67
- | 0.8307 | 0.8311 |
68
 
69
  # Citation
70
 
 
11
  - microsoft/deberta-v3-base
12
 
13
  ---
14
+ # 🏆 Winning model for the COLING 2025 Workshop on Detecting AI Generated Content (DAIGenC)
15
 
16
+ ## Model description
17
 
18
+ A **binary classification model** of machine-generated fragments that achieved **first place** on the monolingual subtask in the [COLING 2025 GenAI Detection Task](https://genai-content-detection.gitlab.io). The model is a fine-tuned version of DeBERTa-v3-base in multi-task mode with a shared encoder and three parallel heads for classification. Only head is used for inference.
19
+
20
+ ## Usage
21
 
22
  ```python
23
  import torch
 
47
 
48
  inputs = tokenizer(
49
  ['Hello, Thanks for sharing your health concern with us. I have gone through your query and here are your answers: 1. If you have regular cycles, there is no further need to use any medication to regulate cycles. 2. Establishment of regular ovulation and timing of intercourse properly is necessary. 3. If you want to conceive quickly, you have to get further evaluation and plan management. Hope this helps.',
50
+ 'He might have small intestinal TB rather than stomach TB. Amoebas also involves small intestine/some part of large intestine. If he has taken medicines for both diseases in form of a Complete Course, he should be fine. U can go for an oral+iv contrast CT scan of him. Now, the diagnosis of a lax cardiac can be confirmed by an upper GI endoscopy with manometry (if available). Lax cardiac may cause acidity with reflux.'],
 
51
  max_length=512,
52
  truncation=True,
53
  padding="max_length",
 
59
  ).detach().cpu()[:, 1].tolist()
60
  ```
61
 
62
+ ## Limitations and bias
63
+
64
+ This model is limited to a training dataset consisting of generated and human generated texts from different sources and domains over a period of time. It may not be a good fit for all use cases in different domains. In addition, the model may have false positives in some cases, which can be varied by the classification threshold.
65
+
66
+ ## Quality
67
+
68
+ Quality on the declaired test set in the competition.
69
+
70
+ Model | Main Score (F1 Macro) | Auxiliary Score (F1 Micro) |
71
+ --------------------- | --------------------- | -------------------------- |
72
+ | MTL DeBERTa-v3-base (*our*) | **0.8307** | **0.8311**
73
+ | Single-task DeBERTa-v30-base | 0.7852 | 0.7891
74
+ | *baseline* | 0.7342 | 0.7343
75
+
76
+
77
+ ## Training procedure
78
+
79
+ This model was fine-tuned on train part of English version of the competition data [MGT Detection Task 1](https://huggingface.co/datasets/Jinyan1/COLING_2025_MGT_en) dataset. Class `0 - human`, `1 - machine`. Model was fine-tuned with 2 stages on a single NVIDIA RTX 3090 GPU with hyperparameters described in [our paper](https://arxiv.org/abs/2411.11736).
80
 
81
+ ## Your Own Fine-Tune
82
 
83
+ If you would like to fine-tune this architecture on your data domains or base models, we offer you our learn and run code with all instructions, which we have posted on the [GitHub](https://github.com/Advacheck-OU/ai-detector-coling2025).
 
 
84
 
85
  # Citation
86