Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -15,7 +15,7 @@ base_model:
|
|
15 |
|
16 |
> **GitHub repository** for exploring the source code and additional resources: https://github.com/wangkevin02/USP
|
17 |
|
18 |
-
The **AI Detect Model** is a binary classification model designed to determine whether a given text is AI-generated (label=1) or written by a human (label=0). This model plays a crucial role in providing AI detection rewards, helping to prevent reward hacking during Reinforcement Learning with Cycle Consistency (RLCC). For more details, please refer to [our paper](https://
|
19 |
|
20 |
This model is built upon the [Longformer](https://huggingface.co/allenai/longformer-base-4096) architecture and trained using our proprietary [LMSYS-USP](https://huggingface.co/datasets/wangkevin02/LMSYS-USP) dataset. Specifically, in a dialogue context, texts generated by the assistant are labeled as AI-generated (label=1), while user-generated texts are assigned the opposite label (label=0).
|
21 |
|
@@ -78,5 +78,13 @@ if __name__ == "__main__":
|
|
78 |
If you find this model useful, please cite:
|
79 |
|
80 |
```plaintext
|
81 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
82 |
```
|
|
|
15 |
|
16 |
> **GitHub repository** for exploring the source code and additional resources: https://github.com/wangkevin02/USP
|
17 |
|
18 |
+
The **AI Detect Model** is a binary classification model designed to determine whether a given text is AI-generated (label=1) or written by a human (label=0). This model plays a crucial role in providing AI detection rewards, helping to prevent reward hacking during Reinforcement Learning with Cycle Consistency (RLCC). For more details, please refer to [our paper](https://arxiv.org/pdf/2502.18968).
|
19 |
|
20 |
This model is built upon the [Longformer](https://huggingface.co/allenai/longformer-base-4096) architecture and trained using our proprietary [LMSYS-USP](https://huggingface.co/datasets/wangkevin02/LMSYS-USP) dataset. Specifically, in a dialogue context, texts generated by the assistant are labeled as AI-generated (label=1), while user-generated texts are assigned the opposite label (label=0).
|
21 |
|
|
|
78 |
If you find this model useful, please cite:
|
79 |
|
80 |
```plaintext
|
81 |
+
@misc{wang2025knowbettermodelinghumanlike,
|
82 |
+
title={Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles},
|
83 |
+
author={Kuang Wang and Xianfei Li and Shenghao Yang and Li Zhou and Feng Jiang and Haizhou Li},
|
84 |
+
year={2025},
|
85 |
+
eprint={2502.18968},
|
86 |
+
archivePrefix={arXiv},
|
87 |
+
primaryClass={cs.CL},
|
88 |
+
url={https://arxiv.org/abs/2502.18968},
|
89 |
+
}
|
90 |
```
|