datasets: | |
- Hello-SimpleAI/HC3 | |
language: | |
- en | |
pipeline_tag: text-classification | |
tags: | |
- chatgpt | |
# Model Card for `Hello-SimpleAI/chatgpt-detector-roberta` | |
This model is trained on **the mix of full-text and splitted sentences** of `answer`s from [Hello-SimpleAI/HC3](https://huggingface.co/datasets/Hello-SimpleAI/HC3). | |
More details refer to [arxiv: 2301.07597](https://arxiv.org/abs/2301.07597) and Gtihub project [Hello-SimpleAI/chatgpt-comparison-detection](https://github.com/Hello-SimpleAI/chatgpt-comparison-detection). | |
The base checkpoint is [roberta-base](https://huggingface.co/roberta-base). | |
We train it with all [Hello-SimpleAI/HC3](https://huggingface.co/datasets/Hello-SimpleAI/HC3) data (without held-out) for 1 epoch. | |
(1-epoch is consistent with the experiments in [our paper](https://arxiv.org/abs/2301.07597).) | |
## Citation | |
Checkout this papaer [arxiv: 2301.07597](https://arxiv.org/abs/2301.07597) | |
``` | |
@article{guo-etal-2023-hc3, | |
title = "How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection", | |
author = "Guo, Biyang and | |
Zhang, Xin and | |
Wang, Ziyuan and | |
Jiang, Minqi and | |
Nie, Jinran and | |
Ding, Yuxuan and | |
Yue, Jianwei and | |
Wu, Yupeng", | |
journal={arXiv preprint arxiv:2301.07597} | |
year = "2023", | |
} | |
``` | |