XLNet-LMGC-M for Machine-Paraphrased Plagiarism Detection

This is the checkpoint for LMGC based on XLNet-base after being trained on the Machine-Paraphrased Plagiarism Dataset: DOI

Additional information about this model:

The model can be loaded to perform Plagiarism like so:

from transformers import AutoModelForSequenceClassification, AutoTokenizer


input = 'Copyright infringement is viewed as an infringement of scholarly uprightness and a penetrate of editorial morals.'

example = tokenizer.tokenize(input, add_special_tokens=True)

answer = model(**example)
# "plagiarism"
