Longformer-base for Word Sense Disambiguation

This is the checkpoint for Longformer-base after being trained on the Machine-Paraphrased Plagiarism Dataset

Additional information about this model:

The model can be loaded to perform Plagiarism like so:

from transformers import AutoModelForSequenceClassification, AutoTokenizer

AutoModelForSequenceClassification("jpelhaw/longformer-base-plagiarism-detection")
AutoTokenizer.from_pretrained("jpelhaw/longformer-base-plagiarism-detection")

input = 'Plagiarism is the representation of another author's writing, thoughts, ideas, or expressions as one's own work.'


example = tokenizer.tokenize(input, add_special_tokens=True)

answer = model(**example)
                                
# "plagiarised"
Downloads last month
80
Hosted inference API
Text Classification
Examples
Examples
Mask token: <mask>
This model can be loaded on the Inference API on-demand.