File size: 1,226 Bytes
e560b81
 
 
 
 
 
 
 
 
 
e50d55f
 
 
 
6edd4df
e50d55f
36135f7
e50d55f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e560b81
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
---
datasets:
- bigscience/P3
language:
- en
metrics:
- accuracy
pipeline_tag: sentence-similarity
---

# Model Card: Paraphrase Identification

## Model Details

- **Model Name**: ParaBERT
- **Description**: A fine-tuned paraphrase identification model based on BERT
- **Author**: Lucie Gabagnou, Armand L'Huillier, Yanis Rehoune, Ghiles Idris
- **Language**: Pytorch

## Intended Use

- **Primary intended uses**: This model is designed to identify whether two questions are paraphrases of each other.
- **Primary intended users**: This model is intended for use by NLP researchers and developers who are working on tasks related to paraphrase identification.
- **Out-of-scope use cases**: This model should not be used for tasks outside of paraphrase identification, or in situations where the input data may contain sensitive or confidential information.

## Model Architecture and Training Data

- **Model Architecture**: BERT
- **Training Data**: https://huggingface.co/datasets/bigscience/P3/viewer/glue_qqp_same_thing/train (Only questions)

## Evaluation Data and Results

- **Evaluation Data**: https://huggingface.co/datasets/bigscience/P3/viewer/glue_qqp_same_thing/test
- **Metrics**: Accuracy
- **Results**: 0.95