|
--- |
|
datasets: |
|
- bigscience/P3 |
|
language: |
|
- en |
|
metrics: |
|
- accuracy |
|
pipeline_tag: sentence-similarity |
|
--- |
|
|
|
# Model Card: Paraphrase Identification |
|
|
|
## Model Details |
|
|
|
- **Model Name**: ParaBERT |
|
- **Description**: A fine-tuned paraphrase identification model based on BERT |
|
- **Author**: Lucie Gabagnou, Armand L'Huillier, Yanis Rehoune, Ghiles Idris |
|
- **Language**: Pytorch |
|
|
|
## Intended Use |
|
|
|
- **Primary intended uses**: This model is designed to identify whether two questions are paraphrases of each other. |
|
- **Primary intended users**: This model is intended for use by NLP researchers and developers who are working on tasks related to paraphrase identification. |
|
- **Out-of-scope use cases**: This model should not be used for tasks outside of paraphrase identification, or in situations where the input data may contain sensitive or confidential information. |
|
|
|
## Model Architecture and Training Data |
|
|
|
- **Model Architecture**: BERT |
|
- **Training Data**: https://huggingface.co/datasets/bigscience/P3/viewer/glue_qqp_same_thing/train (Only questions) |
|
|
|
## Evaluation Data and Results |
|
|
|
- **Evaluation Data**: https://huggingface.co/datasets/bigscience/P3/viewer/glue_qqp_same_thing/test |
|
- **Metrics**: Accuracy |
|
- **Results**: 0.95 |