File size: 1,226 Bytes
e560b81 e50d55f 6edd4df e50d55f 36135f7 e50d55f e560b81 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 |
---
datasets:
- bigscience/P3
language:
- en
metrics:
- accuracy
pipeline_tag: sentence-similarity
---
# Model Card: Paraphrase Identification
## Model Details
- **Model Name**: ParaBERT
- **Description**: A fine-tuned paraphrase identification model based on BERT
- **Author**: Lucie Gabagnou, Armand L'Huillier, Yanis Rehoune, Ghiles Idris
- **Language**: Pytorch
## Intended Use
- **Primary intended uses**: This model is designed to identify whether two questions are paraphrases of each other.
- **Primary intended users**: This model is intended for use by NLP researchers and developers who are working on tasks related to paraphrase identification.
- **Out-of-scope use cases**: This model should not be used for tasks outside of paraphrase identification, or in situations where the input data may contain sensitive or confidential information.
## Model Architecture and Training Data
- **Model Architecture**: BERT
- **Training Data**: https://huggingface.co/datasets/bigscience/P3/viewer/glue_qqp_same_thing/train (Only questions)
## Evaluation Data and Results
- **Evaluation Data**: https://huggingface.co/datasets/bigscience/P3/viewer/glue_qqp_same_thing/test
- **Metrics**: Accuracy
- **Results**: 0.95 |