--- datasets: - bigscience/P3 language: - en metrics: - accuracy pipeline_tag: sentence-similarity --- # Model Card: Paraphrase Identification ## Model Details - **Model Name**: ParaBERT - **Description**: A fine-tuned paraphrase identification model based on BERT - **Author**: Lucie Gabagnou, Armand L'Huillier, Yanis Rehoune, Ghiles Idris - **Language**: Pytorch ## Intended Use - **Primary intended uses**: This model is designed to identify whether two questions are paraphrases of each other. - **Primary intended users**: This model is intended for use by NLP researchers and developers who are working on tasks related to paraphrase identification. - **Out-of-scope use cases**: This model should not be used for tasks outside of paraphrase identification, or in situations where the input data may contain sensitive or confidential information. ## Model Architecture and Training Data - **Model Architecture**: BERT - **Training Data**: https://huggingface.co/datasets/bigscience/P3/viewer/glue_qqp_same_thing/train (Only questions) ## Evaluation Data and Results - **Evaluation Data**: https://huggingface.co/datasets/bigscience/P3/viewer/glue_qqp_same_thing/test - **Metrics**: Accuracy - **Results**: 0.95