A fine-tuned model based on the **DeBERTaV3** model of Microsoft and fine-tuned on **Glue QQP**, which detects the linguistical similarities between two questions and whether they are similar questions or dissimilar. | |
## Model Hyperparameters | |
```python | |
epoch=4 | |
per_device_train_batch_size=32 | |
per_device_eval_batch_size=16 | |
lr=2e-5 | |
weight_decay=1e-2 | |
gradient_checkpointing=True | |
gradient_accumulation_steps=8 | |
``` | |
## Model Performance | |
```JSON | |
{"Training Loss": 0.132400, | |
"Validation Loss": 0.217410, | |
"Validation Accuracy": 0.917969 | |
} | |
``` | |
## Model Dependencies | |
```JSON | |
{"Main Model": "microsoft/deberta-v3-base", | |
"Dataset": "SetFit/qqp" | |
} | |
``` | |