wgcv
/

platzi-distilroberta-base-mrpc-wgcv

@@ -2,13 +2,7 @@
 license: apache-2.0
 base_model: distilroberta-base
 tags:
-- text-classification
 - generated_from_trainer
-widget:
-  - text: "Yucaipa owned Dominick 's before selling the chain to Safeway in 1998 for $ 2.5 billion., Yucaipa bought Dominick's in 1995 for $ 693 million and sold it to Safeway for $ 1.8 billion in 1998."
-    example_title: Not Equivalent
-  - text: "Revenue in the first quarter of the year dropped 15 percent from the same period a year earlier., With the scandal hanging over Stewart's company revenue the first quarter of the year dropped 15 percent from the same period a year earlier."
-    example_title: Equivalent
 metrics:
 - accuracy
 - f1
@@ -22,11 +16,11 @@ should probably proofread and complete it, then remove this comment. -->
 # platzi-distilroberta-base-mrpc-wgcv
-This model is a fine-tuned version of [distilroberta-base](https://huggingface.co/distilroberta-base) on the glue and the mrpc datasets.
 It achieves the following results on the evaluation set:
-- Loss: 0.3977
-- Accuracy: 0.8358
-- F1: 0.8855
 ## Model description
@@ -46,8 +40,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 128
-- eval_batch_size: 128
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -55,6 +49,9 @@ The following hyperparameters were used during training:
 ### Training results
 ### Framework versions
@@ -62,4 +59,4 @@ The following hyperparameters were used during training:
 - Transformers 4.41.2
 - Pytorch 2.3.0+cu121
 - Datasets 2.20.0
-- Tokenizers 0.19.1

 license: apache-2.0
 base_model: distilroberta-base
 tags:
 - generated_from_trainer
 metrics:
 - accuracy
 - f1
 # platzi-distilroberta-base-mrpc-wgcv
+This model is a fine-tuned version of [distilroberta-base](https://huggingface.co/distilroberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4002
+- Accuracy: 0.8456
+- F1: 0.8835
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Accuracy | F1     |
+|:-------------:|:------:|:----:|:---------------:|:--------:|:------:|
+| 0.409         | 2.1739 | 500  | 0.4002          | 0.8456   | 0.8835 |
 ### Framework versions
 - Transformers 4.41.2
 - Pytorch 2.3.0+cu121
 - Datasets 2.20.0
+- Tokenizers 0.19.1

runs/Jun23_22-38-15_c261aaf0d6a8/events.out.tfevents.1719182301.c261aaf0d6a8.177.1 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:17dcb8b4710a565532875f43a08197321729d32794d98c3c671ffe4787cf361f
-size 5695

 version https://git-lfs.github.com/spec/v1
+oid sha256:d05ea7b78efce42c7b64549e9dff70758ac6361c7f9f3a077df34dfd4a3828bb
+size 6049