yochen
/

distilroberta-base-wiki-mark

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

yochen commited on Jul 25, 2022

Commit

b5be272

•

1 Parent(s): bcc8c40

update model card README.md

Files changed (1) hide show

README.md +10 -16

README.md CHANGED Viewed

@@ -12,9 +12,14 @@ should probably proofread and complete it, then remove this comment. -->
 # distilroberta-base-wiki-mark
-This model is a fine-tuned version of [distilroberta-base](https://huggingface.co/distilroberta-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.2066
 ## Model description
@@ -39,22 +44,11 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 5
-### Training results
-| Training Loss | Epoch | Step  | Validation Loss |
-|:-------------:|:-----:|:-----:|:---------------:|
-| 2.2545        | 1.0   | 2028  | 2.3260          |
-| 2.2803        | 2.0   | 4056  | 2.2899          |
-| 2.24          | 3.0   | 6084  | 2.1889          |
-| 2.2414        | 4.0   | 8112  | 2.2387          |
-| 2.1412        | 5.0   | 10140 | 2.1748          |
 ### Framework versions
-- Transformers 4.18.0
-- Pytorch 1.6.0
 - Datasets 2.3.2
 - Tokenizers 0.12.1

 # distilroberta-base-wiki-mark
+This model is a fine-tuned version of [yochen/distilroberta-base-wiki-mark](https://huggingface.co/yochen/distilroberta-base-wiki-mark) on the None dataset.
 It achieves the following results on the evaluation set:
+- eval_loss: 2.2695
+- eval_runtime: 4.3489
+- eval_samples_per_second: 431.836
+- eval_steps_per_second: 54.037
+- epoch: 10.1
+- step: 20489
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5000
 ### Framework versions
+- Transformers 4.20.1
+- Pytorch 1.12.0+cu102
 - Datasets 2.3.2
 - Tokenizers 0.12.1