Thenghuy
/

cropwiz_qa_model_2

Question Answering

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Thenghuy commited on Jul 15

Commit

845f908

•

1 Parent(s): a5426b4

End of training

Files changed (2) hide show

README.md +7 -28
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.1802
 ## Model description
@@ -40,33 +45,7 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 20
-### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 19   | 2.8276          |
-| No log        | 2.0   | 38   | 2.6534          |
-| No log        | 3.0   | 57   | 3.0085          |
-| No log        | 4.0   | 76   | 3.1586          |
-| No log        | 5.0   | 95   | 3.3162          |
-| No log        | 6.0   | 114  | 3.4105          |
-| No log        | 7.0   | 133  | 3.6466          |
-| No log        | 8.0   | 152  | 3.6437          |
-| No log        | 9.0   | 171  | 4.3458          |
-| No log        | 10.0  | 190  | 4.2937          |
-| No log        | 11.0  | 209  | 4.5459          |
-| No log        | 12.0  | 228  | 4.1286          |
-| No log        | 13.0  | 247  | 4.3959          |
-| No log        | 14.0  | 266  | 3.9187          |
-| No log        | 15.0  | 285  | 3.8182          |
-| No log        | 16.0  | 304  | 4.3889          |
-| No log        | 17.0  | 323  | 4.2887          |
-| No log        | 18.0  | 342  | 4.1675          |
-| No log        | 19.0  | 361  | 4.1868          |
-| No log        | 20.0  | 380  | 4.1802          |
 ### Framework versions

 This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- eval_loss: 5.3137
+- eval_runtime: 11.9086
+- eval_samples_per_second: 83.973
+- eval_steps_per_second: 14.023
+- epoch: 10.0
+- step: 6670
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 100
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cec9ad26bf66398af74107b430d3e7d93b73ac35bfcf0866565c85c5191e28b2
 size 265470032

 version https://git-lfs.github.com/spec/v1
+oid sha256:1a28767c9adc220cb2ebd46e047ff75ba3b01f250fbe4da0e5e493c493c394c6
 size 265470032