End of training

Browse files

Files changed (5) hide show

README.md +70 -10
model.safetensors +1 -1
runs/Mar13_21-22-02_4931855db211/events.out.tfevents.1710364950.4931855db211.166.0 +3 -0
runs/Mar13_21-22-02_4931855db211/events.out.tfevents.1710371000.4931855db211.166.1 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -3,6 +3,11 @@ license: apache-2.0
 base_model: distilbert-base-cased
 tags:
 - generated_from_trainer
 model-index:
 - name: trainer10
   results: []
@@ -15,15 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on the None dataset.
 It achieves the following results on the evaluation set:
-- eval_loss: 1.9534
-- eval_precision: 0.0212
-- eval_recall: 0.1429
-- eval_f1: 0.0369
-- eval_accuracy: 0.1429
-- eval_runtime: 5.0122
-- eval_samples_per_second: 16.759
-- eval_steps_per_second: 2.195
-- step: 0
 ## Model description
@@ -50,9 +51,68 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: linear
 - num_epochs: 30
 ### Framework versions
 - Transformers 4.38.2
-- Pytorch 2.1.0+cu121
 - Datasets 2.18.0
 - Tokenizers 0.15.2

 base_model: distilbert-base-cased
 tags:
 - generated_from_trainer
+metrics:
+- precision
+- recall
+- f1
+- accuracy
 model-index:
 - name: trainer10
   results: []
 This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.8784
+- Precision: 0.6554
+- Recall: 0.6310
+- F1: 0.6226
+- Accuracy: 0.6310
 ## Model description
 - lr_scheduler_type: linear
 - num_epochs: 30
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| 1.9269        | 0.57  | 30   | 1.8819          | 0.1351    | 0.2619 | 0.1571 | 0.2619   |
+| 1.8323        | 1.13  | 60   | 1.7729          | 0.2460    | 0.3571 | 0.2779 | 0.3571   |
+| 1.6414        | 1.7   | 90   | 1.5696          | 0.5043    | 0.4524 | 0.4074 | 0.4524   |
+| 1.3837        | 2.26  | 120  | 1.3431          | 0.3511    | 0.4762 | 0.3854 | 0.4762   |
+| 1.0816        | 2.83  | 150  | 1.1919          | 0.4576    | 0.5119 | 0.4623 | 0.5119   |
+| 0.7562        | 3.4   | 180  | 1.1171          | 0.5228    | 0.5357 | 0.4846 | 0.5357   |
+| 0.5875        | 3.96  | 210  | 1.0354          | 0.5617    | 0.5238 | 0.5181 | 0.5238   |
+| 0.3318        | 4.53  | 240  | 1.0221          | 0.6131    | 0.5833 | 0.5768 | 0.5833   |
+| 0.2607        | 5.09  | 270  | 1.0985          | 0.5913    | 0.5714 | 0.5584 | 0.5714   |
+| 0.1437        | 5.66  | 300  | 1.0669          | 0.5715    | 0.5476 | 0.5467 | 0.5476   |
+| 0.0846        | 6.23  | 330  | 1.1989          | 0.6190    | 0.5952 | 0.5936 | 0.5952   |
+| 0.0525        | 6.79  | 360  | 1.2240          | 0.6118    | 0.5952 | 0.5882 | 0.5952   |
+| 0.0494        | 7.36  | 390  | 1.3884          | 0.6320    | 0.5833 | 0.5797 | 0.5833   |
+| 0.0174        | 7.92  | 420  | 1.3464          | 0.6261    | 0.5952 | 0.5972 | 0.5952   |
+| 0.0124        | 8.49  | 450  | 1.4157          | 0.6466    | 0.6190 | 0.6059 | 0.6190   |
+| 0.0091        | 9.06  | 480  | 1.5045          | 0.6425    | 0.6071 | 0.6035 | 0.6071   |
+| 0.0081        | 9.62  | 510  | 1.5286          | 0.6489    | 0.6190 | 0.6131 | 0.6190   |
+| 0.0064        | 10.19 | 540  | 1.5502          | 0.6477    | 0.6190 | 0.6126 | 0.6190   |
+| 0.0057        | 10.75 | 570  | 1.5541          | 0.6394    | 0.6071 | 0.6030 | 0.6071   |
+| 0.0057        | 11.32 | 600  | 1.6535          | 0.6596    | 0.6310 | 0.6232 | 0.6310   |
+| 0.0047        | 11.89 | 630  | 1.6403          | 0.6436    | 0.6190 | 0.6053 | 0.6190   |
+| 0.0043        | 12.45 | 660  | 1.6434          | 0.6489    | 0.6190 | 0.6131 | 0.6190   |
+| 0.0039        | 13.02 | 690  | 1.6630          | 0.6507    | 0.6190 | 0.6188 | 0.6190   |
+| 0.0036        | 13.58 | 720  | 1.6792          | 0.6482    | 0.6190 | 0.6133 | 0.6190   |
+| 0.003         | 14.15 | 750  | 1.7049          | 0.6482    | 0.6190 | 0.6133 | 0.6190   |
+| 0.0031        | 14.72 | 780  | 1.7068          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
+| 0.0031        | 15.28 | 810  | 1.7272          | 0.6482    | 0.6190 | 0.6133 | 0.6190   |
+| 0.0028        | 15.85 | 840  | 1.7443          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
+| 0.0026        | 16.42 | 870  | 1.7604          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
+| 0.0024        | 16.98 | 900  | 1.7654          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
+| 0.0023        | 17.55 | 930  | 1.7699          | 0.6482    | 0.6190 | 0.6133 | 0.6190   |
+| 0.0023        | 18.11 | 960  | 1.7929          | 0.6394    | 0.6071 | 0.6030 | 0.6071   |
+| 0.0022        | 18.68 | 990  | 1.7985          | 0.6482    | 0.6190 | 0.6133 | 0.6190   |
+| 0.0021        | 19.25 | 1020 | 1.8119          | 0.6482    | 0.6190 | 0.6133 | 0.6190   |
+| 0.002         | 19.81 | 1050 | 1.8096          | 0.6394    | 0.6071 | 0.6030 | 0.6071   |
+| 0.0019        | 20.38 | 1080 | 1.8147          | 0.6496    | 0.6190 | 0.6128 | 0.6190   |
+| 0.0019        | 20.94 | 1110 | 1.8211          | 0.6496    | 0.6190 | 0.6128 | 0.6190   |
+| 0.0018        | 21.51 | 1140 | 1.8306          | 0.6394    | 0.6071 | 0.6030 | 0.6071   |
+| 0.0019        | 22.08 | 1170 | 1.8381          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
+| 0.0017        | 22.64 | 1200 | 1.8399          | 0.6394    | 0.6071 | 0.6030 | 0.6071   |
+| 0.0017        | 23.21 | 1230 | 1.8466          | 0.6394    | 0.6071 | 0.6030 | 0.6071   |
+| 0.0016        | 23.77 | 1260 | 1.8503          | 0.6470    | 0.6190 | 0.6128 | 0.6190   |
+| 0.0016        | 24.34 | 1290 | 1.8566          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
+| 0.0016        | 24.91 | 1320 | 1.8693          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
+| 0.0016        | 25.47 | 1350 | 1.8760          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
+| 0.0016        | 26.04 | 1380 | 1.8769          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
+| 0.0015        | 26.6  | 1410 | 1.8804          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
+| 0.0014        | 27.17 | 1440 | 1.8800          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
+| 0.0014        | 27.74 | 1470 | 1.8793          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
+| 0.0014        | 28.3  | 1500 | 1.8792          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
+| 0.0015        | 28.87 | 1530 | 1.8774          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
+| 0.0014        | 29.43 | 1560 | 1.8780          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
+| 0.0014        | 30.0  | 1590 | 1.8784          | 0.6554    | 0.6310 | 0.6226 | 0.6310   |
 ### Framework versions
 - Transformers 4.38.2
+- Pytorch 2.2.1+cu121
 - Datasets 2.18.0
 - Tokenizers 0.15.2

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:22bdc71c920db1ef3504293720845a0e2bb17e39c32758321a5066f62750805b
 size 263160068

 version https://git-lfs.github.com/spec/v1
+oid sha256:32cad832e219c34967f486658f24af56a4cb78c87580c0fecd4f716d0c324c16
 size 263160068

runs/Mar13_21-22-02_4931855db211/events.out.tfevents.1710364950.4931855db211.166.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fa40aa4e83a1ee45716ab00277ce37639d899b3943de4f108652ef7a8506589a
+size 41696

runs/Mar13_21-22-02_4931855db211/events.out.tfevents.1710371000.4931855db211.166.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:47b2fd7bf4864a1eb5715426b4d4030e2292b676ce6b5604b427e359fabf055a
+size 560

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:46a4d31f26ed7214d9ee6d51c5227be085728a2c89805acc322e898f2f6dcecd
 size 4856

 version https://git-lfs.github.com/spec/v1
+oid sha256:534689f8ccc67caec432757849475fb416b5d97f17e0b2dc3372c9b540504a0d
 size 4856