End of training

Browse files

Files changed (14) hide show

README.md +15 -15
config.json +2 -3
model.safetensors +2 -2
runs/Mar19_18-00-21_f2dbfe237d11/events.out.tfevents.1710871228.f2dbfe237d11.373.0 +3 -0
runs/Mar19_18-00-21_f2dbfe237d11/events.out.tfevents.1710872573.f2dbfe237d11.373.1 +3 -0
runs/Mar19_18-00-21_f2dbfe237d11/events.out.tfevents.1710874676.f2dbfe237d11.373.2 +3 -0
runs/Mar19_19-02-58_f2dbfe237d11/events.out.tfevents.1710874990.f2dbfe237d11.373.3 +3 -0
runs/Mar19_19-09-41_f2dbfe237d11/events.out.tfevents.1710875388.f2dbfe237d11.373.4 +3 -0
runs/Mar19_19-24-05_f2dbfe237d11/events.out.tfevents.1710876253.f2dbfe237d11.373.5 +3 -0
runs/Mar19_19-40-19_f2dbfe237d11/events.out.tfevents.1710877228.f2dbfe237d11.373.6 +3 -0
runs/Mar19_19-43-03_f2dbfe237d11/events.out.tfevents.1710877389.f2dbfe237d11.373.7 +3 -0
tokenizer_config.json +1 -1
training_args.bin +1 -1
vocab.txt +0 -0

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 license: apache-2.0
-base_model: distilbert-base-cased
 tags:
 - generated_from_trainer
 metrics:
@@ -18,13 +18,13 @@ should probably proofread and complete it, then remove this comment. -->
 # trainer
-This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.4915
-- Precision: 0.6065
-- Recall: 0.5119
-- F1: 0.5215
-- Accuracy: 0.5119
 ## Model description
@@ -55,14 +55,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| 0.0           | 0.57  | 30   | 3.8135          | 0.6655    | 0.6071 | 0.6173 | 0.6071   |
-| 0.1391        | 1.13  | 60   | 4.2173          | 0.6016    | 0.5119 | 0.5199 | 0.5119   |
-| 0.0399        | 1.7   | 90   | 3.7604          | 0.6025    | 0.5714 | 0.5660 | 0.5714   |
-| 0.0004        | 2.26  | 120  | 4.6200          | 0.5873    | 0.4881 | 0.4964 | 0.4881   |
-| 0.0           | 2.83  | 150  | 4.6164          | 0.6065    | 0.5119 | 0.5215 | 0.5119   |
-| 0.0           | 3.4   | 180  | 4.5299          | 0.6147    | 0.5238 | 0.5328 | 0.5238   |
-| 0.0           | 3.96  | 210  | 4.4992          | 0.6065    | 0.5119 | 0.5215 | 0.5119   |
-| 0.0           | 4.53  | 240  | 4.5022          | 0.6065    | 0.5119 | 0.5215 | 0.5119   |
 ### Framework versions

 ---
 license: apache-2.0
+base_model: distilbert-base-uncased
 tags:
 - generated_from_trainer
 metrics:
 # trainer
+This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 4.5429
+- Precision: 0.6049
+- Recall: 0.5714
+- F1: 0.5559
+- Accuracy: 0.5714
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| 0.0126        | 0.57  | 30   | 3.5750          | 0.6388    | 0.6190 | 0.6167 | 0.6190   |
+| 0.0144        | 1.13  | 60   | 3.7088          | 0.6787    | 0.6548 | 0.6479 | 0.6548   |
+| 0.0364        | 1.7   | 90   | 3.5580          | 0.6614    | 0.6548 | 0.6488 | 0.6548   |
+| 0.0991        | 2.26  | 120  | 3.8208          | 0.6775    | 0.6429 | 0.6407 | 0.6429   |
+| 0.0           | 2.83  | 150  | 4.5110          | 0.6127    | 0.5833 | 0.5646 | 0.5833   |
+| 0.0           | 3.4   | 180  | 4.5298          | 0.6127    | 0.5833 | 0.5646 | 0.5833   |
+| 0.0           | 3.96  | 210  | 4.5318          | 0.6127    | 0.5833 | 0.5646 | 0.5833   |
+| 0.0           | 4.53  | 240  | 4.5429          | 0.6049    | 0.5714 | 0.5559 | 0.5714   |
 ### Framework versions

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "distilbert-base-cased",
   "activation": "gelu",
   "architectures": [
     "DistilBertForSequenceClassification"
@@ -31,7 +31,6 @@
   "model_type": "distilbert",
   "n_heads": 12,
   "n_layers": 6,
-  "output_past": true,
   "pad_token_id": 0,
   "problem_type": "single_label_classification",
   "qa_dropout": 0.1,
@@ -40,5 +39,5 @@
   "tie_weights_": true,
   "torch_dtype": "float32",
   "transformers_version": "4.38.2",
-  "vocab_size": 28996
 }

 {
+  "_name_or_path": "distilbert-base-uncased",
   "activation": "gelu",
   "architectures": [
     "DistilBertForSequenceClassification"
   "model_type": "distilbert",
   "n_heads": 12,
   "n_layers": 6,
   "pad_token_id": 0,
   "problem_type": "single_label_classification",
   "qa_dropout": 0.1,
   "tie_weights_": true,
   "torch_dtype": "float32",
   "transformers_version": "4.38.2",
+  "vocab_size": 30522
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7e6e00bbe39b22051688cf197beb83986fe882f6deb93778a84aad3001277b8a
-size 263160068

 version https://git-lfs.github.com/spec/v1
+oid sha256:e26700f82c8e469c99655b2ce484621837d170b68067f2313596e36af97eb889
+size 267847948

runs/Mar19_18-00-21_f2dbfe237d11/events.out.tfevents.1710871228.f2dbfe237d11.373.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8bd954040ba4c9efb68182b05c78fcc14d7e2a60a8498c343b9417564263db5a
+size 10486

runs/Mar19_18-00-21_f2dbfe237d11/events.out.tfevents.1710872573.f2dbfe237d11.373.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:42676bb6cdf8282830a9d6c4e7b6a2ba76eda162811f391f9015d3e354c60d03
+size 10535

runs/Mar19_18-00-21_f2dbfe237d11/events.out.tfevents.1710874676.f2dbfe237d11.373.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5b1e63b4a4ea590bc12252e2665b037e243b915906542e58cf582d06b5502769
+size 6109

runs/Mar19_19-02-58_f2dbfe237d11/events.out.tfevents.1710874990.f2dbfe237d11.373.3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5bda49810f01f0242bd652b6ebd5421cc09216b9bb8ebef119433b540e8a42d7
+size 6779

runs/Mar19_19-09-41_f2dbfe237d11/events.out.tfevents.1710875388.f2dbfe237d11.373.4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1c82f3aec37f23aef28aa148e611bb3c44e7dcf0cd3b75b78b5a397a50ea0b00
+size 9498

runs/Mar19_19-24-05_f2dbfe237d11/events.out.tfevents.1710876253.f2dbfe237d11.373.5 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8c381847a8a0cfc7db60dce8a00a41aece6297191bec2303cf93367dff86c1c2
+size 10535

runs/Mar19_19-40-19_f2dbfe237d11/events.out.tfevents.1710877228.f2dbfe237d11.373.6 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:24c751ad751e97dae27d3ddb450acfbf818c392e13b330488febfd2c2ece647f
+size 5439

runs/Mar19_19-43-03_f2dbfe237d11/events.out.tfevents.1710877389.f2dbfe237d11.373.7 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fb22fdcb29b880a8295fa5eee26ce7eea385fddd4dcd930ac45de647e3b3f323
+size 10535

tokenizer_config.json CHANGED Viewed

@@ -44,7 +44,7 @@
   "clean_up_tokenization_spaces": true,
   "cls_token": "[CLS]",
   "do_basic_tokenize": true,
-  "do_lower_case": false,
   "mask_token": "[MASK]",
   "model_max_length": 512,
   "never_split": null,

   "clean_up_tokenization_spaces": true,
   "cls_token": "[CLS]",
   "do_basic_tokenize": true,
+  "do_lower_case": true,
   "mask_token": "[MASK]",
   "model_max_length": 512,
   "never_split": null,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:63f3fb18d28911259afac9a1e786ba72f7099eddeaedcdd571769b593b6d183f
 size 4856

 version https://git-lfs.github.com/spec/v1
+oid sha256:ce3382def35726fe9f8dd02e330d086b6ee5535ebde116b86c24b332099b1bd5
 size 4856

vocab.txt CHANGED Viewed

The diff for this file is too large to render. See raw diff