End of training

Browse files

Files changed (7) hide show

README.md +68 -0
config.json +1 -2
model.safetensors +2 -2
tokenizer.json +0 -0
tokenizer_config.json +2 -2
training_args.bin +2 -2
vocab.txt +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,68 @@

+---
+license: apache-2.0
+base_model: gc394/da_distilbert
+tags:
+- generated_from_trainer
+model-index:
+- name: ft_da_distilbert_effective_rate
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# ft_da_distilbert_effective_rate
+This model is a fine-tuned version of [gc394/da_distilbert](https://huggingface.co/gc394/da_distilbert) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.0625
+- Mape: 21050161102848.0
+- Rmse: 0.2500
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 16
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 10
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Mape             | Rmse   |
+|:-------------:|:-----:|:----:|:---------------:|:----------------:|:------:|
+| No log        | 1.0   | 105  | 0.0634          | 7549680615424.0  | 0.2519 |
+| No log        | 2.0   | 210  | 0.0625          | 21050161102848.0 | 0.2500 |
+| No log        | 3.0   | 315  | 0.0651          | 15955784630272.0 | 0.2552 |
+| No log        | 4.0   | 420  | 0.0676          | 16507671150592.0 | 0.2599 |
+| 0.0129        | 5.0   | 525  | 0.0729          | 35525666799616.0 | 0.2700 |
+| 0.0129        | 6.0   | 630  | 0.0669          | 30705371316224.0 | 0.2586 |
+| 0.0129        | 7.0   | 735  | 0.0686          | 32481740849152.0 | 0.2619 |
+| 0.0129        | 8.0   | 840  | 0.0703          | 40486999949312.0 | 0.2652 |
+| 0.0129        | 9.0   | 945  | 0.0708          | 35813152784384.0 | 0.2661 |
+| 0.005         | 10.0  | 1050 | 0.0704          | 38553111232512.0 | 0.2653 |
+### Framework versions
+- Transformers 4.40.1
+- Pytorch 2.4.0.dev20240502
+- Datasets 2.19.0
+- Tokenizers 0.19.1

config.json CHANGED Viewed

@@ -19,7 +19,6 @@
   "model_type": "distilbert",
   "n_heads": 12,
   "n_layers": 6,
-  "output_past": true,
   "pad_token_id": 0,
   "problem_type": "regression",
   "qa_dropout": 0.1,
@@ -28,5 +27,5 @@
   "tie_weights_": true,
   "torch_dtype": "float32",
   "transformers_version": "4.40.1",
-  "vocab_size": 28996
 }

   "model_type": "distilbert",
   "n_heads": 12,
   "n_layers": 6,
   "pad_token_id": 0,
   "problem_type": "regression",
   "qa_dropout": 0.1,
   "tie_weights_": true,
   "torch_dtype": "float32",
   "transformers_version": "4.40.1",
+  "vocab_size": 30522
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3147c7f98b8a793903d30e2a0d61001219ca9457021a09b61e9944679e143202
-size 263141604

 version https://git-lfs.github.com/spec/v1
+oid sha256:edb810cb38427f8ceecb5f3c5a5341c7bab2ff2b4b67c9a71ce23d76f6377566
+size 267829484

tokenizer.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json CHANGED Viewed

@@ -43,9 +43,9 @@
   },
   "clean_up_tokenization_spaces": true,
   "cls_token": "[CLS]",
-  "do_lower_case": false,
   "mask_token": "[MASK]",
-  "model_max_length": 1000000000000000019884624838656,
   "pad_token": "[PAD]",
   "sep_token": "[SEP]",
   "strip_accents": null,

   },
   "clean_up_tokenization_spaces": true,
   "cls_token": "[CLS]",
+  "do_lower_case": true,
   "mask_token": "[MASK]",
+  "model_max_length": 512,
   "pad_token": "[PAD]",
   "sep_token": "[SEP]",
   "strip_accents": null,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1792e4ea57dcf08b3341c14edb699c0644361483dfe48c04f66d036ba3fa3432
-size 5048

 version https://git-lfs.github.com/spec/v1
+oid sha256:3df69f12f2527db3804d7bb60dd132ac1194afc54f61d6f735eae248d23427ad
+size 4984

vocab.txt CHANGED Viewed

The diff for this file is too large to render. See raw diff