End of training

Browse files

Files changed (4) hide show

README.md +73 -0
generation_config.json +7 -0
model.safetensors +1 -1
runs/May29_17-03-47_mike-ws/events.out.tfevents.1716973429.mike-ws.2220.0 +2 -2

README.md ADDED Viewed

	@@ -0,0 +1,73 @@

+---
+license: mit
+base_model: microsoft/git-base
+tags:
+- generated_from_trainer
+model-index:
+- name: git-base-naruto
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# git-base-naruto
+This model is a fine-tuned version of [microsoft/git-base](https://huggingface.co/microsoft/git-base) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.0429
+- Wer Score: 0.4194
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 2
+- eval_batch_size: 2
+- seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 4
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 50
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer Score |
+|:-------------:|:-----:|:----:|:---------------:|:---------:|
+| 7.3552        | 3.7   | 50   | 4.5162          | 22.5645   |
+| 2.3387        | 7.41  | 100  | 0.4370          | 0.3871    |
+| 0.1271        | 11.11 | 150  | 0.0361          | 0.3871    |
+| 0.0162        | 14.81 | 200  | 0.0361          | 0.4194    |
+| 0.0112        | 18.52 | 250  | 0.0381          | 0.4355    |
+| 0.0098        | 22.22 | 300  | 0.0393          | 0.4355    |
+| 0.0088        | 25.93 | 350  | 0.0399          | 0.4516    |
+| 0.0085        | 29.63 | 400  | 0.0424          | 0.4516    |
+| 0.0076        | 33.33 | 450  | 0.0399          | 0.4194    |
+| 0.007         | 37.04 | 500  | 0.0422          | 0.4516    |
+| 0.0064        | 40.74 | 550  | 0.0421          | 0.4355    |
+| 0.0059        | 44.44 | 600  | 0.0434          | 0.4355    |
+| 0.0056        | 48.15 | 650  | 0.0429          | 0.4194    |
+### Framework versions
+- Transformers 4.38.1
+- Pytorch 2.1.2+cu121
+- Datasets 2.17.1
+- Tokenizers 0.15.2

generation_config.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "_from_model_config": true,
+  "bos_token_id": 101,
+  "eos_token_id": 102,
+  "pad_token_id": 0,
+  "transformers_version": "4.38.1"
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8d9915e5f5e3e0f4d9eb55057c0d4a8b011a4312a0ae977609c41508b8789e5a
 size 706516040

 version https://git-lfs.github.com/spec/v1
+oid sha256:0e18086c70bfbc614c08c02960a8cb9e28cb937d9beddbd71bd5548ce2374551
 size 706516040

runs/May29_17-03-47_mike-ws/events.out.tfevents.1716973429.mike-ws.2220.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2a2d2e1a4616f7cd6bb1937b338dacc3ee7e912887e593abacb15a4f0f426b41
-size 11177

 version https://git-lfs.github.com/spec/v1
+oid sha256:e250cc8d323d883d4c99cc5af3e081adb74f9e203a72613e5fd833c8f36f846b
+size 12066