End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -15,12 +15,9 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/stojchets/huggingface/runs/jkto10k1-jsft8)
 # jkto10k1-jsft8
 This model is a fine-tuned version of [stojchet/jkto10k1](https://huggingface.co/stojchet/jkto10k1) on the generator dataset.
-It achieves the following results on the evaluation set:
-- Loss: 1.1941
 ## Model description
@@ -50,13 +47,6 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 200
 - num_epochs: 3
-### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 1.0632        | 2.56  | 100  | 1.1941          |
 ### Framework versions
 - Transformers 4.43.0.dev0

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 # jkto10k1-jsft8
 This model is a fine-tuned version of [stojchet/jkto10k1](https://huggingface.co/stojchet/jkto10k1) on the generator dataset.
 ## Model description
 - lr_scheduler_warmup_steps: 200
 - num_epochs: 3
 ### Framework versions
 - Transformers 4.43.0.dev0

config.json CHANGED Viewed

@@ -27,6 +27,6 @@
   "tie_word_embeddings": false,
   "torch_dtype": "float32",
   "transformers_version": "4.43.0.dev0",
-  "use_cache": false,
   "vocab_size": 32256
 }

   "tie_word_embeddings": false,
   "torch_dtype": "float32",
   "transformers_version": "4.43.0.dev0",
+  "use_cache": true,
   "vocab_size": 32256
 }

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d556dfbf64a68b3d571a29baff05d95c82dd1761dabcc2135b7efb9039c12210
 size 4986380064

 version https://git-lfs.github.com/spec/v1
+oid sha256:362f9afc9604b5f74c3bd52b4495c2a47f345c92e2219b5f1447674922215801
 size 4986380064

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6d6b9dd74ebef05fec968c2c0deba5b49c0a9cdeab5a89df705880c6bcc97242
 size 399532808

 version https://git-lfs.github.com/spec/v1
+oid sha256:49fa8e5cddd375e70a0293e1c942b5d778c36ac693c3d27a0d3774ec9950a88c
 size 399532808

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3bc7182e0a9a54dfa7490a3660c9fe2d308db71691ee1ae5b9da97c0c23dea92
-size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:3de009c249cbaf9baa955a4e2e9714910d63d383071ad8024a4b28d858509906
+size 5176