MHGanainy/gpt2-xl-lora-multi

Files changed (5) hide show

README.md CHANGED Viewed

@@ -16,11 +16,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai-community/gpt2-xl](https://huggingface.co/openai-community/gpt2-xl) on the None dataset.
 It achieves the following results on the evaluation set:
-- eval_loss: 2.7722
-- eval_model_preparation_time: 0.0285
-- eval_runtime: 146.0022
-- eval_samples_per_second: 6.219
-- eval_steps_per_second: 3.11
 - step: 0
 ## Model description
@@ -42,7 +42,7 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
 - train_batch_size: 2
-- eval_batch_size: 2
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine

 This model is a fine-tuned version of [openai-community/gpt2-xl](https://huggingface.co/openai-community/gpt2-xl) on the None dataset.
 It achieves the following results on the evaluation set:
+- eval_loss: 2.8119
+- eval_model_preparation_time: 0.0158
+- eval_runtime: 12012.7429
+- eval_samples_per_second: 8.646
+- eval_steps_per_second: 0.27
 - step: 0
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
 - train_batch_size: 2
+- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6163a50c587ae9ce9515d097d1d3d88e45fbefca9307d409a09422a3f00717b1
 size 216306688

 version https://git-lfs.github.com/spec/v1
+oid sha256:06bac3ad3bde62a9c10f7b7a80b8d420d792b03b925227e52a6c2c1f87c4e1c5
 size 216306688

all_results.json CHANGED Viewed

@@ -1,8 +1,8 @@
 {
-    "eval_loss": 2.772221088409424,
-    "eval_model_preparation_time": 0.0285,
-    "eval_runtime": 146.0022,
-    "eval_samples_per_second": 6.219,
-    "eval_steps_per_second": 3.11,
-    "perplexity": 15.99411893981886
 }

 {
+    "eval_loss": 2.811858654022217,
+    "eval_model_preparation_time": 0.0158,
+    "eval_runtime": 12012.7429,
+    "eval_samples_per_second": 8.646,
+    "eval_steps_per_second": 0.27,
+    "perplexity": 16.640819018144438
 }

eval_results.json CHANGED Viewed

@@ -1,8 +1,8 @@
 {
-    "eval_loss": 2.772221088409424,
-    "eval_model_preparation_time": 0.0285,
-    "eval_runtime": 146.0022,
-    "eval_samples_per_second": 6.219,
-    "eval_steps_per_second": 3.11,
-    "perplexity": 15.99411893981886
 }

 {
+    "eval_loss": 2.811858654022217,
+    "eval_model_preparation_time": 0.0158,
+    "eval_runtime": 12012.7429,
+    "eval_samples_per_second": 8.646,
+    "eval_steps_per_second": 0.27,
+    "perplexity": 16.640819018144438
 }

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fee5cd2b6767dcd833952946ea04c731eed1e8d42483991ad2247ab70679692f
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:defe3c13567a5ffe0b6b528185a1c4a462686f20458bb0a685eca4c6931f5510
 size 5240