Upload 7 files

Browse files

Files changed (3) hide show

README.md +19 -24
adapter_model.safetensors +2 -2
training_args.bin +0 -0

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.1279
 ## Model description
@@ -46,33 +46,28 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.05
-- num_epochs: 70
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch   | Step | Validation Loss |
-|:-------------:|:-------:|:----:|:---------------:|
-| 1.3621        | 3.4964  | 420  | 1.4379          |
-| 1.1954        | 6.9927  | 840  | 1.4208          |
-| 0.8507        | 10.4891 | 1260 | 1.5712          |
-| 0.789         | 13.9854 | 1680 | 1.6759          |
-| 0.5388        | 17.4818 | 2100 | 1.9153          |
-| 0.4013        | 20.9781 | 2520 | 2.0319          |
-| 0.2933        | 24.4745 | 2940 | 2.2094          |
-| 0.207         | 27.9709 | 3360 | 2.3547          |
-| 0.1604        | 31.4672 | 3780 | 2.5483          |
-| 0.1154        | 34.9636 | 4200 | 2.5953          |
-| 0.0982        | 38.4599 | 4620 | 2.7355          |
-| 0.0954        | 41.9563 | 5040 | 2.8220          |
-| 0.0677        | 45.4527 | 5460 | 2.8909          |
-| 0.0613        | 48.9490 | 5880 | 2.9654          |
-| 0.0482        | 52.4454 | 6300 | 3.0125          |
-| 0.0415        | 55.9417 | 6720 | 3.0390          |
-| 0.0477        | 59.4381 | 7140 | 3.0992          |
-| 0.0412        | 62.9344 | 7560 | 3.1126          |
-| 0.0327        | 66.4308 | 7980 | 3.1262          |
-| 0.0391        | 69.9272 | 8400 | 3.1279          |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.6596
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.05
+- num_epochs: 20
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss |
+|:-------------:|:------:|:----:|:---------------:|
+| 1.7073        | 0.4162 | 50   | 1.5993          |
+| 1.4004        | 0.8325 | 100  | 1.4527          |
+| 1.3051        | 1.2487 | 150  | 1.4122          |
+| 1.2396        | 1.6649 | 200  | 1.3871          |
+| 1.2044        | 2.0812 | 250  | 1.3906          |
+| 1.1019        | 2.4974 | 300  | 1.3775          |
+| 1.2682        | 2.9136 | 350  | 1.3649          |
+| 1.1681        | 3.3299 | 400  | 1.4233          |
+| 1.1343        | 3.7461 | 450  | 1.4160          |
+| 0.7987        | 4.1623 | 500  | 1.4964          |
+| 0.8663        | 4.5786 | 550  | 1.5011          |
+| 0.7473        | 4.9948 | 600  | 1.4845          |
+| 0.7386        | 5.4110 | 650  | 1.5706          |
+| 0.61          | 5.8273 | 700  | 1.5695          |
+| 0.4689        | 6.2435 | 750  | 1.6596          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c8f5f623b5b1c86000a3a2a6f3b8bc4c7b09b9f4a1d783d2a0e5f90af49fffc9
-size 37774528

 version https://git-lfs.github.com/spec/v1
+oid sha256:4b76654e3600fee393722e7c47549703279d66c347d34487b9ed9141ea1f4ada
+size 75514264

training_args.bin CHANGED Viewed

Binary files a/training_args.bin and b/training_args.bin differ