Ousso1117
/

SFT-base-Llama-3-8B-Instruct

Generated from Trainer

Model card Files Files and versions Community

Ousso1117 commited on 28 days ago

Commit

bf63c00

•

1 Parent(s): 165089c

Model save

Files changed (2) hide show

README.md +9 -14
adapter_model.safetensors +1 -1

README.md CHANGED Viewed

@@ -1,25 +1,25 @@
 ---
 library_name: peft
-license: llama3.1
-base_model: unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
 tags:
 - trl
 - sft
 - unsloth
 - generated_from_trainer
 model-index:
-- name: SFT-unsloth-Llama-3-8B-Instruct
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# SFT-unsloth-Llama-3-8B-Instruct
-This model is a fine-tuned version of [unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit](https://huggingface.co/unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2950
 ## Model description
@@ -47,20 +47,15 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 5
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 1.053         | 0.4129 | 20   | 0.8088          |
-| 0.757         | 0.8258 | 40   | 0.7216          |
-| 0.6764        | 1.2387 | 60   | 0.6206          |
-| 0.5476        | 1.6516 | 80   | 0.4872          |
-| 0.4231        | 2.0645 | 100  | 0.3801          |
-| 0.3276        | 2.4774 | 120  | 0.3222          |
-| 0.2932        | 2.8903 | 140  | 0.2950          |
 ### Framework versions

 ---
 library_name: peft
+license: llama3
+base_model: meta-llama/Meta-Llama-3-8B-Instruct
 tags:
 - trl
 - sft
 - unsloth
 - generated_from_trainer
 model-index:
+- name: SFT-base-Llama-3-8B-Instruct
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# SFT-base-Llama-3-8B-Instruct
+This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7187
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 5
+- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 1.0531        | 0.4129 | 20   | 0.8121          |
+| 0.7559        | 0.8258 | 40   | 0.7187          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:49d11e586b2fbf0a9c58d30d390814d3ec64966bf5e90c63846452f16ecc2e55
 size 167832240

 version https://git-lfs.github.com/spec/v1
+oid sha256:f0e69ff931a13045dad59bd43a613f951b03b8863fa28eeb21a55597c45412aa
 size 167832240