training after 10 epochs

Browse files

Files changed (2) hide show

README.md +22 -6
runs/Nov09_04-32-01_209704c86495/events.out.tfevents.1699504339.209704c86495.97855.9 +2 -2

README.md CHANGED Viewed

@@ -5,6 +5,9 @@ tags:
 - generated_from_trainer
 datasets:
 - emotion
 model-index:
 - name: llama-2-7B-Guanaco-QLoRA-AWQ
   results: []
@@ -16,6 +19,10 @@ should probably proofread and complete it, then remove this comment. -->
 # llama-2-7B-Guanaco-QLoRA-AWQ
 This model is a fine-tuned version of [TheBloke/llama-2-7B-Guanaco-QLoRA-AWQ](https://huggingface.co/TheBloke/llama-2-7B-Guanaco-QLoRA-AWQ) on the emotion dataset.
 ## Model description
@@ -35,19 +42,28 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 256
-- eval_batch_size: 256
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 63   | 1.6517          |
 ### Framework versions

 - generated_from_trainer
 datasets:
 - emotion
+metrics:
+- accuracy
+- f1
 model-index:
 - name: llama-2-7B-Guanaco-QLoRA-AWQ
   results: []
 # llama-2-7B-Guanaco-QLoRA-AWQ
 This model is a fine-tuned version of [TheBloke/llama-2-7B-Guanaco-QLoRA-AWQ](https://huggingface.co/TheBloke/llama-2-7B-Guanaco-QLoRA-AWQ) on the emotion dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.7119
+- Accuracy: 0.778
+- F1: 0.7718
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Accuracy | F1     |
+|:-------------:|:-----:|:-----:|:---------------:|:--------:|:------:|
+| 1.5456        | 1.0   | 2000  | 1.5658          | 0.397    | 0.2952 |
+| 1.3418        | 2.0   | 4000  | 1.4285          | 0.483    | 0.4464 |
+| 1.1199        | 3.0   | 6000  | 1.3052          | 0.5285   | 0.4825 |
+| 0.9157        | 4.0   | 8000  | 1.1448          | 0.5925   | 0.5616 |
+| 0.695         | 5.0   | 10000 | 0.9214          | 0.6745   | 0.6638 |
+| 0.5373        | 6.0   | 12000 | 0.8784          | 0.6925   | 0.6931 |
+| 0.405         | 7.0   | 14000 | 0.7437          | 0.745    | 0.7362 |
+| 0.2908        | 8.0   | 16000 | 0.7283          | 0.7625   | 0.7538 |
+| 0.2407        | 9.0   | 18000 | 0.6977          | 0.7775   | 0.7745 |
+| 0.1836        | 10.0  | 20000 | 0.7119          | 0.778    | 0.7718 |
 ### Framework versions

runs/Nov09_04-32-01_209704c86495/events.out.tfevents.1699504339.209704c86495.97855.9 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:38111aa82374e2dfc76223e8b0557a4ef36df7c60351f53ee5f4bd2ffecbad9e
-size 14739

 version https://git-lfs.github.com/spec/v1
+oid sha256:a47f1c13bc05434ba5fe64a1f36a15cd0db90a6c500c7db998284c9ce866958a
+size 15475