sedrickkeh commited on
Commit
2ef920f
1 Parent(s): cf0de63

End of training

Browse files
README.md CHANGED
@@ -7,18 +7,18 @@ tags:
7
  - full
8
  - generated_from_trainer
9
  model-index:
10
- - name: mistral_alpaca_sft_sample
11
  results: []
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
- # mistral_alpaca_sft_sample
18
 
19
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the llamafactory/alpaca_en dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 1.7878
22
 
23
  ## Model description
24
 
@@ -55,7 +55,7 @@ The following hyperparameters were used during training:
55
 
56
  | Training Loss | Epoch | Step | Validation Loss |
57
  |:-------------:|:------:|:----:|:---------------:|
58
- | No log | 0.0870 | 2 | 1.7878 |
59
 
60
 
61
  ### Framework versions
 
7
  - full
8
  - generated_from_trainer
9
  model-index:
10
+ - name: sft
11
  results: []
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
+ # sft
18
 
19
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the llamafactory/alpaca_en dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 1.7879
22
 
23
  ## Model description
24
 
 
55
 
56
  | Training Loss | Epoch | Step | Validation Loss |
57
  |:-------------:|:------:|:----:|:---------------:|
58
+ | No log | 0.0870 | 2 | 1.7879 |
59
 
60
 
61
  ### Framework versions
all_results.json CHANGED
@@ -1,9 +1,9 @@
1
  {
2
  "epoch": 0.08695652173913043,
3
- "eval_loss": 1.787781000137329,
4
- "eval_runtime": 3.2321,
5
- "eval_samples_per_second": 190.897,
6
- "eval_steps_per_second": 3.094,
7
  "total_flos": 2.572343380583383e+17,
8
  "train_loss": 1.2197923452957817,
9
  "train_runtime": 1476.3308,
 
1
  {
2
  "epoch": 0.08695652173913043,
3
+ "eval_loss": 1.7879343032836914,
4
+ "eval_runtime": 3.2212,
5
+ "eval_samples_per_second": 191.545,
6
+ "eval_steps_per_second": 3.104,
7
  "total_flos": 2.572343380583383e+17,
8
  "train_loss": 1.2197923452957817,
9
  "train_runtime": 1476.3308,
eval_results.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "epoch": 0.08695652173913043,
3
- "eval_loss": 1.787781000137329,
4
- "eval_runtime": 3.2321,
5
- "eval_samples_per_second": 190.897,
6
- "eval_steps_per_second": 3.094
7
  }
 
1
  {
2
  "epoch": 0.08695652173913043,
3
+ "eval_loss": 1.7879343032836914,
4
+ "eval_runtime": 3.2212,
5
+ "eval_samples_per_second": 191.545,
6
+ "eval_steps_per_second": 3.104
7
  }
model-00001-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bff9fb7c73ff200da39dfe63df53443d3dd78583a153c6d76280b548d4d64514
3
  size 4943162336
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7bb0f357425754d29b8d32ca3f0afe2b7028699c79e881d69bcb7b9ca5bdfc0c
3
  size 4943162336
model-00002-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d69f0b41dd039acb447a0ce06c3ef5e8562641d453d37b6c617815dac34def43
3
  size 4999819336
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8b90d09ad9b65d38791058eb31d2ee4a0bdb66f74c69e38ec579d34a81b6600e
3
  size 4999819336
model-00003-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1299202c730511cc9e6f4a8bcb58128345b9b6f3a74e529bb805b35f1aa36b13
3
  size 4540516344
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b743639beeeab228d14fdbdb0fa63afaa5995c076eabc594c13ae8d7f5069e44
3
  size 4540516344
trainer_log.jsonl CHANGED
@@ -1,2 +1,2 @@
1
- {"current_steps": 2, "total_steps": 2, "eval_loss": 1.787781000137329, "epoch": 0.08695652173913043, "percentage": 100.0, "elapsed_time": "0:00:25", "remaining_time": "0:00:00"}
2
- {"current_steps": 2, "total_steps": 2, "epoch": 0.08695652173913043, "percentage": 100.0, "elapsed_time": "0:00:25", "remaining_time": "0:00:00"}
 
1
+ {"current_steps": 2, "total_steps": 2, "eval_loss": 1.7879343032836914, "epoch": 0.08695652173913043, "percentage": 100.0, "elapsed_time": "0:00:26", "remaining_time": "0:00:00"}
2
+ {"current_steps": 2, "total_steps": 2, "epoch": 0.08695652173913043, "percentage": 100.0, "elapsed_time": "0:00:26", "remaining_time": "0:00:00"}
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1df4121cadf448c3cbdf8399ff40c028927f2341580f4538f9c5eae528f358c5
3
  size 6584
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b24404067eb693034baea63af0d79ca1d95f9d7c8d7cf830bdf689c38a861c70
3
  size 6584