vdavidr commited on
Commit
37f09c0
1 Parent(s): b5956a3

End of training

Browse files
README.md ADDED
@@ -0,0 +1,84 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama2
3
+ library_name: peft
4
+ tags:
5
+ - generated_from_trainer
6
+ base_model: codellama/CodeLlama-7b-Instruct-hf
7
+ metrics:
8
+ - accuracy
9
+ - bleu
10
+ - sacrebleu
11
+ - rouge
12
+ model-index:
13
+ - name: CodeLlama-7b-Instruct-hf_Fi__CMP_TR_size_304_epochs_10_2024-06-22_21-11-23_3558625
14
+ results: []
15
+ ---
16
+
17
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
18
+ should probably proofread and complete it, then remove this comment. -->
19
+
20
+ # CodeLlama-7b-Instruct-hf_Fi__CMP_TR_size_304_epochs_10_2024-06-22_21-11-23_3558625
21
+
22
+ This model is a fine-tuned version of [codellama/CodeLlama-7b-Instruct-hf](https://huggingface.co/codellama/CodeLlama-7b-Instruct-hf) on the None dataset.
23
+ It achieves the following results on the evaluation set:
24
+ - Loss: 1.9442
25
+ - Accuracy: 0.464
26
+ - Chrf: 0.282
27
+ - Bleu: 0.212
28
+ - Sacrebleu: 0.2
29
+ - Rouge1: 0.473
30
+ - Rouge2: 0.304
31
+ - Rougel: 0.447
32
+ - Rougelsum: 0.467
33
+ - Meteor: 0.474
34
+
35
+ ## Model description
36
+
37
+ More information needed
38
+
39
+ ## Intended uses & limitations
40
+
41
+ More information needed
42
+
43
+ ## Training and evaluation data
44
+
45
+ More information needed
46
+
47
+ ## Training procedure
48
+
49
+ ### Training hyperparameters
50
+
51
+ The following hyperparameters were used during training:
52
+ - learning_rate: 0.001
53
+ - train_batch_size: 1
54
+ - eval_batch_size: 1
55
+ - seed: 3407
56
+ - distributed_type: multi-GPU
57
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-06
58
+ - lr_scheduler_type: linear
59
+ - lr_scheduler_warmup_steps: 304
60
+ - training_steps: 3040
61
+
62
+ ### Training results
63
+
64
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy | Chrf | Bleu | Sacrebleu | Rouge1 | Rouge2 | Rougel | Rougelsum | Meteor |
65
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|:-----:|:-----:|:---------:|:------:|:------:|:------:|:---------:|:------:|
66
+ | 0.7293 | 1.0 | 304 | 2.8400 | 0.471 | 0.109 | 0.089 | 0.1 | 0.318 | 0.168 | 0.304 | 0.297 | 0.274 |
67
+ | 0.043 | 2.0 | 608 | 3.2408 | 0.498 | 0.051 | 0.019 | 0.0 | 0.162 | 0.063 | 0.136 | 0.142 | 0.216 |
68
+ | 0.0514 | 3.0 | 912 | 2.8322 | 0.478 | 0.156 | 0.059 | 0.1 | 0.3 | 0.145 | 0.284 | 0.289 | 0.289 |
69
+ | 0.0145 | 4.0 | 1216 | 2.5898 | 0.478 | 0.101 | 0.064 | 0.1 | 0.263 | 0.167 | 0.258 | 0.258 | 0.32 |
70
+ | 0.8203 | 5.0 | 1520 | 2.7395 | 0.478 | 0.16 | 0.049 | 0.0 | 0.306 | 0.114 | 0.284 | 0.298 | 0.27 |
71
+ | 0.0546 | 6.0 | 1824 | 2.8379 | 0.458 | 0.052 | 0.022 | 0.0 | 0.068 | 0.0 | 0.056 | 0.057 | 0.21 |
72
+ | 0.0352 | 7.0 | 2128 | 2.6987 | 0.481 | 0.165 | 0.133 | 0.1 | 0.356 | 0.246 | 0.352 | 0.355 | 0.33 |
73
+ | 0.042 | 8.0 | 2432 | 2.0781 | 0.481 | 0.264 | 0.169 | 0.2 | 0.421 | 0.261 | 0.403 | 0.421 | 0.431 |
74
+ | 0.0124 | 9.0 | 2736 | 1.9029 | 0.464 | 0.293 | 0.222 | 0.2 | 0.466 | 0.304 | 0.445 | 0.465 | 0.473 |
75
+ | 0.0382 | 10.0 | 3040 | 1.9442 | 0.464 | 0.282 | 0.212 | 0.2 | 0.473 | 0.304 | 0.447 | 0.467 | 0.474 |
76
+
77
+
78
+ ### Framework versions
79
+
80
+ - PEFT 0.7.1
81
+ - Transformers 4.37.0
82
+ - Pytorch 2.2.1+cu121
83
+ - Datasets 2.20.0
84
+ - Tokenizers 0.15.2
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d81a356cd16a7fadb158cf9cfd23206558464a2c92fa8d67167c7c294ab44757
3
  size 604565408
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4c64d99a4847f2ac80c972cacb39233895cc4999dfaea53ef167c24d30e263bd
3
  size 604565408
runs/Jun22_21-11-23_g5201.mahti.csc.fi/events.out.tfevents.1719079921.g5201.mahti.csc.fi.596164.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6c76a2d3ceb1f3fdcdacf7f95191f86aee0d1f5402fa1ae4e5dbc0e01a7e80d4
3
- size 440747
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f35b6a9e450add1cae5334b0dd8e87ecb1db94e9d52f28991dd37f9ffeb91bc
3
+ size 489554