vdavidr's picture
End of training
37f09c0 verified
|
raw
history blame
No virus
3.26 kB
---
license: llama2
library_name: peft
tags:
- generated_from_trainer
base_model: codellama/CodeLlama-7b-Instruct-hf
metrics:
- accuracy
- bleu
- sacrebleu
- rouge
model-index:
- name: CodeLlama-7b-Instruct-hf_Fi__CMP_TR_size_304_epochs_10_2024-06-22_21-11-23_3558625
results: []
---
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->
# CodeLlama-7b-Instruct-hf_Fi__CMP_TR_size_304_epochs_10_2024-06-22_21-11-23_3558625
This model is a fine-tuned version of [codellama/CodeLlama-7b-Instruct-hf](https://huggingface.co/codellama/CodeLlama-7b-Instruct-hf) on the None dataset.
It achieves the following results on the evaluation set:
- Loss: 1.9442
- Accuracy: 0.464
- Chrf: 0.282
- Bleu: 0.212
- Sacrebleu: 0.2
- Rouge1: 0.473
- Rouge2: 0.304
- Rougel: 0.447
- Rougelsum: 0.467
- Meteor: 0.474
## Model description
More information needed
## Intended uses & limitations
More information needed
## Training and evaluation data
More information needed
## Training procedure
### Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 0.001
- train_batch_size: 1
- eval_batch_size: 1
- seed: 3407
- distributed_type: multi-GPU
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-06
- lr_scheduler_type: linear
- lr_scheduler_warmup_steps: 304
- training_steps: 3040
### Training results
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Chrf | Bleu | Sacrebleu | Rouge1 | Rouge2 | Rougel | Rougelsum | Meteor |
|:-------------:|:-----:|:----:|:---------------:|:--------:|:-----:|:-----:|:---------:|:------:|:------:|:------:|:---------:|:------:|
| 0.7293 | 1.0 | 304 | 2.8400 | 0.471 | 0.109 | 0.089 | 0.1 | 0.318 | 0.168 | 0.304 | 0.297 | 0.274 |
| 0.043 | 2.0 | 608 | 3.2408 | 0.498 | 0.051 | 0.019 | 0.0 | 0.162 | 0.063 | 0.136 | 0.142 | 0.216 |
| 0.0514 | 3.0 | 912 | 2.8322 | 0.478 | 0.156 | 0.059 | 0.1 | 0.3 | 0.145 | 0.284 | 0.289 | 0.289 |
| 0.0145 | 4.0 | 1216 | 2.5898 | 0.478 | 0.101 | 0.064 | 0.1 | 0.263 | 0.167 | 0.258 | 0.258 | 0.32 |
| 0.8203 | 5.0 | 1520 | 2.7395 | 0.478 | 0.16 | 0.049 | 0.0 | 0.306 | 0.114 | 0.284 | 0.298 | 0.27 |
| 0.0546 | 6.0 | 1824 | 2.8379 | 0.458 | 0.052 | 0.022 | 0.0 | 0.068 | 0.0 | 0.056 | 0.057 | 0.21 |
| 0.0352 | 7.0 | 2128 | 2.6987 | 0.481 | 0.165 | 0.133 | 0.1 | 0.356 | 0.246 | 0.352 | 0.355 | 0.33 |
| 0.042 | 8.0 | 2432 | 2.0781 | 0.481 | 0.264 | 0.169 | 0.2 | 0.421 | 0.261 | 0.403 | 0.421 | 0.431 |
| 0.0124 | 9.0 | 2736 | 1.9029 | 0.464 | 0.293 | 0.222 | 0.2 | 0.466 | 0.304 | 0.445 | 0.465 | 0.473 |
| 0.0382 | 10.0 | 3040 | 1.9442 | 0.464 | 0.282 | 0.212 | 0.2 | 0.473 | 0.304 | 0.447 | 0.467 | 0.474 |
### Framework versions
- PEFT 0.7.1
- Transformers 4.37.0
- Pytorch 2.2.1+cu121
- Datasets 2.20.0
- Tokenizers 0.15.2