---
license: llama2
library_name: peft
tags:
- generated_from_trainer
base_model: meta-llama/Llama-2-7b-chat-hf
model-index:
- name: Llama-2-7b-chat-hf-finetune_90_10_SY_gold
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# Llama-2-7b-chat-hf-finetune_90_10_SY_gold

This model is a fine-tuned version of [meta-llama/Llama-2-7b-chat-hf](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf) on an unknown dataset.
It achieves the following results on the evaluation set:
- Loss: 1.1470

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 0.0001
- train_batch_size: 3
- eval_batch_size: 3
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: reduce_lr_on_plateau
- num_epochs: 50
- mixed_precision_training: Native AMP

### Training results

| Training Loss | Epoch   | Step | Validation Loss |
|:-------------:|:-------:|:----:|:---------------:|
| 0.5446        | 0.9937  | 157  | 0.7173          |
| 0.0863        | 1.9873  | 314  | 0.8235          |
| 0.1229        | 2.9810  | 471  | 0.8690          |
| 0.0914        | 3.9747  | 628  | 0.9089          |
| 0.0766        | 4.9684  | 785  | 0.9324          |
| 0.0842        | 5.9620  | 942  | 0.9564          |
| 0.0725        | 6.9557  | 1099 | 0.9903          |
| 0.0584        | 7.9494  | 1256 | 1.0348          |
| 0.0653        | 8.9430  | 1413 | 1.0468          |
| 0.0601        | 9.9367  | 1570 | 1.0397          |
| 0.0601        | 10.9304 | 1727 | 1.0622          |
| 0.0595        | 11.9241 | 1884 | 1.0757          |
| 0.0676        | 12.9177 | 2041 | 1.0911          |
| 0.0513        | 13.9114 | 2198 | 1.0942          |
| 0.0492        | 14.9051 | 2355 | 1.0980          |
| 0.0489        | 15.8987 | 2512 | 1.1041          |
| 0.05          | 16.8924 | 2669 | 1.1095          |
| 0.0498        | 17.8861 | 2826 | 1.1136          |
| 0.0494        | 18.8797 | 2983 | 1.1174          |
| 0.0483        | 19.8734 | 3140 | 1.1220          |
| 0.0476        | 20.8671 | 3297 | 1.1226          |
| 0.0487        | 21.8608 | 3454 | 1.1250          |
| 0.048         | 22.8544 | 3611 | 1.1297          |
| 0.0445        | 23.8481 | 3768 | 1.1307          |
| 0.0461        | 24.8418 | 3925 | 1.1321          |
| 0.0456        | 25.8354 | 4082 | 1.1338          |
| 0.0455        | 26.8291 | 4239 | 1.1355          |
| 0.0437        | 27.8228 | 4396 | 1.1372          |
| 0.0434        | 28.8165 | 4553 | 1.1390          |
| 0.0442        | 29.8101 | 4710 | 1.1400          |
| 0.0432        | 30.8038 | 4867 | 1.1417          |
| 0.0433        | 31.7975 | 5024 | 1.1426          |
| 0.0424        | 32.7911 | 5181 | 1.1437          |
| 0.0415        | 33.7848 | 5338 | 1.1453          |
| 0.0424        | 34.7785 | 5495 | 1.1453          |
| 0.0399        | 35.7722 | 5652 | 1.1455          |
| 0.04          | 36.7658 | 5809 | 1.1457          |
| 0.0401        | 37.7595 | 5966 | 1.1458          |
| 0.0394        | 38.7532 | 6123 | 1.1461          |
| 0.0369        | 39.7468 | 6280 | 1.1462          |
| 0.0515        | 40.7405 | 6437 | 1.1464          |
| 0.0511        | 41.7342 | 6594 | 1.1465          |
| 0.0483        | 42.7278 | 6751 | 1.1465          |
| 0.0478        | 43.7215 | 6908 | 1.1468          |
| 0.0492        | 44.7152 | 7065 | 1.1469          |
| 0.0492        | 45.7089 | 7222 | 1.1469          |
| 0.0473        | 46.7025 | 7379 | 1.1469          |
| 0.0483        | 47.6962 | 7536 | 1.1470          |
| 0.0474        | 48.6899 | 7693 | 1.1470          |
| 0.0473        | 49.6835 | 7850 | 1.1470          |


### Framework versions

- PEFT 0.11.1
- Transformers 4.40.2
- Pytorch 2.3.0+cu121
- Datasets 2.20.0
- Tokenizers 0.19.1