---
license: apache-2.0
library_name: peft
tags:
- generated_from_trainer
base_model: ahmedabdelwahed/Mojiz-sft
model-index:
- name: test
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# test

This model is a fine-tuned version of [ahmedabdelwahed/Mojiz-sft](https://huggingface.co/ahmedabdelwahed/Mojiz-sft) on the None dataset.
It achieves the following results on the evaluation set:
- Loss: 0.0000
- Rewards/chosen: 10.9834
- Rewards/rejected: -4.3197
- Rewards/accuracies: 1.0
- Rewards/margins: 15.3031
- Logps/rejected: -79.6788
- Logps/chosen: -304.3463
- Logits/rejected: -11.4276
- Logits/chosen: -12.4846

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 1e-05
- train_batch_size: 4
- eval_batch_size: 8
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- lr_scheduler_warmup_steps: 150
- training_steps: 1000

### Training results

| Training Loss | Epoch | Step | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen |
|:-------------:|:-----:|:----:|:---------------:|:--------------:|:----------------:|:------------------:|:---------------:|:--------------:|:------------:|:---------------:|:-------------:|
| 1.0153        | 0.41  | 100  | 0.2835          | 0.9457         | -0.2302          | 1.0                | 1.1759          | -71.4998       | -324.4215    | -11.7052        | -12.9375      |
| 0.5417        | 0.82  | 200  | 0.0117          | 4.5050         | -1.4875          | 1.0                | 5.9925          | -74.0144       | -317.3029    | -11.5990        | -12.7637      |
| 0.0938        | 1.22  | 300  | 0.0006          | 7.3787         | -2.7778          | 1.0                | 10.1565         | -76.5951       | -311.5557    | -11.5086        | -12.6163      |
| 0.0091        | 1.63  | 400  | 0.0001          | 8.4468         | -3.1612          | 1.0                | 11.6080         | -77.3619       | -309.4195    | -11.4851        | -12.5777      |
| 0.0598        | 2.04  | 500  | 0.0000          | 9.5528         | -3.6756          | 1.0                | 13.2285         | -78.3907       | -307.2074    | -11.4570        | -12.5323      |
| 0.0032        | 2.45  | 600  | 0.0000          | 10.0884        | -3.9009          | 1.0                | 13.9892         | -78.8412       | -306.1363    | -11.4508        | -12.5212      |
| 0.017         | 2.86  | 700  | 0.0000          | 10.4984        | -4.0850          | 1.0                | 14.5834         | -79.2095       | -305.3162    | -11.4376        | -12.4997      |
| 0.002         | 3.27  | 800  | 0.0000          | 10.8150        | -4.2275          | 1.0                | 15.0425         | -79.4944       | -304.6830    | -11.4327        | -12.4924      |
| 0.0001        | 3.67  | 900  | 0.0000          | 10.9336        | -4.2971          | 1.0                | 15.2308         | -79.6337       | -304.4458    | -11.4287        | -12.4862      |
| 0.0002        | 4.08  | 1000 | 0.0000          | 10.9834        | -4.3197          | 1.0                | 15.3031         | -79.6788       | -304.3463    | -11.4276        | -12.4846      |


### Framework versions

- PEFT 0.7.1
- Transformers 4.36.0
- Pytorch 2.0.0
- Datasets 2.1.0
- Tokenizers 0.15.0