mkschulz9's picture
Update README.md
7ab40a7 verified
---
base_model: codellama/CodeLlama-7b-hf
library_name: peft
license: llama2
tags:
- trl
- sft
- generated_from_trainer
model-index:
- name: code-llama-7b-chatbot
results: []
pipeline_tag: text-generation
---
# code-llama-7b-chatbot
This model, 'mkschulz9/code-llama-7b-chatbot,' is a fine-tuned version of [codellama/CodeLlama-7b-hf](https://huggingface.co/codellama/CodeLlama-7b-hf), fine-tuned on a custom, QA-style dataset. It utilizes advanced techniques like LoRA and precision optimization with BitsAndBytes for efficient performance. The model is designed to answer questions about Matthew Schulz's educational background, work experience, and hobbies/interests.
## Model description
More information needed
## Intended uses & limitations
More information needed
## Training and evaluation data
More information needed
## Training procedure
### Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 0.0002
- train_batch_size: 1
- eval_batch_size: 8
- seed: 42
- gradient_accumulation_steps: 8
- total_train_batch_size: 8
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: constant
- lr_scheduler_warmup_ratio: 0.03
- num_epochs: 9
### Training results
### Framework versions
- PEFT 0.11.1
- Transformers 4.41.2
- Pytorch 2.3.0+cu121
- Datasets 2.20.0
- Tokenizers 0.19.1