mkschulz9
/

code-llama-7b-chatbot

Text Generation

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

code-llama-7b-chatbot / README.md

mkschulz9's picture

Update README.md

7ab40a7 verified 3 months ago

|

history blame contribute delete

No virus

1.37 kB

	---
	base_model: codellama/CodeLlama-7b-hf
	library_name: peft
	license: llama2
	tags:
	- trl
	- sft
	- generated_from_trainer
	model-index:
	- name: code-llama-7b-chatbot
	results: []
	pipeline_tag: text-generation
	---

	# code-llama-7b-chatbot

	This model, 'mkschulz9/code-llama-7b-chatbot,' is a fine-tuned version of [codellama/CodeLlama-7b-hf](https://huggingface.co/codellama/CodeLlama-7b-hf), fine-tuned on a custom, QA-style dataset. It utilizes advanced techniques like LoRA and precision optimization with BitsAndBytes for efficient performance. The model is designed to answer questions about Matthew Schulz's educational background, work experience, and hobbies/interests.
	## Model description

	More information needed

	## Intended uses & limitations

	More information needed

	## Training and evaluation data

	More information needed

	## Training procedure

	### Training hyperparameters

	The following hyperparameters were used during training:
	- learning_rate: 0.0002
	- train_batch_size: 1
	- eval_batch_size: 8
	- seed: 42
	- gradient_accumulation_steps: 8
	- total_train_batch_size: 8
	- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
	- lr_scheduler_type: constant
	- lr_scheduler_warmup_ratio: 0.03
	- num_epochs: 9

	### Training results



	### Framework versions

	- PEFT 0.11.1
	- Transformers 4.41.2
	- Pytorch 2.3.0+cu121
	- Datasets 2.20.0
	- Tokenizers 0.19.1