Edit model card

DopeyTinyLlama-1.1B-v1

An experimental DPO finetune of SmarTinyLlama with Alpaca-QLoRA

Datasets

Trained on bagel style DPO datasets

Prompt Template

Uses chatml style prompt template

Downloads last month
1,421
Safetensors
Model size
1.1B params
Tensor type
FP16
·
Inference API
Input a message to start chatting with vihangd/DopeyTinyLlama-1.1B-v1.
This model can be loaded on Inference API (serverless).