PEFT Methods📚
Collection
A collection of difference fine tunes based on Google's state-of-the-art model -- Gemma 2 9B -- on the dataset Alpaca-cleaned by Yahma.
•
3 items
•
Updated
This gemma2 model was trained 2x faster with Unsloth and Huggingface's TRL library.
This model is fine tuned from unsloth/gemma-2-9b-bnb-4bit on the alpaca-cleaned dataset using the rsLoRA method.
This model achieved a loss of 0.829200 on the alpaca-cleaned dataset after step 120.
This model follows the alpaca prompt:
Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.
### Instruction:
{}
### Input:
{}
### Response:
{}
This model is trained on a single Tesla T4 GPU.
Base model
unsloth/gemma-2-9b-bnb-4bit