Instructions to use ewald1976/gemma4-12b-it-styletuned with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Local Apps Settings
- Unsloth Studio
How to use ewald1976/gemma4-12b-it-styletuned with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for ewald1976/gemma4-12b-it-styletuned to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for ewald1976/gemma4-12b-it-styletuned to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for ewald1976/gemma4-12b-it-styletuned to start chatting
Load model with FastModel
pip install unsloth from unsloth import FastModel model, tokenizer = FastModel.from_pretrained( model_name="ewald1976/gemma4-12b-it-styletuned", max_seq_length=2048, )
Gemma4-12b-it-styletuned
I'm still fascinated by the topic of style tuning using the Gryphe method. I have now started calling it SSFT (Supervised Style Fine-Tuning). It's not about making a model "better" at all, but rather about the countless possibilities that arise from this kind of fine-tuning. That's why I also did another run with Gemma4-12b-it and my dataset.
With Gemma, I had to be a bit more aggressive during the training.
The style change was also significant, and Gemma4-12b-it shows that I haven't changed much, if anything, about Gemma's intelligence and approach. However, the tone and expression have completely changed, in my opinion. Perhaps I'm just so caught up in a loop that I'm imagining it.
Training Setup
training:
max_seq_length: 4096
num_epochs: 3
learning_rate: 0.0004
batch_size: 2
gradient_accumulation_steps: 4
warmup_steps: 10
max_steps: 0
save_steps: 200
eval_steps: 0
weight_decay: 0.001
random_seed: 3407
packing: false
train_on_completions: true
gradient_checkpointing: unsloth
optim: adamw_8bit
lr_scheduler_type: linear
vision_image_size: null
lora:
lora_r: 128
lora_alpha: 256
lora_dropout: 0
target_modules:
- lm_head
use_rslora: false
use_loftq: false
finetune_vision_layers: false
finetune_language_layers: false
finetune_attention_modules: false
finetune_mlp_modules: false
An example comparison of vanilla and ssft is here: https://huggingface.co/ewald1976/gemma4-12b-it-styletuned/blob/main/examples.md
- Downloads last month
- 85