gemma-1b-pruned-th

Depth-pruned (layer dropping) + healing SFT of google/gemma-3-4b (unsloth mirror) for Thai.

Spec

  • Base: google/gemma-3-4b (unsloth mirror)
  • Params: 2.70B (kept 17/34 decoder layers; drop middle, keep head+tail)
  • Healing: SFT on SEA-PILE v2 Thai (~8k docs), bf16
  • Requires: transformers>=4.50, accelerate

Usage

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
m = "Chokun00032/gemma-1b-pruned-th"
tok = AutoTokenizer.from_pretrained(m)
model = AutoModelForCausalLM.from_pretrained(m, torch_dtype=torch.bfloat16, device_map="cuda")
ids = tok("ปัญญาประดิษฐ์ คือ", return_tensors="pt").to(model.device)
out = model.generate(**ids, max_new_tokens=120, do_sample=True,
                     temperature=0.7, top_p=0.9, repetition_penalty=1.3)
print(tok.decode(out[0], skip_special_tokens=True))

Notes

  • Pruned base healed on raw corpus: Thai grammar is fluent, but factual/arithmetic ability is weak.
  • Use repetition_penalty>=1.2 to avoid loops.
  • Best used as a base for further instruction fine-tuning.
Downloads last month
43
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support