SOMGPT

somgpt-base is a Somali causal language model continued from google/gemma-3-270m and trained on maanka2/somali-web-corpus.

Model Details

  • Developer: maanka2
  • Architecture: Gemma 3 (270M)
  • Model Type: Causal Language Model
  • Language: Somali
  • Base Model: google/gemma-3-270m
  • Dataset: maanka2/somali-web-corpus
  • License: gemma

Overview

This model was further pre-trained on Somali web text to improve its understanding of Somali vocabulary, grammar, spelling, and writing patterns.

somgpt is a base language model designed for text continuation and language modeling. It is not instruction-tuned and is not optimized for chat, question answering, or assistant-style interactions.

For conversational AI or task-specific applications, additional supervised fine-tuning (SFT) or instruction tuning is recommended.

Training Data

Training was performed using maanka2/somali-web-corpus, a collection of cleaned Somali-language web content gathered from various online sources.

Usage

from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "maanka2/somgpt"

tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)

prompt = "Soomaaliya waa dal ku yaal"
inputs = tokenizer(prompt, return_tensors="pt")

outputs = model.generate(
    **inputs,
    max_new_tokens=256,
    do_sample=True,
    temperature=0.1,
    top_p=0.95
)

print(tokenizer.decode(outputs[0], skip_special_tokens=True))
Downloads last month
99
Safetensors
Model size
0.3B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for maanka2/somgpt

Finetuned
(146)
this model

Dataset used to train maanka2/somgpt

Space using maanka2/somgpt 1