Instructions to use AhiskaAI/AhiskaAI-25m-Base-v0.1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use AhiskaAI/AhiskaAI-25m-Base-v0.1 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="AhiskaAI/AhiskaAI-25m-Base-v0.1")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("AhiskaAI/AhiskaAI-25m-Base-v0.1") model = AutoModelForCausalLM.from_pretrained("AhiskaAI/AhiskaAI-25m-Base-v0.1") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use AhiskaAI/AhiskaAI-25m-Base-v0.1 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "AhiskaAI/AhiskaAI-25m-Base-v0.1" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "AhiskaAI/AhiskaAI-25m-Base-v0.1", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/AhiskaAI/AhiskaAI-25m-Base-v0.1
- SGLang
How to use AhiskaAI/AhiskaAI-25m-Base-v0.1 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "AhiskaAI/AhiskaAI-25m-Base-v0.1" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "AhiskaAI/AhiskaAI-25m-Base-v0.1", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "AhiskaAI/AhiskaAI-25m-Base-v0.1" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "AhiskaAI/AhiskaAI-25m-Base-v0.1", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use AhiskaAI/AhiskaAI-25m-Base-v0.1 with Docker Model Runner:
docker model run hf.co/AhiskaAI/AhiskaAI-25m-Base-v0.1
AhiskaAI 25M Base v0.1 (Pre-trained From Scratch)
AhiskaAI 25M Base v0.1 is an ultra-lightweight, foundational causal language model trained entirely from scratch for the Turkish language ecosystem.
With only ~25 Million active parameters, this model was initialized from absolute zero to explore the lower limits of grammatical compression, syntax mapping, and localized semantic density within highly constrained computational environments. It serves as the raw, unaligned backbone for the AhıskaAI research pipeline.
🧠 Training Profile & Dataset
Despite its miniature footprint, the model captures core Turkish morphology and factual token paths due to a highly strategic, clean data mixture of a 5.3 GB base corpus:
- CulturaX (Turkish Split): Utilized for deep web-scale text distributions, teaching the network core token connectivity, basic sentence boundaries, and general Turkish vocabulary.
- Custom Filtered Wikipedia (75 MB): A highly curated, hand-filtered subset of Turkish Wikipedia explicitly processed by AhıskaAI to target rich historical timelines, cultural identity milestones, and factual knowledge. This dense sub-matrix is the primary driver behind the model's factual recall capabilities despite its size.
💻 Hardware & Infrastructure
- Hardware: NVIDIA GeForce RTX 4050 Laptop GPU (6GB VRAM)
- Training Depth: Trained under strict local VRAM constraints with an indie "Build in Public" ethos.
🛠️ Quickstart & Inference
You can easily load and run text generation using the Hugging Face transformers library.
from transformers import GPT2LMHeadModel, AutoTokenizer
import torch
model_name = "AhiskaAI/AhiskaAI-25m-Base-v0.1"
# Load model and custom Turkish tokenizer
model = GPT2LMHeadModel.from_pretrained(model_name)
tokenizer = AutoTokenizer.from_pretrained(model_name)
# Configure pad token for GPT-2 architecture
tokenizer.pad_token = tokenizer.eos_token
# Sample generation
prompt = "Ahıska Türkleri,"
inputs = tokenizer(prompt, return_tensors="pt")
with torch.no_grad():
outputs = model.generate(
**inputs,
max_length=100,
do_sample=True,
top_k=50,
top_p=0.95,
temperature=0.7,
no_repeat_ngram_size=2
)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
⚠️ Model Limitations & Intended Use Pure Base Model: This is a raw text completer and has not been aligned using Instruction Fine-Tuning (SFT) or RLHF. It will not act as a conversational chatbot out of the box and might loop sequences if not sampled correctly.
Intended Use: This model is highly receptive to immediate downstream SFT fine-tuning, vocabulary adaptations, or continuation of pre-training checkpoints.
For the aligned chat version, please check out: AhiskaAI-25m-Chat-v0.1-Experimental
- Downloads last month
- 113