Averroes-Q-Instruct-human55k

Arabic instruction-tuned model on 55K human-curated data.

Base: Qwen2.5-7B-Instruct Data: 55,174 human-curated Arabic instruction-conversation pairs LoRA: rank=8, scale=0.5, lr=1e-5, iters=5000 Train Loss: 1.434 (final) Training: ~45 min on M2 Ultra

Why "human55k"?

Previous versions (v1-v4) failed due to synthetic data (6M) and wrong configs. This version uses correct data + correct base + correct LoRA config.

Usage

from mlx_lm import load, generate
model, tokenizer = load("BinSaqban/Averroes-Q-Instruct-human55k")
prompt = "<|im_start|>user\nWhat is the capital of Saudi Arabia?<|im_end|>\n<|im_start|>assistant\n"
resp = generate(model, tokenizer, prompt=prompt, max_tokens=200)
print(resp)

License

Apache 2.0

Built by Hayula AI — Arabic-first, open-source, local-first.

Downloads last month
11
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support