AI & ML interests

None defined yet.

Recent Activity

azaiatsΒ  updated a model 1 day ago
aipster/DevRouter-1.5B
azaiatsΒ  updated a Space 1 day ago
aipster/README
azaiatsΒ  updated a model 1 day ago
aipster/DevRouter-1.5B-GGUF
View all activity

Organization Card

AIpster

An independent think tank on artificial intelligence, society, and the future of thought.

We're a collective of computer science friends from the late '90s who turned a WhatsApp group into a laboratory for exploring what AI is doing to how we work, build, and think.

🌐 aipster.com


What we do here

This Hugging Face organization is where we publish the artifacts of our exploration β€” models, datasets, and tools that come out of the experiments we write about on our blog.

We're not a company. We don't sell anything. We build things to understand them, then share what we learned.


Focus areas

  • πŸ”¬ Small specialist models β€” distillation, fine-tuning, and the art of making tiny models punch above their weight
  • 🧭 Prompt engineering & routing β€” how prompts become infrastructure, not just text
  • πŸ› οΈ Local LLM workflows β€” what 96 GB of VRAM can (and can't) do
  • πŸ€– Coding agents & automation β€” how AI is reshaping software development from the inside out
  • πŸ“– AI & society β€” the uncomfortable conversations the industry would rather skip

What you'll find here

Models

DevRouter-1.5B β€” our first release. A tiny prompt router that reads a raw developer prompt and returns a single JSON decision: a cleaned-up rewrite, an intent / complexity classification, a suggested model-tier route, and the context the prompt forgot to include. Built on Qwen2.5-Coder-1.5B (Apache 2.0) and distilled from a stronger teacher, it holds ~96% valid-JSON and runs at ~280 tokens/s on a single RTX 3090 β€” small enough to sit in front of your real models and triage every prompt in 1–3 seconds.

And one honest caveat, because we ship those too: Q6 and below quantizations break its JSON. A small model doing strict structured output is far more fragile than the "Q4 is fine" rule of thumb suggests β€” ship Q8_0 or F16.

Datasets

Coming soon β€” curated and synthetic datasets from our distillation experiments, released alongside the models that use them.

Spaces

Coming soon β€” interactive demos of our experiments.


Read our work


Philosophy

We build to understand. We share to learn together.

Everything we publish here is open. Code, weights, datasets, methodology β€” including the failures. Especially the failures.


Get in touch


Independent. Curious. Slightly skeptical.

datasets 0

None public yet