BatiAI

company

https://flow.bati.ai/

rparo20

batiai

Activity Feed

AI & ML interests

On-device AI, GGUF quantization, Apple Silicon, macOS automation

Recent Activity

hero775 updated a model about 12 hours ago

batiai/DeepSeek-V4-Pro-GGUF

hero775 updated a collection 3 days ago

🚀 Frontier MoE — 128B–1T

hero775 updated a model 4 days ago

batiai/Llama-4-Scout-17B-16E-Instruct-GGUF

View all activity

batiai 's collections 6

🧠 NVIDIA Nemotron 3 — Hybrid Mamba+Attention

NVIDIA Nemotron 3 family — NemotronH architecture combining Mamba state-space + standard attention. Mac-runnable, BatiAI-quantized + signed.

batiai/Nemotron-3-Super-120B-A12B-GGUF

Text Generation • 121B • Updated 22 days ago • 814
batiai/Nemotron-3-Nano-30B-A3B-GGUF

Text Generation • 32B • Updated 17 days ago • 1.54k
batiai/Nemotron-3-Nano-Omni-30B-Text-Only-GGUF

Text Generation • 32B • Updated 12 days ago • 2.03k

⚡ Qwen 3.6 — Tools, Thinking, Vision

Latest Qwen 3.6 series with native tool calling, thinking mode, and Vision-Language. Best balance for 48-128GB Macs.

batiai/Qwen3.6-27B-GGUF

Text Generation • 27B • Updated 17 days ago • 28.3k • 2
batiai/Qwen3.6-35B-A3B-GGUF

Text Generation • 35B • Updated 17 days ago • 9.99k • 2

🐉 Qwen 3.5 — Alibaba Stable

Qwen 3.5 dense and MoE quantizations. Reliable tool calling and JSON generation.

batiai/Qwen3.5-9B-GGUF

Text Generation • 9B • Updated 17 days ago • 564
batiai/Qwen3.5-27B-GGUF

Text Generation • 27B • Updated 18 days ago • 396
batiai/Qwen3.5-35B-A3B-GGUF

Text Generation • 35B • Updated 18 days ago • 489

🚀 Frontier MoE — 128B–1T

Largest open-weight LLMs, BatiAI-quantized. Mac-runnable from M4 Max 128GB to Mac Studio M3 Ultra 512GB.

batiai/DeepSeek-V4-Pro-GGUF

Text Generation • 1.6T • Updated about 12 hours ago • 1.51k • 1
batiai/Kimi-K2.6-GGUF

Text Generation • 1T • Updated 21 days ago • 7.41k
batiai/GLM-5.1-GGUF

Text Generation • 754B • Updated 14 days ago • 1.46k
batiai/DeepSeek-V4-Flash-GGUF

Text Generation • 284B • Updated 18 days ago • 4.52k • 4

🍎 Gemma 4 — Google's Latest

Gemma 4 quantizations from Google's official weights. Best entry for 16GB Mac mini M4 (E4B Q4 = 57 t/s).

batiai/gemma-4-E2B-it-GGUF

Text Generation • 5B • Updated Apr 18 • 801 • 1
batiai/gemma-4-E4B-it-GGUF

Text Generation • 8B • Updated Apr 18 • 1.06k • 4
batiai/Gemma-4-26B-A4B-it-GGUF

Text Generation • 25B • Updated 17 days ago • 3.8k • 2
batiai/gemma-4-31B-it-GGUF

Text Generation • 31B • Updated Apr 18 • 516

BatiAI RAG Stack

Complete Mac-first on-device RAG stack — chat LLM + reranker + text/VL embedder, direct from BF16, BatiAI-signed. For BatiFlow.

batiai/Qwen3-Embedding-4B-GGUF

Sentence Similarity • 4B • Updated Apr 19 • 95
batiai/Qwen3-Embedding-0.6B-GGUF

Sentence Similarity • 0.6B • Updated Apr 19 • 171 • 1
batiai/Qwen3-VL-Embedding-8B-GGUF

Sentence Similarity • 8B • Updated Apr 18 • 1.83k • 6
batiai/Qwen3-VL-Embedding-2B-GGUF

Sentence Similarity • 2B • Updated Apr 17 • 248

🧠 NVIDIA Nemotron 3 — Hybrid Mamba+Attention

NVIDIA Nemotron 3 family — NemotronH architecture combining Mamba state-space + standard attention. Mac-runnable, BatiAI-quantized + signed.

batiai/Nemotron-3-Super-120B-A12B-GGUF

Text Generation • 121B • Updated 22 days ago • 814
batiai/Nemotron-3-Nano-30B-A3B-GGUF

Text Generation • 32B • Updated 17 days ago • 1.54k
batiai/Nemotron-3-Nano-Omni-30B-Text-Only-GGUF

Text Generation • 32B • Updated 12 days ago • 2.03k

🚀 Frontier MoE — 128B–1T

Largest open-weight LLMs, BatiAI-quantized. Mac-runnable from M4 Max 128GB to Mac Studio M3 Ultra 512GB.

batiai/DeepSeek-V4-Pro-GGUF

Text Generation • 1.6T • Updated about 12 hours ago • 1.51k • 1
batiai/Kimi-K2.6-GGUF

Text Generation • 1T • Updated 21 days ago • 7.41k
batiai/GLM-5.1-GGUF

Text Generation • 754B • Updated 14 days ago • 1.46k
batiai/DeepSeek-V4-Flash-GGUF

Text Generation • 284B • Updated 18 days ago • 4.52k • 4

⚡ Qwen 3.6 — Tools, Thinking, Vision

Latest Qwen 3.6 series with native tool calling, thinking mode, and Vision-Language. Best balance for 48-128GB Macs.

batiai/Qwen3.6-27B-GGUF

Text Generation • 27B • Updated 17 days ago • 28.3k • 2
batiai/Qwen3.6-35B-A3B-GGUF

Text Generation • 35B • Updated 17 days ago • 9.99k • 2

🍎 Gemma 4 — Google's Latest

Gemma 4 quantizations from Google's official weights. Best entry for 16GB Mac mini M4 (E4B Q4 = 57 t/s).

batiai/gemma-4-E2B-it-GGUF

Text Generation • 5B • Updated Apr 18 • 801 • 1
batiai/gemma-4-E4B-it-GGUF

Text Generation • 8B • Updated Apr 18 • 1.06k • 4
batiai/Gemma-4-26B-A4B-it-GGUF

Text Generation • 25B • Updated 17 days ago • 3.8k • 2
batiai/gemma-4-31B-it-GGUF

Text Generation • 31B • Updated Apr 18 • 516

🐉 Qwen 3.5 — Alibaba Stable

Qwen 3.5 dense and MoE quantizations. Reliable tool calling and JSON generation.

batiai/Qwen3.5-9B-GGUF

Text Generation • 9B • Updated 17 days ago • 564
batiai/Qwen3.5-27B-GGUF

Text Generation • 27B • Updated 18 days ago • 396
batiai/Qwen3.5-35B-A3B-GGUF

Text Generation • 35B • Updated 18 days ago • 489

BatiAI RAG Stack

Complete Mac-first on-device RAG stack — chat LLM + reranker + text/VL embedder, direct from BF16, BatiAI-signed. For BatiFlow.

batiai/Qwen3-Embedding-4B-GGUF

Sentence Similarity • 4B • Updated Apr 19 • 95
batiai/Qwen3-Embedding-0.6B-GGUF

Sentence Similarity • 0.6B • Updated Apr 19 • 171 • 1
batiai/Qwen3-VL-Embedding-8B-GGUF

Sentence Similarity • 8B • Updated Apr 18 • 1.83k • 6
batiai/Qwen3-VL-Embedding-2B-GGUF

Sentence Similarity • 2B • Updated Apr 17 • 248

AI & ML interests

Recent Activity

Team members 1

batiai 's collections 6