Instructions to use Jenil05/Aether-1.5B-Agentic-core with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Local Apps Settings
- Unsloth Studio
How to use Jenil05/Aether-1.5B-Agentic-core with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Jenil05/Aether-1.5B-Agentic-core to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Jenil05/Aether-1.5B-Agentic-core to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for Jenil05/Aether-1.5B-Agentic-core to start chatting
Load model with FastModel
pip install unsloth from unsloth import FastModel model, tokenizer = FastModel.from_pretrained( model_name="Jenil05/Aether-1.5B-Agentic-core", max_seq_length=2048, )
Aether-1.5B-Agentic-core
Aether-1.5B-Agentic-core is an elite, edge-native language model explicitly optimized to serve as the core routing hub and tool-execution engine for autonomous multi-agent orchestration frameworks (such as CrewAI, LangChain, and AutoGen).
By optimizing the attention heads specifically for API schemas, this model bridges a massive architectural gap: enabling localized, low-latency deployment without sacrificing structural integrity during code output steps.
⚡ Technical Specifications
| Feature | System Configuration |
|---|---|
| Model Blueprint | Qwen-2.5 (Instruct variant backplane) |
| Parameter Volume | 1.54 Billion |
| Context Window | 4,096 Tokens |
| Quantization Format | Un-quantized; natively merged back into 16-bit Float (fp16) |
| Inference VRAM Profile | ~3.5 GB (Highly accessible for consumer hardware) |
| Primary Specialty | Deterministic JSON-Schema Parsing & Argument Extraction |
🛠️ The Fine-Tuning Pipeline
Standard small language models (<3B parameters) are notoriously fragile when handling code parameters—they frequently introduce syntax hallucinations, drop trailing brackets, or introduce verbose conversational fluff ("Sure, let me call that function for you!") that instantly breaks automated software parsers.
🧠 Training Strategy & Hyperparameters
Aether-1.5B-Agentic-core was constructed via Parameter-Efficient Fine-Tuning (PEFT) using Unsloth:
- Quantization Method: 4-bit QLoRA target module injection to maximize gradient headroom.
- Target Modules: Complete attention mechanism coverage (
q_proj,k_proj,v_proj,o_proj,gate_proj,up_proj,down_proj). - Dataset Alignment: Conditioned exclusively on high-fidelity multi-turn function invocation layouts from
NousResearch/hermes-function-calling-v1. - Baking Protocol: Merged permanently back into structural base layers (
merged_16bit), eliminating adapter latency overhead entirely.
⚙️ System Infrastructure & Stack Badges
🚀 Behavioral Traits & Core Abilities
- Deterministic Structured Layouts: Hardened against schema syntax decay, guaranteeing valid, clean, parseable JSON payload extractions.
- Zero-Dialogue Overhead: Stripped of non-operational text arrays. The model targets raw arguments instantly, cutting down execution latency and compute token costs.
- Strict Data-Type Preservation: Natively correlates natural text variables into explicit system-level parameters (e.g., matching raw strings directly to accurate
int,boolean, orarrayproperties).
💻 Implementation & Inference Sample
To trigger the tool-calling pathway natively, structure your system prompt with clear tool boundaries.
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch
model_id = "Jenil05/Aether-1.5B-Agentic-core"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=torch.float16, device_map="auto")
messages =
- Downloads last month
- 68
Model tree for Jenil05/Aether-1.5B-Agentic-core
Base model
Qwen/Qwen2.5-1.5B