Text Generation
PEFT
Safetensors
English
Korean
code
frontend
agentic
tool-calling
react
nextjs
typescript
tailwind-css
conversational
Instructions to use fnruha0921/VibeThinker-3B-Fable5-UI with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use fnruha0921/VibeThinker-3B-Fable5-UI with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("/app/previous_checkpoint") model = PeftModel.from_pretrained(base_model, "fnruha0921/VibeThinker-3B-Fable5-UI") - Notebooks
- Google Colab
- Kaggle
VibeThinker-3B: Agentic Frontend Specialist
VibeThinker-3B is a fine-tuned version of Qwen2.5-3B, specifically engineered for Agentic Frontend Development. Unlike the base model, VibeThinker-3B is trained to handle complex UI/UX tasks, follow agentic reasoning traces (Claude-style), and perform Tool Calling for automated coding workflows.
Model Details
Model Description
VibeThinker-3B is designed to bridge the gap between simple code generation and autonomous agentic behavior. It excels at creating production-ready frontend components using modern stacks (React, Next.js, Tailwind CSS, shadcn/ui) while maintaining a high level of reasoning for complex logic.
- Developed by: WeiboAI (Finetuned for Fable Frontend Project)
- Model type: Causal Language Model (LoRA Finetune)
- Language(s): English, Korean
- Finetuned from model: Qwen/Qwen2.5-3B
- Key Focus: Tool Calling, Agentic Reasoning, Frontend Orchestration
Model Sources
- Repository: HuggingFace: WeiboAI/VibeThinker-3B
Uses
Direct Use
- Agentic Coding: Generating step-by-step reasoning for complex features before implementation.
- Frontend Generation: Creating high-quality React/Next.js components with best practices.
- Tool Calling: Interacting with external APIs or local functions to automate development tasks.
- UI/UX Prototyping: Rapidly building functional UI from natural language descriptions.
Out-of-Scope Use
- Competitive programming (LeetCode style) is supported but not the primary focus.
- Heavy backend-only logic without frontend context.
Bias, Risks, and Limitations
- Tool Calling Precision: While trained for tool use, users should verify the generated JSON arguments for safety.
- Context Window: Optimized for 8,192 sequence length. Performance may degrade for extremely large codebases without RAG.
How to Get Started with the Model
from transformers import AutoModelForCausalLM, AutoTokenizer
model_id = "WeiboAI/VibeThinker-3B"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto")
messages = [
{"role": "user", "content": "Create a responsive dashboard using Next.js and shadcn/ui."}
]
inputs = tokenizer.apply_chat_template(messages, return_tensors="pt").to("cuda")
outputs = model.generate(inputs, max_new_tokens=2048)
print(tokenizer.decode(outputs[0]))
Training Details
Training Data
The model was trained on a curated mix of ~11,000 high-quality samples:
- Stage 1: Agentic Reasoning (40%): Claude-style reasoning traces, CodeAlpaca, and Magicoder-Evol datasets.
- Stage 2: Frontend Specialization (40%): Specialized Next.js, React, and shadcn/ui component datasets.
- Stage 3: Tool Calling & Agentic (20%): Glaive Function Calling v2 and Hermes Function Calling datasets.
Training Procedure
- Method: LoRA (Low-Rank Adaptation)
- Sequence Length: 8,192
- Rank (R): 64
- Alpha: 128
- Learning Rate: 1e-4
- Epochs: 3
- Optimizer: AdamW with cosine schedule
Technical Specifications
- Hardware: Trained on Together AI Cloud Infrastructure.
- Framework: PEFT 0.15.1, Transformers.
Citation
BibTeX:
@misc{vibethinker2026,
author = {WeiboAI Team},
title = {VibeThinker-3B: Agentic Frontend Specialist},
year = {2026},
publisher = {HuggingFace},
journal = {HuggingFace Model Hub}
}
- Downloads last month
- 27
Model tree for fnruha0921/VibeThinker-3B-Fable5-UI
Base model
Qwen/Qwen2.5-3B