Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked
a model
16 days ago
google/gemma-3n-E4B-it
liked
a model
29 days ago
ai21labs/AI21-Jamba-Mini-1.6
liked
a model
3 months ago
sand-ai/MAGI-1