Modifying Large Language Model Post-Training for Diverse Creative Writing Paper • 2503.17126 • Published 10 days ago • 33
A Comprehensive Survey on Long Context Language Modeling Paper • 2503.17407 • Published 11 days ago • 47
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders Paper • 2503.18878 • Published 7 days ago • 110
When Less is Enough: Adaptive Token Reduction for Efficient Image Representation Paper • 2503.16660 • Published 11 days ago • 70
Exploring the Vulnerabilities of Federated Learning: A Deep Dive into Gradient Inversion Attacks Paper • 2503.11514 • Published 19 days ago • 15
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published 13 days ago • 131
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 20 days ago • 360
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching Paper • 2503.05179 • Published 25 days ago • 43
R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model Paper • 2503.05132 • Published 25 days ago • 52
CoSTAast: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing Paper • 2503.10613 • Published 18 days ago • 75
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL Paper • 2503.07536 • Published 21 days ago • 83
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published 26 days ago • 221
SurveyX: Academic Survey Automation via Large Language Models Paper • 2502.14776 • Published Feb 20 • 97