LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper β’ 2502.15007 β’ Published 19 days ago β’ 161
SurveyX: Academic Survey Automation via Large Language Models Paper β’ 2502.14776 β’ Published 19 days ago β’ 92
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper β’ 2502.14499 β’ Published 20 days ago β’ 177
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper β’ 2502.14739 β’ Published 19 days ago β’ 95
CoSER: Coordinating LLM-Based Persona Simulation of Established Roles Paper β’ 2502.09082 β’ Published 27 days ago β’ 27
Improving Transformer World Models for Data-Efficient RL Paper β’ 2502.01591 β’ Published Feb 3 β’ 9
Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation Paper β’ 2501.17433 β’ Published Jan 29 β’ 9
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling Paper β’ 2501.16975 β’ Published Jan 28 β’ 26
Evolution and The Knightian Blindspot of Machine Learning Paper β’ 2501.13075 β’ Published Jan 22 β’ 6
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces Paper β’ 2501.12909 β’ Published Jan 22 β’ 68
3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering Paper β’ 2501.05131 β’ Published Jan 9 β’ 34
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper β’ 2501.08313 β’ Published Jan 14 β’ 275
MangaNinja: Line Art Colorization with Precise Reference Following Paper β’ 2501.08332 β’ Published Jan 14 β’ 57
YuLan-Mini: An Open Data-efficient Language Model Paper β’ 2412.17743 β’ Published Dec 23, 2024 β’ 65
MMFactory: A Universal Solution Search Engine for Vision-Language Tasks Paper β’ 2412.18072 β’ Published Dec 24, 2024 β’ 18
Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation Paper β’ 2412.18176 β’ Published Dec 24, 2024 β’ 15
Deliberation in Latent Space via Differentiable Cache Augmentation Paper β’ 2412.17747 β’ Published Dec 23, 2024 β’ 30