LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper β’ 2502.15007 β’ Published 21 days ago β’ 162
SurveyX: Academic Survey Automation via Large Language Models Paper β’ 2502.14776 β’ Published 22 days ago β’ 93
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper β’ 2502.14499 β’ Published 22 days ago β’ 179
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper β’ 2502.14739 β’ Published 22 days ago β’ 97
CoSER: Coordinating LLM-Based Persona Simulation of Established Roles Paper β’ 2502.09082 β’ Published 29 days ago β’ 28
Improving Transformer World Models for Data-Efficient RL Paper β’ 2502.01591 β’ Published Feb 3 β’ 9
Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation Paper β’ 2501.17433 β’ Published Jan 29 β’ 9
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling Paper β’ 2501.16975 β’ Published Jan 28 β’ 26