Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward Paper • 2504.03206 • Published 27 days ago • 1
Interpreting Emergent Planning in Model-Free Reinforcement Learning Paper • 2504.01871 • Published 28 days ago • 12
Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of Light Paper • 2504.16922 • Published 7 days ago • 1
Representation Learning with Contrastive Predictive Coding Paper • 1807.03748 • Published Jul 10, 2018 • 1
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper • 2504.13837 • Published 12 days ago • 116
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models Paper • 2403.19647 • Published Mar 28, 2024 • 4
SelfCP: Compressing Long Prompt to 1/12 Using the Frozen Large Language Model Itself Paper • 2405.17052 • Published May 27, 2024 • 2
Learning-Order Autoregressive Models with Application to Molecular Graph Generation Paper • 2503.05979 • Published Mar 7 • 2
Is the Reversal Curse a Binding Problem? Uncovering Limitations of Transformers from a Basic Generalization Failure Paper • 2504.01928 • Published 28 days ago • 1
EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test Paper • 2503.01840 • Published Mar 3 • 5
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention Paper • 2504.06261 • Published 22 days ago • 107
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 22 days ago • 156
Value Residual Learning For Alleviating Attention Concentration In Transformers Paper • 2410.17897 • Published Oct 23, 2024 • 9
Flex Attention: A Programming Model for Generating Optimized Attention Kernels Paper • 2412.05496 • Published Dec 7, 2024 • 1
Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents Paper • 2504.00906 • Published 29 days ago • 21
ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning Paper • 2504.00254 • Published 30 days ago • 1
Representation & Optimization Collection Understanding about representation sheds light on optimization • 21 items • Updated 2 days ago • 1