Understanding Transformer Reasoning Capabilities via Graph Algorithms Paper • 2405.18512 • Published 6 days ago • 1
Contextual Position Encoding: Learning to Count What's Important Paper • 2405.18719 • Published 6 days ago • 3
AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct Paper • 2405.14906 • Published 12 days ago • 18
Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models Paper • 2405.12939 • Published 13 days ago • 1
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning Paper • 2405.12130 • Published 14 days ago • 42
Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? Paper • 2405.05904 • Published 25 days ago • 5
ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models Paper • 2405.09220 • Published 19 days ago • 23
The Consensus Game: Language Model Generation via Equilibrium Search Paper • 2310.09139 • Published Oct 13, 2023 • 12
Chain of Thoughtlessness: An Analysis of CoT in Planning Paper • 2405.04776 • Published 27 days ago • 1
Aligning LLM Agents by Learning Latent Preference from User Edits Paper • 2404.15269 • Published Apr 23 • 1
Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs Paper • 2310.01801 • Published Oct 3, 2023 • 3
Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations Paper • 2303.02536 • Published Mar 5, 2023 • 1
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published May 2 • 103
ORPO: Monolithic Preference Optimization without Reference Model Paper • 2403.07691 • Published Mar 12 • 58
Better & Faster Large Language Models via Multi-token Prediction Paper • 2404.19737 • Published Apr 30 • 64
NExT: Teaching Large Language Models to Reason about Code Execution Paper • 2404.14662 • Published Apr 23 • 4
Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding Paper • 2404.16710 • Published Apr 25 • 56
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Perfect Reasoners Paper • 2404.14963 • Published Apr 23 • 1
SnapKV: LLM Knows What You are Looking for Before Generation Paper • 2404.14469 • Published Apr 22 • 23
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models Paper • 2404.14507 • Published Apr 22 • 21
Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment Paper • 2404.12318 • Published Apr 18 • 14
TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding Paper • 2404.11912 • Published Apr 18 • 16
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing Paper • 2404.12253 • Published Apr 18 • 51
From r to Q^*: Your Language Model is Secretly a Q-Function Paper • 2404.12358 • Published Apr 18 • 2
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Paper • 2401.10774 • Published Jan 19 • 50
Symbol tuning improves in-context learning in language models Paper • 2305.08298 • Published May 15, 2023 • 2
THOUGHTSCULPT: Reasoning with Intermediate Revision and Search Paper • 2404.05966 • Published Apr 9 • 1
Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction Paper • 2402.02416 • Published Feb 4 • 3
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies Paper • 2404.06395 • Published Apr 9 • 18
Best Practices and Lessons Learned on Synthetic Data for Language Models Paper • 2404.07503 • Published Apr 11 • 24
WILBUR: Adaptive In-Context Learning for Robust and Accurate Web Agents Paper • 2404.05902 • Published Apr 8 • 20
RS-DPO: A Hybrid Rejection Sampling and Direct Preference Optimization Method for Alignment of Large Language Models Paper • 2402.10038 • Published Feb 15 • 6
CodecLM: Aligning Language Models with Tailored Synthetic Data Paper • 2404.05875 • Published Apr 8 • 15
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence Paper • 2404.05892 • Published Apr 8 • 28
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders Paper • 2404.05961 • Published Apr 9 • 62
Let's Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model Paper • 2403.11621 • Published Mar 18 • 2
Model Editing Can Hurt General Abilities of Large Language Models Paper • 2401.04700 • Published Jan 9 • 3
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences Paper • 2404.03715 • Published Apr 4 • 58
Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models Paper • 2404.03622 • Published Apr 4 • 4