EfficientQAT: Efficient Quantization-Aware Training for Large Language Models Paper • 2407.11062 • Published 12 days ago • 3
RegMix: Data Mixture as Regression for Language Model Pre-training Paper • 2407.01492 • Published 21 days ago • 30
Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision Paper • 2407.06189 • Published 14 days ago • 24
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models Paper • 2406.09403 • Published Jun 13 • 18
ColPali: Efficient Document Retrieval with Vision Language Models Paper • 2407.01449 • Published 25 days ago • 29
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models Paper • 2405.14831 • Published May 23 • 2
Unlocking Continual Learning Abilities in Language Models Paper • 2406.17245 • Published 27 days ago • 28
Scaling Synthetic Data Creation with 1,000,000,000 Personas Paper • 2406.20094 • Published 24 days ago • 84
SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented Generation Paper • 2406.19215 • Published 25 days ago • 28
Simulating Classroom Education with LLM-Empowered Agents Paper • 2406.19226 • Published 25 days ago • 28
Octo-planner: On-device Language Model for Planner-Action Agents Paper • 2406.18082 • Published 26 days ago • 47
Brain-Like Language Processing via a Shallow Untrained Multihead Attention Network Paper • 2406.15109 • Published about 1 month ago • 1
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models Paper • 2406.13542 • Published Jun 19 • 16
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper • 2406.14491 • Published Jun 20 • 76
HARE: HumAn pRiors, a key to small language model Efficiency Paper • 2406.11410 • Published Jun 17 • 38
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning Paper • 2406.11896 • Published Jun 14 • 18
Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning Paper • 2406.14283 • Published Jun 20 • 2
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning Paper • 2406.06469 • Published Jun 10 • 22
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts Paper • 2406.12034 • Published Jun 17 • 12
Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper • 2406.06592 • Published Jun 5 • 17
In-Context Editing: Learning Knowledge from Self-Induced Distributions Paper • 2406.11194 • Published Jun 17 • 15
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models Paper • 2405.20541 • Published May 30 • 19
How Do Large Language Models Acquire Factual Knowledge During Pretraining? Paper • 2406.11813 • Published Jun 17 • 29
Show, Don't Tell: Aligning Language Models with Demonstrated Feedback Paper • 2406.00888 • Published Jun 2 • 29
What's the Magic Word? A Control Theory of LLM Prompting Paper • 2310.04444 • Published Oct 2, 2023 • 1
Understanding Transformer Reasoning Capabilities via Graph Algorithms Paper • 2405.18512 • Published May 28 • 1
Contextual Position Encoding: Learning to Count What's Important Paper • 2405.18719 • Published May 29 • 4
AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct Paper • 2405.14906 • Published May 23 • 21
Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models Paper • 2405.12939 • Published May 21 • 1
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning Paper • 2405.12130 • Published May 20 • 44
Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? Paper • 2405.05904 • Published May 9 • 5
ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models Paper • 2405.09220 • Published May 15 • 23
The Consensus Game: Language Model Generation via Equilibrium Search Paper • 2310.09139 • Published Oct 13, 2023 • 12
Aligning LLM Agents by Learning Latent Preference from User Edits Paper • 2404.15269 • Published Apr 23 • 1
Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs Paper • 2310.01801 • Published Oct 3, 2023 • 3
Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations Paper • 2303.02536 • Published Mar 5, 2023 • 1
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published May 2 • 109
ORPO: Monolithic Preference Optimization without Reference Model Paper • 2403.07691 • Published Mar 12 • 59