-
Zephyr: Direct Distillation of LM Alignment
Paper • 2310.16944 • Published • 117 -
Exponentially Faster Language Modelling
Paper • 2311.10770 • Published • 117 -
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 38 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 38
Collections
Discover the best community collections!
Collections including paper arxiv:2311.11829
-
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 38 -
ToolTalk: Evaluating Tool-Usage in a Conversational Setting
Paper • 2311.10775 • Published • 7 -
Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning
Paper • 2311.11077 • Published • 24
-
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 38 -
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers
Paper • 2311.10642 • Published • 23 -
Orca 2: Teaching Small Language Models How to Reason
Paper • 2311.11045 • Published • 69
-
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 38 -
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems
Paper • 2311.11315 • Published • 6 -
Alignment for Honesty
Paper • 2312.07000 • Published • 11 -
Steering Llama 2 via Contrastive Activation Addition
Paper • 2312.06681 • Published • 9
-
Contrastive Chain-of-Thought Prompting
Paper • 2311.09277 • Published • 31 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper • 2201.11903 • Published • 8 -
Orca 2: Teaching Small Language Models How to Reason
Paper • 2311.11045 • Published • 69 -
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 38
-
Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models
Paper • 2311.08692 • Published • 12 -
DiLoCo: Distributed Low-Communication Training of Language Models
Paper • 2311.08105 • Published • 13 -
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 38 -
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning
Paper • 2312.06134 • Published • 2
-
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision
Paper • 2311.02077 • Published • 14 -
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 38 -
Large Language Models for Mathematicians
Paper • 2312.04556 • Published • 11 -
VisionLLaMA: A Unified LLaMA Interface for Vision Tasks
Paper • 2403.00522 • Published • 40