-
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 135 -
ReFT: Reasoning with Reinforced Fine-Tuning
Paper • 2401.08967 • Published • 26 -
Tuning Language Models by Proxy
Paper • 2401.08565 • Published • 19 -
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 62
Collections
Discover the best community collections!
Collections including paper arxiv:2311.07989
-
A Survey on Language Models for Code
Paper • 2311.07989 • Published • 20 -
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
Paper • 2310.06770 • Published • 3 -
CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution
Paper • 2401.03065 • Published • 10 -
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
Paper • 2402.14261 • Published • 10
-
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Paper • 2312.03818 • Published • 31 -
Scaling Laws of Synthetic Images for Model Training ... for Now
Paper • 2312.04567 • Published • 7 -
Large Language Models for Mathematicians
Paper • 2312.04556 • Published • 11 -
LooseControl: Lifting ControlNet for Generalized Depth Conditioning
Paper • 2312.03079 • Published • 12
-
Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure
Paper • 2311.07590 • Published • 15 -
A Survey on Language Models for Code
Paper • 2311.07989 • Published • 20 -
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation
Paper • 2311.08877 • Published • 5 -
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise
Paper • 2312.12436 • Published • 12
-
ChatAnything: Facetime Chat with LLM-Enhanced Personas
Paper • 2311.06772 • Published • 33 -
Fine-tuning Language Models for Factuality
Paper • 2311.08401 • Published • 26 -
A Survey on Language Models for Code
Paper • 2311.07989 • Published • 20 -
Instruction-Following Evaluation for Large Language Models
Paper • 2311.07911 • Published • 17
-
A Survey on Language Models for Code
Paper • 2311.07989 • Published • 20 -
The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4
Paper • 2311.07361 • Published • 11 -
Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure
Paper • 2311.07590 • Published • 15 -
Model Cards for Model Reporting
Paper • 1810.03993 • Published • 3
-
Levels of AGI: Operationalizing Progress on the Path to AGI
Paper • 2311.02462 • Published • 30 -
Ultra-Long Sequence Distributed Transformer
Paper • 2311.02382 • Published • 2 -
A Survey on Language Models for Code
Paper • 2311.07989 • Published • 20 -
GRIM: GRaph-based Interactive narrative visualization for gaMes
Paper • 2311.09213 • Published • 11