-
LoRA+: Efficient Low Rank Adaptation of Large Models
Paper • 2402.12354 • Published • 5 -
The FinBen: An Holistic Financial Benchmark for Large Language Models
Paper • 2402.12659 • Published • 13 -
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Paper • 2402.13249 • Published • 10 -
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 62
Collections
Discover the best community collections!
Collections including paper arxiv:2403.07691
-
Iterative Reasoning Preference Optimization
Paper • 2404.19733 • Published • 41 -
Better & Faster Large Language Models via Multi-token Prediction
Paper • 2404.19737 • Published • 61 -
ORPO: Monolithic Preference Optimization without Reference Model
Paper • 2403.07691 • Published • 58 -
KAN: Kolmogorov-Arnold Networks
Paper • 2404.19756 • Published • 96
-
ORPO: Monolithic Preference Optimization without Reference Model
Paper • 2403.07691 • Published • 58 -
ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models
Paper • 2404.07738 • Published • 2 -
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models
Paper • 2405.01535 • Published • 102
-
A General Theoretical Paradigm to Understand Learning from Human Preferences
Paper • 2310.12036 • Published • 11 -
ORPO: Monolithic Preference Optimization without Reference Model
Paper • 2403.07691 • Published • 58 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 37