-
Ada-Instruct: Adapting Instruction Generators for Complex Reasoning
Paper • 2310.04484 • Published • 4 -
Diversity of Thought Improves Reasoning Abilities of Large Language Models
Paper • 2310.07088 • Published • 4 -
Adapting Large Language Models via Reading Comprehension
Paper • 2309.09530 • Published • 69 -
Democratizing Reasoning Ability: Tailored Learning from Large Language Model
Paper • 2310.13332 • Published • 14
Collections
Discover the best community collections!
Collections including paper arxiv:2311.07911
-
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 135 -
ReFT: Reasoning with Reinforced Fine-Tuning
Paper • 2401.08967 • Published • 26 -
Tuning Language Models by Proxy
Paper • 2401.08565 • Published • 19 -
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 62
-
Instruction-Following Evaluation for Large Language Models
Paper • 2311.07911 • Published • 17 -
HuggingFaceH4/mt_bench_prompts
Viewer • Updated • 3.94k • 7 -
vectara/hallucination_evaluation_model
Text Classification • Updated • 15.4k • 163 -
GAIA: a benchmark for General AI Assistants
Paper • 2311.12983 • Published • 170
-
Holistic Evaluation of Text-To-Image Models
Paper • 2311.04287 • Published • 10 -
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks
Paper • 2311.07463 • Published • 13 -
Trusted Source Alignment in Large Language Models
Paper • 2311.06697 • Published • 9 -
DiLoCo: Distributed Low-Communication Training of Language Models
Paper • 2311.08105 • Published • 13
-
G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment
Paper • 2303.16634 • Published • 1 -
miracl/miracl-corpus
Viewer • Updated • 6.25k • 39 -
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Paper • 2306.05685 • Published • 20 -
How is ChatGPT's behavior changing over time?
Paper • 2307.09009 • Published • 22
-
ChatAnything: Facetime Chat with LLM-Enhanced Personas
Paper • 2311.06772 • Published • 33 -
Fine-tuning Language Models for Factuality
Paper • 2311.08401 • Published • 26 -
A Survey on Language Models for Code
Paper • 2311.07989 • Published • 20 -
Instruction-Following Evaluation for Large Language Models
Paper • 2311.07911 • Published • 17