-
Detecting Pretraining Data from Large Language Models
Paper • 2310.16789 • Published • 9 -
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models
Paper • 2310.13671 • Published • 17 -
AutoMix: Automatically Mixing Language Models
Paper • 2310.12963 • Published • 14 -
An Emulator for Fine-Tuning Large Language Models using Small Language Models
Paper • 2310.12962 • Published • 13
Collections
Discover the best community collections!
Collections including paper arxiv:2309.11495
-
AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models
Paper • 2309.16414 • Published • 19 -
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model
Paper • 2309.13018 • Published • 9 -
Robust Speech Recognition via Large-Scale Weak Supervision
Paper • 2212.04356 • Published • 12 -
Language models in molecular discovery
Paper • 2309.16235 • Published • 10
-
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper • 2309.11495 • Published • 37 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 61 -
Aligning Large Multimodal Models with Factually Augmented RLHF
Paper • 2309.14525 • Published • 29 -
SCREWS: A Modular Framework for Reasoning with Revisions
Paper • 2309.13075 • Published • 15
-
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper • 2309.11495 • Published • 37 -
CodePlan: Repository-level Coding using LLMs and Planning
Paper • 2309.12499 • Published • 69 -
SCREWS: A Modular Framework for Reasoning with Revisions
Paper • 2309.13075 • Published • 15 -
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Paper • 2309.08532 • Published • 50
-
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 83 -
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper • 2309.11495 • Published • 37 -
Knowledge-Driven CoT: Exploring Faithful Reasoning in LLMs for Knowledge-intensive Question Answering
Paper • 2308.13259 • Published • 2 -
Cognitive Mirage: A Review of Hallucinations in Large Language Models
Paper • 2309.06794 • Published • 1
-
Language Modeling Is Compression
Paper • 2309.10668 • Published • 80 -
Baichuan 2: Open Large-scale Language Models
Paper • 2309.10305 • Published • 16 -
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper • 2309.11495 • Published • 37 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 61
-
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper • 2309.11495 • Published • 37 -
EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation
Paper • 2310.08185 • Published • 6 -
The Consensus Game: Language Model Generation via Equilibrium Search
Paper • 2310.09139 • Published • 12 -
In-Context Pretraining: Language Modeling Beyond Document Boundaries
Paper • 2310.10638 • Published • 26