Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming Paper • 2402.14261 • Published Feb 22 • 10
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization Paper • 2402.13249 • Published Feb 20 • 10
Self-Discover: Large Language Models Self-Compose Reasoning Structures Paper • 2402.03620 • Published Feb 6 • 109
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection Paper • 2310.11511 • Published Oct 17, 2023 • 74
Generate rather than Retrieve: Large Language Models are Strong Context Generators Paper • 2209.10063 • Published Sep 21, 2022 • 1
WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation Paper • 2312.14187 • Published Dec 20, 2023 • 49
Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM Paper • 2401.02994 • Published Jan 4 • 47
DocLLM: A layout-aware generative language model for multimodal document understanding Paper • 2401.00908 • Published Dec 31, 2023 • 179
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Paper • 2306.00978 • Published Jun 1, 2023 • 8
Time is Encoded in the Weights of Finetuned Language Models Paper • 2312.13401 • Published Dec 20, 2023 • 19
CodePlan: Repository-level Coding using LLMs and Planning Paper • 2309.12499 • Published Sep 21, 2023 • 73
Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization Paper • 2308.09716 • Published Aug 18, 2023 • 2
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback Paper • 2309.00267 • Published Sep 1, 2023 • 47
YaRN: Efficient Context Window Extension of Large Language Models Paper • 2309.00071 • Published Aug 31, 2023 • 65
RoFormer: Enhanced Transformer with Rotary Position Embedding Paper • 2104.09864 • Published Apr 20, 2021 • 10
SoTaNa: The Open-Source Software Development Assistant Paper • 2308.13416 • Published Aug 25, 2023 • 11
Nougat: Neural Optical Understanding for Academic Documents Paper • 2308.13418 • Published Aug 25, 2023 • 35
Grammar Prompting for Domain-Specific Language Generation with Large Language Models Paper • 2305.19234 • Published May 30, 2023 • 3
Flamingo: a Visual Language Model for Few-Shot Learning Paper • 2204.14198 • Published Apr 29, 2022 • 14
Towards Faster and Stabilized GAN Training for High-fidelity Few-shot Image Synthesis Paper • 2101.04775 • Published Jan 12, 2021 • 1
RAVEN: In-Context Learning with Retrieval Augmented Encoder-Decoder Language Models Paper • 2308.07922 • Published Aug 15, 2023 • 17
Clickbait Classification and Spoiling Using Natural Language Processing Paper • 2306.14907 • Published Jun 16, 2023 • 1
Platypus: Quick, Cheap, and Powerful Refinement of LLMs Paper • 2308.07317 • Published Aug 14, 2023 • 23
Efficient Guided Generation for Large Language Models Paper • 2307.09702 • Published Jul 19, 2023 • 8
SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes Paper • 2302.06587 • Published Feb 13, 2023 • 2
SPLADE v2: Sparse Lexical and Expansion Model for Information Retrieval Paper • 2109.10086 • Published Sep 21, 2021 • 1
Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment Paper • 2308.05374 • Published Aug 10, 2023 • 27
Training Diffusion Models with Reinforcement Learning Paper • 2305.13301 • Published May 22, 2023 • 3
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Paper • 2308.05734 • Published Aug 10, 2023 • 36