FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance Paper • 2305.05176 • Published May 9, 2023 • 2 • 3
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect Paper • 2403.03853 • Published 22 days ago • 59 • 17
SaulLM-7B: A pioneering Large Language Model for Law Paper • 2403.03883 • Published 22 days ago • 62 • 4
Full Parameter Fine-tuning for Large Language Models with Limited Resources Paper • 2306.09782 • Published Jun 16, 2023 • 28 • 4
Evaluating Very Long-Term Conversational Memory of LLM Agents Paper • 2402.17753 • Published 30 days ago • 15 • 3
Defending LLMs against Jailbreaking Attacks via Backtranslation Paper • 2402.16459 • Published Feb 26 • 2 • 1
Improving Classification Performance With Human Feedback: Label a few, we label the rest Paper • 2401.09555 • Published Jan 17 • 6 • 2
Scaling Laws for Downstream Task Performance of Large Language Models Paper • 2402.04177 • Published Feb 6 • 15 • 4
INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning Paper • 2401.06532 • Published Jan 12 • 9 • 7
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models Paper • 2308.16149 • Published Aug 30, 2023 • 24 • 6
Generalist embedding models are better at short-context clinical semantic search than specialized embedding models Paper • 2401.01943 • Published Jan 3 • 6 • 4
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation Paper • 2401.08417 • Published Jan 16 • 25 • 2
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning Paper • 2312.01552 • Published Dec 4, 2023 • 24 • 4
PromptBench: A Unified Library for Evaluation of Large Language Models Paper • 2312.07910 • Published Dec 13, 2023 • 14 • 3
Improving Text Embeddings with Large Language Models Paper • 2401.00368 • Published Dec 31, 2023 • 69 • 14
Understanding Retrieval Augmentation for Long-Form Question Answering Paper • 2310.12150 • Published Oct 18, 2023 • 1 • 1
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch Paper • 2311.03099 • Published Nov 6, 2023 • 27 • 7
Effective Long-Context Scaling of Foundation Models Paper • 2309.16039 • Published Sep 27, 2023 • 28 • 2
PDFTriage: Question Answering over Long, Structured Documents Paper • 2309.08872 • Published Sep 16, 2023 • 50 • 4