adamsecada
's Collections
Favorites
updated
Bootstrapping Language Models with DPO Implicit Rewards
Paper
•
2406.09760
•
Published
•
38
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code
Intelligence
Paper
•
2406.11931
•
Published
•
58
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs
Paper
•
2406.14544
•
Published
•
34
Instruction Pre-Training: Language Models are Supervised Multitask
Learners
Paper
•
2406.14491
•
Published
•
86
Mixture-of-Agents Enhances Large Language Model Capabilities
Paper
•
2406.04692
•
Published
•
55
CRAG -- Comprehensive RAG Benchmark
Paper
•
2406.04744
•
Published
•
44
Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for
Large Language Models
Paper
•
2406.12644
•
Published
•
4
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs
with Nothing
Paper
•
2406.08464
•
Published
•
65
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs
Paper
•
2404.16873
•
Published
•
28
LLM Agents can Autonomously Hack Websites
Paper
•
2402.06664
•
Published
•
3
Negotiating with LLMS: Prompt Hacks, Skill Gaps, and Reasoning Deficits
Paper
•
2312.03720
•
Published
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of
LLMs through a Global Scale Prompt Hacking Competition
Paper
•
2311.16119
•
Published
•
2
On the Exploitability of Instruction Tuning
Paper
•
2306.17194
•
Published
•
9
Teams of LLM Agents can Exploit Zero-Day Vulnerabilities
Paper
•
2406.01637
•
Published
•
1
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Paper
•
2407.01370
•
Published
•
86
Imagine yourself: Tuning-Free Personalized Image Generation
Paper
•
2409.13346
•
Published
•
68
Training Language Models to Self-Correct via Reinforcement Learning
Paper
•
2409.12917
•
Published
•
135
LLMs + Persona-Plug = Personalized LLMs
Paper
•
2409.11901
•
Published
•
31
Seed-Music: A Unified Framework for High Quality and Controlled Music
Generation
Paper
•
2409.09214
•
Published
•
49