SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe Paper • 2410.05248 • Published Oct 7 • 8
Multi-hop Evidence Retrieval for Cross-document Relation Extraction Paper • 2212.10786 • Published Dec 21, 2022
Summarization as Indirect Supervision for Relation Extraction Paper • 2205.09837 • Published May 19, 2022
CliBench: Multifaceted Evaluation of Large Language Models in Clinical Decisions on Diagnoses, Procedures, Lab Tests Orders and Prescriptions Paper • 2406.09923 • Published Jun 14 • 1
mDPO: Conditional Preference Optimization for Multimodal Large Language Models Paper • 2406.11839 • Published Jun 17 • 37
Mitigating Bias for Question Answering Models by Tracking Bias Influence Paper • 2310.08795 • Published Oct 13, 2023
Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Paper • 2406.07546 • Published Jun 11 • 8
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding Paper • 2406.09411 • Published Jun 13 • 18
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding Paper • 2406.09411 • Published Jun 13 • 18
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models Paper • 2406.09403 • Published Jun 13 • 19
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding Paper • 2406.09411 • Published Jun 13 • 18
Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models Paper • 2305.14710 • Published May 24, 2023 • 2
Can NLI Provide Proper Indirect Supervision for Low-resource Biomedical Relation Extraction? Paper • 2212.10784 • Published Dec 21, 2022
AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models Paper • 2310.04451 • Published Oct 3, 2023
JailBreakV-28K: A Benchmark for Assessing the Robustness of MultiModal Large Language Models against Jailbreak Attacks Paper • 2404.03027 • Published Apr 3 • 3
ImagenHub: Standardizing the evaluation of conditional image generation models Paper • 2310.01596 • Published Oct 2, 2023 • 18
BLINK: Multimodal Large Language Models Can See but Not Perceive Paper • 2404.12390 • Published Apr 18 • 24