RMIMG (rmimg)

Fiaa

authored a paper about 2 months ago

ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding

Paper • 2501.05452 • Published Jan 9 • 15

wzhouad

authored a paper 5 months ago

SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe

Paper • 2410.05248 • Published Oct 7, 2024 • 8

derekma

authored a paper 8 months ago

MIRAI: Evaluating LLM Agents for Event Forecasting

Paper • 2407.01231 • Published Jul 1, 2024 • 18

derekma

authored 3 papers 9 months ago

Multi-hop Evidence Retrieval for Cross-document Relation Extraction

Paper • 2212.10786 • Published Dec 21, 2022

Summarization as Indirect Supervision for Relation Extraction

Paper • 2205.09837 • Published May 19, 2022

CliBench: Multifaceted Evaluation of Large Language Models in Clinical Decisions on Diagnoses, Procedures, Lab Tests Orders and Prescriptions

Paper • 2406.09923 • Published Jun 14, 2024 • 1

wzhouad

authored 2 papers 9 months ago

mDPO: Conditional Preference Optimization for Multimodal Large Language Models

Paper • 2406.11839 • Published Jun 17, 2024 • 38

WPO: Enhancing RLHF with Weighted Preference Optimization

Paper • 2406.11827 • Published Jun 17, 2024 • 15

derekma

authored a paper 9 months ago

Mitigating Bias for Question Answering Models by Tracking Bias Influence

Paper • 2310.08795 • Published Oct 13, 2023

Fiaa

authored a paper 9 months ago

Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?

Paper • 2406.07546 • Published Jun 11, 2024 • 9

wzhouad

authored a paper 9 months ago

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Paper • 2406.09411 • Published Jun 13, 2024 • 20

Fiaa

authored 2 papers 9 months ago

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Paper • 2406.09411 • Published Jun 13, 2024 • 20

Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

Paper • 2406.09403 • Published Jun 13, 2024 • 21

derekma

authored 4 papers 9 months ago

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Paper • 2406.09411 • Published Jun 13, 2024 • 20

Instructional Fingerprinting of Large Language Models

Paper • 2401.12255 • Published Jan 21, 2024 • 1

Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models

Paper • 2305.14710 • Published May 24, 2023 • 2

Can NLI Provide Proper Indirect Supervision for Low-resource Biomedical Relation Extraction?

Paper • 2212.10784 • Published Dec 21, 2022

Xiaogeng-SheltonLiu

authored 2 papers 11 months ago

AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models

Paper • 2310.04451 • Published Oct 3, 2023

JailBreakV-28K: A Benchmark for Assessing the Robustness of MultiModal Large Language Models against Jailbreak Attacks

Paper • 2404.03027 • Published Apr 3, 2024 • 3

Fiaa

authored a paper 11 months ago

ImagenHub: Standardizing the evaluation of conditional image generation models

Paper • 2310.01596 • Published Oct 2, 2023 • 19

rmimg

AI & ML interests

RMIMG's activity

ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding

SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe

MIRAI: Evaluating LLM Agents for Event Forecasting

Multi-hop Evidence Retrieval for Cross-document Relation Extraction

Summarization as Indirect Supervision for Relation Extraction

CliBench: Multifaceted Evaluation of Large Language Models in Clinical Decisions on Diagnoses, Procedures, Lab Tests Orders and Prescriptions

mDPO: Conditional Preference Optimization for Multimodal Large Language Models

WPO: Enhancing RLHF with Weighted Preference Optimization

Mitigating Bias for Question Answering Models by Tracking Bias Influence

Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Instructional Fingerprinting of Large Language Models

Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models

Can NLI Provide Proper Indirect Supervision for Low-resource Biomedical Relation Extraction?

AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models

JailBreakV-28K: A Benchmark for Assessing the Robustness of MultiModal Large Language Models against Jailbreak Attacks

ImagenHub: Standardizing the evaluation of conditional image generation models

AI & ML interests

Team members 8

RMIMG's activity