57 18 44

Felix Friedrich

felfri

https://ml-research.github.io/people/ffriedrich/

AI & ML interests

Multimodal GenAI; AI Alignment; AI steerability

Recent Activity

liked a dataset 7 days ago

AIML-TUDA/t2i-diversity-gender-neutral-captions

updated a dataset 7 days ago

AIML-TUDA/t2i-diversity-gender-neutral-captions

new activity 8 days ago

AIML-TUDA/t2i-diversity-gender-neutral-captions:[bot] Conversion to Parquet

View all activity

Organizations

Posts 1

Post

2362

🚀 Excited to announce the release of our new research paper, "LLAVAGUARD: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment"!
In this work, we introduce LLAVAGUARD, a family of cutting-edge Vision-Language Model (VLM) judges designed to enhance the safety and integrity of vision datasets and generative models. Our approach leverages flexible policies for assessing safety in diverse settings. This context awareness ensures robust data curation and model safeguarding alongside comprehensive safety assessments, setting a new standard for vision datasets and models. We provide three versions (7B, 13B, and 34B) and our data, see below. This achievement wouldn't have been possible without the incredible teamwork and dedication of my great colleagues @LukasHug , @PSaiml , @mbrack . 🙏 Together, we've pushed the boundaries of what’s possible at the intersection of large generative models and safety.
🔍 Dive into our paper to explore:
Innovative methodologies for dataset curation and model safeguarding.
State-of-the-art safety assessments.
Practical implications for AI development and deployment.
Find more at AIML-TUDA/llavaguard-665b42e89803408ee8ec1086 and https://ml-research.github.io/human-centered-genai/projects/llavaguard/index.html