LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps Paper β’ 2412.15035 β’ Published 7 days ago β’ 4
Introducing v0.5 of the AI Safety Benchmark from MLCommons Paper β’ 2404.12241 β’ Published Apr 18 β’ 10
occiglot-eu5-7b-v0.1 Collection First release of 7B LLMs models for the 5 biggest European languages. All models initialised from mistral-7b-v0.1. β’ 10 items β’ Updated Mar 7 β’ 21
LEDITS++: Limitless Image Editing using Text-to-Image Models Paper β’ 2311.16711 β’ Published Nov 28, 2023 β’ 22
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models Paper β’ 2211.05105 β’ Published Nov 9, 2022
Speaking Multiple Languages Affects the Moral Bias of Language Models Paper β’ 2211.07733 β’ Published Nov 14, 2022 β’ 1
Revision Transformers: Instructing Language Models to Change their Values Paper β’ 2210.10332 β’ Published Oct 19, 2022
Class Attribute Inference Attacks: Inferring Sensitive Class Information by Diffusion-Based Attribute Manipulations Paper β’ 2303.09289 β’ Published Mar 16, 2023 β’ 1
The Stable Artist: Steering Semantics in Diffusion Latent Space Paper β’ 2212.06013 β’ Published Dec 12, 2022