AI & ML interests

Text Style Transfer, Text Detoxification, Toxic Speech Detection and Mitigation, Multilingualism

Recent Activity

dardemĀ  updated a Space 1 day ago
textdetox/README
dardemĀ  updated a collection 3 days ago
TextDetox 2025 Starter Kit
View all activity

Multilingual Text Detoxification with Parallel Data

Text Detoxification, toxicity detection and explanation for diverse languages: English, Spanish, German, French, Italian, Chinese, Japanese, Arabic, Hebrew, Hindi, Ukrainian, Russian, Tatar, Amharic. By many researchers from all over the world šŸŒ

Support for better, safe, and multicultural online spaces.

šŸ“° Read about the project in press šŸ“¹ PyData&CPyConf Berlin 2023 talk

[2025] !!!NOW OPEN!!! TextDetox CLEF2025 shared task website šŸ¤—Starter Kit

[2025] COLNG2025: Daryna Dementieva, Nikolay Babakov, Amit Ronen, Abinew Ali Ayele, Naquee Rizwan, Florian Schneider, Xintong Wang, Seid Muhie Yimam, Daniil Alekhseevich Moskovskiy, Elisei Stakovskii, Eran Kaufman, Ashraf Elnagar, Animesh Mukherjee, and Alexander Panchenko. 2025. Multilingual and Explainable Text Detoxification with Parallel Corpora. In Proceedings of the 31st International Conference on Computational Linguistics, pages 7998ā€“8025, Abu Dhabi, UAE. Association for Computational Linguistics. pdf

[2024] TextDetox2024 Report: Daryna Dementieva, Daniil Moskovskiy, Nikolay Babakov, Abinew Ali Ayele, Naquee Rizwan, Florian Schneider, Xintong Wang, Seid Muhie Yimam, Dmitry Ustalov, Elisei Stakovskii, Alisa Smirnova, Ashraf Elnagar, Animesh Mukherjee, and Alexander Panchenko "Overview of the multilingual text detoxification task at pan 2024" Working Notes of CLEF (2024). pdf

[2024] MultiParaDetox @ NAACL2024: Daryna Dementieva, Nikolay Babakov, and Alexander Panchenko. "MultiParaDetox: Extending Text Detoxification with Parallel Data to New Languages." Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers). 2024. pdf

[2024] TextDetox CLEF2024 shared task website

[2022] The first Parall Text Detoxification datasets: English ParaDetox and Russian ParaDetox

Contact

We are happy to extend our research to more languages, cultures, and dimensions šŸ˜‰

Please, contact: Daryna Dementieva