--- license: cc language: - en library_name: transformers tags: - social media - contrastive learning --- # The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 Languages

Chiyu Zhang, Khai Duy Doan, Qisheng Liao, Muhammad Abdul-Mageed

The University of British Columbia, Mohamed bin Zayed University of Artificial Intelligence

Publish at Main Conference of EMNLP 2023

Paper

[![Code License](https://img.shields.io/badge/Code%20License-Apache_2.0-green.svg)]() [![Data License](https://img.shields.io/badge/Data%20License-CC%20By%20NC%204.0-red.svg)]() ## Checkpoints of Models Pre-Trained with InfoDCL We further pretrained XLMR/RoBERTa with InfoDCL framework by ([Zhang et al. 2023](https://aclanthology.org/2023.findings-acl.152/)) Multilingual Model: * InfoDCL-XLMR trained with multilingual TweetEmoji-multi: https://huggingface.co/UBC-NLP/InfoDCL-Emoji-XLMR-Base English Models: * InfoDCL-RoBERTa trained with TweetEmoji-EN: https://huggingface.co/UBC-NLP/InfoDCL-emoji * InfoDCL-RoBERTa trained with TweetHashtag-EN: https://huggingface.co/UBC-NLP/InfoDCL-hashtag ## Citation Please cite us if you find our data or models useful. ```bibtex @inproceedings{zhang-etal-2023-skipped, title = "The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 Languages", author = "Zhang, Chiyu and Khai Duy Doan and, Qisheng Liao and, Abdul-Mageed, Muhammad", booktitle = "Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP)", year = "2023", publisher = "Association for Computational Linguistics", } ```