3 PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts · 11 authors
2 Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks · 16 authors
1 Increasing Diversity While Maintaining Accuracy: Text Data Generation with Large Language Models and Human Interventions · 3 authors
1 Triggering Multi-Hop Reasoning for Question Answering in Language Models using Soft Prompts and Random Walks · 3 authors