STIV: Scalable Text and Image Conditioned Video Generation Paper • 2412.07730 • Published 13 days ago • 68
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory Paper • 2410.10813 • Published Oct 14 • 9
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory Paper • 2410.10813 • Published Oct 14 • 9
Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models Paper • 2410.05269 • Published Oct 7 • 3
Better Automatic Evaluation of Open-Domain Dialogue Systems with Contextualized Embeddings Paper • 1904.10635 • Published Apr 24, 2019
The Woman Worked as a Babysitter: On Biases in Language Generation Paper • 1909.01326 • Published Sep 3, 2019
Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases in Dialogue Systems Paper • 2310.05280 • Published Oct 8, 2023 • 1
ACCENT: An Automatic Event Commonsense Evaluation Metric for Open-Domain Dialogue Systems Paper • 2305.07797 • Published May 12, 2023
Mitigating Bias for Question Answering Models by Tracking Bias Influence Paper • 2310.08795 • Published Oct 13, 2023
"Kelly is a Warm Person, Joseph is a Role Model": Gender Biases in LLM-Generated Reference Letters Paper • 2310.09219 • Published Oct 13, 2023
Evaluating Large Language Models on Controlled Generation Tasks Paper • 2310.14542 • Published Oct 23, 2023
ACQUIRED: A Dataset for Answering Counterfactual Questions In Real-Life Videos Paper • 2311.01620 • Published Nov 2, 2023
AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model Paper • 2305.16734 • Published May 26, 2023
Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks Paper • 2311.00288 • Published Nov 1, 2023
DesCo: Learning Object Recognition with Rich Language Descriptions Paper • 2306.14060 • Published Jun 24, 2023 • 1