See or Guess: Counterfactually Regularized Image Captioning Paper • 2408.16809 • Published Aug 29, 2024 • 1
ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering Paper • 2503.16867 • Published Mar 21 • 11