OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts Paper • 2503.22952 • Published 8 days ago • 18
openai/whisper-large-v3-turbo Automatic Speech Recognition • Updated Oct 4, 2024 • 3.57M • • 2.23k
OpenAssistant/reward-model-deberta-v3-large-v2 Text Classification • Updated Feb 1, 2023 • 12.1k • • 217