Running 540 540 Vision Arena (Testing VLMs side-by-side) πΌ Analyze images to detect and label objects
Cosmos Tokenizer Collection A suite of image and video tokenizers β’ 13 items β’ Updated 21 days ago β’ 37
Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use Paper β’ 2410.24218 β’ Published Oct 31, 2024 β’ 5
Training Language Models to Self-Correct via Reinforcement Learning Paper β’ 2409.12917 β’ Published Sep 19, 2024 β’ 136
The Imperative of Conversation Analysis in the Era of LLMs: A Survey of Tasks, Techniques, and Trends Paper β’ 2409.14195 β’ Published Sep 21, 2024 β’ 13
GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection Paper β’ 2111.14592 β’ Published Nov 29, 2021 β’ 1
A Survey on Dialog Management: Recent Advances and Challenges Paper β’ 2005.02233 β’ Published May 5, 2020
SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents Paper β’ 2305.13040 β’ Published May 22, 2023 β’ 2
GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection Paper β’ 2111.14592 β’ Published Nov 29, 2021 β’ 1
Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation Paper β’ 2310.07968 β’ Published Oct 12, 2023
Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialog State Tracking Paper β’ 2106.00291 β’ Published Jun 1, 2021