Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
prasadt2 's Collections
RAG
Memory
Generative UI
Voice agents
Screen agents
Reasoning
LAMs
Agents
Trained models
Datasets

Voice agents

updated 15 days ago
Upvote
-

  • Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant

    Paper • 2410.15316 • Published Oct 20, 2024 • 11

  • MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

    Paper • 2410.19168 • Published Oct 24, 2024 • 20

  • LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects

    Paper • 2504.19838 • Published 16 days ago • 21
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs