Multimodal agents (robotics) Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Paper • 2402.09844 • Published Feb 15 • 20 HuggingFaceM4/idefics2-8b Image-Text-to-Text • Updated Jul 30 • 39k • 573 VIMA/VIMA Updated Jun 20, 2023 • 13 rail-berkeley/octo-base Robotics • Updated Dec 14, 2023 • 88 • 19
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Paper • 2402.09844 • Published Feb 15 • 20
Robotics stack openai/whisper-base Automatic Speech Recognition • Updated Feb 29 • 493k • 184 HuggingFaceM4/idefics2-8b-AWQ Image-Text-to-Text • Updated May 6 • 578 • 26 parler-tts/parler_tts_mini_v0.1 Text-to-Speech • Updated Apr 30 • 26.4k • 345 dora-rs/dora-idefics2 Updated May 5 • 2 • 5