MM LLM Papers OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics Paper • 2401.12202 • Published Jan 22 • 9 Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs Paper • 2401.11708 • Published Jan 22 • 27
OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics Paper • 2401.12202 • Published Jan 22 • 9
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs Paper • 2401.11708 • Published Jan 22 • 27
Interesting Papers to Read StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion Paper • 2401.11053 • Published Jan 19 • 8
StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion Paper • 2401.11053 • Published Jan 19 • 8