Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models Paper โข 2403.18814 โข Published Mar 27 โข 37
Snap-it, Tap-it, Splat-it: Tactile-Informed 3D Gaussian Splatting for Reconstructing Challenging Surfaces Paper โข 2403.20275 โข Published Mar 29 โข 8
Gemma release Collection Groups the Gemma models released by the Google team. โข 40 items โข Updated 25 days ago โข 291
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community โข 15 items โข Updated 5 days ago โข 82
AppAgent: Multimodal Agents as Smartphone Users Paper โข 2312.13771 โข Published Dec 21, 2023 โข 49