view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control Feb 4 • 144
view post Post 10086 Realtime Whisper Large v3 Turbo Demo:It transcribes audio in about 0.3 seconds. KingNish/Realtime-whisper-large-v3-turbo 2 replies · 🔥 8 8 + Reply
view post Post 3594 Mistral Nemo is better than many models in 1st grader level reasoning. 👍 8 8 😎 2 2 👀 1 1 + Reply
view post Post 1370 Generative 3D demos often produce vertex-colored meshes, without UVs or texturesso I made a minimal library that converts vertex-colored meshes to uv-mapped, textured mesheslibrary: https://github.com/dylanebert/InstantTexturedemo: dylanebert/InstantTexture 🔥 4 4 + Reply
view post Post 3474 A super good and fast image inpainting demo is here.Its' super cool and realistic. Demo by @OzzyGT (Must try): OzzyGT/diffusers-fast-inpaint 👍 6 6 👀 2 2 ❤️ 2 2 + Reply
view post Post 5886 Introducing OpenCHAT mini: a lightweight, fast, and unlimited version of OpenGPT 4o. KingNish/OpenCHAT-mini2It has unlimited web search, vision and image generation.Please take a look and share your review. Thank you! 🤗 7 replies · 👍 10 10 😎 1 1 + Reply
view post Post 2005 The First Multimodal Language Model dedicated for Chemistry.Demo: https://v.chemllm.org/Finetune based on ChemLLM-20B and InterViT-6B on MMChemExam and ChemOCR Datasets (coming soon...) AI4Chem/ChemVLM-26B ChemLLM: A Chemical Large Language Model (2402.06852) 🚀 7 7 🔥 1 1 + Reply