view article Article How to run Gemini Nano locally in your browser By Xenova • about 7 hours ago • 28
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems Paper • 2407.01370 • Published 10 days ago • 76
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper • 2402.14905 • Published Feb 22 • 100
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output Paper • 2407.03320 • Published 8 days ago • 84
Agentless: Demystifying LLM-based Software Engineering Agents Paper • 2407.01489 • Published 10 days ago • 38
LLM Compiler Collection Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated 14 days ago • 139
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Apr 22 • 75
Transformers.js demos Collection A collection of my favorite WebML demos, built with Transformers.js! • 30 items • Updated about 8 hours ago • 59
4M Models Collection Multimodal models from https://4m.epfl.ch/ • 14 items • Updated 27 days ago • 29
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion Paper • 2406.03184 • Published Jun 5 • 18
Learning Temporally Consistent Video Depth from Video Diffusion Priors Paper • 2406.01493 • Published Jun 3 • 17
CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner Paper • 2405.14979 • Published May 23 • 14
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 23 items • Updated about 2 hours ago • 366
Idefics2 🐶 Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6 • 86
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 • 148
view article Article StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation Apr 29 • 70
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1 • 59
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 22 items • Updated 2 days ago • 145
view article Article 🧑⚖️ "Replacing Judges with Juries" using distilabel By alvarobartt • May 3 • 17
Llama3-ChatQA-1.5 Collection Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 27 days ago • 37
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare Apr 19 • 84
Leaderboards and benchmarks ✨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 64 items • Updated about 1 month ago • 69
DocLLM: A layout-aware generative language model for multimodal document understanding Paper • 2401.00908 • Published Dec 31, 2023 • 176
ControlRoom3D: Room Generation using Semantic Proxy Rooms Paper • 2312.05208 • Published Dec 8, 2023 • 8
Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis Paper • 2312.08782 • Published Dec 14, 2023 • 5
Seamless Communication Collection A significant step towards removing language barriers through expressive, fast and high-quality AI translation. • 16 items • Updated Jan 16 • 132
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer Paper • 2308.06873 • Published Aug 14, 2023 • 25
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft Paper • 2306.00937 • Published Jun 1, 2023 • 8