view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others β’ May 23 β’ 151
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others β’ May 12 β’ 490
view post Post 5079 A ton of impactful models and datasets in open AI past week, let's summarize the best π€© merve/releases-apr-21-and-may-2-6819dcc84da4190620f448a3π¬ Qwen made it rain! They released Qwen3: new dense and MoE models ranging from 0.6B to 235B π€― as well as Qwen2.5-Omni, any-to-any model in 3B and 7B!> Microsoft AI released Phi4 reasoning models (that also come in mini and plus sizes)> NVIDIA released new CoT reasoning datasetsπΌοΈ > ByteDance released UI-TARS-1.5, native multimodal UI parsing agentic model> Meta released EdgeTAM, an on-device object tracking model (SAM2 variant)π£οΈ NVIDIA released parakeet-tdt-0.6b-v2, a smol 600M automatic speech recognition model> Nari released Dia, a 1.6B text-to-speech model> Moonshot AI released Kimi Audio, a new audio understanding, generation, conversation modelπ©π»βπ» JetBrains released Melium models in base and SFT for coding> Tesslate released UIGEN-T2-7B, a new text-to-frontend-code model π€© See translation π₯ 10 10 + Reply
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others β’ Mar 12 β’ 447