view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 6 days ago • 290
view article Article A simple implementation of the attention mechanism in JAX By rishiraj • 14 days ago • 2
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing Paper • 2502.17258 • Published 21 days ago • 73
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google 27 days ago • 65
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 25 days ago • 130
Temporal Preference Optimization Collection Temporal Preference Optimization for Long-form Video Understanding • 3 items • Updated Jan 19 • 5
Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models Paper • 2402.14207 • Published Feb 22, 2024 • 8
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language Dec 16, 2024 • 118
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • Jan 23 • 64