OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 14 days ago • 148
BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues Paper • 2310.13650 • Published Oct 20, 2023
LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancement Paper • 2407.18595 • Published Jul 26, 2024
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D Paper • 2411.02336 • Published Nov 4, 2024 • 25
Running 106 106 Open VLM Video Leaderboard 🌎 VLMEvalKit Eval Results in video understanding benchmark
POINTS: Improving Your Vision-language Model with Affordable Strategies Paper • 2409.04828 • Published Sep 7, 2024 • 25
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline Paper • 2408.15079 • Published Aug 27, 2024 • 55
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment Paper • 2403.05135 • Published Mar 8, 2024 • 46
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment Paper • 2403.05135 • Published Mar 8, 2024 • 46