SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models Paper • 2407.15841 • Published 1 day ago • 21
PlacidDreamer: Advancing Harmony in Text-to-3D Generation Paper • 2407.13976 • Published 5 days ago • 5
Efficient Audio Captioning with Encoder-Level Knowledge Distillation Paper • 2407.14329 • Published 5 days ago • 2
GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression Paper • 2407.12077 • Published 7 days ago • 46
Click-Gaussian: Interactive Segmentation to Any 3D Gaussians Paper • 2407.11793 • Published 8 days ago • 3
Q-Sparse: All Large Language Models can be Fully Sparsely-Activated Paper • 2407.10969 • Published 8 days ago • 16