Loong: Generating Minute-level Long Videos with Autoregressive Language Models Paper • 2410.02757 • Published Oct 3 • 36
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second Paper • 2410.02073 • Published Oct 2 • 40
Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper • 2406.06592 • Published Jun 5 • 24
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 15 items • Updated 2 days ago • 72
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16 • 97
SparseCraft: Few-Shot Neural Reconstruction through Stereopsis Guided Geometric Linearization Paper • 2407.14257 • Published Jul 19 • 5
Scaling Synthetic Data Creation with 1,000,000,000 Personas Paper • 2406.20094 • Published Jun 28 • 94