TEXGen: a Generative Diffusion Model for Mesh Textures Paper • 2411.14740 • Published 7 days ago • 12
Learning 3D Representations from Procedural 3D Programs Paper • 2411.17467 • Published 4 days ago • 8
Adapting Vision Foundation Models for Robust Cloud Segmentation in Remote Sensing Images Paper • 2411.13127 • Published 9 days ago • 4
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published 9 days ago • 37
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs Paper • 2411.14199 • Published 8 days ago • 25
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices Paper • 2411.10640 • Published 14 days ago • 41
Drowning in Documents: Consequences of Scaling Reranker Inference Paper • 2411.11767 • Published 11 days ago • 16
Large Language Models Can Self-Improve in Long-context Reasoning Paper • 2411.08147 • Published 17 days ago • 59
CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM Paper • 2411.04954 • Published 22 days ago • 8
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published 22 days ago • 109
NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks Paper • 2410.20650 • Published Oct 28 • 16
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated about 1 hour ago • 181
LayerSkip Collection Models continually pretrained using LayerSkip - https://arxiv.org/abs/2404.16710 • 8 items • Updated 8 days ago • 43
DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes Paper • 2410.18084 • Published Oct 23 • 12
Remember, Retrieve and Generate: Understanding Infinite Visual Concepts as Your Personalized Assistant Paper • 2410.13360 • Published Oct 17 • 8
MobA: A Two-Level Agent System for Efficient Mobile Task Automation Paper • 2410.13757 • Published Oct 17 • 31
Addition is All You Need for Energy-efficient Language Models Paper • 2410.00907 • Published Oct 1 • 144
SyntheOcc: Synthesize Geometric-Controlled Street View Images through 3D Semantic MPIs Paper • 2410.00337 • Published Oct 1 • 10
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion Paper • 2409.11406 • Published Sep 17 • 25