SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models Paper • 2503.07605 • Published 2 days ago • 61
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 27 days ago • 183
Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling Paper • 2402.10211 • Published Feb 15, 2024 • 14
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization Paper • 2402.09812 • Published Feb 15, 2024 • 16
GES: Generalized Exponential Splatting for Efficient Radiance Field Rendering Paper • 2402.10128 • Published Feb 15, 2024 • 18
Data Engineering for Scaling Language Models to 128K Context Paper • 2402.10171 • Published Feb 15, 2024 • 25
Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion Paper • 2402.10009 • Published Feb 15, 2024 • 22
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Paper • 2402.10210 • Published Feb 15, 2024 • 35
MPIrigen: MPI Code Generation through Domain-Specific Language Models Paper • 2402.09126 • Published Feb 14, 2024 • 15
Towards Next-Level Post-Training Quantization of Hyper-Scale Transformers Paper • 2402.08958 • Published Feb 14, 2024 • 6
PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models Paper • 2402.08714 • Published Feb 13, 2024 • 14
GhostWriter: Augmenting Collaborative Human-AI Writing Experiences Through Personalization and Agency Paper • 2402.08855 • Published Feb 13, 2024 • 14
Computing Power and the Governance of Artificial Intelligence Paper • 2402.08797 • Published Feb 13, 2024 • 15
NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs Paper • 2402.08622 • Published Feb 13, 2024 • 6
IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation Paper • 2402.08682 • Published Feb 13, 2024 • 14
ChatCell: Facilitating Single-Cell Analysis with Natural Language Paper • 2402.08303 • Published Feb 13, 2024 • 13
Learning Continuous 3D Words for Text-to-Image Generation Paper • 2402.08654 • Published Feb 13, 2024 • 12