-
Video Creation by Demonstration
Paper β’ 2412.09551 β’ Published β’ 8 -
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Paper β’ 2412.07589 β’ Published β’ 45 -
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation
Paper β’ 2412.06531 β’ Published β’ 71 -
APOLLO: SGD-like Memory, AdamW-level Performance
Paper β’ 2412.05270 β’ Published β’ 38
Collections
Discover the best community collections!
Collections including paper arxiv:2403.05185
-
User-LLM: Efficient LLM Contextualization with User Embeddings
Paper β’ 2402.13598 β’ Published β’ 19 -
Personalized Audiobook Recommendations at Spotify Through Graph Neural Networks
Paper β’ 2403.05185 β’ Published β’ 22 -
SPAR: Personalized Content-Based Recommendation via Long Engagement Attention
Paper β’ 2402.10555 β’ Published β’ 34
-
VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
Paper β’ 2312.02087 β’ Published β’ 20 -
FaceStudio: Put Your Face Everywhere in Seconds
Paper β’ 2312.02663 β’ Published β’ 30 -
Orthogonal Adaptation for Modular Customization of Diffusion Models
Paper β’ 2312.02432 β’ Published β’ 12 -
ReconFusion: 3D Reconstruction with Diffusion Priors
Paper β’ 2312.02981 β’ Published β’ 8
-
NExT-GPT: Any-to-Any Multimodal LLM
Paper β’ 2309.05519 β’ Published β’ 78 -
Large Language Model for Science: A Study on P vs. NP
Paper β’ 2309.05689 β’ Published β’ 20 -
AstroLLaMA: Towards Specialized Foundation Models in Astronomy
Paper β’ 2309.06126 β’ Published β’ 16 -
Large Language Models for Compiler Optimization
Paper β’ 2309.07062 β’ Published β’ 23