FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion Paper • 2412.09626 • Published Dec 12, 2024 • 20
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding Paper • 2408.15545 • Published Aug 28, 2024 • 35
Leveraging Multimodal Features and Item-level User Feedback for Bundle Construction Paper • 2310.18770 • Published Oct 28, 2023
Discovering Spatio-Temporal Rationales for Video Question Answering Paper • 2307.12058 • Published Jul 22, 2023
EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models Paper • 2410.07133 • Published Oct 9, 2024 • 19
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding Paper • 2408.15545 • Published Aug 28, 2024 • 35
DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning Paper • 2407.04078 • Published Jul 4, 2024 • 18
MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter Paper • 2310.12798 • Published Oct 19, 2023 • 4
Uni-SMART: Universal Science Multimodal Analysis and Research Transformer Paper • 2403.10301 • Published Mar 15, 2024 • 52
Towards 3D Molecule-Text Interpretation in Language Models Paper • 2401.13923 • Published Jan 25, 2024 • 9