Fixed-Budget Differentially Private Best Arm Identification Paper • 2401.09073 • Published Jan 17, 2024
Words or Vision: Do Vision-Language Models Have Blind Faith in Text? Paper • 2503.02199 • Published 7 days ago • 1
InterFeedback: Unveiling Interactive Intelligence of Large Multimodal Models via Human Feedback Paper • 2502.15027 • Published 18 days ago • 7
WorldGUI: Dynamic Testing for Comprehensive Desktop GUI Automation Paper • 2502.08047 • Published 27 days ago • 26
A Comprehensive Guide to Explainable AI: From Classical Models to LLMs Paper • 2412.00800 • Published Dec 1, 2024
Deep Learning Model Security: Threats and Defenses Paper • 2412.08969 • Published Dec 12, 2024 • 1
Deep Learning, Machine Learning, Advancing Big Data Analytics and Management Paper • 2412.02187 • Published Dec 3, 2024
OminiControl: Minimal and Universal Control for Diffusion Transformer Paper • 2411.15098 • Published Nov 22, 2024 • 55
Investigating Copyright Issues of Diffusion Models under Practical Scenarios Paper • 2311.12803 • Published Sep 15, 2023
Subclass-balancing Contrastive Learning for Long-tailed Recognition Paper • 2306.15925 • Published Jun 28, 2023
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training Paper • 2411.13476 • Published Nov 20, 2024 • 16
Genixer: Empowering Multimodal Large Language Models as a Powerful Data Generator Paper • 2312.06731 • Published Dec 11, 2023 • 1
LOVA3: Learning to Visual Question Answering, Asking and Assessment Paper • 2405.14974 • Published May 23, 2024 • 1
Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame Pruning Paper • 2408.07931 • Published Aug 15, 2024 • 21
AMSP: Super-Scaling LLM Training via Advanced Model States Partitioning Paper • 2311.00257 • Published Nov 1, 2023 • 10