Exploring the Potential of Encoder-free Architectures in 3D LMMs Paper • 2502.09620 • Published Feb 13 • 25
OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation Paper • 2502.18041 • Published Feb 25 • 1
How Far Can Cantonese NLP Go? Benchmarking Cantonese Capabilities of Large Language Models Paper • 2408.16756 • Published Aug 29, 2024
AlignBot: Aligning VLM-powered Customized Task Planning with User Reminders Through Fine-Tuning for Household Robots Paper • 2409.11905 • Published Sep 18, 2024
MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards Paper • 2410.00938 • Published Oct 1, 2024
OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation Paper • 2502.18041 • Published Feb 25 • 1
FreeGaussian: Annotation-free Controllable 3D Gaussian Splats with Flow Derivatives Paper • 2410.22070 • Published Oct 29, 2024
Uni$\textbf{F}^2$ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models Paper • 2503.08120 • Published Mar 11 • 31
OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation Paper • 2502.18041 • Published Feb 25 • 1