Achieving Sample and Computational Efficient Reinforcement Learning by Action Space Reduction via Grouping Paper • 2306.12981 • Published Jun 22, 2023
Towards Language-Driven Video Inpainting via Multimodal Large Language Models Paper • 2401.10226 • Published Jan 18, 2024 • 1
OMG-Seg: Is One Model Good Enough For All Segmentation? Paper • 2401.10229 • Published Jan 18, 2024 • 1
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD Paper • 2404.06512 • Published Apr 9, 2024 • 31
MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning Paper • 2406.17770 • Published Jun 25, 2024 • 19
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output Paper • 2407.03320 • Published Jul 3, 2024 • 96
InternLM-Law: An Open Source Chinese Legal Large Language Model Paper • 2406.14887 • Published Jun 21, 2024
RTMW: Real-Time Multi-Person 2D and 3D Whole-body Pose Estimation Paper • 2407.08634 • Published Jul 11, 2024
An Open and Comprehensive Pipeline for Unified Object Grounding and Detection Paper • 2401.02361 • Published Jan 4, 2024
RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose Paper • 2303.07399 • Published Mar 13, 2023
MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space Paper • 2504.13835 • Published 4 days ago • 33
MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space Paper • 2504.13835 • Published 4 days ago • 33
MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space Paper • 2504.13835 • Published 4 days ago • 33 • 3
NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window? Paper • 2407.11963 • Published Jul 16, 2024 • 45