OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning Paper • 2306.11249 • Published Jun 20, 2023 • 1
OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning Paper • 2306.11249 • Published Jun 20, 2023 • 1
OpenMixup: Open Mixup Toolbox and Benchmark for Visual Representation Learning Paper • 2209.04851 • Published Sep 11, 2022 • 2
SemiReward: A General Reward Model for Semi-supervised Learning Paper • 2310.03013 • Published Oct 4, 2023 • 2
AutoMix: Unveiling the Power of Mixup for Stronger Classifiers Paper • 2103.13027 • Published Mar 24, 2021 • 1
Improved Visual-Spatial Reasoning via R1-Zero-Like Training Paper • 2504.00883 • Published 6 days ago • 56
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution Paper • 2409.12191 • Published Sep 18, 2024 • 77
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization Paper • 2504.00999 • Published 6 days ago • 73 • 7
From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing Paper • 2411.11916 • Published Nov 18, 2024 • 2
AutoMix: Unveiling the Power of Mixup for Stronger Classifiers Paper • 2103.13027 • Published Mar 24, 2021 • 1
Boosting Discriminative Visual Representation Learning with Scenario-Agnostic Mixup Paper • 2111.15454 • Published Nov 30, 2021
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization Paper • 2504.00999 • Published 6 days ago • 73
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization Paper • 2504.00999 • Published 6 days ago • 73
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN Paper • 2205.13943 • Published May 27, 2022 • 1
Switch EMA: A Free Lunch for Better Flatness and Sharpness Paper • 2402.09240 • Published Feb 14, 2024 • 3