SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap Paper • 2309.12382 • Published Sep 21, 2023
What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis Paper • 1904.01906 • Published Apr 3, 2019
Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models Paper • 2305.15080 • Published May 24, 2023
Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation Paper • 2401.06591 • Published Jan 12, 2024 • 4
On Web-based Visual Corpus Construction for Visual Document Understanding Paper • 2211.03256 • Published Nov 7, 2022 • 1
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning Paper • 2406.11823 • Published Jun 17, 2024
How Does Vision-Language Adaptation Impact the Safety of Vision Language Models? Paper • 2410.07571 • Published Oct 10, 2024 • 2
Evaluating Multimodal Generative AI with Korean Educational Standards Paper • 2502.15422 • Published 10 days ago • 9
Magma: A Foundation Model for Multimodal AI Agents Paper • 2502.13130 • Published 13 days ago • 52
Typos that Broke the RAG's Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level Perturbations Paper • 2404.13948 • Published Apr 22, 2024 • 1
Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding Paper • 2502.05609 • Published 23 days ago • 16
LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation Paper • 2412.10424 • Published Dec 10, 2024 • 2
EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation Paper • 2412.12559 • Published Dec 17, 2024 • 1
Bridging the Data Provenance Gap Across Text, Speech and Video Paper • 2412.17847 • Published Dec 19, 2024 • 9
Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published Dec 4, 2024 • 48
Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published Dec 4, 2024 • 48