university of science and technology of china

university

https://ustc.edu.cn/

AI & ML interests

None defined yet.

Recent Activity

lovesnowbest authored a paper 2 days ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

lovesnowbest authored a paper about 1 month ago

ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents

MakiseKurisu authored a paper 6 months ago

Can MLLMs Understand the Deep Implication Behind Chinese Images?

View all activity

ustc's activity

lovesnowbest

authored a paper 2 days ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published 3 days ago • 37

lovesnowbest

authored a paper about 1 month ago

ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents

Paper • 2502.18017 • Published Feb 25 • 19

lovesnowbest

authored a paper 3 months ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20 • 105

tomorrowdawn

authored a paper 5 months ago

Top-$nσ$: Not All Logits Are You Need

Paper • 2411.07641 • Published Nov 12, 2024 • 22

lovesnowbest

authored a paper 7 months ago

MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines

Paper • 2409.12959 • Published Sep 19, 2024 • 38

lovesnowbest

authored a paper 9 months ago

MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

Paper • 2407.20183 • Published Jul 29, 2024 • 44

lovesnowbest

authored a paper 10 months ago

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Paper • 2406.04325 • Published Jun 6, 2024 • 76

SiruiZhao

authored a paper 10 months ago

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

Paper • 2405.21075 • Published May 31, 2024 • 24

lovesnowbest

authored 2 papers about 1 year ago

InternLM2 Technical Report

Paper • 2403.17297 • Published Mar 26, 2024 • 33

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

Paper • 2403.12881 • Published Mar 19, 2024 • 17

SiruiZhao

authored a paper over 1 year ago

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

Paper • 2312.12436 • Published Dec 19, 2023 • 14