wenwenh

wenhfaver

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References

upvoted a paper 12 days ago

Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation

upvoted a paper 12 days ago

Retrieval-augmented Large Language Models for Financial Time Series Forecasting

View all activity

Organizations

None yet

wenhfaver's activity

upvoted 4 papers 12 days ago

DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References

Paper • 2502.09614 • Published 13 days ago • 12

Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation

Paper • 2502.05415 • Published 19 days ago • 21

Retrieval-augmented Large Language Models for Financial Time Series Forecasting

Paper • 2502.05878 • Published 18 days ago • 38

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 14 days ago • 181

upvoted 3 papers 23 days ago

Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts

Paper • 2501.14334 • Published Jan 24 • 20

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 27 days ago • 56

SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model

Paper • 2501.18636 • Published 30 days ago • 28

upvoted 3 papers 30 days ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23 • 24

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published Jan 24 • 30

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published Jan 26 • 61

upvoted 9 papers about 1 month ago

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published Dec 24, 2024 • 75

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 40

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Paper • 2501.01957 • Published Jan 3 • 42

VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control

Paper • 2501.01427 • Published Jan 2 • 51

Xmodel-2 Technical Report

Paper • 2412.19638 • Published Dec 27, 2024 • 26

Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models

Paper • 2501.06751 • Published Jan 12 • 31

liked a model about 1 month ago

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • Updated 21 days ago • 1.47M • • 1.14k