Zhongyi Han

zhyhan

https://zhyhan.github.io/

AI & ML interests

OOD Generalization & Detection，AI for Science

Recent Activity

upvoted a paper 5 days ago

Exploring Expert Failures Improves LLM Agent Tuning

upvoted a paper 8 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

upvoted a paper 3 months ago

Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation

View all activity

Organizations

None yet

zhyhan's activity

upvoted a paper 5 days ago

Exploring Expert Failures Improves LLM Agent Tuning

Paper • 2504.13145 • Published 5 days ago • 11

upvoted a paper 8 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 8 days ago • 239

upvoted a paper 3 months ago

Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation

Paper • 2501.17749 • Published Jan 29 • 14

upvoted a paper 7 months ago

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 57

upvoted 2 papers 8 months ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 101

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 124

upvoted 2 papers 11 months ago

Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7, 2024 • 60

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19, 2024 • 159

upvoted 9 papers about 1 year ago

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8, 2024 • 65

Wukong: Towards a Scaling Law for Large-Scale Recommendation

Paper • 2403.02545 • Published Mar 4, 2024 • 17

Learning and Leveraging World Models in Visual Representation Learning

Paper • 2403.00504 • Published Mar 1, 2024 • 34

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 615

BitDelta: Your Fine-Tune May Only Be Worth One Bit

Paper • 2402.10193 • Published Feb 15, 2024 • 23

upvoted 2 papers over 1 year ago

Transformers are Multi-State RNNs

Paper • 2401.06104 • Published Jan 11, 2024 • 38

PromptBench: A Unified Library for Evaluation of Large Language Models

Paper • 2312.07910 • Published Dec 13, 2023 • 19

authored a paper over 1 year ago

How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation

Paper • 2312.07424 • Published Dec 12, 2023 • 11