9 18 1

Tianyi Zhou

zhoutianyi

https://tianyizhou.github.io/

AI & ML interests

ML, NLP, RL, Multi-modality

Recent Activity

authored a paper about 2 months ago

GUI Agents: A Survey

upvoted a paper about 2 months ago

GUI Agents: A Survey

authored a paper 2 months ago

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

View all activity

Organizations

zhoutianyi's activity

upvoted a paper about 2 months ago

GUI Agents: A Survey

Paper • 2412.13501 • Published Dec 18, 2024 • 25

upvoted a paper 2 months ago

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published Dec 5, 2024 • 59

upvoted 2 papers 3 months ago

DynaSaur: Large Language Agents Beyond Predefined Actions

Paper • 2411.01747 • Published Nov 4, 2024 • 26

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Paper • 2410.23743 • Published Oct 31, 2024 • 60

upvoted 5 papers 4 months ago

Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion

Paper • 2410.13674 • Published Oct 17, 2024 • 16

Do great minds think alike? Investigating Human-AI Complementarity in Question Answering with CAIMIRA

Paper • 2410.06524 • Published Oct 9, 2024 • 4

upvoted a paper 7 months ago

AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models

Paper • 2406.10900 • Published Jun 16, 2024 • 11

upvoted a paper 12 months ago

ODIN: Disentangled Reward Mitigates Hacking in RLHF

Paper • 2402.07319 • Published Feb 11, 2024 • 14

upvoted 2 papers about 1 year ago

TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 67

Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld

Paper • 2311.16714 • Published Nov 28, 2023 • 1

upvoted 5 papers over 1 year ago

Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning

Paper • 2310.11716 • Published Oct 18, 2023 • 5

HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Paper • 2310.14566 • Published Oct 23, 2023 • 26

Diffusion Models Beat GANs on Image Classification

Paper • 2307.08702 • Published Jul 17, 2023 • 18

AlpaGasus: Training A Better Alpaca with Fewer Data

Paper • 2307.08701 • Published Jul 17, 2023 • 23

InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models

Paper • 2306.03082 • Published Jun 5, 2023 • 5