1 13 22

Andrew Zhao

andrewzh

https://andrewzh112.github.io/

AI & ML interests

Reinforcement Learning, Agents

Recent Activity

authored a paper about 13 hours ago

LLM-based Optimization of Compound AI Systems: A Survey

authored a paper about 13 hours ago

DMotion: Robotic Visuomotor Control with Unsupervised Forward Model Learned from Videos

authored a paper about 13 hours ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

View all activity

Organizations

None yet

andrewzh's activity

authored 3 papers about 13 hours ago

LLM-based Optimization of Compound AI Systems: A Survey

Paper • 2410.16392 • Published Oct 21, 2024 • 15

DMotion: Robotic Visuomotor Control with Unsupervised Forward Model Learned from Videos

Paper • 2103.04301 • Published Mar 7, 2021

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 4 days ago • 84

upvoted a paper 2 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 4 days ago • 84

upvoted a paper about 2 months ago

ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation

Paper • 2502.18364 • Published Feb 25 • 36

upvoted a paper 2 months ago

Building A Proof-Oriented Programmer That Is 64% Better Than GPT-4o Under Data Scarsity

Paper • 2502.11901 • Published Feb 17 • 6

liked a model 3 months ago

xwen-team/Xwen-72B-Chat

Text Generation • Updated Feb 4 • 37 • 32

liked a dataset 4 months ago

TAUR-Lab/MuSR

Viewer • Updated May 21, 2024 • 756 • 9.81k • 18

upvoted 3 papers 6 months ago

DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution

Paper • 2411.02359 • Published Nov 4, 2024 • 13

How Far is Video Generation from World Model: A Physical Law Perspective

Paper • 2411.02385 • Published Nov 4, 2024 • 36

LLM-based Optimization of Compound AI Systems: A Survey

Paper • 2410.16392 • Published Oct 21, 2024 • 15

liked 2 models 9 months ago

shenzhi-wang/Llama3.1-70B-Chinese-Chat

Text Generation • Updated Jul 29, 2024 • 389 • 44

shenzhi-wang/Llama3.1-8B-Chinese-Chat

Text Generation • Updated Jul 29, 2024 • 7.3k • 261

authored a paper 9 months ago

Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing

Paper • 2407.08770 • Published Jul 11, 2024 • 21

upvoted a paper 9 months ago

Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing

Paper • 2407.08770 • Published Jul 11, 2024 • 21

liked a model 10 months ago

shenzhi-wang/Gemma-2-27B-Chinese-Chat

Text Generation • Updated Jul 4, 2024 • 1.55k • 63

authored a paper 10 months ago

DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints

Paper • 2405.19026 • Published May 29, 2024 • 7

upvoted a paper 10 months ago

Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models

Paper • 2406.11230 • Published Jun 17, 2024 • 35

upvoted a paper 11 months ago

DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints

Paper • 2405.19026 • Published May 29, 2024 • 7

liked a model about 1 year ago

CohereLabs/c4ai-command-r-plus

Text Generation • Updated 7 days ago • 7.92k • • 1.72k