14 21

Zhao Zihao

xishze

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement

upvoted a paper 10 days ago

Towards Verifiable Multimodal Deep Research: A Multi-Agent Harness for Interleaved Report Generation

upvoted a paper 11 days ago

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

View all activity

Organizations

None yet

upvoted a paper 9 days ago

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement

Paper • 2605.30888 • Published 16 days ago • 10

upvoted a paper 10 days ago

Towards Verifiable Multimodal Deep Research: A Multi-Agent Harness for Interleaved Report Generation

Paper • 2605.29861 • Published 17 days ago • 16

upvoted a paper 11 days ago

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

Paper • 2605.30993 • Published 16 days ago • 57

liked a dataset 12 days ago

wikimedia/wikipedia

Viewer • Updated Jan 9, 2024 • 61.6M • 169k • 1.25k

liked a model 13 days ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 1.68M • 3.28k

upvoted a paper 15 days ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Paper • 2605.28816 • Published 18 days ago • 423

liked a dataset 17 days ago

world-igr-plum/regions

Updated Jun 17, 2025 • 377k • 23

liked a model 20 days ago

tencent/Hy-MT2-1.8B

Translation • 2B • Updated 18 days ago • 22.1k • • 1.11k

liked a dataset 22 days ago

HuggingFaceFW/finephrase

Viewer • Updated Mar 31 • 1.02B • 517k • 125

liked a model 22 days ago

tencent/Hy-MT2-30B-A3B

Translation • 30B • Updated 18 days ago • 5.76k • 457

upvoted a paper 22 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published May 12 • 195

liked a model 23 days ago

seraphimzzzz/824092

Updated 23 days ago • 1

upvoted 2 papers 23 days ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published 26 days ago • 189

DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models

Paper • 2605.15055 • Published about 1 month ago • 19

liked a model 26 days ago

nataliaaolmo/distilhubert-urbansound8k-finetuned1

23.7M • Updated 26 days ago • 22 • 1

liked a dataset about 1 month ago

maifoundations/VideoOdyssey

Viewer • Updated 17 days ago • 100 • 855 • 7

liked 2 models about 1 month ago

rafathasan/temp

Updated 6 days ago • 1

Theogott/spr-qwen3_5-9b-dora-vramsafe-gguf

Text Generation • 9B • Updated May 1 • 38 • 1

liked a dataset about 2 months ago

wegrthj/yzbw0u-akrw-raw

Preview • Updated Apr 28 • 2.69k • 1

upvoted a paper 2 months ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 247

Zhao Zihao

AI & ML interests

Recent Activity

Organizations

xishze's activity