1 17

hangyu guo

Rosiness

https://github.com/pygh0er

pygh0er

AI & ML interests

Natural Language Processing

Recent Activity

updated a model about 4 hours ago

Rosiness/Qwen2.5-VL-7B-Instruct-Mulberry-HY

published a model about 5 hours ago

Rosiness/Qwen2.5-VL-7B-Instruct-Mulberry-HY

upvoted a paper 6 days ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

View all activity

Organizations

None yet

Rosiness's activity

updated a model about 4 hours ago

Rosiness/Qwen2.5-VL-7B-Instruct-Mulberry-HY

Updated about 4 hours ago

published a model about 5 hours ago

Rosiness/Qwen2.5-VL-7B-Instruct-Mulberry-HY

Updated about 4 hours ago

upvoted a paper 6 days ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published 8 days ago • 41

upvoted 2 papers 14 days ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 15 days ago • 61

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published 18 days ago • 43

upvoted 3 papers 21 days ago

Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models

Paper • 2503.18923 • Published 22 days ago • 12

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published 22 days ago • 29

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published 26 days ago • 49

upvoted 2 papers about 1 month ago

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Paper • 2503.07703 • Published Mar 10 • 35

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 62

upvoted 2 papers about 2 months ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published Feb 26 • 28

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103

upvoted 2 papers 2 months ago

Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12 • 47

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7 • 104

upvoted a paper 3 months ago

Taming Teacher Forcing for Masked Autoregressive Video Generation

Paper • 2501.12389 • Published Jan 21 • 10

authored a paper 4 months ago

PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos

Paper • 2412.01800 • Published Dec 2, 2024 • 6

upvoted a paper 5 months ago

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models

Paper • 2411.07140 • Published Nov 11, 2024 • 35

authored a paper 6 months ago

ING-VP: MLLMs cannot Play Easy Vision-based Games Yet

Paper • 2410.06555 • Published Oct 9, 2024 • 8

upvoted a paper 6 months ago

ING-VP: MLLMs cannot Play Easy Vision-based Games Yet

Paper • 2410.06555 • Published Oct 9, 2024 • 8

commented a paper 6 months ago

ING-VP: MLLMs cannot Play Easy Vision-based Games Yet

Paper • 2410.06555 • Published Oct 9, 2024 • 8 •