87 46 162

Yaowei Zheng

hiyouga

https://github.com/hiyouga

AI & ML interests

LLM Knowledge Management

Recent Activity

liked a model 8 days ago

moonshotai/Kimi-VL-A3B-Instruct

updated a dataset 8 days ago

hiyouga/journeybench-multi-image-vqa

updated a dataset 8 days ago

hiyouga/math12k

View all activity

Organizations

hiyouga's activity

liked a model 8 days ago

moonshotai/Kimi-VL-A3B-Instruct

Image-Text-to-Text • Updated 2 days ago • 25.4k • 179

updated 3 datasets 8 days ago

published a dataset 8 days ago

hiyouga/journeybench-multi-image-vqa

Viewer • Updated 8 days ago • 313 • 144

upvoted a paper 13 days ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published 15 days ago • 44

liked 2 models 15 days ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • Updated 13 days ago • 737k • • 811

open-thoughts/OpenThinker2-32B

Text Generation • Updated 19 days ago • 902 • 46

New activity in Qwen/Qwen2.5-Omni-7B 16 days ago

Open-source Fine-tuning script of Qwen2.5-Omni 7B 🚀

#29 opened 21 days ago by

hiyouga

updated a model 16 days ago

llamafactory/tiny-random-Llama-4

Image-Text-to-Text • Updated 16 days ago • 2.07k

published a model 16 days ago

llamafactory/tiny-random-Llama-4

Image-Text-to-Text • Updated 16 days ago • 2.07k

liked a model 22 days ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated 7 days ago • 179k • 1.46k

upvoted a paper 22 days ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published 25 days ago • 43

liked a dataset 25 days ago

m-a-p/neo_sft_phase2

Viewer • Updated Jun 12, 2024 • 109k • 148 • 52

liked a model 27 days ago

manycore-research/SpatialLM-Llama-1B

Text Generation • Updated Mar 21 • 18.6k • 955

New activity in hiyouga/gsm8k about 1 month ago

[bot] Conversion to Parquet

#1 opened about 1 month ago by

parquet-converter

updated a dataset about 1 month ago

hiyouga/gsm8k

Viewer • Updated Mar 17 • 8.79k • 67

published a dataset about 1 month ago

hiyouga/gsm8k

Viewer • Updated Mar 17 • 8.79k • 67

upvoted a paper about 1 month ago

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

Paper • 2503.10460 • Published Mar 13 • 27

liked a model about 1 month ago

google/gemma-3-4b-it

Image-Text-to-Text • Updated Mar 21 • 611k • 455