7 9 3

yuzhe gu

vanilla1116

https://guyuzhe.site/

Liqu1d-G

AI & ML interests

LLM; Hallucination; Self-Improvement

Recent Activity

new activity 3 days ago

opencompass/anah-7b:Add missing metadata and clarify license

new activity 3 days ago

opencompass/anah-20b:Add missing metadata: `pipeline_tag`, `library_name`, and `license`

new activity 3 days ago

opencompass/anah-v2:Improve model card with library_name and pipeline_tag

View all activity

Organizations

vanilla1116's activity

New activity in opencompass/anah-7b 3 days ago

Add missing metadata and clarify license

#1 opened 3 days ago by

nielsr

New activity in opencompass/anah-20b 3 days ago

Add missing metadata: `pipeline_tag`, `library_name`, and `license`

#1 opened 3 days ago by

nielsr

New activity in opencompass/anah-v2 3 days ago

Improve model card with library_name and pipeline_tag

#1 opened 3 days ago by

nielsr

authored a paper 6 days ago

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

Paper • 2503.02846 • Published 6 days ago • 18

upvoted a paper 6 days ago

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

Paper • 2503.02846 • Published 6 days ago • 18

commented a paper 6 days ago

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

Paper • 2503.02846 • Published 6 days ago • 18 •

upvoted a paper 7 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published 7 days ago • 62

authored a paper 20 days ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published 28 days ago • 60

upvoted a paper 28 days ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published 28 days ago • 60

commented a paper 28 days ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published 28 days ago • 60 •

upvoted a paper about 2 months ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20 • 92

updated a model 3 months ago

opencompass/anah-v2

Text Classification • Updated 3 days ago • 114 • 3

liked a Space 5 months ago

101

Open VLM Video Leaderboard

🌎

VLMEvalKit Eval Results in video understanding benchmark

upvoted a collection 7 months ago

InternLM2-Reward

Collection

InternLM2 Reward Models • 3 items • Updated 28 days ago • 4

upvoted a paper 7 months ago

MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

Paper • 2407.20183 • Published Jul 29, 2024 • 42

liked a model 8 months ago

internlm/Agent-FLAN-7b

Text Generation • Updated Mar 20, 2024 • 50 • 18

liked a dataset 8 months ago

internlm/Agent-FLAN

Preview • Updated Mar 20, 2024 • 140 • 72

New activity in opencompass/anah 8 months ago

[bot] Conversion to Parquet

#1 opened 8 months ago by

parquet-converter

authored a paper 8 months ago

InternLM2 Technical Report

Paper • 2403.17297 • Published Mar 26, 2024 • 31

upvoted a paper 8 months ago

InternLM2 Technical Report

Paper • 2403.17297 • Published Mar 26, 2024 • 31