Yuhang Wang's picture

2 5

Yuhang Wang

Rykeryh

AI & ML interests

None yet

Recent Activity

liked a dataset about 10 hours ago

basicv8vc/SimpleQA

liked a dataset 14 days ago

walledai/XSTest

liked a dataset 14 days ago

walledai/StrongREJECT

View all activity

Organizations

None yet

Rykeryh's activity

liked a dataset about 10 hours ago

basicv8vc/SimpleQA

Viewer • Updated Nov 5, 2024 • 4.33k • 2.11k • 11

liked 3 datasets 14 days ago

walledai/XSTest

Viewer • Updated Jul 4, 2024 • 450 • 535 • 7

walledai/StrongREJECT

Viewer • Updated Oct 18, 2024 • 313 • 528 • 8

walledai/AdvBench

Viewer • Updated Jul 4, 2024 • 520 • 3.52k • 24

liked a dataset about 1 month ago

Rykeryuhang/CDEval

Viewer • Updated May 27, 2024 • 2.95k • 132 • 3

upvoted a paper about 1 month ago

Agent models: Internalizing Chain-of-Action Generation into Reasoning models

Paper • 2503.06580 • Published Mar 9 • 16

authored a paper 3 months ago

Don't Command, Cultivate: An Exploratory Study of System-2 Alignment

Paper • 2411.17075 • Published Nov 26, 2024 • 1

upvoted a paper 3 months ago

Don't Command, Cultivate: An Exploratory Study of System-2 Alignment

Paper • 2411.17075 • Published Nov 26, 2024 • 1

authored 3 papers 3 months ago

CDEval: A Benchmark for Measuring the Cultural Dimensions of Large Language Models

Paper • 2311.16421 • Published Nov 28, 2023

OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

Paper • 2412.16849 • Published Dec 22, 2024 • 9

AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation

Paper • 2311.07397 • Published Nov 13, 2023