Kevin Wei
kevinlwei
·
AI & ML interests
Science of AI evaluations, AI evaluations, AI governance, legal evaluations
Recent Activity
updated
a Space
2 days ago
evaleval/README
published
a Space
2 days ago
evaleval/README
authored
a paper
3 months ago
Recommendations and Reporting Checklist for Rigorous & Transparent Human
Baselines in Model Evaluations