arxiv:2310.11865
Xiaoyuan Liu
littleRound
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
JudgeBench: A Benchmark for Evaluating LLM-based Judges
Organizations
Papers
1
models
None public yet
datasets
None public yet