Gregor Betz PRO

ggbetz

AI & ML interests

Reasoning, AGI, AI Safety, AI Reliability

Articles

Organizations

Posts 1

view post
Post
1382
🥇Open CoT Leaderboard

We're delighted to announce the [Open CoT Leaderboard]( logikon/open_cot_leaderboard) on 🤗 Spaces.

Unlike other LLM performance leaderboards, the Open CoT Leaderboard is not tracking absolute benchmark accuracies, but relative **accuracy gains** due to **chain-of-thought**.

Eval datasets that underpin the leaderboard are hosted [here](https://huggingface.co/cot-leaderboard).

Feedback and suggestions more than welcome.

@clefourrier

models

None public yet

datasets

None public yet