Commit History

Likert-5 by default with Prometheus prompt template
65ba9f3
verified

kaikaidai commited on

Updated policy
0b4911a
verified

kaikaidai commited on

Update common.py
47102a0
verified

kaikaidai commited on

Update common.py
68c49ca
verified

kaikaidai commited on

Update gen_api_answer.py
40a124e
verified

kaikaidai commited on

Update requirements.txt
7c6d478
verified

kaikaidai commited on

Update gen_api_answer.py
acbea0e
verified

kaikaidai commited on

Cohere models
d1f5b6e
verified

kaikaidai commited on

Update data/models.jsonl
31f4c9b
verified

kaikaidai commited on

Create random_sample_generation.py
7d0e577
verified

kaikaidai commited on

Update gen_api_answer.py
6e812c0
verified

kaikaidai commited on

Update common.py
e098d1e
verified

kaikaidai commited on

Update app.py
8bba8de
verified

kaikaidai commited on

Renamed accordion to "Edit Judge Prompt" & added messaging around "Turbo" models
f2d7524
verified

kaikaidai commited on

Improved random sample gen
d31079e
verified

kaikaidai commited on

Update app.py
82ae430
verified

kaikaidai commited on

Update common.py
f3cb34b
verified

kaikaidai commited on

Update app.py
47e4bdb
verified

kaikaidai commited on

Added 3.5 Haiku / Sonnet
4bc1049
verified

kaikaidai commited on

Update common.py
14ad351
verified

kaikaidai commited on

Update common.py
c873398
verified

kaikaidai commited on

Update app.py
9fb17ef
verified

kaikaidai commited on

Update gen_api_answer.py
5407740
verified

kaikaidai commited on

Update common.py
9897c61
verified

kaikaidai commited on

Update gen_api_answer.py
9b05683
verified

kaikaidai commited on

Create leaderboard.py
5267683
verified

kaikaidai commited on

Update gen_api_answer.py
ab62ff3
verified

kaikaidai commited on

13-14 Nov changes
d4256bf
verified

kaikaidai commited on

Improve JSON parsing
44387c3
verified

kaikaidai commited on

Update app.py
23f2441
verified

kaikaidai commited on

Updated prompt
6ec66e7
verified

kaikaidai commited on

Update app.py
b342f89
verified

kaikaidai commited on

Simplified
af1f413
verified

kaikaidai commited on

Simplified
4f951e1
verified

kaikaidai commited on

Simplified
36bdd78
verified

kaikaidai commited on

Update app.py
8c60083
verified

kaikaidai commited on

Update example_metrics.py
e78f2d8
verified

kaikaidai commited on

UI changes 11 Nov
58f5f61
verified

kaikaidai commited on

UI changes 11 Nov
dcdb545
verified

kaikaidai commited on

Leaderboard, decimal places
ced5a34
verified

kaikaidai commited on

Update app.py
8863707
verified

kaikaidai commited on

Update app.py
be3c6a3
verified

kaikaidai commited on

Update app.py
06545f5
verified

kaikaidai commited on

User_id fix
3201182
verified

kaikaidai commited on

Sort leaderboard auto
15a404c
verified

kaikaidai commited on

Updated language
c514098
verified

kaikaidai commited on

Added Qwen 2.5 and Mistral
ebdee62
verified

kaikaidai commited on

Update common.py
3f0d906
verified

kaikaidai commited on

Update app.py
b58e0f1
verified

kaikaidai commited on