Spaces:

instructkr
/

ko-chatbot-arena-leaderboard

Running

Where is the benchmarking dataset coming from?

by zhiminy - opened Mar 18, 2024

Mar 18, 2024

This comment has been hidden

인스트럭트.한국 org Mar 18, 2024

There’s no such dataset being used.

It’s elo arena

Mar 18, 2024

This comment has been hidden

인스트럭트.한국 org Mar 18, 2024

People prompt 2 model and choose better model. (blinded test)
Then we use elo algorithm to change elo of model.

Mar 18, 2024

•

People prompt 2 model and choose better model. (blinded test)
Then we use elo algorithm to change elo of model.

My mistake, I found it is actually the users who propose prompts directly...

zhiminy changed discussion status to closed Mar 18, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment