Spaces:

HuggingFaceH4
/

open_llm_leaderboard

Running on CPU Upgrade

App Files Files Community

753

How the n shot prompts are selected?

#39

by SamG123 - opened Jun 2, 2023

Discussion

SamG123

Jun 2, 2023

How are the n shots identified/selected? Is it same for all the models?

SamG123

Jun 5, 2023

In the llm_leaderboard, it is mentioned that the n-shot prompt used for dataset ARC is 25. How are these 25 prompts identified? What are the n-shot prompts that are used for this leaderboard?

kmfoda

Jun 5, 2023

•

edited Jun 5, 2023

Hey, I had the same question and then read in HELM repo that you could get the prompts that were used using:

python write_out.py
--tasks all_tasks
--num_fewshot 5
--num_examples 10
--output_base_path /path/to/output/folder.

If you use that and add the arc dataset and change the num_fewshot to 25 that might be what was used.

SamG123 changed discussion status to closed Jun 26, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment