leaderboard / requirements.txt
pminervini's picture
update
79ad88b
raw
history blame
411 Bytes
torch
colorama
APScheduler
black
click
datasets
gradio
gradio_client
huggingface-hub
matplotlib
numpy
pandas
plotly
python-dateutil
requests
semantic-version
tqdm
wandb
transformers>=4.36.0
tokenizers>=0.15.0
lm_eval[ifeval] @ git+https://github.com/EleutherAI/lm-evaluation-harness.git
accelerate
sentencepiece
langdetect
sacrebleu
cchardet
rouge_score
bert-score
evaluate
# spacy
# selfcheckgpt
immutabledict