-
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Paper • 2306.05685 • Published • 20 -
prometheus-eval/Feedback-Collection
Viewer • Updated • 56 • 91 -
prometheus-eval/prometheus-13b-v1.0
Text2Text Generation • Updated • 9.57k • 113 -
HuggingFaceH4/ultrafeedback_binarized
Viewer • Updated • 55.6k • 182
Krzysztof Sopyla
ksopyla
·
AI & ML interests
NLP, knowledge extraction, knowledge graphs, semantic similarity, model factfulness
Organizations
Collections
2
models
None public yet
datasets
None public yet