fine-tuned-judge / README.md

lmzheng

Create README.md

04c6e60 about 1 year ago

preview code

raw

history blame

No virus

243 Bytes

This is a 3-way classifier judge model fine-tuned on the Chatbot Arena human preference dataset. The base model is llama 13B. More details can be found in the Appendix. F of this [paper](Judging LLM-as-a-judge with MT-Bench and Chatbot Arena).