Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
m-ric 's Collections
GUI Agents
Could be useful one day
Scaling Laws πŸ“
πŸš€ Spinning Up in LLMs
πŸ§‘β€βš–οΈ LLM-as-a-judge
πŸ”Žβ‡’πŸ’¬ RAG
πŸ€– Agents
πŸ‘οΈ Vision
πŸ›£οΈ Grammar
πŸ’‘ Interpretability - understanding LLMs
LLM foundations
πŸ”§ Optimization Mechanics πŸ”§
🌍 Earth
Open-source AI Releases - August '24
Mother of all Training Clusters

πŸ§‘β€βš–οΈ LLM-as-a-judge

updated Nov 21, 2024
Upvote
1

  • Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

    Paper β€’ 2306.05685 β€’ Published Jun 9, 2023 β€’ 40

  • ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

    Paper β€’ 2312.10003 β€’ Published Dec 15, 2023 β€’ 44

  • Leveraging Large Language Models for NLG Evaluation: A Survey

    Paper β€’ 2401.07103 β€’ Published Jan 13, 2024 β€’ 4

  • Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

    Paper β€’ 2310.08491 β€’ Published Oct 12, 2023 β€’ 57

  • Running
    109

    Judge Arena

    πŸ’»
    109

    View and compare open‑source AI model rankings with ELO scores

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs