Spaces:

llm-council
/

README

Running

justinxzhao commited on Jun 12, 2024

Commit

33cc960

verified ·

1 Parent(s): b286409

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -7,14 +7,10 @@ sdk: static
 pinned: false
 ---
-<p align="center">
-  <img src="https://cdn-uploads.huggingface.co/production/uploads/6462ac71514ee1645bd1f7f7/6MkoY412i9IqvISWSS4qs.png">
-</p>
 The rapid advancement of Large Language Models (LLMs) necessitates robust
 and challenging benchmarks.
-To address the challenge of ranking LLMs on *highly subjective* tasks such as emotional intelligence, creative writing, or persuasiveness,
 the **Language Model Council (LMC)** operates through a democratic process to: 1) formulate a test set through
 equal participation, 2) administer the test among council members, and 3) evaluate
 responses as a collective jury.
@@ -24,5 +20,6 @@ and less biased than those from any individual LLM judge, and is more consistent
 Roadmap:
 - Expand to more domains, use cases, and sophisticated agentic interactions.
 - Produce a generalized user interface for Council-as-a-Service.

 pinned: false
 ---
 The rapid advancement of Large Language Models (LLMs) necessitates robust
 and challenging benchmarks.
+To address the challenge of ranking LLMs on highly subjective tasks such as emotional intelligence, creative writing, or persuasiveness,
 the **Language Model Council (LMC)** operates through a democratic process to: 1) formulate a test set through
 equal participation, 2) administer the test among council members, and 3) evaluate
 responses as a collective jury.
 Roadmap:
+- Use the Council to benchmark evaluative characteristics of LLM-as-a-Judge/Jury like bias, affinity, and agreement.
 - Expand to more domains, use cases, and sophisticated agentic interactions.
 - Produce a generalized user interface for Council-as-a-Service.