Update README.md
Browse files
README.md
CHANGED
@@ -10,11 +10,11 @@ tags:
|
|
10 |
|
11 |
A robust production grade and scalable SOTA (4 Benchmarks) model for Relative and Absolute grading of LLM (as well human) responses.
|
12 |
|
13 |
-
Given a question and it's response, it can judge the quality of response from a scale of 1-5. It is trained to be used in Absolute (1 Question - 1 Answer)
|
14 |
|
15 |
-
Question + Response to evaluate
|
16 |
-
Question + Response to evaluate + Custom Rubric (scoring criteria for your business use case)
|
17 |
-
Question + Response to evaluate + Custom Rubric + Reference Answer (A high Quality Answer which serves as the base)
|
18 |
|
19 |
Model adapted from https://github.com/deshwalmahesh/PHUDGE to make it compatible with HuggingFace Hub.
|
20 |
|
|
|
10 |
|
11 |
A robust production grade and scalable SOTA (4 Benchmarks) model for Relative and Absolute grading of LLM (as well human) responses.
|
12 |
|
13 |
+
Given a question and it's response, it can judge the quality of response from a scale of 1-5. It is trained to be used in Absolute (1 Question - 1 Answer) but can be used as Relative task too. It is supposed to work on Reference free settings too. So you can use it as following:
|
14 |
|
15 |
+
* Question + Response to evaluate
|
16 |
+
* Question + Response to evaluate + Custom Rubric (scoring criteria for your business use case)
|
17 |
+
* Question + Response to evaluate + Custom Rubric + Reference Answer (A high Quality Answer which serves as the base)
|
18 |
|
19 |
Model adapted from https://github.com/deshwalmahesh/PHUDGE to make it compatible with HuggingFace Hub.
|
20 |
|