automaise
/

quokka-7b

Text Generation

Carbon Emissions

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

patricia-rocha commited on Jun 23, 2023

Commit

81a87ac

•

1 Parent(s): 6e46ae4

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -113,13 +113,14 @@ We then conducted their [automatic evaluation](https://github.com/FreedomIntelli
 This prompt was designed to elicit assessments of answers in terms of helpfulness, relevance, accuracy, and level of detail.
 [Additional prompts](https://github.com/FreedomIntelligence/LLMZoo/blob/main/llmzoo/eval/prompts/order/prompt_all.json) are provided for assessing overall performance on different perspectives.
-Follows the results against GPT-3.5 and Falcon, one of the highest performing open-source models at the moment:
 * Automatic Evaluation **in Portuguese**:
 |                        | **Lose** | **Tie** | **Win** |
 |------------------------|----------|---------|---------|
 | QUOKKA vs. **GPT-3.5** | 63.8%    | 10.1%   | 26.1%   |
 | QUOKKA vs. **Falcon**  | 17.4%    | 1.4%    | 81.2%   |
 ## Environmental impact

 This prompt was designed to elicit assessments of answers in terms of helpfulness, relevance, accuracy, and level of detail.
 [Additional prompts](https://github.com/FreedomIntelligence/LLMZoo/blob/main/llmzoo/eval/prompts/order/prompt_all.json) are provided for assessing overall performance on different perspectives.
+Follows the results against GPT-3.5, two of the highest performing open-source models at the moment, Vicuna (13B) and Falcon (7B):
 * Automatic Evaluation **in Portuguese**:
 |                        | **Lose** | **Tie** | **Win** |
 |------------------------|----------|---------|---------|
 | QUOKKA vs. **GPT-3.5** | 63.8%    | 10.1%   | 26.1%   |
+| QUOKKA vs. **Vicuna**  | 66.2%    | 8.8%    | 25.0%   |
 | QUOKKA vs. **Falcon**  | 17.4%    | 1.4%    | 81.2%   |
 ## Environmental impact