MayaPH
/

GodziLLa2-70B

Text Generation

text-generation-inference

Model card Files Files and versions

jaspercatapang commited on Aug 17, 2023

Commit

bcc175a

•

1 Parent(s): 7eb7a89

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -35,7 +35,7 @@ According to the leaderboard description, here are the benchmarks used for the e
 ## Leaderboard Highlights (as of August 17, 2023)
 - Godzilla 2 70B ranks 4th, worldwide, in the Open LLM Leaderboard.
 - Godzilla 2 70B ranks #3 in the ARC challenge.
-- Godzilla 2 70B ranks #6 in the TruthfulQA benchmark.
 - *Godzilla 2 70B beats GPT-3.5 (ChatGPT) in terms of average performance and the HellaSwag benchmark (87.53 > 85.5).
 - *Godzilla 2 70B outperforms GPT-3.5 (ChatGPT) and GPT-4 on the TruthfulQA benchmark (61.54 for G2-70B, 47 for GPT-3.5, 59 for GPT-4).
 - *Godzilla 2 70B is on par with GPT-3.5 (ChatGPT) on the MMLU benchmark (<0.12%).

 ## Leaderboard Highlights (as of August 17, 2023)
 - Godzilla 2 70B ranks 4th, worldwide, in the Open LLM Leaderboard.
 - Godzilla 2 70B ranks #3 in the ARC challenge.
+- Godzilla 2 70B ranks #5 in the TruthfulQA benchmark.
 - *Godzilla 2 70B beats GPT-3.5 (ChatGPT) in terms of average performance and the HellaSwag benchmark (87.53 > 85.5).
 - *Godzilla 2 70B outperforms GPT-3.5 (ChatGPT) and GPT-4 on the TruthfulQA benchmark (61.54 for G2-70B, 47 for GPT-3.5, 59 for GPT-4).
 - *Godzilla 2 70B is on par with GPT-3.5 (ChatGPT) on the MMLU benchmark (<0.12%).