HuggingFaceH4
/

zephyr-7b-gemma-v0.1

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

lewtun HF staff commited on Mar 1

Commit

5a6c0cd

•

1 Parent(s): 9c6d73c

Add tables

Files changed (1) hide show

README.md +12 -9

README.md CHANGED Viewed

@@ -49,15 +49,18 @@ Zephyr is a series of language models that are trained to act as helpful assista
 ## Performance
-At the time of release, Zephyr 7B Gemma is the highest ranked 7B chat model on the [MT-Bench](https://huggingface.co/spaces/lmsys/mt-bench) and [AlpacaEval](https://tatsu-lab.github.io/alpaca_eval/) benchmarks:
-In particular, on several categories of MT-Bench, Zephyρ 7B Gemma has strong performance compared to larger open models like Llama2-Chat-70B:
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/6200d0a443eb0913fa2df7cc/raxvt5ma16d7T23my34WC.png)
-However, on more complex tasks like coding and mathematics, Zephyr 7B Gemma lags behind proprietary models and more research is needed to close the gap.
 ## Intended uses & limitations

 ## Performance
+|                                 Model                                 |MT Bench|IFEval|
+|-----------------------------------------------------------------------|------:|------:|
+|[zephyr-7b-gemma](https://huggingface.co/HuggingFaceH4/zephyr-7b-gemma)|  7.81 |  28.76|
+|[zephyr-7b-beta](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta)  |  7.34 |  43.81|
+|[gemma-7b-it](https://huggingface.co/google/gemma-7b-it)               |  6.38 |  38.01|
+|                                 Model                                 |AGIEval|GPT4All|TruthfulQA|BigBench|Average|
+|-----------------------------------------------------------------------|------:|------:|---------:|-------:|------:|
+|[zephyr-7b-gemma](https://huggingface.co/HuggingFaceH4/zephyr-7b-gemma)|  34.22|  66.37|     52.19|   37.10|  47.47|
+|[zephyr-7b-beta](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta)  |  37.52|  71.77|     55.26|   39.77|  51.08|
+|[gemma-7b-it](https://huggingface.co/google/gemma-7b-it)               |  21.33|  40.84|     41.70|   30.25|  33.53|
 ## Intended uses & limitations