tiiuae
/

falcon-mamba-7b

Text Generation

Inference Endpoints

Model card Files Files and versions Community

yellowvm commited on Jul 25

Commit

e93d9bf

•

1 Parent(s): c78f432

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -221,7 +221,7 @@ Also, we evaluate our model on the benchmarks of the first leaderboard using `li
 | `FalconMamba-7B`<sup>*</sup>             |62.03   |   80.82   | 62.11 |   73.64    |   53.42    | 52.54 |  **64.09**       |
 | `TRI-ML/mamba-7b-rw`<sup>*</sup>         | 51.25  | 80.85     | 33.41 | 71.11      | 23.13      | 4.70  | 44.03            |
 |***Hybrid SSM-attention models***|     |           |       |            |            |       |                  |
-| `recurrentgemma-9b`          |52.00   |   80.40   | 60.50 |   73.60    |   38.60    | 42.60 |  57.95           |
 | `Zyphra/Zamba-7B-v1`<sup>*</sup>         | 56.14  | 82.23     | 58.11 | 79.87      | 36.23      | 30.78 |  57.23           |
 |***Transformer models***      |        |           |       |            |            |       |                  |
 | `Falcon2-11B`                | 59.73  | 82.91     | 58.37 | 78.30      | 52.56      | 53.83 | **64.28**        |
@@ -229,7 +229,7 @@ Also, we evaluate our model on the benchmarks of the first leaderboard using `li
 | `Mistral-7B-v0.1`            | 59.98  | 83.31     | 64.16 | 78.37      | 42.15      | 37.83 | 60.97            |
 | `gemma-7B`                   | 61.09  |   82.20   | 64.56 |   79.01    |   44.79    | 50.87 |  63.75           |
-The evaluation results were borrowed from both leaderboards. For the models with no leaderboard results (marked by *star*), we evalueated the tasks internally.
 ## Throughput

 | `FalconMamba-7B`<sup>*</sup>             |62.03   |   80.82   | 62.11 |   73.64    |   53.42    | 52.54 |  **64.09**       |
 | `TRI-ML/mamba-7b-rw`<sup>*</sup>         | 51.25  | 80.85     | 33.41 | 71.11      | 23.13      | 4.70  | 44.03            |
 |***Hybrid SSM-attention models***|     |           |       |            |            |       |                  |
+| `recurrentgemma-9b`<sup>**</sup>          |52.00   |   80.40   | 60.50 |   73.60    |   38.60    | 42.60 |  57.95           |
 | `Zyphra/Zamba-7B-v1`<sup>*</sup>         | 56.14  | 82.23     | 58.11 | 79.87      | 36.23      | 30.78 |  57.23           |
 |***Transformer models***      |        |           |       |            |            |       |                  |
 | `Falcon2-11B`                | 59.73  | 82.91     | 58.37 | 78.30      | 52.56      | 53.83 | **64.28**        |
 | `Mistral-7B-v0.1`            | 59.98  | 83.31     | 64.16 | 78.37      | 42.15      | 37.83 | 60.97            |
 | `gemma-7B`                   | 61.09  |   82.20   | 64.56 |   79.01    |   44.79    | 50.87 |  63.75           |
+Mostly, we took evaluation results from both leaderboards. For the models marked by *star* we evaluated the tasks internally, while for the models marked by two *stars* the results were taken from paper or model card.
 ## Throughput