SakanaAI
/

EvoLLM-JP-v1-7B

@@ -11,37 +11,21 @@ language:
 <!-- Provide a quick summary of what the model is/does. -->
-**EvoLLM-JP-v1-7B** is a Japanese Math LLM by Evolutionary Model Merge.
-## Model Details
-### Model Description
-<!-- Provide a longer summary of what this model is. -->
-**EvoLLM-JP-v1-7B** is a Japanese Math LLM, merged the following source models in the Parameter Space (PS) by Evolutionary Model Merge.
-- **Developed by:** [Sakana AI](https://sakana.ai/)
-- **Model type:** Autoregressive Language Model
-- **Language(s):** Japanese
-- **License:** [MICROSOFT RESEARCH LICENSE TERMS](./LICENSE)
-- **Source models:**
-  - [Shisa Gamma 7B v1](https://huggingface.co/augmxnt/shisa-gamma-7b-v1)
-  - [WizardMath 7B V1.1](https://huggingface.co/WizardLM/WizardMath-7B-V1.1)
-  - [Abel 7B 002](https://huggingface.co/GAIR/Abel-7B-002)
-### Model Sources
-<!-- Provide the basic links for the model. -->
-- **Repository:** [SakanaAI/evolutionary-model-merge](https://github.com/SakanaAI/evolutionary-model-merge)
-- **Paper:** TODO
-- **Blog:** TODO
 ## Usage
 Use the code below to get started with the model.
 ```python
 import torch
@@ -70,21 +54,23 @@ generated_text = tokenizer.batch_decode(output_ids, skip_special_tokens=True)[0]
 print(generated_text)
 ```
-## Evaluation
-We present the results on the [MGSM-JA](https://huggingface.co/datasets/juletxara/mgsm) test set that compares the performance of the our evolved LLMs compared to the source LLMs.
-For details on the evaluation, please refer to Section 4.1 of the paper.
-If you want to reproduce the results, please see [our Github repository](https://github.com/SakanaAI/evolutionary-model-merge).
-| Id. | Model | Type | Params | MGSM-JA (acc &uarr; ) |
-| :--: | :-- | :-- | --: | --: |
-| 1 | [Shisa Gamma 7B v1](https://huggingface.co/augmxnt/shisa-gamma-7b-v1) | JA general | 7B |9.6 |
-| 2 | [WizardMath 7B V1.1](https://huggingface.co/WizardLM/WizardMath-7B-V1.1) | EN math | 7B | 18.4 |
-| 3 | [Abel 7B 002](https://huggingface.co/GAIR/Abel-7B-002) | EN math | 7B | 30.0 |
-| 4 | [Arithmo2 Mistral 7B](https://huggingface.co/upaya07/Arithmo2-Mistral-7B) | EN math | 7B | 24.0 |
-| 5 | [EvoLLM-JP-v1-7B](https://huggingface.co/SakanaAI/EvoLLM-JP-v1-7B) | 1+2+3 | 7B | **52.0** |
-| 6 | [EvoLLM-JP-A-v1-7B](https://huggingface.co/SakanaAI/EvoLLM-JP-A-v1-7B) | 1+3+4 | 7B | **52.4** |
-| 7 | [EvoLLM-JP-v1-10B](https://huggingface.co/SakanaAI/EvoLLM-JP-v1-10B) | 1 + 5 | 10B | **55.6** |
 ## Acknowledgement

 <!-- Provide a quick summary of what the model is/does. -->
+**EvoLLM-JP-v1-7B** is an experimental general-purpose Japanese LLM. This model was created using the Evolutionary Model Merge method. Please refer to our [report](TOOD) and [blog](TODO) for more details.  This model was produced by merging the following models. We are grateful to the developers of the source models.
+- [Shisa Gamma 7B v1](https://huggingface.co/augmxnt/shisa-gamma-7b-v1)
+- [WizardMath 7B V1.1](https://huggingface.co/WizardLM/WizardMath-7B-V1.1)
+- [Abel 7B 002](https://huggingface.co/GAIR/Abel-7B-002)
 ## Usage
 Use the code below to get started with the model.
+<details>
+<summary> Click to expand </summary>
 ```python
 import torch
 print(generated_text)
 ```
+</details>
+## Model Details
+<!-- Provide a longer summary of what this model is. -->
+- **Developed by:** [Sakana AI](https://sakana.ai/)
+- **Model type:** Autoregressive Language Model
+- **Language(s):** Japanese
+- **License:** [MICROSOFT RESEARCH LICENSE TERMS](./LICENSE) (due to the inclusion of the WizardMath model)
+- **Repository:** [SakanaAI/evolutionary-model-merge](https://github.com/SakanaAI/evolutionary-model-merge)
+- **Paper:** TODO
+- **Blog:** TODO
 ## Acknowledgement