taesunwhang's picture
Update README.md
8be0038 verified
|
raw
history blame
No virus
1.37 kB
metadata
license: llama2

Model merge based on lmsys/vicuna-7b-v1.5 and meta-math/MetaMath-Llemma-7B

  1. Vicuna

    Model Details

    Vicuna is a chat assistant trained by fine-tuning Llama 2 on user-shared conversations collected from ShareGPT.

    • Developed by: LMSYS
    • Model type: An auto-regressive language model based on the transformer architecture
    • License: Llama 2 Community License Agreement
    • Finetuned from model: Llama 2

    Model Sources

  2. MetaMath Llemma

    Model Details

    MetaMath-Llemma-7B is fully fine-tuned on the MetaMathQA datasets and based on the powerful Llemma-7B model. It is glad to see using MetaMathQA datasets and change the base model from llama-2-7B to Llemma-7B can boost the MATH performance from 19.8 to 30.0.