metadata
base_model: mistralai/Mixtral-8x7B-v0.1
inference: false
license: apache-2.0
language:
- zh
- en
Chinese-Mixtral-LoRA
Chinese Mixtral GitHub repository: https://github.com/ymcui/Chinese-Mixtral
This repository contains Chinese-Mixtral-LoRA, which is further pre-trained on Mixtral-8x7B-v0.1.
Note: You must combine LoRA with the original Mixtral-8x7B-v0.1 to obtain full weight.
Others
For full model, please see: https://huggingface.co/hfl/chinese-mixtral
For GGUF model (llama.cpp compatible), please see: https://huggingface.co/hfl/chinese-mixtral-gguf
If you have questions/issues regarding this model, please submit an issue through https://github.com/ymcui/Chinese-Mixtral/.
Citation
Please consider cite our paper if you use the resource of this repository. Paper link: https://arxiv.org/abs/2403.01851
@article{chinese-mixtral,
title={Rethinking LLM Language Adaptation: A Case Study on Chinese Mixtral},
author={Cui, Yiming and Yao, Xin},
journal={arXiv preprint arXiv:2403.01851},
url={https://arxiv.org/abs/2403.01851},
year={2024}
}