neural-zephyr / README.md
ai-agi's picture
Update README.md
82770c4 verified
|
raw
history blame
1.4 kB
metadata
license: apache-2.0
language:
  - en
pipeline_tag: text-generation
tags:
  - serialization

image/png

Model Card for Neural-Zephyr Mistral 14B

Intel and Hugging Face developed two of the most prominent Mistral-type models released: Neural-Chat and Zephyr.

Neural-Zephyr is a hybrid Transfer Learning version joining Neural-Chat weights and Zephyr Mistral type models

Zephyr is a series of language models that are trained to act as helpful assistants. Zephyr-7B-β is the second model in the series, and is a fine-tuned version of mistralai/Mistral-7B-v0.1 that was trained on on a mix of publicly available, synthetic datasets using Direct Preference Optimization (DPO). and made the model more helpful. However, this means that model is likely to generate problematic text when prompted to do so. You can find more details in the technical report.

Model description

  • Model type: A 14B parameter GPT-like model fine-tuned on a mix of publicly available, synthetic datasets.
  • Language(s) (NLP): Primarily English
  • License: MIT
  • Finetuned from model: mistralai/Mistral-7B-v0.1