AlessandroW's picture
Update README.md
1c9667c verified
|
raw
history blame
3 kB
metadata
license: mit
license_link: https://huggingface.co/microsoft/Phi-3-mini-128k-instruct/resolve/main/LICENSE
language:
  - en
pipeline_tag: text-generation
tags:
  - nlp
  - code

Model Summary

This repo provides the GGUF format for the Phi-3-Mini-128K-Instruct.

For more details check out the original model at microsoft/Phi-3-mini-128k-instruct.

TThe Phi-3-Mini-128K-Instruct is a 3.8 billion-parameter, lightweight, state-of-the-art open model trained using the Phi-3 datasets. This dataset includes both synthetic data and filtered publicly available website data, with an emphasis on high-quality and reasoning-dense properties. The model belongs to the Phi-3 family with the Mini version in two variants 4K and 128K which is the context length (in tokens) that it can support.

After initial training, the model underwent a post-training process that involved supervised fine-tuning and direct preference optimization to enhance its ability to follow instructions and adhere to safety measures. When evaluated against benchmarks that test common sense, language understanding, mathematics, coding, long-term context, and logical reasoning, the Phi-3 Mini-128K-Instruct demonstrated robust and state-of-the-art performance among models with fewer than 13 billion parameters. Resources and Technical Documentation:

Resources and Technical Documentation:

This repo provides GGUF files for the Phi-3 Mini-128K-Instruct model.

Name Quant method Bits Size Use case
Phi-3-mini-128k-instruct-Q4_K_M.gguf Q4_K_M 4 2.39 GB medium, balanced quality - recommended
Phi-3-mini-128k-instruct-f16.gguf None 16 7.2 GB minimal quality loss

License

The model is licensed under the MIT license.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft’s Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party’s policies.