Fix grammatical errors
Browse files
README.md
CHANGED
@@ -12,12 +12,12 @@ tags:
|
|
12 |
|
13 |
## Model Summary
|
14 |
|
15 |
-
The Phi-3-Mini-128K-Instruct is a 3.
|
|
|
16 |
The model belongs to the Phi-3 family with the Mini version in two variants [4K](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) and [128K](https://huggingface.co/microsoft/Phi-3-mini-128k-instruct) which is the context length (in tokens) that it can support.
|
17 |
|
18 |
-
|
19 |
-
When
|
20 |
-
|
21 |
Resources and Technical Documentation:
|
22 |
|
23 |
+ [Phi-3 Microsoft Blog](https://aka.ms/phi3blog-april)
|
|
|
12 |
|
13 |
## Model Summary
|
14 |
|
15 |
+
The Phi-3-Mini-128K-Instruct is a 3.8 billion-parameter, lightweight, state-of-the-art open model trained using the Phi-3 datasets.
|
16 |
+
This dataset includes both synthetic data and filtered publicly available website data, with an emphasis on high-quality and reasoning-dense properties.
|
17 |
The model belongs to the Phi-3 family with the Mini version in two variants [4K](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) and [128K](https://huggingface.co/microsoft/Phi-3-mini-128k-instruct) which is the context length (in tokens) that it can support.
|
18 |
|
19 |
+
After initial training, the model underwent a post-training process that involved supervised fine-tuning and direct preference optimization to enhance its ability to follow instructions and adhere to safety measures.
|
20 |
+
When evaluated against benchmarks that test common sense, language understanding, mathematics, coding, long-term context, and logical reasoning, the Phi-3 Mini-128K-Instruct demonstrated robust and state-of-the-art performance among models with fewer than 13 billion parameters.
|
|
|
21 |
Resources and Technical Documentation:
|
22 |
|
23 |
+ [Phi-3 Microsoft Blog](https://aka.ms/phi3blog-april)
|