nguyenbh commited on
Commit
94d2ad2
·
verified ·
1 Parent(s): eda34e1

Fix grammatical errors

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -12,12 +12,12 @@ tags:
12
 
13
  ## Model Summary
14
 
15
- The Phi-3-Mini-128K-Instruct is a 3.8B parameters, lightweight, state-of-the-art open model trained with the Phi-3 datasets that includes both synthetic data and the filtered publicly available websites data with a focus on high-quality and reasoning dense properties.
 
16
  The model belongs to the Phi-3 family with the Mini version in two variants [4K](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) and [128K](https://huggingface.co/microsoft/Phi-3-mini-128k-instruct) which is the context length (in tokens) that it can support.
17
 
18
- The model has underwent a post-training process that incorporates both supervised fine-tuning and direct preference optimization for the instruction following and safety measures.
19
- When assessed against benchmarks testing common sense, language understanding, math, code, long context and logical reasoning, Phi-3 Mini-4K-Instruct showcased a robust and state-of-the-art performance among models with less than 13 billion parameters.
20
-
21
  Resources and Technical Documentation:
22
 
23
  + [Phi-3 Microsoft Blog](https://aka.ms/phi3blog-april)
 
12
 
13
  ## Model Summary
14
 
15
+ The Phi-3-Mini-128K-Instruct is a 3.8 billion-parameter, lightweight, state-of-the-art open model trained using the Phi-3 datasets.
16
+ This dataset includes both synthetic data and filtered publicly available website data, with an emphasis on high-quality and reasoning-dense properties.
17
  The model belongs to the Phi-3 family with the Mini version in two variants [4K](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) and [128K](https://huggingface.co/microsoft/Phi-3-mini-128k-instruct) which is the context length (in tokens) that it can support.
18
 
19
+ After initial training, the model underwent a post-training process that involved supervised fine-tuning and direct preference optimization to enhance its ability to follow instructions and adhere to safety measures.
20
+ When evaluated against benchmarks that test common sense, language understanding, mathematics, coding, long-term context, and logical reasoning, the Phi-3 Mini-128K-Instruct demonstrated robust and state-of-the-art performance among models with fewer than 13 billion parameters.
 
21
  Resources and Technical Documentation:
22
 
23
  + [Phi-3 Microsoft Blog](https://aka.ms/phi3blog-april)