microsoft
/

Phi-3-mini-4k-instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

gargamit commited on Jul 1, 2024

Commit

43660c7

·

verified ·

1 Parent(s): ba3e2e8

updated headers

Files changed (2) hide show

README.md +3 -3
config.json +1 -0

README.md CHANGED Viewed

@@ -3,14 +3,14 @@ license: mit
 license_link: https://huggingface.co/microsoft/Phi-3-mini-4k-instruct/resolve/main/LICENSE
 language:
-- multilingual
 pipeline_tag: text-generation
 tags:
 - nlp
 - code
 inference:
   parameters:
-    temperature: 0.7
 widget:
   - messages:
       - role: user
@@ -81,7 +81,7 @@ The table below highlights improvements on instruction following, structure outp
 | MMLU	| 68.8	| 70.9 |
 | **Average**	| **21.9**	| **36.7** |
-Notes: if users would like to check out the previous version, use the git commit id **ff07dc01615f8113924aed013115ab2abd32115b**.
 ## How to Use

 license_link: https://huggingface.co/microsoft/Phi-3-mini-4k-instruct/resolve/main/LICENSE
 language:
+- en
 pipeline_tag: text-generation
 tags:
 - nlp
 - code
 inference:
   parameters:
+    temperature: 0.0
 widget:
   - messages:
       - role: user
 | MMLU	| 68.8	| 70.9 |
 | **Average**	| **21.9**	| **36.7** |
+Notes: if users would like to check out the previous version, use the git commit id **ff07dc01615f8113924aed013115ab2abd32115b**. For the model conversion, e.g. GGUF and other formats, we invite the community to experiment with various approaches and share your valuable feedback. Let's innovate together!
 ## How to Use

config.json CHANGED Viewed

@@ -31,5 +31,6 @@
   "torch_dtype": "bfloat16",
   "transformers_version": "4.40.2",
   "use_cache": true,
   "vocab_size": 32064
 }

   "torch_dtype": "bfloat16",
   "transformers_version": "4.40.2",
   "use_cache": true,
+  "attention_bias": false,
   "vocab_size": 32064
 }