ontocord
/

phi-3-22b-128k

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

huu-ontocord commited on May 23, 2024

Commit

15cc43e

•

1 Parent(s): bc7eca0

Update README.md

Files changed (1) hide show

README.md +5 -2

README.md CHANGED Viewed

@@ -7,8 +7,6 @@ license: mit
 The Phi-3-22b is a depth upsampled version of the 14b  [Phi-3-medium-128k-instruct](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct). We removed the bottom 8 layers of one copy of the 14b and the top 8 layers of another copy of the 14b model and stacked them. We plan to do continued pretraining to improve performance.
 Since this model has not been continued pretrained, the quality may vary.
-Some tests of the model in [colab](https://colab.research.google.com/drive/1eLoQXhysnBmN7DNNB6yElpELOSe6DHHH?usp=sharing).
 ```
 !pip install flash-attn --no-build-isolation
 !pip install peft bitsandbytes accelerate transformers
@@ -61,4 +59,9 @@ Will produce:
 ```
 <|user|> Explain why it is surprising that one can build a language model small enough to fit on a phone, yet almost as powerful as ChatGPT. Just use one funny sentence.<|end|><|assistant|> "Who knew that fitting a ChatGPT rival in your pocket would be easier than fitting a penguin in a pocket-sized suit!"<|end|>
 ```
 See the [Phi-3-medium-128k-instruct](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct) model card for more details.

 The Phi-3-22b is a depth upsampled version of the 14b  [Phi-3-medium-128k-instruct](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct). We removed the bottom 8 layers of one copy of the 14b and the top 8 layers of another copy of the 14b model and stacked them. We plan to do continued pretraining to improve performance.
 Since this model has not been continued pretrained, the quality may vary.
 ```
 !pip install flash-attn --no-build-isolation
 !pip install peft bitsandbytes accelerate transformers
 ```
 <|user|> Explain why it is surprising that one can build a language model small enough to fit on a phone, yet almost as powerful as ChatGPT. Just use one funny sentence.<|end|><|assistant|> "Who knew that fitting a ChatGPT rival in your pocket would be easier than fitting a penguin in a pocket-sized suit!"<|end|>
 ```
+Some more tests of the model in [colab](https://colab.research.google.com/drive/1eLoQXhysnBmN7DNNB6yElpELOSe6DHHH?usp=sharing).
 See the [Phi-3-medium-128k-instruct](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct) model card for more details.