ontocord
/

phi-3-22b-128k

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

huu-ontocord commited on May 23, 2024

Commit

bc7eca0

·

verified ·

1 Parent(s): 805a077

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -6,6 +6,9 @@ license: mit
 The Phi-3-22b is a depth upsampled version of the 14b  [Phi-3-medium-128k-instruct](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct). We removed the bottom 8 layers of one copy of the 14b and the top 8 layers of another copy of the 14b model and stacked them. We plan to do continued pretraining to improve performance.
 Since this model has not been continued pretrained, the quality may vary.
 ```
 !pip install flash-attn --no-build-isolation
 !pip install peft bitsandbytes accelerate transformers

 The Phi-3-22b is a depth upsampled version of the 14b  [Phi-3-medium-128k-instruct](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct). We removed the bottom 8 layers of one copy of the 14b and the top 8 layers of another copy of the 14b model and stacked them. We plan to do continued pretraining to improve performance.
 Since this model has not been continued pretrained, the quality may vary.
+Some tests of the model in [colab](https://colab.research.google.com/drive/1eLoQXhysnBmN7DNNB6yElpELOSe6DHHH?usp=sharing).
 ```
 !pip install flash-attn --no-build-isolation
 !pip install peft bitsandbytes accelerate transformers