Vezora commited on
Commit
0d8e63b
·
verified ·
1 Parent(s): 6847cbb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -3,7 +3,7 @@ license: apache-2.0
3
  ---
4
  <img src="https://huggingface.co/Vezora/Mistral-22B-v0.1/resolve/main/unsloth.png" width="100" height="150" />
5
 
6
- ### Mistral-22b-V.02 Release Announcement 🚀
7
 
8
  ## This model is not an moe, it is infact a 22B parameter dense model!
9
 
@@ -11,7 +11,7 @@ license: apache-2.0
11
  **Creator** [Nicolas Mejia-Petit](https://twitter.com/mejia_petit)
12
 
13
  ### Overview
14
- - Just two days after our release of **Mistral-22b-v0.1**, we are excited to introduce our handcrafted experimental model, **Mistral-22b-V.02**. This model is a culmination of equal knowledge distilled from all experts into a single, dense 22b model. This model is not a single trained expert, rather its a compressed MOE model, turning it into a dense 22b mode. This is the first working MOE to Dense model conversion.
15
  - v0.2 has trained on 8x more data than v0.1!
16
 
17
  ### Capabilities
 
3
  ---
4
  <img src="https://huggingface.co/Vezora/Mistral-22B-v0.1/resolve/main/unsloth.png" width="100" height="150" />
5
 
6
+ ### Mistral-22b-v.02 Release Announcement 🚀
7
 
8
  ## This model is not an moe, it is infact a 22B parameter dense model!
9
 
 
11
  **Creator** [Nicolas Mejia-Petit](https://twitter.com/mejia_petit)
12
 
13
  ### Overview
14
+ - Just two days after our release of **Mistral-22b-v0.1**, we are excited to introduce our handcrafted experimental model, **Mistral-22b-v.02**. This model is a culmination of equal knowledge distilled from all experts into a single, dense 22b model. This model is not a single trained expert, rather its a compressed MOE model, turning it into a dense 22b mode. This is the first working MOE to Dense model conversion.
15
  - v0.2 has trained on 8x more data than v0.1!
16
 
17
  ### Capabilities