Commit
•
e268af1
1
Parent(s):
d265440
Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@ tags: []
|
|
11 |
|
12 |
## Model Details
|
13 |
|
14 |
-
This my attemp (probably too naive) to reproduce the upcycling process used to initialize [Qwen1.5-MoE-A2.7B](https://huggingface.co/Qwen/Qwen1.5-MoE-A2.7B) using [Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B).
|
15 |
|
16 |
## Upcycling script
|
17 |
|
|
|
11 |
|
12 |
## Model Details
|
13 |
|
14 |
+
This is my attemp (probably too naive) to reproduce the upcycling process used to initialize [Qwen1.5-MoE-A2.7B](https://huggingface.co/Qwen/Qwen1.5-MoE-A2.7B) using [Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B).
|
15 |
|
16 |
## Upcycling script
|
17 |
|