Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ widget:
|
|
12 |
content: What is your favorite condiment?
|
13 |
---
|
14 |
|
15 |
-
This model is compressed from the Mixtral-8x7B
|
16 |
|
17 |
|
18 |
The model still retains its core performance:
|
|
|
12 |
content: What is your favorite condiment?
|
13 |
---
|
14 |
|
15 |
+
This model is compressed from the Mixtral-8x7B. Using Low-Rank Approximation, I removed 10 billion parameters from the MLP experts' matrices, enough to run the model on a single A100 80GB GPU using half precision.
|
16 |
|
17 |
|
18 |
The model still retains its core performance:
|