Update README.md
Browse files
README.md
CHANGED
@@ -5,8 +5,22 @@ library_name: transformers
|
|
5 |
tags:
|
6 |
- mergekit
|
7 |
- merge
|
8 |
-
|
|
|
|
|
|
|
9 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
10 |
# model
|
11 |
|
12 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
|
|
5 |
tags:
|
6 |
- mergekit
|
7 |
- merge
|
8 |
+
- llama3
|
9 |
+
license: llama3
|
10 |
+
language:
|
11 |
+
- en
|
12 |
---
|
13 |
+
|
14 |
+
Meta's Llama 3 8B pruned to 7B parameters(w/ 29 layers). Layers to prune selected using PruneMe repo on Github.
|
15 |
+
|
16 |
+
- layers_to_skip = 3
|
17 |
+
- Layer 24 to 27 has the minimum average distance of 0.15680849609375.
|
18 |
+
|
19 |
+
- [ ] To Do : Post pruning training.
|
20 |
+
|
21 |
+

|
22 |
+
|
23 |
+
|
24 |
# model
|
25 |
|
26 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|