Update README.md
Browse files
README.md
CHANGED
@@ -29,6 +29,7 @@ The following models were included in the merge:
|
|
29 |
|
30 |
The following YAML configuration was used to produce this model:
|
31 |
|
|
|
32 |
```yaml
|
33 |
|
34 |
slices:
|
@@ -44,7 +45,7 @@ dtype: bfloat16
|
|
44 |
|
45 |
[PruneMe](https://github.com/arcee-ai/PruneMe) has been utilized using the wikimedia/wikipedia Norwegian (no) subset by investigating layer similarity with 2000 samples. The layer ranges for pruning were determined based on this analysis to maintain performance while reducing model size.
|
46 |
|
47 |
-
![Layer Similarity Plot](https://cdn-uploads.huggingface.co/production/uploads/660c0a02cf274b3ab77dd6b7/
|
48 |
|
49 |
## Direct Use
|
50 |
Research on large language models; as a foundation for further specialization and finetuning for specific usecases (e.g., summarization, text generation, chatbot, etc.)
|
|
|
29 |
|
30 |
The following YAML configuration was used to produce this model:
|
31 |
|
32 |
+
|
33 |
```yaml
|
34 |
|
35 |
slices:
|
|
|
45 |
|
46 |
[PruneMe](https://github.com/arcee-ai/PruneMe) has been utilized using the wikimedia/wikipedia Norwegian (no) subset by investigating layer similarity with 2000 samples. The layer ranges for pruning were determined based on this analysis to maintain performance while reducing model size.
|
47 |
|
48 |
+
![Layer Similarity Plot](https://cdn-uploads.huggingface.co/production/uploads/660c0a02cf274b3ab77dd6b7/qO1JQv2NN-dKXYTwU_lTl.png)
|
49 |
|
50 |
## Direct Use
|
51 |
Research on large language models; as a foundation for further specialization and finetuning for specific usecases (e.g., summarization, text generation, chatbot, etc.)
|