kuotient
/

Llama-3-13B-Instruct-attenuated

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

kuotient commited on Apr 20, 2024

Commit

36d1e89

·

verified ·

1 Parent(s): ee7f534

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -7,9 +7,11 @@ tags:
 - merge
 ---
-# meta-llama-15b-2
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details
 ### Merge Method

 - merge
 ---
+# Llama-3-11.5B-Instruct
+The core idea came from @jukofyork, see this [issue;](https://github.com/arcee-ai/mergekit/issues/198)
+As I understand, The concept of the idea is to make model think twice but leap same distances like original. but why 0.7071067812?
+> The scale factor to use, eg: solve x^2 = 1/2 --> x = 1/sqrt(2) ≈ 0.7071067812
 ## Merge Details
 ### Merge Method