perlthoughts
/

Chupacabra-7B

@@ -6,6 +6,28 @@ license: apache-2.0
 <p><img src="https://huggingface.co/perlthoughts/Chupacabra-7B/resolve/main/chupacabra.jpeg" width=320></p>
 ## Purpose
 Merging the "thick"est model weights from mistral models using amazing training methods like deep probabilistic optimization (dpo) and reinforced learning.
@@ -28,28 +50,6 @@ Here is my contribution.
 GPT4 User: {prompt}<|end_of_turn|>GPT4 Assistant:
 ```
-### Model Description
-Based models that are based on Mistral.
-All model's weights were merged using the SLERP method. More information below.
-Advantages of SLERP method vs averaging weights are as follows:
-Spherical Linear Interpolation (SLERP)
-Traditionally, model merging often resorts to weight averaging which, although straightforward, might not always capture the intricate features of the models being merged. The SLERP technique addresses this limitation, producing a blended model with characteristics smoothly interpolated from both parent models, ensuring the resultant model captures the essence of both its parents.
-Smooth Transitions
-SLERP ensures smoother transitions between model parameters. This is especially significant when interpolating between high-dimensional vectors.
-Better Preservation of Characteristics
-Unlike weight averaging, which might dilute distinct features, SLERP preserves the curvature and characteristics of both models in high-dimensional spaces.
-Nuanced Blending
-SLERP takes into account the geometric and rotational properties of the models in the vector space, resulting in a blend that is more reflective of both parent models' characteristics.
-List of models merged coming soon as well as more information on merging techniques and methods.
 ### Bug fixes
 - Fixed issue with generation and the incorrect model weights. Model weights have been corrected and now generation works again. Reuploading GGUF to the GGUF repository as well as the AWQ versions.

 <p><img src="https://huggingface.co/perlthoughts/Chupacabra-7B/resolve/main/chupacabra.jpeg" width=320></p>
+### Model Description
+Based models that are based on Mistral.
+All model's weights were merged using the SLERP method. More information below.
+Advantages of SLERP method vs averaging weights are as follows:
+Spherical Linear Interpolation (SLERP)
+Traditionally, model merging often resorts to weight averaging which, although straightforward, might not always capture the intricate features of the models being merged. The SLERP technique addresses this limitation, producing a blended model with characteristics smoothly interpolated from both parent models, ensuring the resultant model captures the essence of both its parents.
+Smooth Transitions
+SLERP ensures smoother transitions between model parameters. This is especially significant when interpolating between high-dimensional vectors.
+Better Preservation of Characteristics
+Unlike weight averaging, which might dilute distinct features, SLERP preserves the curvature and characteristics of both models in high-dimensional spaces.
+Nuanced Blending
+SLERP takes into account the geometric and rotational properties of the models in the vector space, resulting in a blend that is more reflective of both parent models' characteristics.
+List of all models and merging path is coming soon.
 ## Purpose
 Merging the "thick"est model weights from mistral models using amazing training methods like deep probabilistic optimization (dpo) and reinforced learning.
 GPT4 User: {prompt}<|end_of_turn|>GPT4 Assistant:
 ```
 ### Bug fixes
 - Fixed issue with generation and the incorrect model weights. Model weights have been corrected and now generation works again. Reuploading GGUF to the GGUF repository as well as the AWQ versions.