cookinai
/

CatMacaroni14

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

cookinai commited on Dec 31, 2023

Commit

1bca790

•

1 Parent(s): 7d39449

Update README.md

Files changed (1) hide show

README.md +26 -0

README.md CHANGED Viewed

@@ -1,3 +1,29 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
 ---
+Slerp Merge of shadowml/Marcoro14-7B-slerp and rishiraj/CatPPT
+I've been meaning to mix in EmbeddedLLM/Mistral-7B-Merge-14-v0.1 but have had issues so thanks to shadowml that merges it with AIDC-ai-business/Marcoroni-7B-v3
+Also, been hearing talks of AIDC-ai-business/Marcoroni-7B-v3 being contaminated,
+I don't know if this is true but make a post on HuggingFaceH4/open_llm_leaderboard so we can keep the board clean
+.yaml file for mergekit
+```.yaml:
+slices:
+  - sources:
+      - model: shadowml/Marcoro14-7B-slerp
+        layer_range: [0, 32]
+      - model: rishiraj/CatPPT
+        layer_range: [0, 32]
+merge_method: slerp
+base_model: shadowml/Marcoro14-7B-slerp
+parameters:
+  t:
+    - filter: self_attn
+      value: [0, 0.5, 0.3, 0.7, 1]
+    - filter: mlp
+      value: [1, 0.5, 0.7, 0.3, 0]
+    - value: 0.5 # fallback for rest of tensors
+dtype: bfloat16
+```