spow12
/

MK_Nemo_12B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

spow12 commited on Nov 6, 2024

Commit

cc2ae0a

•

1 Parent(s): c870478

Update README.md

Files changed (1) hide show

README.md +20 -0

README.md CHANGED Viewed

@@ -1,6 +1,14 @@
 ---
 library_name: transformers
 license: cc-by-nc-4.0
 language:
 - ko
 - en
@@ -14,6 +22,18 @@ language:
 This model is a Supervised fine-tuned version of [mistralai/Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) with DeepSpeed and trl for korean.
 ### Trained Data
 - Trained with public, private data (about 130K)

 ---
 library_name: transformers
 license: cc-by-nc-4.0
+base_model:
+- anthracite-org/magnum-v4-12b
+- mistralai/Mistral-Nemo-Instruct-2407
+- werty1248/Mistral-Nemo-NT-Ko-12B-dpo
+tags:
+- mergekit
+- merge
 language:
 - ko
 - en
 This model is a Supervised fine-tuned version of [mistralai/Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) with DeepSpeed and trl for korean.
+Merge methods.
+```yaml
+models:
+    - model: anthracite-org/magnum-v4-12b
+    - model: mistralai/Mistral-Nemo-Instruct-2407
+    - model: spow12/Mistral-Nemo-Instruct-2407_sft_ver_4.4(private)
+    - model: werty1248/Mistral-Nemo-NT-Ko-12B-dpo
+merge_method: model_stock
+base_model: spow12/Mistral-Nemo-Instruct-2407_sft_ver_4.4(private)
+dtype: bfloat16
+```
 ### Trained Data
 - Trained with public, private data (about 130K)