munish0838 commited on
Commit
1105a76
1 Parent(s): 9ca5060

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +63 -0
README.md ADDED
@@ -0,0 +1,63 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+
4
+ base_model:
5
+ - Qwen/Qwen2.5-Coder-7B-Instruct
6
+ - Qwen/Qwen2.5-7B-Instruct
7
+ - Qwen/Qwen2.5-Math-7B-Instruct
8
+ library_name: transformers
9
+ tags:
10
+ - mergekit
11
+ - merge
12
+
13
+
14
+ ---
15
+
16
+ [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
17
+
18
+
19
+ # QuantFactory/Qwen2.5-7B-Instruct-MathCoder-GGUF
20
+ This is quantized version of [DeepMount00/Qwen2.5-7B-Instruct-MathCoder](https://huggingface.co/DeepMount00/Qwen2.5-7B-Instruct-MathCoder) created using llama.cpp
21
+
22
+ # Original Model Card
23
+
24
+ # merge
25
+
26
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
27
+
28
+ ## Merge Details
29
+ ### Merge Method
30
+
31
+ This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) as a base.
32
+
33
+ ### Models Merged
34
+
35
+ The following models were included in the merge:
36
+ * [Qwen/Qwen2.5-Coder-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-7B-Instruct)
37
+ * [Qwen/Qwen2.5-Math-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Math-7B-Instruct)
38
+
39
+ ### Configuration
40
+
41
+ The following YAML configuration was used to produce this model:
42
+
43
+ ```yaml
44
+ models:
45
+ - model: Qwen/Qwen2.5-7B-Instruct
46
+ #no parameters necessary for base model
47
+ - model: Qwen/Qwen2.5-Math-7B-Instruct
48
+ parameters:
49
+ density: 0.5
50
+ weight: 0.5
51
+ - model: Qwen/Qwen2.5-Coder-7B-Instruct
52
+ parameters:
53
+ density: 0.5
54
+ weight: 0.5
55
+
56
+ merge_method: ties
57
+ base_model: Qwen/Qwen2.5-7B-Instruct
58
+ parameters:
59
+ normalize: false
60
+ int8_mask: true
61
+ dtype: float16
62
+ ```
63
+