GGUF
TensorBlock
GGUF
Inference Endpoints
conversational
morriszms commited on
Commit
fe7bdb5
1 Parent(s): 0829fba

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ X-ALMA-13B-Pretrain-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ X-ALMA-13B-Pretrain-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
38
+ X-ALMA-13B-Pretrain-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ X-ALMA-13B-Pretrain-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ X-ALMA-13B-Pretrain-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
41
+ X-ALMA-13B-Pretrain-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
42
+ X-ALMA-13B-Pretrain-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
43
+ X-ALMA-13B-Pretrain-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
44
+ X-ALMA-13B-Pretrain-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
45
+ X-ALMA-13B-Pretrain-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
46
+ X-ALMA-13B-Pretrain-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
47
+ X-ALMA-13B-Pretrain-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,129 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - oscar-corpus/OSCAR-2301
5
+ - allenai/nllb
6
+ - Helsinki-NLP/opus-100
7
+ language:
8
+ - en
9
+ - da
10
+ - nl
11
+ - de
12
+ - is
13
+ - 'no'
14
+ - sc
15
+ - af
16
+ - ca
17
+ - ro
18
+ - gl
19
+ - it
20
+ - pt
21
+ - es
22
+ - bg
23
+ - mk
24
+ - sr
25
+ - uk
26
+ - ru
27
+ - id
28
+ - ms
29
+ - th
30
+ - vi
31
+ - mg
32
+ - fr
33
+ - hu
34
+ - el
35
+ - cs
36
+ - pl
37
+ - lt
38
+ - lv
39
+ - ka
40
+ - zh
41
+ - ja
42
+ - ko
43
+ - fi
44
+ - et
45
+ - gu
46
+ - hi
47
+ - mr
48
+ - ne
49
+ - ur
50
+ - az
51
+ - kk
52
+ - ky
53
+ - tr
54
+ - uz
55
+ - ar
56
+ - he
57
+ - fa
58
+ base_model: haoranxu/X-ALMA-13B-Pretrain
59
+ tags:
60
+ - TensorBlock
61
+ - GGUF
62
+ ---
63
+
64
+ <div style="width: auto; margin-left: auto; margin-right: auto">
65
+ <img src="https://i.imgur.com/jC7kdl8.jpeg" alt="TensorBlock" style="width: 100%; min-width: 400px; display: block; margin: auto;">
66
+ </div>
67
+ <div style="display: flex; justify-content: space-between; width: 100%;">
68
+ <div style="display: flex; flex-direction: column; align-items: flex-start;">
69
+ <p style="margin-top: 0.5em; margin-bottom: 0em;">
70
+ Feedback and support: TensorBlock's <a href="https://x.com/tensorblock_aoi">Twitter/X</a>, <a href="https://t.me/TensorBlock">Telegram Group</a> and <a href="https://x.com/tensorblock_aoi">Discord server</a>
71
+ </p>
72
+ </div>
73
+ </div>
74
+
75
+ ## haoranxu/X-ALMA-13B-Pretrain - GGUF
76
+
77
+ This repo contains GGUF format model files for [haoranxu/X-ALMA-13B-Pretrain](https://huggingface.co/haoranxu/X-ALMA-13B-Pretrain).
78
+
79
+ The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b4011](https://github.com/ggerganov/llama.cpp/commit/a6744e43e80f4be6398fc7733a01642c846dce1d).
80
+
81
+ ## Prompt template
82
+
83
+ ```
84
+ <s>[INST] <<SYS>>
85
+ {system_prompt}
86
+ <</SYS>>
87
+
88
+ {prompt} [/INST]
89
+ ```
90
+
91
+ ## Model file specification
92
+
93
+ | Filename | Quant type | File Size | Description |
94
+ | -------- | ---------- | --------- | ----------- |
95
+ | [X-ALMA-13B-Pretrain-Q2_K.gguf](https://huggingface.co/tensorblock/X-ALMA-13B-Pretrain-GGUF/tree/main/X-ALMA-13B-Pretrain-Q2_K.gguf) | Q2_K | 4.521 GB | smallest, significant quality loss - not recommended for most purposes |
96
+ | [X-ALMA-13B-Pretrain-Q3_K_S.gguf](https://huggingface.co/tensorblock/X-ALMA-13B-Pretrain-GGUF/tree/main/X-ALMA-13B-Pretrain-Q3_K_S.gguf) | Q3_K_S | 5.270 GB | very small, high quality loss |
97
+ | [X-ALMA-13B-Pretrain-Q3_K_M.gguf](https://huggingface.co/tensorblock/X-ALMA-13B-Pretrain-GGUF/tree/main/X-ALMA-13B-Pretrain-Q3_K_M.gguf) | Q3_K_M | 5.903 GB | very small, high quality loss |
98
+ | [X-ALMA-13B-Pretrain-Q3_K_L.gguf](https://huggingface.co/tensorblock/X-ALMA-13B-Pretrain-GGUF/tree/main/X-ALMA-13B-Pretrain-Q3_K_L.gguf) | Q3_K_L | 6.454 GB | small, substantial quality loss |
99
+ | [X-ALMA-13B-Pretrain-Q4_0.gguf](https://huggingface.co/tensorblock/X-ALMA-13B-Pretrain-GGUF/tree/main/X-ALMA-13B-Pretrain-Q4_0.gguf) | Q4_0 | 6.860 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
100
+ | [X-ALMA-13B-Pretrain-Q4_K_S.gguf](https://huggingface.co/tensorblock/X-ALMA-13B-Pretrain-GGUF/tree/main/X-ALMA-13B-Pretrain-Q4_K_S.gguf) | Q4_K_S | 6.913 GB | small, greater quality loss |
101
+ | [X-ALMA-13B-Pretrain-Q4_K_M.gguf](https://huggingface.co/tensorblock/X-ALMA-13B-Pretrain-GGUF/tree/main/X-ALMA-13B-Pretrain-Q4_K_M.gguf) | Q4_K_M | 7.326 GB | medium, balanced quality - recommended |
102
+ | [X-ALMA-13B-Pretrain-Q5_0.gguf](https://huggingface.co/tensorblock/X-ALMA-13B-Pretrain-GGUF/tree/main/X-ALMA-13B-Pretrain-Q5_0.gguf) | Q5_0 | 8.356 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
103
+ | [X-ALMA-13B-Pretrain-Q5_K_S.gguf](https://huggingface.co/tensorblock/X-ALMA-13B-Pretrain-GGUF/tree/main/X-ALMA-13B-Pretrain-Q5_K_S.gguf) | Q5_K_S | 8.356 GB | large, low quality loss - recommended |
104
+ | [X-ALMA-13B-Pretrain-Q5_K_M.gguf](https://huggingface.co/tensorblock/X-ALMA-13B-Pretrain-GGUF/tree/main/X-ALMA-13B-Pretrain-Q5_K_M.gguf) | Q5_K_M | 8.596 GB | large, very low quality loss - recommended |
105
+ | [X-ALMA-13B-Pretrain-Q6_K.gguf](https://huggingface.co/tensorblock/X-ALMA-13B-Pretrain-GGUF/tree/main/X-ALMA-13B-Pretrain-Q6_K.gguf) | Q6_K | 9.946 GB | very large, extremely low quality loss |
106
+ | [X-ALMA-13B-Pretrain-Q8_0.gguf](https://huggingface.co/tensorblock/X-ALMA-13B-Pretrain-GGUF/tree/main/X-ALMA-13B-Pretrain-Q8_0.gguf) | Q8_0 | 12.881 GB | very large, extremely low quality loss - not recommended |
107
+
108
+
109
+ ## Downloading instruction
110
+
111
+ ### Command line
112
+
113
+ Firstly, install Huggingface Client
114
+
115
+ ```shell
116
+ pip install -U "huggingface_hub[cli]"
117
+ ```
118
+
119
+ Then, downoad the individual model file the a local directory
120
+
121
+ ```shell
122
+ huggingface-cli download tensorblock/X-ALMA-13B-Pretrain-GGUF --include "X-ALMA-13B-Pretrain-Q2_K.gguf" --local-dir MY_LOCAL_DIR
123
+ ```
124
+
125
+ If you wanna download multiple model files with a pattern (e.g., `*Q4_K*gguf`), you can try:
126
+
127
+ ```shell
128
+ huggingface-cli download tensorblock/X-ALMA-13B-Pretrain-GGUF --local-dir MY_LOCAL_DIR --local-dir-use-symlinks False --include='*Q4_K*gguf'
129
+ ```
X-ALMA-13B-Pretrain-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b40b7997e1c6192c15d24a2378de2ff8b557ff83e9b2ef717382da6a790deef5
3
+ size 4854272160
X-ALMA-13B-Pretrain-Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a76faeb9028a504a7c88c2943196095cc8c01e90a9d14b3c0575b3c39ca7b9a4
3
+ size 6929561760
X-ALMA-13B-Pretrain-Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ea9f13e2f2db54fb324e990b6b3b20afa7844f4f671fa623f5a306ae493b9788
3
+ size 6337771680
X-ALMA-13B-Pretrain-Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c6e3c424b425a535c5cd061f730f9be25079b0f074d2d6dba044c60fac61166b
3
+ size 5658982560
X-ALMA-13B-Pretrain-Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e1ecdf09aca772a83e3cb18f6d7087312bc67618710411654c921689712069ae
3
+ size 7365836960
X-ALMA-13B-Pretrain-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7d05cd739fd0b0f7674545edea4de3cc26c355d127d0cae82a8f3d8fb64af009
3
+ size 7865958560
X-ALMA-13B-Pretrain-Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a24eb5619d0d5842598107f4ce6eb04cc0771a6fe9dbf4443377f829794f7c92
3
+ size 7423180960
X-ALMA-13B-Pretrain-Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:91003133f4f7e203163c8d92b9e1168719bda6c0715bba62f8aa82c939d22b49
3
+ size 8972288160
X-ALMA-13B-Pretrain-Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7b29acb29841f4887d8d62286fd10b4ff30743a46cc552c315f435d7f5ce5e50
3
+ size 9229926560
X-ALMA-13B-Pretrain-Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f260397275463dc25f6ff81a7364455759b1814427dd7a5e31394fea88ab9019
3
+ size 8972288160
X-ALMA-13B-Pretrain-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e9d2294a70a7258211a1af90b321a44cdab4de6dd9f33e38bb414139f8444d72
3
+ size 10679142560
X-ALMA-13B-Pretrain-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a780eb4be9d5d51dad27552e390cdd1fb58d8dd4d63e7d0bf4001c0eeab309ad
3
+ size 13831321760