s3nh commited on
Commit
7f38fd0
1 Parent(s): 581ffc8

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -40,3 +40,8 @@ AlexWortega-Vikhr-7b-0.1.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
  AlexWortega-Vikhr-7b-0.1.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
41
  AlexWortega-Vikhr-7b-0.1.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
42
  AlexWortega-Vikhr-7b-0.1.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
40
  AlexWortega-Vikhr-7b-0.1.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
41
  AlexWortega-Vikhr-7b-0.1.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
42
  AlexWortega-Vikhr-7b-0.1.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
43
+ vikhr-7b-0.1.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
44
+ vikhr-7b-0.1.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
45
+ vikhr-7b-0.1.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
46
+ vikhr-7b-0.1.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
47
+ vikhr-7b-0.1.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -31,17 +31,21 @@ The key difference between GGJT and GGUF is the use of a key-value structure for
31
  This allows for new metadata to be added without breaking compatibility with existing models, and to annotate the model with additional information that may be useful for
32
  inference or for identifying the model.
33
 
34
- ### Perplexity params
35
-
36
- Model Measure Q2_K Q3_K_S Q3_K_M Q3_K_L Q4_0 Q4_1 Q4_K_S Q4_K_M Q5_0 Q5_1 Q5_K_S Q5_K_M Q6_K Q8_0 F16
37
- 7B perplexity 6.7764 6.4571 6.1503 6.0869 6.1565 6.0912 6.0215 5.9601 5.9862 5.9481 5.9419 5.9208 5.9110 5.9070 5.9066
38
- 13B perplexity 5.8545 5.6033 5.4498 5.4063 5.3860 5.3608 5.3404 5.3002 5.2856 5.2706 5.2785 5.2638 5.2568 5.2548 5.2543
39
-
40
 
41
 
42
  ### inference
43
 
44
 
45
- TODO
 
 
 
 
 
 
 
 
 
 
46
 
47
  # Original model card
 
31
  This allows for new metadata to be added without breaking compatibility with existing models, and to annotate the model with additional information that may be useful for
32
  inference or for identifying the model.
33
 
 
 
 
 
 
 
34
 
35
 
36
  ### inference
37
 
38
 
39
+ User: Tell me story about what is an quantization and what do we need to build.
40
+ > What's a quatization?
41
+ >
42
+ > — I can't remember, let me look it up!
43
+ > What's the point of this article?
44
+ >
45
+ > — To make sure that you will remember everything next time!
46
+ ## Введение в математику
47
+ ### Координатные системы и векторное произведение.
48
+ Для начала давайте рассмотрим, что такое координатная система и как она работает, а потом пошагово построим функцию вектора из трех координт.
49
+ Зачем? Потому что так будет легко понять, какие
50
 
51
  # Original model card
vikhr-7b-0.1.Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f034c454f527eeb3c6254dcbba68dc2471f6e55e6d97f820080fad0e879cadc1
3
+ size 3205761824
vikhr-7b-0.1.Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:261504f7b38e002fc505a1c64033267dac380e6b13b831b6c68a9b930666e903
3
+ size 4413985568
vikhr-7b-0.1.Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2801e522ad74f12cfae000353befdc5c5d4fe8d2864595b4efc65def2b7d32f9
3
+ size 5181051680
vikhr-7b-0.1.Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3652ecc30ec4d23a602d6c3d4e7f4cebeab0b88f2282ed0e09b0e4adcd5adc34
3
+ size 5996059424
vikhr-7b-0.1.Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:69a9eb7241712fbe63cf8517df91e93040f77619690462f58150167f682099dc
3
+ size 7765723936
vikhr-7b-0.1.fp16.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d7f2699b5621a7ee94c7596dd29b948794cfc5ebc2b732ed3f5da669a680d9ed
3
+ size 14616038144