IlyaGusev
/

saiga_13b_ggml

Text Generation

Model card Files Files and versions Community

Avoid printing incomplete bytes

#1

by fikavec - opened May 16, 2023

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (4) hide show

README.md +4 -4
ggml-model-q4_1.bin +2 -2
ggml-model-q5_1.bin +1 -1
ggml-model-q8_0.bin +2 -2

README.md CHANGED Viewed

@@ -6,22 +6,22 @@ datasets:
 language:
 - ru
 inference: false
-pipeline_tag: conversational
 ---
 Llama.cpp compatible versions of an original [13B model](https://huggingface.co/IlyaGusev/saiga_13b_lora).
 * Download one of the versions, for example `ggml-model-q4_1.bin`.
-* Download [interact_llamacpp.py](https://raw.githubusercontent.com/IlyaGusev/rulm/master/self_instruct/src/interact_llamacpp.py)
 How to run:
 ```
 sudo apt-get install git-lfs
 pip install llama-cpp-python fire
-python3 interact_llamacpp.py ggml-model-q4_1.bin
 ```
 System requirements:
 * 18GB RAM for q8_0
-* 13GB RAM for q4_1

 language:
 - ru
 inference: false
+pipeline_tag: text2text-generation
 ---
 Llama.cpp compatible versions of an original [13B model](https://huggingface.co/IlyaGusev/saiga_13b_lora).
 * Download one of the versions, for example `ggml-model-q4_1.bin`.
+* Download [interact_saiga_llamacpp.py](https://raw.githubusercontent.com/IlyaGusev/rulm/master/self_instruct/src/infer_saiga_llamacpp.py)
 How to run:
 ```
 sudo apt-get install git-lfs
 pip install llama-cpp-python fire
+python3 interact_saiga_llamacpp.py ggml-model-q4_1.bin
 ```
 System requirements:
 * 18GB RAM for q8_0
+* 13GB RAM for q4_1

ggml-model-q4_1.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:61f1b160e769b3bc8971d8d02bde50e9b3db6fb18fa3f29dba023e9eb9848242
-size 8136770688

 version https://git-lfs.github.com/spec/v1
+oid sha256:c9fed0b0e45e8731d0e1a0c8ed5b36f83951887805147a2012b327fe9aa7a5b9
+size 9763701888

ggml-model-q5_1.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a44ecb301859ae987353eb1b0b4b67bc337a9b04c59563d9fe75942834d1c00a
 size 9763701888

 version https://git-lfs.github.com/spec/v1
+oid sha256:e0098b1bab55b796bb9a1d596241f266205220738f3a9a9064990e44b78b2453
 size 9763701888

ggml-model-q8_0.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e267bdfcc3218c9b06e0d1cf2e15475502f971eeaf6a1d80676e00b79dbf4d01
-size 13831029888

 version https://git-lfs.github.com/spec/v1
+oid sha256:5b2bc1ee6a05625f78af12f0011ead209a049f986395c1b08eadcb9abd00f992
+size 14644495488