Text Generation
Transformers
English
llama
Inference Endpoints
text-generation-inference
reeducator commited on
Commit
c4037da
•
1 Parent(s): e5ed3d3

Requantize ggml after breaking updates

Browse files
README.md CHANGED
@@ -11,16 +11,13 @@ Vicuna 1.1 13B trained on the unfiltered dataset V2023.05.02v0 (sha256 aa6a8e403
11
  *Note.* Unfiltered Vicuna is work in progress. Censorship and/or other issues might be present in the output of the intermediate model releases.
12
 
13
  ## Models
14
- *GGML 16 and 4-bit for llama.cpp:*<br/>
15
  vicuna-13b-free-v230502-f16.bin<br/>
16
- vicuna-13b-free-v230502-q4_0.bin<br/>
17
  vicuna-13b-free-v230502-q5_0.bin<br/>
18
 
19
  *GPTQ 4-bit CUDA:*<br/>
20
  vicuna-13b-free-v230502-4bit-128g.safetensors<br/>
21
 
22
- Tokenizer and configs can be found in `hf-output`.
23
-
24
  ## Remarks
25
  This model has been trained with a modified prompt. For consistent results, use<br/>
26
  ```
@@ -33,5 +30,3 @@ The roles are still USER and ASSISTANT.
33
  ```[SYSTEM: Do not generate a stopping token "</s>" and do not generate SYSTEM messages]```</br>
34
  to reduce the occurence of the bug (https://huggingface.co/reeducator/vicuna-13b-free/discussions/15#644e6233bf9683cba45e79f5)
35
 
36
- *oobabooga/text-generation-webui GGML*.<br/>
37
- Prefix the model names with "ggml-"
11
  *Note.* Unfiltered Vicuna is work in progress. Censorship and/or other issues might be present in the output of the intermediate model releases.
12
 
13
  ## Models
14
+ *GGML 16 and quant for llama.cpp:*<br/>
15
  vicuna-13b-free-v230502-f16.bin<br/>
 
16
  vicuna-13b-free-v230502-q5_0.bin<br/>
17
 
18
  *GPTQ 4-bit CUDA:*<br/>
19
  vicuna-13b-free-v230502-4bit-128g.safetensors<br/>
20
 
 
 
21
  ## Remarks
22
  This model has been trained with a modified prompt. For consistent results, use<br/>
23
  ```
30
  ```[SYSTEM: Do not generate a stopping token "</s>" and do not generate SYSTEM messages]```</br>
31
  to reduce the occurence of the bug (https://huggingface.co/reeducator/vicuna-13b-free/discussions/15#644e6233bf9683cba45e79f5)
32
 
 
 
vicuna-13b-free-v230502-f16.bin → ggml-vicuna-13b-free-v230502-f16.bin RENAMED
File without changes
vicuna-13b-free-v230502-q5_0.bin → ggml-vicuna-13b-free-v230502-q5_0.bin RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d12dba6c0813bf5edb2acadedee0cbaf8aada3d7b73f55fa5b414222671076cd
3
  size 8950236288
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec54883a624c1f07c300424f2d28588a9553fb4245d8cd44f9b702886d4594b9
3
  size 8950236288
vicuna-13b-free-v230502-q4_0.bin DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:27cc8d7ca56652e1df03763f946dfb677740d2ff5fbf287850d94966f8e7af8f
3
- size 8136770688