TheBloke commited on
Commit
59371f8
1 Parent(s): 22cae5b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -0
README.md CHANGED
@@ -1,3 +1,32 @@
1
  ---
2
  license: other
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: other
3
+ inference: false
4
  ---
5
+
6
+ # WizardLM: An Instruction-following LLM Using Evol-Instruct
7
+
8
+ These files are the result of merging the [delta weights](https://huggingface.co/victor123/WizardLM) with the original Llama7B model.
9
+
10
+ The code for merging is provided in the [WizardLM official Github repo](https://github.com/nlpxucan/WizardLM).
11
+
12
+ ## WizardLM-7B GGML
13
+
14
+ This repo contains GGML files for WizardLM-7B for CPU inference
15
+
16
+ ## Provided files
17
+ | Name | Quant method | Bits | Size | RAM required | Use case |
18
+ | ---- | ---- | ---- | ---- | ---- | ----- |
19
+ `WizardLM-7B.GGML.q4_0.bin` | q4_0 | 4bit | 39GB | 41GB | Superseded and not recommended |
20
+ `WizardLM-7B.GGML.q4_2.bin` | q4_2 | 4bit | 39GB | 41GB | Best compromise between resources, speed and quality |
21
+ `WizardLM-7B.GGML.q4_3.bin` | q4_3 | 4bit | 47GB | 49GB | Maximum quality, high RAM requirements and slow inference |
22
+
23
+ * The q4_0 file is provided for compatibility with older versions of llama.cpp. It has been superseded and is no longer recommended.
24
+ * The q4_2 file offers the best combination of performance and quality.
25
+ * The q4_3 file offers the highest quality, at the cost of increased RAM usage and slower inference speed.
26
+
27
+ # Original model info
28
+
29
+ Overview of Evol-Instruct
30
+ Evol-Instruct is a novel method using LLMs instead of humans to automatically mass-produce open-domain instructions of various difficulty levels and skills range, to improve the performance of LLMs.
31
+
32
+ ![info](https://github.com/nlpxucan/WizardLM/raw/main/imgs/git_running.png)