aehwanc commited on
Commit
3801535
1 Parent(s): 6165279

change readme

Browse files
Files changed (1) hide show
  1. README.md +43 -1
README.md CHANGED
@@ -9,4 +9,46 @@ app_file: server.py
9
  pinned: false
10
  license: agpl-3.0
11
  python_version: 3.10.9
12
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  pinned: false
10
  license: agpl-3.0
11
  python_version: 3.10.9
12
+ ---
13
+
14
+ <h1>Introduction</h1>
15
+
16
+ This is a domain-specific chatbot on whiskey, with the basic format / setting derived from [text-generation-webui](https://github.com/oobabooga/text-generation-webui). This chatbot is modified version of the existing one mentioned above. This document is to demonstrate the functions that are not explained thoroughly in the above repository and to explain several maintenance functions.
17
+
18
+ This chatbot is based on gradio UI, and it only implemented chat mode, unlike the original model with two more available modes.
19
+
20
+ <h1>Features</h1>
21
+
22
+ The features include: server configuration, language model, and chatbot application.
23
+
24
+ <h2>Server Configuration</h2>
25
+
26
+ The model runs in two different configurations without any bottleneck processes.
27
+
28
+ One is the local server. The key features include:
29
+ * GPU: GTX 1080 Ti, 12GB,
30
+ * CPU: Intel(R) Xeon(R) CPU E5-2690 v3 @ 2.60GHz, 24 cores,
31
+ * RAM: 64GB.
32
+
33
+ Another is the huggingface server. The space in huggingface is set to Nvidia T4 small. The key features include:
34
+ * GPU: Nvidia T4, 16GB,
35
+ * CPU: 4vCPU,
36
+ * RAM: 15GB.
37
+
38
+ These two servers are able to run the model effectively.
39
+
40
+ <h2>Language Model</h2>
41
+
42
+ The language model that is applied in this model is LLaMa-2-7b by Meta. This model is considered as one of the best performing language models as for now (August 2023). The original model can be found [here](https://huggingface.co/meta-llama/Llama-2-7b). The model is located at /models/llama-2-7b-hf.
43
+
44
+ For running the chatbot, the model is quantized into 4bits due to server constraints. Quantization allows number of users to access to the chatbot system simultaneously.
45
+
46
+ <h2>Chatbot Application</h2>
47
+
48
+ From the original application, there were different parameters that are set differently compared to the existing chatbot application.
49
+
50
+ * Character - For Domain-specific chatbot, the prompt is set up with a new character that has not been introduced,
51
+ * Model - The basis language model is set,
52
+ * Lora - LoRAs have been applied for training different texts for training the model for better accuracy.
53
+
54
+ The details of LoRA datasets will not be published, since it is the sole basis of the model for chatbot manipulation.