fractalego commited on
Commit
81e2151
1 Parent(s): 658a838

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +30 -0
README.md ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # What is SamSum Bot?
2
+ This is a model fine-tuned on the [SamSum dataset](https://huggingface.co/datasets/samsum).
3
+ However, instead of training the system to summarize conversations, the model is trained to predict a conversation given a summary.
4
+ The prompt needs to be in the following form
5
+ ```python
6
+ A partial summary of the conversation is:
7
+ {summary}
8
+
9
+ With the dialogue being:
10
+ {dialogue}
11
+ ```
12
+
13
+ where *{summary}* is a text as in
14
+ ```python
15
+ John went out to buy groceries. He meets Jane on the way and they talk about the weather.
16
+ ```
17
+
18
+ and the *{dialogue}* needs to be structured with speaking lines preceded by the speaking character
19
+ ```python
20
+ John: Oh hi Jane.
21
+ Jane: Nice to see you?
22
+ John: The weather looks nice today
23
+ Jane: [PREDICTION]
24
+ ```
25
+
26
+ The system is based on the GPTJ-6B by EleutherAI, [quantized by Hivemind](https://huggingface.co/hivemind/gpt-j-6B-8bit). It has been fine-tuned according to the [LoRa method](https://arxiv.org/abs/2106.09685).
27
+ A simple back-end is available in [this repo](https://github.com/fractalego/samsum-bot), where the model is served using Torchserve.
28
+ A terminal-like front-end interface is available [here](https://github.com/fractalego/samsumbot_client).
29
+ This interface is the one used in my website [http://fractalego.io](http://fractalego.io).
30
+