deepparag commited on
Commit
42103d2
1 Parent(s): aecfda2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -0
README.md CHANGED
@@ -31,6 +31,21 @@ contact: deep.p.sarda@gmail.com
31
  ## Training
32
  The Discord Messages Dataset simply dwarfs the other datasets, Hence the data sets are repeated.
33
  This leads to them covering each others issues!
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
34
  ## Usage
35
  Example:
36
  ```python
 
31
  ## Training
32
  The Discord Messages Dataset simply dwarfs the other datasets, Hence the data sets are repeated.
33
  This leads to them covering each others issues!
34
+
35
+ ## Evaluation
36
+
37
+ Below is a comparison of Aeona vs. other baselines on the mixed dataset given above using automatic evaluation metrics.
38
+
39
+ | Model | Perplexity
40
+ |---|---|---
41
+ | Seq2seq Baseline [3] | 29.8 |
42
+ | Wolf et al. [5] | 16.3 |
43
+ | GPT-2 baseline | 99.5 |
44
+ | DialoGPT baseline | 56.6 |
45
+ | DialoGPT finetuned | 11.4 |
46
+ | PersonaGPT | 10.2 |
47
+ | **Aeona** | **7.9** |
48
+
49
  ## Usage
50
  Example:
51
  ```python