yuyouyu commited on
Commit
b5f33c6
1 Parent(s): ae13226

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -4
README.md CHANGED
@@ -118,7 +118,7 @@ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
118
  We use objective questions to assess eight dimensions: **Character, Style, Emotion, Relationship, Personality, Human-likeness, Coherence, and Role Consistency**. The metric design can be find in our [paper](https://arxiv.org/abs/2408.10903). The evaluation code can be found in [GitHub](https://github.com/yuyouyu32/BeyondDialogue/tree/main/AutoRPEval). The results are shown below:
119
 
120
  | **Model** | **Character ↑** | **Style ↑** | **Emotion ↓** | **Relationship ↓** | **Personality ↑** | **Avg. ↑** | **Human-likeness ↑** | **Role Choice ↑** | **Coherence ↑** |
121
- |------------------------------------------------------------|---------------|---------------|---------------|------------------|-----------------|----------------|----------------------|-------------------|-----------------|
122
  | **General Baselines(Proprietary)** |
123
  | GPT-4o | 74.32 ± 1.15 | **81.67 ± 1.51** | 16.31 ± 0.48 | **12.13 ± 0.66** | 66.58 ± 4.41 | 78.83 ± 1.64 | **67.33 ± 3.95** | **87.33 ± 3.86** | **99.67 ± 0.33**|
124
  | GPT-3.5-Turbo | 72.26 ± 1.27 | 73.66 ± 1.73 | 17.79 ± 0.56 | 14.17 ± 0.73 | 66.92 ± 4.85 | 76.18 ± 1.83 | 33.33 ± 4.43 | 83.00 ± 4.68 | 97.33 ± 1.17 |
@@ -148,9 +148,7 @@ We use objective questions to assess eight dimensions: **Character, Style, Emoti
148
  title = {BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language Model},
149
  author = {Yu, Yeyong and Yu, Runsheng and Wei, Haojie and Zhang, Zhanqiu and Qian, Quan},
150
  year = {2024},
151
- journal = {arXiv preprint arXiv:2408.10903
152
-
153
- ,
154
  }
155
  ```
156
 
 
118
  We use objective questions to assess eight dimensions: **Character, Style, Emotion, Relationship, Personality, Human-likeness, Coherence, and Role Consistency**. The metric design can be find in our [paper](https://arxiv.org/abs/2408.10903). The evaluation code can be found in [GitHub](https://github.com/yuyouyu32/BeyondDialogue/tree/main/AutoRPEval). The results are shown below:
119
 
120
  | **Model** | **Character ↑** | **Style ↑** | **Emotion ↓** | **Relationship ↓** | **Personality ↑** | **Avg. ↑** | **Human-likeness ↑** | **Role Choice ↑** | **Coherence ↑** |
121
+ |---------------|---------------|---------------|---------------|------------------|-----------------|----------------|----------------------|-------------------|-----------------|
122
  | **General Baselines(Proprietary)** |
123
  | GPT-4o | 74.32 ± 1.15 | **81.67 ± 1.51** | 16.31 ± 0.48 | **12.13 ± 0.66** | 66.58 ± 4.41 | 78.83 ± 1.64 | **67.33 ± 3.95** | **87.33 ± 3.86** | **99.67 ± 0.33**|
124
  | GPT-3.5-Turbo | 72.26 ± 1.27 | 73.66 ± 1.73 | 17.79 ± 0.56 | 14.17 ± 0.73 | 66.92 ± 4.85 | 76.18 ± 1.83 | 33.33 ± 4.43 | 83.00 ± 4.68 | 97.33 ± 1.17 |
 
148
  title = {BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language Model},
149
  author = {Yu, Yeyong and Yu, Runsheng and Wei, Haojie and Zhang, Zhanqiu and Qian, Quan},
150
  year = {2024},
151
+ journal = {arXiv preprint arXiv:2408.10903},
 
 
152
  }
153
  ```
154