DialogRPT-updown / README.md
system's picture
system HF staff
Update README.md
be1bac8
|
raw
history blame
No virus
2.51 kB

DialogRPT-updown

Dialog Ranking Pretrained Transformers

How likely a dialog response is upvoted 👍 and/or gets replied 💬?

This is what DialogRPT is learned to predict. It is a set of dialog response ranking models proposed by Microsoft Research NLP Group trained on 100 + millions of human feedback data. It can be used to improve existing dialog generation model (e.g., DialoGPT) by re-ranking the generated response candidates.

Quick Links:

We considered the following tasks and provided corresponding pretrained models.

Model card Description
Given a context and its two human responses, predict...
microsoft/DialogRPT-updown ... which gets more upvotes?
microsoft/DialogRPT-width ... which gets more direct replies?
microsoft/DialogRPT-depth ... which gets longer follow-up thread?
Given a context and one human response, distinguish it with...
microsoft/DialogRPT-human-vs-rand ... a random human response
microsoft/DialogRPT-human-vs-machine ... a machine generated response

Examples:

The updown score predicts how likely the response is getting upvoted.

Context Response updown score
I love NLP! Here’s a free textbook (URL) in case anyone needs it. 0.613
I love NLP! Me too! 0.111

Contact:

Please create an issue on our repo

Citation:

@inproceedings{gao2020dialogrpt,
    title={Dialogue Response RankingTraining with Large-Scale Human Feedback Data},
    author={Xiang Gao and Yizhe Zhang and Michel Galley and Chris Brockett and Bill Dolan},
    year={2020},
    booktitle={EMNLP}
}