microsoft
/

DialogRPT-updown

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

system HF staff commited on Oct 7, 2020

Commit

a529f65

•

1 Parent(s): 026f97d

Update README.md

Files changed (1) hide show

README.md +12 -10

README.md CHANGED Viewed

@@ -13,21 +13,23 @@ Quick Links:
 * [Dataset, training, and evaluation](https://github.com/golsun/DialogRPT)
 * [Colab Notebook Demo](https://colab.research.google.com/drive/1cAtfkbhqsRsT59y3imjR1APw3MHDMkuV?usp=sharing)
-We considered the following tasks and provided corresponding pretrained models.
-| Model card | Description  |
-| :-----------: | :----------- |
-|   | **Given a context and its two human responses, predict...** |
-| [`microsoft/DialogRPT-updown`](https://huggingface.co/microsoft/DialogRPT-updown) |  ... which gets more upvotes?  |
-| [`microsoft/DialogRPT-width`](https://huggingface.co/microsoft/DialogRPT-width) | ... which gets more direct replies?  |
-| [`microsoft/DialogRPT-depth`](https://huggingface.co/microsoft/DialogRPT-depth) |  ... which gets longer follow-up thread? |
-|  | **Given a context and one human response, distinguish it with...**  |
-| [`microsoft/DialogRPT-human-vs-rand`](https://huggingface.co/microsoft/DialogRPT-human-vs-rand) | ... a random human response  |
-| [`microsoft/DialogRPT-human-vs-machine`](https://huggingface.co/microsoft/DialogRPT-human-vs-machine) | ... a machine generated response  |
 ### Examples:
 The `updown` score predicts how likely the response is getting upvoted.
 | Context | Response | `updown` score |
 | :------ | :------- | :------------: |

 * [Dataset, training, and evaluation](https://github.com/golsun/DialogRPT)
 * [Colab Notebook Demo](https://colab.research.google.com/drive/1cAtfkbhqsRsT59y3imjR1APw3MHDMkuV?usp=sharing)
+We considered the following tasks and provided corresponding pretrained models. This page is for the `updown` task, and other model cards can be found in table below.
+|Task | Description  | Pretrained model |
+| :------------- | :----------- | :-----------: |
+|  **Human feedback**  |  **given a context and its two human responses, predict...**|
+| `updown` |  ... which gets more upvotes? | (this model) |
+| `width`| ... which gets more direct replies?  | [model card](https://huggingface.co/microsoft/DialogRPT-width) |
+| `depth`|  ... which gets longer follow-up thread?  | [model card](https://huggingface.co/microsoft/DialogRPT-width) |
+|  **Human-like** (human vs fake) | **given a context and one human response, distinguish it with...** |
+| `human_vs_rand`| ... a random human response  | [model card](https://huggingface.co/microsoft/DialogRPT-human-vs-rand) |
+| `human_vs_machine`| ... a machine generated response  | [model card](https://huggingface.co/microsoft/DialogRPT-human-vs-machine) |
 ### Examples:
 The `updown` score predicts how likely the response is getting upvoted.
+Examples below can be reproduced with this [Colab Notebook](https://colab.research.google.com/drive/1cAtfkbhqsRsT59y3imjR1APw3MHDMkuV?usp=sharing)
 | Context | Response | `updown` score |
 | :------ | :------- | :------------: |