Dongfu Jiang
commited on
Commit
•
31ff879
1
Parent(s):
c7eb635
Update README.md
Browse files
README.md
CHANGED
@@ -1,8 +1,26 @@
|
|
1 |
---
|
2 |
license: mit
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
4 |
|
5 |
-
PairRanker used in llm-blender, trained on deberta-v3-large.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
6 |
|
7 |
- Github: [https://github.com/yuchenlin/LLM-Blender](https://github.com/yuchenlin/LLM-Blender)
|
8 |
- Paper: [https://arxiv.org/abs/2306.02561](https://arxiv.org/abs/2306.02561)
|
@@ -37,6 +55,4 @@ Then you are good to use pairrankers with
|
|
37 |
- `blender.compare()` to compare 2 candiates.
|
38 |
See LLM-Blender Github [README.md](https://github.com/yuchenlin/LLM-Blender#rank-and-fusion)
|
39 |
and jupyter file [blender_usage.ipynb](https://github.com/yuchenlin/LLM-Blender/blob/main/blender_usage.ipynb)
|
40 |
-
for detailed usage examples.
|
41 |
-
|
42 |
-
|
|
|
1 |
---
|
2 |
license: mit
|
3 |
+
datasets:
|
4 |
+
- llm-blender/mix-instruct
|
5 |
+
metrics:
|
6 |
+
- BERTScore
|
7 |
+
- BLEURT
|
8 |
+
- BARTScore
|
9 |
+
- Pairwise Rank
|
10 |
+
tags:
|
11 |
+
- pair_ranker
|
12 |
+
- reward_model
|
13 |
+
- RLHF
|
14 |
---
|
15 |
|
16 |
+
PairRanker used in llm-blender, trained on deberta-v3-large. This is the ranker model used in experiments in LLM-Blender paper,
|
17 |
+
which is trained on [mixinstruct](https://huggingface.co/datasets/llm-blender/mix-instruct) dataset for 5 epochs.
|
18 |
+
|
19 |
+
| PairRanker type | Source max length | Candidate max length | Total max length |
|
20 |
+
|:-----------------:|:-----------------:|----------------------|------------------|
|
21 |
+
| [pair-ranker](https://huggingface.co/llm-blender/pair-ranker) (This model) | 128 | 128 | 384 |
|
22 |
+
| [pair-reward-model](https://huggingface.co/llm-blender/pair-reward-model/) | 1224 | 412 | 2048 |
|
23 |
+
|
24 |
|
25 |
- Github: [https://github.com/yuchenlin/LLM-Blender](https://github.com/yuchenlin/LLM-Blender)
|
26 |
- Paper: [https://arxiv.org/abs/2306.02561](https://arxiv.org/abs/2306.02561)
|
|
|
55 |
- `blender.compare()` to compare 2 candiates.
|
56 |
See LLM-Blender Github [README.md](https://github.com/yuchenlin/LLM-Blender#rank-and-fusion)
|
57 |
and jupyter file [blender_usage.ipynb](https://github.com/yuchenlin/LLM-Blender/blob/main/blender_usage.ipynb)
|
58 |
+
for detailed usage examples.
|
|
|
|