Dongfu Jiang commited on
Commit
31ff879
1 Parent(s): c7eb635

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -4
README.md CHANGED
@@ -1,8 +1,26 @@
1
  ---
2
  license: mit
 
 
 
 
 
 
 
 
 
 
 
3
  ---
4
 
5
- PairRanker used in llm-blender, trained on deberta-v3-large.
 
 
 
 
 
 
 
6
 
7
  - Github: [https://github.com/yuchenlin/LLM-Blender](https://github.com/yuchenlin/LLM-Blender)
8
  - Paper: [https://arxiv.org/abs/2306.02561](https://arxiv.org/abs/2306.02561)
@@ -37,6 +55,4 @@ Then you are good to use pairrankers with
37
  - `blender.compare()` to compare 2 candiates.
38
  See LLM-Blender Github [README.md](https://github.com/yuchenlin/LLM-Blender#rank-and-fusion)
39
  and jupyter file [blender_usage.ipynb](https://github.com/yuchenlin/LLM-Blender/blob/main/blender_usage.ipynb)
40
- for detailed usage examples.
41
-
42
-
 
1
  ---
2
  license: mit
3
+ datasets:
4
+ - llm-blender/mix-instruct
5
+ metrics:
6
+ - BERTScore
7
+ - BLEURT
8
+ - BARTScore
9
+ - Pairwise Rank
10
+ tags:
11
+ - pair_ranker
12
+ - reward_model
13
+ - RLHF
14
  ---
15
 
16
+ PairRanker used in llm-blender, trained on deberta-v3-large. This is the ranker model used in experiments in LLM-Blender paper,
17
+ which is trained on [mixinstruct](https://huggingface.co/datasets/llm-blender/mix-instruct) dataset for 5 epochs.
18
+
19
+ | PairRanker type | Source max length | Candidate max length | Total max length |
20
+ |:-----------------:|:-----------------:|----------------------|------------------|
21
+ | [pair-ranker](https://huggingface.co/llm-blender/pair-ranker) (This model) | 128 | 128 | 384 |
22
+ | [pair-reward-model](https://huggingface.co/llm-blender/pair-reward-model/) | 1224 | 412 | 2048 |
23
+
24
 
25
  - Github: [https://github.com/yuchenlin/LLM-Blender](https://github.com/yuchenlin/LLM-Blender)
26
  - Paper: [https://arxiv.org/abs/2306.02561](https://arxiv.org/abs/2306.02561)
 
55
  - `blender.compare()` to compare 2 candiates.
56
  See LLM-Blender Github [README.md](https://github.com/yuchenlin/LLM-Blender#rank-and-fusion)
57
  and jupyter file [blender_usage.ipynb](https://github.com/yuchenlin/LLM-Blender/blob/main/blender_usage.ipynb)
58
+ for detailed usage examples.