Text Generation
Transformers
Safetensors
English
deberta
reward_model
reward-model
RLHF
evaluation
llm
instruction
reranking
Inference Endpoints
PairRM / README.md

Commit History

Update README.md
504eb7b

yuchenlin commited on

Update README.md
a20b0a8

Dongfu Jiang commited on

Update README.md
9ea925c

yuchenlin commited on

Update README.md
2b7fceb

yuchenlin commited on

Update README.md
2cccc72

yuchenlin commited on

Update README.md
8b166f0

yuchenlin commited on

Update README.md
96cc13f

yuchenlin commited on

Update README.md
e066c87

yuchenlin commited on

Update README.md
90f9aa4

yuchenlin commited on

Update README.md
447053d

yuchenlin commited on

Update README.md
c845907

yuchenlin commited on

Update README.md
671d616

yuchenlin commited on

Update README.md
0305f74

yuchenlin commited on

Update README.md
94512ba

yuchenlin commited on

Update README.md
345b1ee

yuchenlin commited on

Update README.md
7333fb2

Dongfu Jiang commited on

Update README.md
14d4a72

Dongfu Jiang commited on

Update README.md
a2f8211

Dongfu Jiang commited on

Update README.md
edac579

Dongfu Jiang commited on

Update README.md
8d9ead8

Dongfu Jiang commited on

Update README.md
00b7e60

Dongfu Jiang commited on

Update README.md
80230fd

Dongfu Jiang commited on

Update README.md
0ef6e21

Dongfu Jiang commited on

Update README.md
bb45a4c

Dongfu Jiang commited on