Hugging Face
Models
Datasets
Spaces
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Reset Other
reward_model
text-generation-inference
Has a Space
Other with no match
AutoTrain Compatible
Eval Results
custom_code
Carbon Emissions
8-bit precision
Apply filters
Models
18
new
Full-text search
Edit filters
Sort: Trending
Active filters:
reward_model
Clear all
OpenAssistant/reward-model-deberta-v3-large
Text Classification
•
Updated
Feb 17
•
94
•
16
OpenAssistant/reward-model-deberta-v3-large-v2
Text Classification
•
Updated
Feb 1
•
40.8k
•
105
AlekseyKorshuk/test_reward_model
Updated
Dec 22, 2022
•
17
OpenAssistant/reward-model-deberta-v3-base
Text Classification
•
Updated
Jan 26
•
170
•
6
OpenAssistant/reward-model-electra-large-discriminator
Text Classification
•
Updated
Jan 26
•
31
•
4
ChaiML/gpt2_base_retry_and_continue_12m_reward_model
Text Classification
•
Updated
Mar 13
•
80
•
2
ChaiML/gpt2_medium_retry_and_continue_12m_reward_model
Text Classification
•
Updated
Mar 13
•
3
ChaiML/gpt2_large_retry_and_continue_12m_reward_model
Text Classification
•
Updated
Mar 13
•
8
ChaiML/gpt2_xl_retry_and_continue_12m_reward_model
Text Classification
•
Updated
Mar 13
•
4
•
1
ChaiML/gpt2_base_retry_and_continue_5m_reward_model
Text Classification
•
Updated
Mar 13
•
6
•
3
oliversssf2/distilbert-base-uncased-rm-helpful
Updated
Apr 7
oliversssf2/distilbert-base-uncased-rm-harmless
Updated
Apr 7
•
2
oliversssf2/gptneo-1.3B-rm-harmless
Updated
Apr 7
oliversssf2/gptneo-1.3B-rm-helpful
Updated
Apr 7
oliversssf2/gptneo-1.3B-rm-instructgpt
Updated
Apr 16
tatsu-lab/alpaca-farm-reward-model-human-wdiff
Updated
May 31
•
2
tatsu-lab/alpaca-farm-reward-model-sim-wdiff
Updated
May 31
•
1
angie-chen55/af-rmh
Updated
2 days ago