Hugging Face
Models
Datasets
Spaces
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Reset Other
RLHF
Has a Space
text-generation-inference
Other with no match
AutoTrain Compatible
Eval Results
custom_code
Carbon Emissions
8-bit precision
Apply filters
Models
16
new
Full-text search
Edit filters
Sort: Trending
Active filters:
RLHF
Clear all
fb700/chatglm-fitness-RLHF
Updated
about 1 month ago
•
236
OpenAssistant/reward-model-deberta-v3-large
Text Classification
•
Updated
Feb 17
•
94
•
16
OpenAssistant/reward-model-deberta-v3-large-v2
Text Classification
•
Updated
Feb 1
•
40.8k
•
105
fb700/Bofan-chatglm-Best-lora
Updated
30 days ago
•
7
OpenAssistant/reward-model-deberta-v3-base
Text Classification
•
Updated
Jan 26
•
170
•
6
OpenAssistant/reward-model-electra-large-discriminator
Text Classification
•
Updated
Jan 26
•
31
•
4
ChaiML/gpt2_base_retry_and_continue_12m_reward_model
Text Classification
•
Updated
Mar 13
•
80
•
2
ChaiML/gpt2_medium_retry_and_continue_12m_reward_model
Text Classification
•
Updated
Mar 13
•
3
ChaiML/gpt2_large_retry_and_continue_12m_reward_model
Text Classification
•
Updated
Mar 13
•
8
ChaiML/gpt2_xl_retry_and_continue_12m_reward_model
Text Classification
•
Updated
Mar 13
•
4
•
1
ChaiML/gpt2_base_retry_and_continue_5m_reward_model
Text Classification
•
Updated
Mar 13
•
6
•
3
nicholasKluge/RewardModelPT
Text Classification
•
Updated
22 days ago
•
30
nicholasKluge/RewardModel
Text Classification
•
Updated
23 days ago
•
30
nicholasKluge/Aira-RLHF-124M
Text Generation
•
Updated
Aug 6
•
1
Yu-Yang-Li/StarGLM
Updated
24 days ago
•
3
kubernetes-bad/Ligma-L2-13b
Updated
3 days ago
•
2
•
2