Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
mightbe
/
Better-PairRM
like
10
Transformers
Safetensors
openai/summarize_from_feedback
openai/webgpt_comparisons
berkeley-nest/Nectar
Dahoas/instruct-synthetic-prompt-responses
Anthropic/hh-rlhf
lmsys/chatbot_arena_conversations
openbmb/UltraFeedback
argilla/ultrafeedback-binarized-preferences-cleaned
English
deberta-v2
reward_model
reward-model
RLHF
evaluation
llm
instruction
reranking
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (3)
Welcome to the community
The community tab is the place to discuss and collaborate with the HF community!