-
-
-
-
-
-
Active filters:
RLHF
Nexusflow/Starling-LM-7B-beta
Text Generation
•
Updated
•
27.5k
•
298
NousResearch/Hermes-2-Pro-Mistral-7B
Text Generation
•
Updated
•
77.5k
•
430
NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO
Text Generation
•
Updated
•
77.4k
•
349
NousResearch/Hermes-2-Pro-Mistral-7B-GGUF
Updated
•
34.9k
•
196
llm-blender/PairRM
Text Generation
•
Updated
•
1.34k
•
155
OpenAssistant/reward-model-deberta-v3-large-v2
Text Classification
•
Updated
•
20.9k
•
175
tanamettpk/TC-instruct-DPO
Text Generation
•
Updated
•
260
•
6
Hemanth-thunder/Tamil-Mistral-7B-Instruct-v0.1
Text Generation
•
Updated
•
2.16k
•
9
MaziyarPanahi/Starling-LM-7B-beta-GPTQ
Text Generation
•
Updated
•
673
•
3
johnsnowlabs/JSL-MedMNX-7B
Text Generation
•
Updated
•
299
•
2
TheBloke/Starling-LM-7B-alpha-GGUF
Updated
•
2.92k
•
93
perlthoughts/Starling-LM-11B-alpha-GGUF
Updated
•
592
•
9
NousResearch/Nous-Hermes-2-Mistral-7B-DPO-GGUF
Updated
•
25.1k
•
45
solidrust/Nous-Hermes-2-Mistral-7B-DPO-AWQ
Text Generation
•
Updated
•
118
•
6
bartowski/Hermes-2-Pro-Mistral-7B-GGUF
Text Generation
•
Updated
•
431
•
2
bartowski/Starling-LM-7B-beta-GGUF
Text Generation
•
Updated
•
8.53k
•
19
LoneStriker/Starling-LM-7B-beta-8.0bpw-h8-exl2
Text Generation
•
Updated
•
380
•
3
mightbe/Better-PairRM
Updated
•
53
•
9
johnsnowlabs/JSL-MedMNX-7B-v2.0
Text Generation
•
Updated
•
1
OpenAssistant/reward-model-deberta-v3-base
Text Classification
•
Updated
•
4.6k
•
10
OpenAssistant/reward-model-electra-large-discriminator
Text Classification
•
Updated
•
123
•
5
OpenAssistant/reward-model-deberta-v3-large
Text Classification
•
Updated
•
364
•
20
ChaiML/gpt2_base_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
7
•
2
ChaiML/gpt2_medium_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
4
ChaiML/gpt2_large_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
4
ChaiML/gpt2_xl_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
3
•
1
ChaiML/gpt2_base_retry_and_continue_5m_reward_model
Text Classification
•
Updated
•
2
•
3
llm-blender/pair-ranker
Updated
•
12
•
2
nicholasKluge/RewardModelPT
Text Classification
•
Updated
•
32
nicholasKluge/RewardModel
Text Classification
•
Updated
•
1