-
-
-
-
-
-
Active filters:
RLHF
NousResearch/Hermes-2-Pro-Llama-3-8B
Text Generation
•
Updated
•
63.5k
•
358
NousResearch/Hermes-2-Theta-Llama-3-8B
Text Generation
•
Updated
•
8.91k
•
118
aaditya/Llama3-OpenBioLLM-70B
Text Generation
•
Updated
•
6.79k
•
281
NousResearch/Hermes-2-Theta-Llama-3-8B-GGUF
Updated
•
14.4k
•
65
aaditya/Llama3-OpenBioLLM-8B
Text Generation
•
Updated
•
22.7k
•
114
NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO
Text Generation
•
Updated
•
22.5k
•
375
NousResearch/Hermes-2-Pro-Mistral-7B-GGUF
Updated
•
24k
•
214
llm-blender/PairRM
Text Generation
•
Updated
•
11.7k
•
161
NousResearch/Nous-Hermes-2-Mistral-7B-DPO
Text Generation
•
Updated
•
22.3k
•
149
NousResearch/Hermes-2-Pro-Mistral-7B
Text Generation
•
Updated
•
18.9k
•
469
ruslanmv/Medical-Llama3-8B
Text Generation
•
Updated
•
5.92k
•
27
OpenAssistant/reward-model-deberta-v3-large-v2
Text Classification
•
Updated
•
63.5k
•
184
fb700/chatglm-fitness-RLHF
Nexusflow/Starling-LM-7B-beta
Text Generation
•
Updated
•
16.7k
•
319
mradermacher/OpenBioLLM-Llama3-70B-i1-GGUF
Updated
•
1.42k
•
7
OpenPipe/Hermes-2-Theta-Llama-3-8B-32k
Text Generation
•
Updated
•
435
•
1
OpenAssistant/reward-model-deberta-v3-base
Text Classification
•
Updated
•
156
•
10
OpenAssistant/reward-model-electra-large-discriminator
Text Classification
•
Updated
•
19
•
5
OpenAssistant/reward-model-deberta-v3-large
Text Classification
•
Updated
•
268
•
20
ChaiML/gpt2_base_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
8
•
2
ChaiML/gpt2_medium_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
10
ChaiML/gpt2_large_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
16
ChaiML/gpt2_xl_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
13
•
1
ChaiML/gpt2_base_retry_and_continue_5m_reward_model
Text Classification
•
Updated
•
13
•
3
llm-blender/pair-ranker
Updated
•
28
•
2
nicholasKluge/RewardModelPT
Text Classification
•
Updated
•
8
nicholasKluge/RewardModel
Text Classification
•
Updated
•
7
fb700/Bofan-chatglm-Best-lora
Updated
•
1
•
9
kubernetes-bad/Ligma-L2-13b
Updated
•
3
•
3
berkeley-nest/Starling-LM-7B-alpha
Text Generation
•
Updated
•
35.3k
•
549