-
-
-
-
-
-
Active filters:
RLHF
NousResearch/Hermes-2-Theta-Llama-3-8B
Text Generation
•
Updated
•
7.47k
•
113
NousResearch/Hermes-2-Pro-Llama-3-8B
Text Generation
•
Updated
•
67.1k
•
352
aaditya/Llama3-OpenBioLLM-70B
Text Generation
•
Updated
•
46.7k
•
276
NousResearch/Hermes-2-Pro-Llama-3-8B-GGUF
Updated
•
41.6k
•
136
NousResearch/Hermes-2-Theta-Llama-3-8B-GGUF
Updated
•
12.3k
•
61
NousResearch/Hermes-2-Pro-Mistral-7B
Text Generation
•
Updated
•
22.2k
•
467
Nexusflow/Starling-LM-7B-beta
Text Generation
•
Updated
•
18.2k
•
319
aaditya/Llama3-OpenBioLLM-8B
Text Generation
•
Updated
•
24.3k
•
110
NousResearch/Hermes-2-Pro-Mistral-7B-GGUF
Updated
•
25.7k
•
212
ruslanmv/Medical-Llama3-8B
Text Generation
•
Updated
•
4.08k
•
25
OpenAssistant/reward-model-deberta-v3-large-v2
Text Classification
•
Updated
•
64.4k
•
183
llm-blender/PairRM
Text Generation
•
Updated
•
3.77k
•
159
NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO
Text Generation
•
Updated
•
26.4k
•
373
TheBloke/Nous-Hermes-2-Mixtral-8x7B-DPO-GPTQ
Text Generation
•
Updated
•
35.5k
•
25
mradermacher/OpenBioLLM-Llama3-70B-GGUF
Updated
•
684
•
3
LiteLLMs/Llama3-OpenBioLLM-70B-GGUF
Updated
•
411
•
2
bartowski/Hermes-2-Pro-Llama-3-8B-GGUF
Text Generation
•
Updated
•
62.4k
•
8
OpenAssistant/reward-model-deberta-v3-base
Text Classification
•
Updated
•
226
•
10
OpenAssistant/reward-model-electra-large-discriminator
Text Classification
•
Updated
•
69
•
5
OpenAssistant/reward-model-deberta-v3-large
Text Classification
•
Updated
•
409
•
20
ChaiML/gpt2_base_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
7
•
2
ChaiML/gpt2_medium_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
9
ChaiML/gpt2_large_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
15
ChaiML/gpt2_xl_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
12
•
1
ChaiML/gpt2_base_retry_and_continue_5m_reward_model
Text Classification
•
Updated
•
12
•
3
llm-blender/pair-ranker
Updated
•
25
•
2
nicholasKluge/RewardModelPT
Text Classification
•
Updated
•
6
nicholasKluge/RewardModel
Text Classification
•
Updated
•
6
fb700/chatglm-fitness-RLHF
fb700/Bofan-chatglm-Best-lora
Updated
•
1
•
9