-
-
-
-
-
-
Active filters:
RLHF
NousResearch/Hermes-2-Pro-Llama-3-70B
Text Generation
•
Updated
•
1.55k
•
28
mradermacher/NeonLLM-Base-GGUF
Updated
•
12.9k
•
1
ruslanmv/Medical-Llama3-v2
Text Generation
•
Updated
•
212
•
2
mradermacher/Hermes-2-Pro-Llama-3-70B-GGUF
Updated
•
1.62k
•
1
mradermacher/Athene-70B-GGUF
Updated
•
344
•
1
mradermacher/Athene-70B-i1-GGUF
Updated
•
735
•
1
legraphista/Athene-70B-IMat-GGUF
Text Generation
•
Updated
•
824
•
1
OpenAssistant/reward-model-deberta-v3-base
Text Classification
•
Updated
•
645
•
10
OpenAssistant/reward-model-electra-large-discriminator
Text Classification
•
Updated
•
32
•
5
OpenAssistant/reward-model-deberta-v3-large
Text Classification
•
Updated
•
631
•
20
OpenAssistant/reward-model-deberta-v3-large-v2
Text Classification
•
Updated
•
19.7k
•
191
ChaiML/gpt2_base_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
7
•
2
ChaiML/gpt2_medium_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
5
ChaiML/gpt2_large_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
5
ChaiML/gpt2_xl_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
1
ChaiML/gpt2_base_retry_and_continue_5m_reward_model
Text Classification
•
Updated
•
1
•
3
llm-blender/pair-ranker
Updated
•
3
•
2
nicholasKluge/RewardModelPT
Text Classification
•
Updated
•
26
nicholasKluge/RewardModel
Text Classification
•
Updated
•
20
fb700/chatglm-fitness-RLHF
fb700/Bofan-chatglm-Best-lora
Updated
•
3
•
9
kubernetes-bad/Ligma-L2-13b
Updated
•
4
•
3
berkeley-nest/Starling-RM-7B-alpha
LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2
Text Generation
•
Updated
•
2
LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2
Text Generation
•
Updated
•
6
•
1
LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2
Text Generation
•
Updated
•
6
•
2
LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2
Text Generation
•
Updated
•
6
•
1
LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2
Text Generation
•
Updated
•
7
•
2
TheBloke/Starling-LM-7B-alpha-GGUF
Updated
•
942
•
94
TheBloke/Starling-LM-7B-alpha-AWQ
Text Generation
•
Updated
•
44
•
9