-
-
-
-
-
-
Active filters:
RLHF
ChaiML/gpt2_xl_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
13
•
1
ChaiML/gpt2_base_retry_and_continue_5m_reward_model
Text Classification
•
Updated
•
14
•
3
llm-blender/pair-ranker
Updated
•
6
•
2
nicholasKluge/RewardModelPT
Text Classification
•
Updated
•
7
nicholasKluge/RewardModel
Text Classification
•
Updated
•
9
fb700/Bofan-chatglm-Best-lora
Updated
•
1
•
9
kubernetes-bad/Ligma-L2-13b
Updated
•
3
•
3
berkeley-nest/Starling-RM-7B-alpha
LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2
Text Generation
•
Updated
•
1
LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2
Text Generation
•
Updated
•
2
•
1
LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2
Text Generation
•
Updated
•
2
•
2
LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2
Text Generation
•
Updated
•
2
•
1
LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2
Text Generation
•
Updated
•
1
•
2
TheBloke/Starling-LM-7B-alpha-GGUF
Updated
•
1.39k
•
94
TheBloke/Starling-LM-7B-alpha-AWQ
Text Generation
•
Updated
•
53
•
9
second-state/Starling-LM-7B-alpha-GGUF
Text Generation
•
Updated
•
471
•
3
TheBloke/Starling-LM-7B-alpha-GPTQ
Text Generation
•
Updated
•
117
•
9
bartowski/Starling-LM-7B-alpha-old-exl2
Text Generation
•
Updated
tastypear/chatglm-fitness-RLHF-GGML
CallComply/Starling-LM-11B-alpha
Text Generation
•
Updated
•
2.38k
•
9
perlthoughts/Starling-LM-11B-alpha-GGUF
Updated
•
453
•
9
perlthoughts/Starling-LM-alpha-8x7B-MoE
Text Generation
•
Updated
•
2.23k
•
5
TheBloke/Starling-LM-alpha-8x7B-MoE-GGUF
Updated
•
354
•
9
TheBloke/Starling-LM-alpha-8x7B-MoE-GPTQ
Text Generation
•
Updated
•
24
•
2
bartowski/Starling-LM-7B-alpha-exl2
Text Generation
•
Updated
llm-blender/PairRM-hf
Text Generation
•
Updated
•
312
•
10
NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO-adapter
gizmo-ai/Starling-LM-7B-alpha
Text Generation
•
Updated
•
1
gizmo-ai/Starling-LM-7B-alpha-AWQ
Text Generation
•
Updated
•
1
rAIfle/Nous-Hermes-2-Mixtral-8x7B-DPO-exl2-rpcal
Updated