-
-
-
-
-
-
Active filters:
rlhf
argilla/CapybaraHermes-2.5-Mistral-7B
sileod/deberta-v3-small-tasksource-nli
Zero-Shot Classification
•
Updated
•
387k
•
19
TheBloke/CapybaraHermes-2.5-Mistral-7B-GGUF
Updated
•
7.23k
•
75
TheBloke/CapybaraHermes-2.5-Mistral-7B-GPTQ
Updated
•
1.55k
•
48
sileod/deberta-v3-base-tasksource-nli
Zero-Shot Classification
•
Updated
•
19.2k
•
114
sileod/deberta-v3-large-tasksource-nli
Zero-Shot Classification
•
Updated
•
12k
•
29
PKU-Alignment/beaver-7b-v1.0
Reinforcement Learning
•
Updated
•
484
•
9
lyogavin/Anima33B-DPO-Belle-1k-merged
Text Generation
•
Updated
•
17
•
12
PKU-Alignment/beaver-7b-v1.0-cost
Reinforcement Learning
•
Updated
•
19.5k
•
8
ContextualAI/archangel_sft-kto_llama30b
Text Generation
•
Updated
•
33
•
2
mlabonne/NeuralDaredevil-7B
Text Generation
•
Updated
•
1.42M
•
34
TheBloke/CapybaraHermes-2.5-Mistral-7B-AWQ
Updated
•
18.9k
•
17
PKU-Alignment/beaver-7b-unified-cost
Reinforcement Learning
•
Updated
•
34
•
1
line-corporation/sacpo
Reinforcement Learning
•
Updated
•
39
•
4
dfurman/Qwen2-72B-Orpo-v0.1
Text Generation
•
Updated
•
284
•
1
stanfordnlp/SteamSHP-flan-t5-xl
Text2Text Generation
•
Updated
•
281
•
43
stanfordnlp/SteamSHP-flan-t5-large
Text2Text Generation
•
Updated
•
49
•
33
trl-lib/llama-7b-se-peft
sileod/deberta-v3-large-tasksource-rlhf-reward-model
Text Classification
•
Updated
•
38
•
10
trl-lib/llama-7b-se-rl-peft
Updated
•
102
trl-lib/llama-7b-se-rm-peft
toloka/gpt2-large-rl-prompt-writing
Text Generation
•
Updated
•
4
•
3
AdamG012/chat-opt-1.3b-rlhf-actor-deepspeed
Text Generation
•
Updated
•
14
•
5
AdamG012/chat-opt-1.3b-rlhf-critic-deepspeed
Text Generation
•
Updated
•
10
•
3
AdamG012/chat-opt-1.3b-rlhf-actor-ema-deepspeed
Text Generation
•
Updated
•
7
•
8
sileod/mdeberta-v3-base-tasksource-nli
Zero-Shot Classification
•
Updated
•
70
•
14
agi-css/socially-good-lm
Text Generation
•
Updated
•
9
•
5
agi-css/hh-rlhf-sft
Text Generation
•
Updated
•
10
•
3
agi-css/better-base
Text Generation
•
Updated
•
6
•
5
argilla/roberta-base-reward-model-falcon-dolly
Text Classification
•
Updated
•
5
•
4