Edit Models filters

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

Other with no match

Carbon Emissions

Mixture of Experts

Models

263

new Full-text search

Active filters: rlhf

TheBloke/CapybaraHermes-2.5-Mistral-7B-GPTQ

Updated Jan 31 • 10.3k • 39

mlabonne/OrpoLlama-3-8B

Text Generation • Updated 12 days ago • 901 • 47

dfurman/Llama-3-70B-Orpo-v0.1

Text Generation • Updated about 4 hours ago • 482 • 2

sileod/deberta-v3-base-tasksource-nli

Zero-Shot Classification • Updated 11 days ago • 51.2k • 107

sileod/deberta-v3-large-tasksource-nli

Zero-Shot Classification • Updated Feb 17 • 3.81k • 25

lyogavin/Anima33B-DPO-Belle-1k-merged

Text Generation • Updated Jul 2, 2023 • 5 • 10

TheBloke/NeuralBeagle14-7B-GGUF

Updated Jan 17 • 504 • 23

TheBloke/CapybaraHermes-2.5-Mistral-7B-GGUF

Updated Jan 31 • 14k • 62

TheBloke/CapybaraHermes-2.5-Mistral-7B-AWQ

Updated Jan 31 • 688 • 11

mlabonne/AlphaMonarch-7B-2bit-HQQ

Text Generation • Updated Mar 28 • 8 • 8

solidrust/OrpoLlama-3-8B-AWQ

Text Generation • Updated 13 days ago • 15 • 3

bartowski/OrpoLlama-3-8B-GGUF

Text Generation • Updated 14 days ago • 4.67k • 4

dfurman/Llama-3-8B-Orpo-v0.1

Text Generation • Updated 6 days ago • 400 • 1

stanfordnlp/SteamSHP-flan-t5-xl

Text2Text Generation • Updated Oct 10, 2023 • 269 • 43

stanfordnlp/SteamSHP-flan-t5-large

Text2Text Generation • Updated Oct 10, 2023 • 67 • 33

trl-lib/llama-7b-se-peft

Updated Apr 6, 2023 • 4

sileod/deberta-v3-large-tasksource-rlhf-reward-model

Text Classification • Updated Mar 28, 2023 • 308 • 10

trl-lib/llama-7b-se-rl-peft

Updated Apr 14, 2023 • 102

trl-lib/llama-7b-se-rm-peft

Updated Apr 6, 2023 • 7

toloka/gpt2-large-rl-prompt-writing

Text Generation • Updated Apr 21, 2023 • 6 • 3

AdamG012/chat-opt-1.3b-rlhf-actor-deepspeed

Text Generation • Updated Apr 25, 2023 • 9 • 5

AdamG012/chat-opt-1.3b-rlhf-critic-deepspeed

Text Generation • Updated Apr 25, 2023 • 6 • 3

AdamG012/chat-opt-1.3b-rlhf-actor-ema-deepspeed

Text Generation • Updated Apr 25, 2023 • 6 • 8

sileod/mdeberta-v3-base-tasksource-nli

Zero-Shot Classification • Updated Oct 19, 2023 • 183 • 13

agi-css/socially-good-lm

Text Generation • Updated May 29, 2023 • 6 • 5

agi-css/hh-rlhf-sft

Text Generation • Updated Jun 1, 2023 • 6 • 3

agi-css/better-base

Text Generation • Updated Jun 1, 2023 • 5 • 5

argilla/roberta-base-reward-model-falcon-dolly

Text Classification • Updated Jun 16, 2023 • 6 • 4

merve/peft-copy-test

Text Generation • Updated Jun 14, 2023 • 1

PKU-Alignment/beaver-7b-v1.0

Reinforcement Learning • Updated 14 days ago • 295 • 7