Edit Models filters

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

Misc with no match

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

309

Full-text search

Active filters: rlhf

PKU-Alignment/beaver-7b-v1.0

Reinforcement Learning • Updated May 9 • 166 • 10

PKU-Alignment/beaver-7b-v1.0-cost

Reinforcement Learning • Updated Apr 20 • 238 • 9

nvidia/NV-Llama2-70B-RLHF-Chat

Text Generation • Updated Mar 9 • 4

joey00072/ToxicHermes-2.5-Mistral-7B

Text Generation • Updated Dec 16, 2023 • 66 • 18

argilla/distilabeled-OpenHermes-2.5-Mistral-7B

Text Generation • Updated Jan 17 • 21 • 29

mlabonne/NeuralBeagle14-7B

Text Generation • Updated Mar 4 • 205 • 158

mlabonne/NeuralBeagle14-7B-GGUF

Updated Jan 28 • 451 • 46

mlx-community/NeuralBeagle14-7B-4bit-mlx

Updated Jan 17 • 7 • 4

TheBloke/NeuralBeagle14-7B-GGUF

Updated Jan 17 • 646 • 24

argilla/CapybaraHermes-2.5-Mistral-7B

Updated Mar 4 • 119 • 67

tasksource/deberta-small-long-nli

Zero-Shot Classification • Updated Aug 28 • 134k • 37

TheBloke/CapybaraHermes-2.5-Mistral-7B-GGUF

Updated Jan 31 • 11k • 100

TheBloke/CapybaraHermes-2.5-Mistral-7B-AWQ

Updated Jan 31 • 395 • 21

TheBloke/CapybaraHermes-2.5-Mistral-7B-GPTQ

Updated Jan 31 • 990 • 55

mlabonne/AlphaMonarch-7B

Text Generation • Updated Mar 28 • 14.6k • 148

mlabonne/OrpoLlama-3-8B

Text Generation • Updated Jun 15 • 275 • 54

mradermacher/CapybaraHermes-2.5-Mistral-7B-GGUF

Updated 18 days ago • 233 • 1

mradermacher/distilabeled-Hermes-2.5-Mistral-7B-GGUF

Updated 19 days ago • 366 • 1

mradermacher/distilabeled-Hermes-2.5-Mistral-7B-i1-GGUF

Updated 18 days ago • 692 • 1

mradermacher/CapybaraHermes-2.5-Mistral-7B-i1-GGUF

Updated 18 days ago • 577 • 1

sileod/deberta-v3-base-tasksource-nli

Zero-Shot Classification • Updated Aug 13 • 13.3k • 118

stanfordnlp/SteamSHP-flan-t5-xl

Text2Text Generation • Updated Oct 10, 2023 • 270 • 43

stanfordnlp/SteamSHP-flan-t5-large

Text2Text Generation • Updated Oct 10, 2023 • 24 • 33

trl-lib/llama-7b-se-peft

Updated Apr 6, 2023 • 4

sileod/deberta-v3-large-tasksource-nli

Zero-Shot Classification • Updated Feb 17 • 2.74k • 31

sileod/deberta-v3-large-tasksource-rlhf-reward-model

Text Classification • Updated Mar 28, 2023 • 1.35k • 11

trl-lib/llama-7b-se-rl-peft

Updated Apr 14, 2023 • 103

trl-lib/llama-7b-se-rm-peft

Updated Apr 6, 2023 • 8

toloka/gpt2-large-rl-prompt-writing

Text Generation • Updated Apr 21, 2023 • 14 • 3

AdamG012/chat-opt-1.3b-rlhf-actor-deepspeed

Text Generation • Updated Apr 25, 2023 • 34 • 5