Edit Models filters

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

Other with no match

Carbon Emissions

Mixture of Experts

Models

267

Full-text search

Active filters: rlhf

LoneStriker/NeuralMonarch-7B-GPTQ

Text Generation • Updated Feb 19 • 8

LoneStriker/AlphaMonarch-7B-GPTQ

Text Generation • Updated Feb 19 • 21 • 3

mlx-community/AlphaMonarch-7B-mlx-4bit

Updated Feb 19 • 1 • 3

mlx-community/AlphaMonarch-7B-mlx

Updated Feb 19 • 1 • 4

sugatoray/mlx-neuralhermes-2.5-mistral-7b-q4bits

Updated Feb 25 • 1

sugatoray/mlx-alphamonarch-7b-q4bits

Updated Mar 4 • 1

ArchiveAI/AlphaMonarch-7B

Text Generation • Updated Mar 1 • 1

ContextualAI/Contextual_KTO_Mistral_PairRM

Text Generation • Updated Apr 26 • 2.57k • 25

solidrust/NeuralHermes-2.5-Mistral-7B-laser-AWQ

Text Generation • Updated Mar 12 • 6

solidrust/NeuralMonarch-7B-AWQ

Text Generation • Updated Mar 12 • 7

solidrust/AlphaMonarch-7B-AWQ

Text Generation • Updated Mar 12 • 6

abdullahalzubaer/NeuralHermes-2.5-Mistral-7B

Text Generation • Updated Mar 13 • 2 • 1

koesn/NeuralHermes-2.5-Mistral-7B-GGUF

Updated Mar 10 • 162

delayedkarma/NeuralHermes-2.5-Mistral-7B

Text Generation • Updated Mar 10 • 1.49k • 1

asedmammad/Contextual_KTO_Mistral_PairRM-GGUF

Updated Mar 11 • 447 • 1

danilopeixoto/pandora-7b-chat

Text Generation • Updated Mar 24 • 1

solidrust/NeuralBeagle14-7B-AWQ

Text Generation • Updated Mar 12 • 6

vibhorg/rl4llm_uofm_nlpo_super_t5_arxiv

Text2Text Generation • Updated Mar 20 • 2

umarigan/Trendyol-LLM-7b-chat-v1.0-RLHF

Question Answering • Updated Mar 16

vibhorg/rl4llm_uofm_nlpo_unsuper_t5_arxiv

Text2Text Generation • Updated Mar 20 • 1

mlabonne/AlphaMonarch-7B-GPTQ

Text Generation • Updated Mar 28 • 6

mlabonne/AlphaMonarch-7B-AWQ

Text Generation • Updated Mar 28 • 7 • 1

mlabonne/AlphaMonarch-7B-2bit-HQQ

Text Generation • Updated Mar 28 • 2 • 8

mlabonne/AlphaMonarch-7B-5.0bpw-exl2

Text Generation • Updated Mar 28 • 6

mlx-community/CapybaraHermes-2.5-Mistral-7B

Updated Apr 7 • 2

mlabonne/OrpoLlama-3-8B

Text Generation • Updated 14 days ago • 2.45k • 49

solidrust/OrpoLlama-3-8B-AWQ

Text Generation • Updated Apr 21 • 5 • 3

PKU-Alignment/beaver-7b-v2.0

Reinforcement Learning • Updated about 1 month ago • 260

PKU-Alignment/beaver-7b-v2.0-reward

Reinforcement Learning • Updated Apr 20 • 7

PKU-Alignment/beaver-7b-v2.0-cost

Reinforcement Learning • Updated Apr 20 • 5