Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Reset Other
rlhf
Inference Endpoints
AutoTrain Compatible
text-generation-inference
Merge
Eval Results
4-bit precision
8-bit precision
custom_code
Other with no match
Carbon Emissions
Mixture of Experts
Apply filters
Models
267
Full-text search
Edit filters
Sort: Trending
Active filters:
rlhf
Clear all
LoneStriker/NeuralMonarch-7B-GPTQ
Text Generation
•
Updated
Feb 19
•
8
LoneStriker/AlphaMonarch-7B-GPTQ
Text Generation
•
Updated
Feb 19
•
21
•
3
mlx-community/AlphaMonarch-7B-mlx-4bit
Updated
Feb 19
•
1
•
3
mlx-community/AlphaMonarch-7B-mlx
Updated
Feb 19
•
1
•
4
sugatoray/mlx-neuralhermes-2.5-mistral-7b-q4bits
Updated
Feb 25
•
1
sugatoray/mlx-alphamonarch-7b-q4bits
Updated
Mar 4
•
1
ArchiveAI/AlphaMonarch-7B
Text Generation
•
Updated
Mar 1
•
1
ContextualAI/Contextual_KTO_Mistral_PairRM
Text Generation
•
Updated
Apr 26
•
2.57k
•
25
solidrust/NeuralHermes-2.5-Mistral-7B-laser-AWQ
Text Generation
•
Updated
Mar 12
•
6
solidrust/NeuralMonarch-7B-AWQ
Text Generation
•
Updated
Mar 12
•
7
solidrust/AlphaMonarch-7B-AWQ
Text Generation
•
Updated
Mar 12
•
6
abdullahalzubaer/NeuralHermes-2.5-Mistral-7B
Text Generation
•
Updated
Mar 13
•
2
•
1
koesn/NeuralHermes-2.5-Mistral-7B-GGUF
Updated
Mar 10
•
162
delayedkarma/NeuralHermes-2.5-Mistral-7B
Text Generation
•
Updated
Mar 10
•
1.49k
•
1
asedmammad/Contextual_KTO_Mistral_PairRM-GGUF
Updated
Mar 11
•
447
•
1
danilopeixoto/pandora-7b-chat
Text Generation
•
Updated
Mar 24
•
1
solidrust/NeuralBeagle14-7B-AWQ
Text Generation
•
Updated
Mar 12
•
6
vibhorg/rl4llm_uofm_nlpo_super_t5_arxiv
Text2Text Generation
•
Updated
Mar 20
•
2
umarigan/Trendyol-LLM-7b-chat-v1.0-RLHF
Question Answering
•
Updated
Mar 16
vibhorg/rl4llm_uofm_nlpo_unsuper_t5_arxiv
Text2Text Generation
•
Updated
Mar 20
•
1
mlabonne/AlphaMonarch-7B-GPTQ
Text Generation
•
Updated
Mar 28
•
6
mlabonne/AlphaMonarch-7B-AWQ
Text Generation
•
Updated
Mar 28
•
7
•
1
mlabonne/AlphaMonarch-7B-2bit-HQQ
Text Generation
•
Updated
Mar 28
•
2
•
8
mlabonne/AlphaMonarch-7B-5.0bpw-exl2
Text Generation
•
Updated
Mar 28
•
6
mlx-community/CapybaraHermes-2.5-Mistral-7B
Updated
Apr 7
•
2
mlabonne/OrpoLlama-3-8B
Text Generation
•
Updated
14 days ago
•
2.45k
•
49
solidrust/OrpoLlama-3-8B-AWQ
Text Generation
•
Updated
Apr 21
•
5
•
3
PKU-Alignment/beaver-7b-v2.0
Reinforcement Learning
•
Updated
about 1 month ago
•
260
PKU-Alignment/beaver-7b-v2.0-reward
Reinforcement Learning
•
Updated
Apr 20
•
7
PKU-Alignment/beaver-7b-v2.0-cost
Reinforcement Learning
•
Updated
Apr 20
•
5
Previous
1
...
6
7
8
9
Next