Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Reset Other
rlhf
Inference Endpoints
AutoTrain Compatible
text-generation-inference
Merge
Eval Results
4-bit precision
8-bit precision
custom_code
Other with no match
Carbon Emissions
Mixture of Experts
Apply filters
Models
264
Full-text search
Edit filters
Sort: Trending
Active filters:
rlhf
Clear all
LoneStriker/NeuralMonarch-7B-AWQ
Text Generation
•
Updated
Feb 19
•
2
LoneStriker/AlphaMonarch-7B-AWQ
Text Generation
•
Updated
Feb 19
•
106
•
2
LoneStriker/NeuralMonarch-7B-GPTQ
Text Generation
•
Updated
Feb 19
•
4
LoneStriker/AlphaMonarch-7B-GPTQ
Text Generation
•
Updated
Feb 19
•
18
•
3
mlx-community/AlphaMonarch-7B-mlx-4bit
Updated
Feb 19
•
1
•
3
mlx-community/AlphaMonarch-7B-mlx
Updated
Feb 19
•
4
sugatoray/mlx-neuralhermes-2.5-mistral-7b-q4bits
Updated
Feb 25
sugatoray/mlx-alphamonarch-7b-q4bits
Updated
Mar 4
•
2
ArchiveAI/AlphaMonarch-7B
Text Generation
•
Updated
Mar 1
•
3
ContextualAI/Contextual_KTO_Mistral_PairRM
Text Generation
•
Updated
22 days ago
•
3.08k
•
24
solidrust/NeuralHermes-2.5-Mistral-7B-laser-AWQ
Text Generation
•
Updated
Mar 12
•
6
solidrust/NeuralMonarch-7B-AWQ
Text Generation
•
Updated
Mar 12
•
4
solidrust/AlphaMonarch-7B-AWQ
Text Generation
•
Updated
Mar 12
•
9
abdullahalzubaer/NeuralHermes-2.5-Mistral-7B
Text Generation
•
Updated
Mar 13
•
3
•
1
koesn/NeuralHermes-2.5-Mistral-7B-GGUF
Updated
Mar 10
•
86
delayedkarma/NeuralHermes-2.5-Mistral-7B
Text Generation
•
Updated
Mar 10
•
1.96k
•
1
asedmammad/Contextual_KTO_Mistral_PairRM-GGUF
Updated
Mar 11
•
235
•
1
danilopeixoto/pandora-7b-chat
Text Generation
•
Updated
Mar 24
•
3
solidrust/NeuralBeagle14-7B-AWQ
Text Generation
•
Updated
Mar 12
•
8
vibhorg/rl4llm_uofm_nlpo_super_t5_arxiv
Text2Text Generation
•
Updated
Mar 20
•
2
umarigan/Trendyol-LLM-7b-chat-v1.0-RLHF
Question Answering
•
Updated
Mar 16
vibhorg/rl4llm_uofm_nlpo_unsuper_t5_arxiv
Text2Text Generation
•
Updated
Mar 20
•
1
mlabonne/AlphaMonarch-7B-GPTQ
Text Generation
•
Updated
Mar 28
•
7
mlabonne/AlphaMonarch-7B-AWQ
Text Generation
•
Updated
Mar 28
•
8
•
1
mlabonne/AlphaMonarch-7B-2bit-HQQ
Text Generation
•
Updated
Mar 28
•
3
•
8
mlabonne/AlphaMonarch-7B-5.0bpw-exl2
Text Generation
•
Updated
Mar 28
•
8
mlx-community/CapybaraHermes-2.5-Mistral-7B
Updated
Apr 7
•
2
solidrust/OrpoLlama-3-8B-AWQ
Text Generation
•
Updated
27 days ago
•
17
•
3
PKU-Alignment/beaver-7b-v2.0
Reinforcement Learning
•
Updated
9 days ago
•
16
PKU-Alignment/beaver-7b-v2.0-reward
Reinforcement Learning
•
Updated
28 days ago
•
1
Previous
1
...
6
7
8
9
Next