Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Reset Other
arxiv:
2312.11456
AutoTrain Compatible
Inference Endpoints
text-generation-inference
Other with no match
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
Carbon Emissions
Mixture of Experts
Apply filters
Models
11
Full-text search
Edit filters
Sort: Trending
Active filters:
2312.11456
Clear all
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
•
Updated
Apr 24
•
18.2k
•
25
weqweasdas/RM-Mistral-7B
Text Classification
•
Updated
Mar 31
•
4.16k
•
19
RLHFlow/LLaMA3-iterative-DPO-final
Text Generation
•
Updated
9 days ago
•
1.26k
•
34
snorkelai/Snorkel-Mistral-PairRM-DPO
Text Generation
•
Updated
about 1 month ago
•
3.64k
•
103
sfairXC/FsfairX-Zephyr-Chat-v0.1
Text Generation
•
Updated
Apr 24
•
2.21k
•
7
qwp4w3hyb/SFR-Iterative-DPO-LLaMA-3-8B-R-iMat-GGUF
Text Generation
•
Updated
27 days ago
•
5.7k
•
1
TriAiExperiments/SFR-Iterative-DPO-LLaMA-3-8B-R
Text Generation
•
Updated
18 days ago
•
19
sirovub/SFR-Iterative-DPO-LLaMA-3-8B-R-GGUF
Text Generation
•
Updated
17 days ago
•
132
Apel-sin/llama-3-8B-iterative-DPO-final-exl2
Updated
17 days ago
•
1
thesven/SFR-Iterative-DPO-LLaMA-3-8B-R-GGUF
Updated
18 days ago
•
1.86k
sirovub/LLaMA3-iterative-DPO-final-GGUF
Text Generation
•
Updated
17 days ago
•
100