Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
arxiv:
2405.00675
Inference Endpoints
AutoTrain Compatible
text-generation-inference
4-bit precision
Eval Results
8-bit precision
Merge
Misc with no match
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
57
Full-text search
Edit filters
Sort: Trending
Active filters:
2405.00675
Clear all
UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3
Text Generation
•
Updated
Jul 1
•
9.14k
•
117
general-preference/GPO-Llama-3-8B-Instruct-GPM-2B
Text Generation
•
Updated
Oct 11
•
24
•
2
UCLA-AGI/Mistral7B-PairRM-SPPO
Text Generation
•
Updated
May 7
•
3.54k
•
6
UCLA-AGI/Mistral7B-PairRM-SPPO-Iter3
Text Generation
•
Updated
May 7
•
5.72k
•
5
UCLA-AGI/Mistral7B-PairRM-SPPO-Iter2
Text Generation
•
Updated
May 6
•
5.73k
•
1
UCLA-AGI/Mistral7B-PairRM-SPPO-Iter1
Text Generation
•
Updated
May 6
•
2.92k
•
1
LoneStriker/Mistral7B-PairRM-SPPO-GGUF
Text Generation
•
Updated
May 6
•
13
LoneStriker/Mistral7B-PairRM-SPPO-3.0bpw-h6-exl2
Text Generation
•
Updated
May 6
•
6
LoneStriker/Mistral7B-PairRM-SPPO-4.0bpw-h6-exl2
Text Generation
•
Updated
May 6
•
4
LoneStriker/Mistral7B-PairRM-SPPO-5.0bpw-h6-exl2
Text Generation
•
Updated
May 6
•
6
LoneStriker/Mistral7B-PairRM-SPPO-6.0bpw-h6-exl2
Text Generation
•
Updated
May 6
•
5
LoneStriker/Mistral7B-PairRM-SPPO-8.0bpw-h8-exl2
Text Generation
•
Updated
May 6
•
3
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter1
Text Generation
•
Updated
Jun 25
•
4.91k
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter2
Text Generation
•
Updated
Jun 25
•
7.3k
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3
Text Generation
•
Updated
Jun 28
•
7k
•
77
bullerwins/Llama-3-Instruct-8B-SPPO-Iter3-exl2_4.0bpw
Text Generation
•
Updated
Jun 26
•
9
bullerwins/Llama-3-Instruct-8B-SPPO-Iter3-exl2_8.0bpw
Text Generation
•
Updated
Jun 26
•
10
bullerwins/Llama-3-Instruct-8B-SPPO-Iter3-exl2_5.0bpw
Text Generation
•
Updated
Jun 26
•
5
bullerwins/Llama-3-Instruct-8B-SPPO-Iter3-exl2_6.0bpw
Text Generation
•
Updated
Jun 26
•
6
QuantFactory/Llama-3-Instruct-8B-SPPO-Iter3-GGUF
Text Generation
•
Updated
Jun 28
•
304
•
1
blockblockblock/Llama-3-Instruct-8B-SPPO-Iter3-bpw6-exl2
Text Generation
•
Updated
Jun 28
•
5
blockblockblock/Llama-3-Instruct-8B-SPPO-Iter3-bpw5.5-exl2
Text Generation
•
Updated
Jun 28
blockblockblock/Llama-3-Instruct-8B-SPPO-Iter3-bpw4-exl2
Text Generation
•
Updated
Jun 29
•
4
blockblockblock/Llama-3-Instruct-8B-SPPO-Iter3-bpw4.6-exl2
Text Generation
•
Updated
Jun 28
•
4
blockblockblock/Llama-3-Instruct-8B-SPPO-Iter3-bpw5-exl2
Text Generation
•
Updated
Jun 28
•
4
UCLA-AGI/Gemma-2-9B-It-SPPO-Iter1
Text Generation
•
Updated
Jul 1
•
5.23k
•
3
UCLA-AGI/Gemma-2-9B-It-SPPO-Iter2
Text Generation
•
Updated
Jul 1
•
5.23k
•
2
qwp4w3hyb/Gemma-2-9B-It-SPPO-Iter3-iMat-GGUF
Text Generation
•
Updated
Jul 3
•
74
v000000/Llama-3-Instruct-15B-SPPO-Iter3-SH
Text Generation
•
Updated
Jul 13
•
7
v000000/Llama-3-Instruct-15B-SPPO-Iter3-SH-Q5_K_M-GGUF
Updated
Jul 13
•
3
•
1
Previous
1
2
Next