Edit Models filters

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

Mixture of Experts

8-bit precision

dataset:argilla/ultrafeedback-binarized-preferences-cleaned

Misc with no match

text-embeddings-inference

Carbon Emissions

Models

105

Full-text search

Active filters: argilla/ultrafeedback-binarized-preferences-cleaned

QuantFactory/Faro-Yi-9B-DPO-GGUF

Text Generation • Updated May 24 • 92 • 1

vikarti-anatra/Faro-Yi-9B-DPO-6bpw-exl2

Text Generation • Updated May 24 • 18 • 1

vikarti-anatra/Faro-Yi-9B-DPO-8bpw-exl2

Text Generation • Updated May 25 • 18 • 1

bartowski/Faro-Yi-9B-DPO-GGUF

Text Generation • Updated May 24 • 138 • 4

vikarti-anatra/Faro-Yi-9B-DPO-gguf

Text Generation • Updated May 25 • 9 • 1

ggalmeida0/Faro-Yi-9B-DPO-Q8_0-GGUF

Text Generation • Updated May 26 • 2

StefanKrsteski/Phi-3-mini-4k-instruct-DPO-EPFL

Updated Jun 4 • 5

AliE02/NaturalLanguagePioneersDPO

Text Generation • Updated May 30 • 21

allenai/llama-3-tulu-2-dpo-70b

Text Generation • Updated Aug 5 • 94

allenai/llama-3-tulu-2-dpo-8b

Text Generation • Updated Aug 9 • 353 • 1

MoxoffSpA/Moxoff-Phi3Mini-KTO

Text Generation • Updated Jun 27 • 4.2k

MoxoffSpA/Moxoff-Phi3Mini-DPO

Text Generation • Updated Jun 27 • 4.18k

MoxoffSpA/Moxoff-Phi3Mini-ORPO

Text Generation • Updated Jun 27 • 4.22k

MoxoffSpA/Moxoff-Phi3Mini-PPO

Text Generation • Updated Jun 27 • 4.21k

hamishivi/OLMo-1B-0724-Instruct-hf

Text Generation • Updated Jul 17 • 14

mradermacher/llama-3-tulu-2-dpo-70b-GGUF

Updated Jul 19 • 94

mradermacher/llama-3-tulu-2-dpo-8b-GGUF

Updated Jul 20 • 3

mradermacher/llama-3-tulu-2-dpo-70b-i1-GGUF

Updated Aug 2 • 193

mradermacher/llama-3-tulu-2-dpo-8b-i1-GGUF

Updated Aug 2 • 4

JW17/mistral-sft-simpo-cleaned-re

Text Generation • Updated Jul 28 • 19

allenai/llama-3.1-tulu-2-dpo-70b

Updated Aug 15 • 22

allenai/llama-3.1-tulu-2-dpo-8b

Updated Aug 15 • 27

depurator/llama-3.1-tulu-2-dpo-8b-Q4_K_M-GGUF

AINovice2005/ElEmperador

Text Generation • Updated Oct 16 • 282

mradermacher/ElEmperador-GGUF

Updated Oct 14 • 5

miulab/llama2-7b-ultrafeedback-rm

Text Classification • Updated Oct 3 • 109

mradermacher/DPO_mistral_v01_7b_ultra_0131_1k_1epoch-GGUF

Updated Nov 7 • 23

mradermacher/DPO_mistral_v01_7b_ultra_0130_1k-GGUF

Updated Nov 7 • 26

tensorblock/Faro-Yi-9B-DPO-GGUF

Text Generation • Updated Nov 16 • 41

tensorblock/Moxoff-Phi3Mini-PPO-GGUF

Updated Nov 16 • 17