Edit Models filters

arxiv: 2312.11456

AutoTrain Compatible

Inference Endpoints

text-generation-inference

Other with no match

4-bit precision

8-bit precision

Carbon Emissions

Mixture of Experts

Models

11

Full-text search

Active filters: 2312.11456

sfairXC/FsfairX-LLaMA3-RM-v0.1

Text Classification • Updated Apr 24 • 18.2k • 25

weqweasdas/RM-Mistral-7B

Text Classification • Updated Mar 31 • 4.16k • 19

RLHFlow/LLaMA3-iterative-DPO-final

Text Generation • Updated 9 days ago • 1.26k • 34

snorkelai/Snorkel-Mistral-PairRM-DPO

Text Generation • Updated about 1 month ago • 3.64k • 103

sfairXC/FsfairX-Zephyr-Chat-v0.1

Text Generation • Updated Apr 24 • 2.21k • 7

qwp4w3hyb/SFR-Iterative-DPO-LLaMA-3-8B-R-iMat-GGUF

Text Generation • Updated 27 days ago • 5.7k • 1

TriAiExperiments/SFR-Iterative-DPO-LLaMA-3-8B-R

Text Generation • Updated 18 days ago • 19

sirovub/SFR-Iterative-DPO-LLaMA-3-8B-R-GGUF

Text Generation • Updated 17 days ago • 132

Apel-sin/llama-3-8B-iterative-DPO-final-exl2

Updated 17 days ago • 1

thesven/SFR-Iterative-DPO-LLaMA-3-8B-R-GGUF

Updated 18 days ago • 1.86k

sirovub/LLaMA3-iterative-DPO-final-GGUF

Text Generation • Updated 17 days ago • 100