Edit Models filters

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

Other with no match

Carbon Emissions

Mixture of Experts

Models

267

Full-text search

Active filters: rlhf

PKU-Alignment/beaver-7b-v2.0-cost

Reinforcement Learning • Updated Apr 20 • 4

PKU-Alignment/beaver-7b-v3.0

Reinforcement Learning • Updated 25 days ago • 190

PKU-Alignment/beaver-7b-v3.0-reward

Reinforcement Learning • Updated Apr 20 • 1.76k

PKU-Alignment/beaver-7b-v3.0-cost

Reinforcement Learning • Updated Apr 20 • 1.22k

PKU-Alignment/beaver-7b-unified-reward

Reinforcement Learning • Updated Apr 20 • 1.09k

PKU-Alignment/beaver-7b-unified-cost

Reinforcement Learning • Updated Apr 20 • 321

Aditya685/UpshotLlama-3-8B

Text Generation • Updated Apr 20

bartowski/OrpoLlama-3-8B-GGUF

Text Generation • Updated Apr 20 • 610 • 4

QuantFactory/NeuralDaredevil-7B-GGUF

Text Generation • Updated 10 days ago • 2.62k

LoneStriker/OrpoLlama-3-8B-GGUF

Updated Apr 21 • 103 • 1

LoneStriker/OrpoLlama-3-8B-3.0bpw-h6-exl2

Text Generation • Updated Apr 21 • 5

LoneStriker/OrpoLlama-3-8B-4.0bpw-h6-exl2

Text Generation • Updated Apr 21 • 1

LoneStriker/OrpoLlama-3-8B-5.0bpw-h6-exl2

Text Generation • Updated Apr 21 • 4

LoneStriker/OrpoLlama-3-8B-6.0bpw-h6-exl2

Text Generation • Updated Apr 21 • 6

LoneStriker/OrpoLlama-3-8B-8.0bpw-h8-exl2

Text Generation • Updated Apr 21

jalaganapathy/jalaModelRepo

Text Generation • Updated Apr 21 • 7

mlx-community/OrpoLlama-3-8B-4bit

Text Generation • Updated Apr 21 • 2

mlx-community/OrpoLlama-3-8B-8bit

Text Generation • Updated Apr 21 • 1

bartowski/OrpoLlama-3-8B-exl2

Text Generation • Updated Apr 21 • 1

hus960/OrpoLlama-3-8B-Q4_K_M-GGUF

Updated Apr 23 • 49

DavidAU/AlphaMonarch-7B-Q6_K-GGUF

Updated Apr 24 • 22

QuantFactory/OrpoLlama-3-8B-GGUF

Text Generation • Updated Apr 24 • 288

DavidAU/OrpoLlama-3-8B-Q8_0-GGUF

Updated Apr 25 • 58

dfurman/Llama-3-8B-Orpo-v0.1

Text Generation • Updated Apr 29 • 2.61k • 1

dfurman/Llama-3-70B-Orpo-v0.1

Text Generation • Updated 29 days ago • 2.61k • 2

newsletter/CapybaraHermes-2.5-Mistral-7B-Q6_K-GGUF

Updated 20 days ago • 28 • 1

mradermacher/archangel_sft-kto_llama30b-i1-GGUF

Updated 1 day ago • 223