Edit Models filters

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

Other with no match

Carbon Emissions

Mixture of Experts

Models

263

new Full-text search

Active filters: rlhf

PKU-Alignment/beaver-7b-v2.0

Reinforcement Learning • Updated 14 days ago • 13

PKU-Alignment/beaver-7b-v2.0-reward

Reinforcement Learning • Updated 14 days ago • 1

PKU-Alignment/beaver-7b-v2.0-cost

Reinforcement Learning • Updated 14 days ago • 1

PKU-Alignment/beaver-7b-v3.0

Reinforcement Learning • Updated 14 days ago • 72

PKU-Alignment/beaver-7b-v3.0-reward

Reinforcement Learning • Updated 14 days ago • 1.27k

PKU-Alignment/beaver-7b-v3.0-cost

Reinforcement Learning • Updated 14 days ago • 1.16k

PKU-Alignment/beaver-7b-unified-reward

Reinforcement Learning • Updated 14 days ago • 251

PKU-Alignment/beaver-7b-unified-cost

Reinforcement Learning • Updated 14 days ago • 249

Aditya685/UpshotLlama-3-8B

Text Generation • Updated 14 days ago • 3

LoneStriker/OrpoLlama-3-8B-GGUF

Updated 13 days ago • 148 • 1

LoneStriker/OrpoLlama-3-8B-3.0bpw-h6-exl2

Text Generation • Updated 13 days ago • 3

LoneStriker/OrpoLlama-3-8B-4.0bpw-h6-exl2

Text Generation • Updated 13 days ago • 1

LoneStriker/OrpoLlama-3-8B-5.0bpw-h6-exl2

Text Generation • Updated 13 days ago • 7

LoneStriker/OrpoLlama-3-8B-6.0bpw-h6-exl2

Text Generation • Updated 13 days ago • 4

LoneStriker/OrpoLlama-3-8B-8.0bpw-h8-exl2

Text Generation • Updated 13 days ago • 1

jalaganapathy/jalaModelRepo

Text Generation • Updated 13 days ago • 1

mlx-community/OrpoLlama-3-8B-4bit

Text Generation • Updated 13 days ago • 8

mlx-community/OrpoLlama-3-8B-8bit

Text Generation • Updated 13 days ago • 8

bartowski/OrpoLlama-3-8B-exl2

Text Generation • Updated 13 days ago • 1 • 1

hus960/OrpoLlama-3-8B-Q4_K_M-GGUF

Updated 12 days ago • 20

DavidAU/AlphaMonarch-7B-Q6_K-GGUF

Updated 11 days ago • 7

QuantFactory/OrpoLlama-3-8B-GGUF

Text Generation • Updated 11 days ago • 1.32k

DavidAU/OrpoLlama-3-8B-Q8_0-GGUF

Updated 10 days ago • 12