Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

Misc with no match

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

522

Full-text search

Active filters: reward-trainer

chandrasekhar319/reward_model_tinyllama_sql

Updated Jun 19, 2024 • 2

mnoukhov/pythia410m-rm-tldr6.9b

Text Classification • Updated Jun 20, 2024 • 248

vwxyzjn/rm_1b

Text Classification • Updated Jun 20, 2024 • 105

SiMajid/value_reward_modeling

Text Classification • Updated Jun 21, 2024 • 107

SiMajid/deberta_value

Text Classification • Updated Jun 22, 2024 • 72

SiMajid/xlm-roberta-base

Text Classification • Updated Jun 21, 2024 • 129

SiMajid/opt-350-value

Text Classification • Updated Jun 22, 2024 • 110

smohammadi/tinyllama_rm_sentiment_1b

Text Classification • Updated Jun 28, 2024 • 119

prometheus04/tinystarcoder-rlhf-model

Text Generation • Updated Jun 29, 2024 • 92

Baidicoot/reward_modeling

Updated Jul 2, 2024 • 2

Baidicoot/gemma-2b-jailbreak-RM

Updated Jul 2, 2024 • 7 • 1

mnoukhov/pythia160m-rm-tldr6.9b

Text Classification • Updated Jul 4, 2024 • 34

mnoukhov/pythia1b-rm-tldr6.9b

Text Classification • Updated Jul 3, 2024 • 113

blai88/reward_modeling_anthropic_hh

Updated Jul 6, 2024 • 16

mnoukhov/pythia2.8b-rm-tldr6.9b

Text Classification • Updated Jul 7, 2024 • 270

steve-sli/0721_185958-google-gemma-2b

Updated Jul 21, 2024 • 2

steve-sli/0721_201833-google-gemma-2b

Updated Jul 21, 2024 • 5

steve-sli/0721_210648-google-gemma-2b

Updated Jul 21, 2024 • 2

steve-sli/0721_210856-google-gemma-2b

Updated Jul 21, 2024 • 2

steve-sli/0721_211205-google-gemma-2b

Updated Jul 21, 2024 • 3

steve-sli/0721_222324-google-gemma-2b

Updated Jul 21, 2024 • 2

SiMajid/value-reward-model-opt-350m-v3

Text Classification • Updated Jul 23, 2024 • 108

SiMajid/value-reward-model-opt-350m-v11

Text Classification • Updated Jul 25, 2024 • 104

SiMajid/value-reward-model-opt-350m-v12

Text Classification • Updated Jul 25, 2024 • 104

Penghaoo/workspace

Updated Jul 25, 2024 • 3

ChokeGM/train_dir

Text Classification • Updated Jul 26, 2024 • 106

SiMajid/value-reward-model-opt-350m-v15

Text Classification • Updated Jul 28, 2024 • 114

SiMajid/value-reward-model-opt-350m-v16

Text Classification • Updated Jul 28, 2024 • 105

nahed22/rm_checkpoint

Text Generation • Updated Jul 30, 2024 • 143

RylanSchaeffer/pythia-70m_tatsu-lab_alpaca_farm_sftsd0_policy_pythia-6.9b_gold_offsetbias-8b_noise0.25_rmsd0

Text Classification • Updated Jul 30, 2024 • 105