15 2 2

Olivier Delalleau

odelalleau

odelalleau

AI & ML interests

None yet

Recent Activity

updated a dataset about 1 month ago

nvidia/HelpSteer3

authored a paper about 2 months ago

Dedicated Feedback and Edit Models Empower Inference-Time Scaling for Open-Ended General-Domain Tasks

new activity 4 months ago

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF:is this the fp16 version ?

View all activity

Organizations

odelalleau's activity

updated a dataset about 1 month ago

nvidia/HelpSteer3

Viewer • Updated Mar 18 • 99k • 1.69k • 39

authored a paper about 2 months ago

Dedicated Feedback and Edit Models Empower Inference-Time Scaling for Open-Ended General-Domain Tasks

Paper • 2503.04378 • Published Mar 6 • 7

New activity in nvidia/Llama-3.1-Nemotron-70B-Instruct-HF 4 months ago

is this the fp16 version ?

#58 opened 4 months ago by

kingriel

New activity in nvidia/Llama-3.1-Nemotron-70B-Instruct 6 months ago

Cannot verify benchmark results

#4 opened 6 months ago by

Lexski

Not quite as good as I hoped.

#1 opened 6 months ago by

DonaldSeibert

authored a paper 7 months ago

HelpSteer2-Preference: Complementing Ratings with Preferences

Paper • 2410.01257 • Published Oct 2, 2024 • 24

New activity in nvidia/Nemotron-4-340B-Instruct 9 months ago

Apex Error

#10 opened 10 months ago by

RoshanJoe

New activity in nvidia/nemotron-3-8b-base-4k 10 months ago

License Question

#3 opened 10 months ago by

BradleyHartlove

liked a Space 10 months ago

361

Reward Bench Leaderboard

📐

Explore and analyze RewardBench leaderboard data

New activity in nvidia/Nemotron-4-340B-Instruct 10 months ago

qnemo file

#9 opened 10 months ago by

willy1212009

New activity in nvidia/Nemotron-4-340B-Base 10 months ago

qnemo file

#6 opened 10 months ago by

willy1212009

New activity in nvidia/HelpSteer2 10 months ago

Averaging GT Overall Scores in Bradley-Terry Model with HelpSteer2

#3 opened 10 months ago by

yyqoni

Clarification on Multi-turn Dialogues in Dataset

#2 opened 10 months ago by

iseesaw

New activity in nvidia/Llama3-70B-DPO-Chat 10 months ago

Training memory requirements.

#1 opened 10 months ago by

thanhdaonguyen

New activity in nvidia/Nemotron-4-340B-Instruct 10 months ago

Gguf

#5 opened 10 months ago by

iHaag

authored a paper 11 months ago

HelpSteer2: Open-source dataset for training top-performing reward models

Paper • 2406.08673 • Published Jun 12, 2024 • 19

upvoted a paper 12 months ago

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

Paper • 2405.01481 • Published May 2, 2024 • 31

authored a paper 12 months ago

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

Paper • 2405.01481 • Published May 2, 2024 • 31

upvoted a paper about 1 year ago

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4, 2024 • 27

New activity in nvidia/nemotron-3-8b-base-4k about 1 year ago

nemo.deploy missing file

#1 opened over 1 year ago by

AlirezaAR