Jixuan Leng's picture

Jixuan Leng

Sean123321

AI & ML interests

None yet

Recent Activity

updated a collection about 21 hours ago

updated a collection 5 days ago

updated a collection 6 days ago

View all activity

Organizations

Sean123321's activity

updated a collection about 21 hours ago

VLM

6 items • Updated about 21 hours ago

updated a collection 5 days ago

VLM

6 items • Updated about 21 hours ago

updated a collection 6 days ago

VLM

6 items • Updated about 21 hours ago

updated a collection 12 days ago

VLM

6 items • Updated about 21 hours ago

updated a collection about 2 months ago

VLM

6 items • Updated about 21 hours ago

authored a paper 2 months ago

Taming Overconfidence in LLMs: Reward Calibration in RLHF

Paper • 2410.09724 • Published Oct 13 • 2

updated 6 models 2 months ago

HINT-lab/llama3-8b-dpo-v0.2

Text Generation • Updated Oct 12 • 15

HINT-lab/llama3-8b-cdpo-v0.2

Text Generation • Updated Oct 12 • 15

HINT-lab/mistral-7b-ppo-hermes-v0.3

Text Generation • Updated Oct 12 • 9 • 1

HINT-lab/mistral-7b-ppo-clean-hermes

Text Generation • Updated Oct 12 • 13

HINT-lab/llama3-8b-final-ppo-v0.3

Text Generation • Updated Oct 12 • 14

HINT-lab/llama3-8b-final-ppo-clean-v0.1

Text Generation • Updated Oct 12 • 67

updated a dataset 2 months ago

HINT-lab/prompt-collections-final-v0.3

Viewer • Updated Oct 11 • 20.5k • 38

updated a model 2 months ago

HINT-lab/mistral-7b-hermes-rm-skywork

Updated Oct 11 • 2

updated a model 3 months ago

HINT-lab/mistral-7b-hermes-dpo-v0.2

Text Generation • Updated Oct 10 • 11

updated a dataset 3 months ago

HINT-lab/calibration_preference_mixture_final-v0.1

Viewer • Updated Oct 10 • 25.5k • 44

updated a model 3 months ago

HINT-lab/mistral-7b-hermes-cdpo-v0.2

Text Generation • Updated Oct 10 • 13