weishen's picture

8 8 33

weishen

fakerbaby

·

fakerbaby

AI & ML interests

NLP, alignment, LLM

Recent Activity

liked a model 1 day ago

Skywork/Skywork-VL-Reward-7B

liked a Space 13 days ago

opencompass/open_vlm_leaderboard

liked a dataset 14 days ago

Rapidata/700k_Human_Preference_Dataset_FLUX_SD3_MJ_DALLE3

View all activity

Organizations

fakerbaby's activity

upvoted an article 14 days ago

Article

Open Preference Dataset for Text-to-Image Generation by the 🤗 Community

Dec 9, 2024

• 59

upvoted a collection 5 months ago

Medical QA Datasets

A collection of medical question answering (QA) datasets • 23 items • Updated Feb 22 • 35

upvoted 2 collections 7 months ago

Infinity Instruct

16 items • Updated Mar 9 • 9

DeepSeekCoder-V2

6 items • Updated Sep 5, 2024 • 93

upvoted a paper 10 months ago

Secrets of RLHF in Large Language Models Part I: PPO

Paper • 2307.04964 • Published Jul 11, 2023 • 29

upvoted 2 collections 11 months ago

MoEs papers reading list

60 items • Updated Nov 4, 2024 • 140

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 236

upvoted a paper over 1 year ago

Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback

Paper • 2310.05199 • Published Oct 8, 2023 • 1