Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
Shentao Yang
shentaoyang
Follow
https://scholar.google.com/citations?hl=en&user=jxxSLbkAAAAJ&view_op=list_works
Shentao-YANG
shentaoyang
AI & ML interests
Generative AI, Large Language Models, RLHF, RLAIF, Reinforcement Learning
Recent Activity
authored
a paper
about 2 months ago
Preference-grounded Token-level Guidance for Language Model Fine-tuning
authored
a paper
about 2 months ago
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
authored
a paper
about 2 months ago
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model
View all activity
Organizations
None yet
Papers
3
arxiv:
2501.02790
arxiv:
2402.08265
arxiv:
2306.00398
models
None public yet
datasets
None public yet