Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
1
Yuandong Tian
tydsh
Follow
rongbinngmu's profile picture
Titus-von-Koeller's profile picture
liliwululu's profile picture
10 followers
ยท
2 following
https://yuandong-tian.com/
tydsh
yuandong-tian
AI & ML interests
Reinforcement Learning, Optimization, Representation Learning
Recent Activity
authored
a paper
9 days ago
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
authored
a paper
18 days ago
Towards General-Purpose Model-Free Reinforcement Learning
authored
a paper
22 days ago
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback
View all activity
Organizations
None yet
Papers
21
arxiv:
2502.03275
arxiv:
2501.16142
arxiv:
2501.10799
arxiv:
2412.06769
Expand 21 papers
models
None public yet
datasets
None public yet