Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
5
1
Steven
yijunyang
Follow
kanxue's profile picture
1 follower
·
2 following
stevenyangyj
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training
upvoted
a
paper
3 days ago
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training
commented
on
a paper
3 days ago
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training
View all activity
Organizations
Papers
6
arxiv:
2503.08525
arxiv:
2502.00698
arxiv:
2410.07484
arxiv:
2311.16714
Expand 6 papers
models
1
yijunyang/instructblip-sft-alfworld
Updated
Mar 20, 2024
datasets
1
yijunyang/alfworld-sft-dataset
Preview
•
Updated
Mar 14, 2024
•
45