Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
6
1
Steven
yijunyang
Follow
kanxue's profile picture
haonanzhang's profile picture
2 followers
·
2 following
stevenyangyj
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
11 days ago
C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing
authored
a paper
about 1 month ago
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training
upvoted
a
paper
about 1 month ago
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training
View all activity
Organizations
Papers
6
arxiv:
2503.08525
arxiv:
2502.00698
arxiv:
2410.07484
arxiv:
2311.16714
Expand 6 papers
models
1
yijunyang/instructblip-sft-alfworld
Updated
Mar 20, 2024
datasets
1
yijunyang/alfworld-sft-dataset
Preview
•
Updated
Mar 14, 2024
•
16