Shaobai Jiang
shaobaij
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
19 days ago
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative
Textual Feedback
upvoted
a
paper
23 days ago
START: Self-taught Reasoner with Tools
upvoted
a
paper
23 days ago
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and
Beyond
Organizations
None yet
shaobaij's activity
If I trained a model on mistral already, do I need to start from scratch due to difficulties of fine-tuning?
2
#62 opened over 1 year ago
by
brando

Cant run the model with the most basic code
6
#7 opened over 1 year ago
by
masterchop