Harshit Sikchi's picture

1 2 1

Harshit Sikchi

hsikchi

·

AI & ML interests

None yet

Recent Activity

commented on a paper about 2 months ago

RL Zero: Zero-Shot Language to Behaviors without any Supervision

upvoted a paper 8 months ago

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

authored a paper 8 months ago

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

View all activity

Organizations

hsikchi's activity

upvoted a paper 8 months ago

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

Paper • 2406.02900 • Published Jun 5, 2024 • 12

upvoted a paper over 1 year ago

Contrastive Prefence Learning: Learning from Human Feedback without RL

Paper • 2310.13639 • Published Oct 20, 2023 • 25