Shashwat Goel's picture

5 2 4

Shashwat Goel

shash42

·

https://www.shash42.github.io

AI & ML interests

Science of Deep Learning, Safe AI

Recent Activity

liked a dataset about 2 months ago

bethgelab/REFUTE

authored a paper about 2 months ago

Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation

commented on a paper about 2 months ago

Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation

View all activity

Organizations

shash42's activity

commented a paper about 2 months ago

Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation

Paper • 2502.19414 • Published Feb 26 • 20 •

commented a paper 2 months ago

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Paper • 2502.03544 • Published Feb 5 • 44 •

commented a paper 3 months ago

Great Models Think Alike and this Undermines AI Oversight

Paper • 2502.04313 • Published Feb 6 • 34 •