Olga Pustovalova

pos

AI & ML interests

None yet

Recent Activity

reacted to burtenshaw's post with 🚀 25 days ago

NEW UNIT in the Hugging Face Reasoning course. We dive deep into the algorithm behind DeepSeek R1 with an advanced and hands-on guide to interpreting GRPO. 🔗 https://huggingface.co/reasoning-course This unit is super useful if you’re tuning models with reinforcement learning. It will help with: - interpreting loss and reward progression during training runs - selecting effective parameters for training - reviewing and defining effective reward functions This unit also works up smoothly toward the existing practical exercises form @mlabonne and Unsloth. 📣 Shout out to @ShirinYamani who wrote the unit. Follow for more great content.

upvoted a paper about 1 month ago

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

View all activity

Organizations

pos's activity

reacted to burtenshaw's post with 🚀 25 days ago

Post

2984

NEW UNIT in the Hugging Face Reasoning course. We dive deep into the algorithm behind DeepSeek R1 with an advanced and hands-on guide to interpreting GRPO.

🔗

reasoning-course

This unit is super useful if you’re tuning models with reinforcement learning. It will help with:

- interpreting loss and reward progression during training runs
- selecting effective parameters for training
- reviewing and defining effective reward functions

This unit also works up smoothly toward the existing practical exercises form @mlabonne and Unsloth.

📣 Shout out to @ShirinYamani who wrote the unit. Follow for more great content.

1 reply

upvoted a paper about 1 month ago

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5 • 230