Perusha Moodley's picture

7 9

Perusha Moodley

moodlep

·

https://www.perusha.dev/

AI & ML interests

RL, DRL, Decision Transformers, Auxiliary signals, self-supervised methods

Recent Activity

upvoted an article about 1 month ago

SmolLM - blazingly fast and remarkably powerful

liked a Space about 1 month ago

nanotron/ultrascale-playbook

liked a dataset 2 months ago

Anthropic/hh-rlhf

View all activity

Organizations

moodlep's activity

upvoted an article about 1 month ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 352

upvoted a paper 3 months ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 101

upvoted a collection 3 months ago

Scaling Test-Time Compute with Open Models

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6 • 23

upvoted a collection 4 months ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 27 days ago • 78

upvoted an article 12 months ago

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Apr 22, 2024

• 80

upvoted a paper 12 months ago

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning

Paper • 2310.20587 • Published Oct 31, 2023 • 18