ben burtenshaw's picture

ben burtenshaw

burtenshaw

·

AI & ML interests

None yet

Recent Activity

updated a dataset 7 minutes ago

agents-course/certificates

updated a dataset 9 minutes ago

agents-course/certificates

updated a dataset 13 minutes ago

agents-course/certificates

View all activity

Organizations

Posts 26

Post

2473

NEW UNIT in the Hugging Face Reasoning course. We dive deep into the algorithm behind DeepSeek R1 with an advanced and hands-on guide to interpreting GRPO.

🔗

reasoning-course

This unit is super useful if you’re tuning models with reinforcement learning. It will help with:

- interpreting loss and reward progression during training runs
- selecting effective parameters for training
- reviewing and defining effective reward functions

This unit also works up smoothly toward the existing practical exercises form @mlabonne and Unsloth.

📣 Shout out to @ShirinYamani who wrote the unit. Follow for more great content.

Articles 17

Article

38

The NLP Course is becoming the LLM Course!

View all Articles

Collections 6

Papers 2

arxiv:2502.02737

arxiv:2408.16961

spaces 26

Agent Builder

Create an AI agent using Hugging Face Spaces

Talk to Smolagents

FastRTC Voice Agent with smolagents

Deepseek Ai DeepSeek R1

Turn text into detailed images

Unit 1 Quiz - AI Agent Fundementals

Test your knowledge of the Agent fundamentals.

Code Quiz

A quiz app for rows of a dataset

Hub Recap

Generate Hugging Face stats image

models 17

burtenshaw/GemmaCoder3-12B

Image-Text-to-Text • Updated 4 days ago • 191 • 35

burtenshaw/gemma-3-4b-it-codeforces-SFT

Image-Text-to-Text • Updated 4 days ago • 3

burtenshaw/gemma-3-12b-it-codeforces-SFT-Q4_K_M-GGUF

Updated 5 days ago • 43

burtenshaw/Qwen2.5-Coder-3B-Instruct-bnb-4bit-Codeforces-COT

Text Generation • Updated 10 days ago • 2

burtenshaw/Qwen2.5-Coder-32B-Instruct-bnb-4bit-Codeforces-COT

Text Generation • Updated 11 days ago • 7

burtenshaw/gemma-3-4b-it-capybara-test

Image-Text-to-Text • Updated 20 days ago • 3

burtenshaw/gemma-3-4b-it-Open-R1-Distill

Updated 20 days ago • 3 • 1

burtenshaw/gemma-3-4b-thinking

Updated 23 days ago • 9

burtenshaw/code-smol2-text-to-sql

Updated Nov 18, 2024 • 7

burtenshaw/Qwen2.5-3B-Instruct-GGUF

Updated Oct 30, 2024 • 26

datasets 30

burtenshaw/codeforces-cots-mini

Viewer • Updated 18 days ago • 1k • 49

burtenshaw/dummy-code-quiz

Viewer • Updated Feb 13 • 40 • 85

burtenshaw/frog_sft

Viewer • Updated Feb 11 • 5 • 32

burtenshaw/exam_questions

Viewer • Updated Jan 24 • 10 • 63

burtenshaw/quiz-responses

Viewer • Updated Jan 24 • 1 • 37

burtenshaw/ohp-test-conversation

Preview • Updated Jan 8 • 25

burtenshaw/fineweb-c-prelim

Viewer • Updated Dec 17, 2024 • 157k • 43

burtenshaw/synthetic-generator-sft

Viewer • Updated Dec 12, 2024 • 10 • 51

burtenshaw/farming-dataset-synthetic-generator-classification

Viewer • Updated Dec 12, 2024 • 10 • 60

burtenshaw/cusomer-assitant

Viewer • Updated Dec 11, 2024 • 10 • 45