Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3
1
3
A.Genchev
AGenchev
Follow
0 followers
·
3 following
AI & ML interests
None yet
Recent Activity
new
activity
5 days ago
hantian/yolo-doclaynet:
Good job !
upvoted
a
collection
21 days ago
olmOCR
reacted
to
burtenshaw
's
post
with 👍
23 days ago
Here’s a notebook to make Gemma reason with GRPO & TRL. I made this whilst prepping the next unit of the reasoning course: In this notebooks I combine together google’s model with some community tooling - First, I load the model from the Hugging Face hub with transformers’s latest release for Gemma 3 - I use PEFT and bitsandbytes to get it running on Colab - Then, I took Will Browns processing and reward functions to make reasoning chains from GSM8k - Finally, I used TRL’s GRPOTrainer to train the model Next step is to bring Unsloth AI in, then ship it in the reasoning course. Links to notebook below. https://colab.research.google.com/drive/1Vkl69ytCS3bvOtV9_stRETMthlQXR4wX?usp=sharing
View all activity
Organizations
None yet
AGenchev
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a Space
23 days ago
Running
507
507
QwQ 32B Demo
🌖
Send text and get detailed responses
liked
2 models
about 2 months ago
arcee-ai/DeepSeek-R1-bf16
Text Generation
•
Updated
Jan 20
•
809
•
16
huihui-ai/DeepSeek-R1-bf16
Text Generation
•
Updated
Feb 15
•
260
•
3