A.Genchev's picture

3 1 3

A.Genchev

AGenchev

·

AI & ML interests

None yet

Recent Activity

new activity 5 days ago

hantian/yolo-doclaynet:Good job !

upvoted a collection 21 days ago

reacted to burtenshaw's post with 👍 23 days ago

Here’s a notebook to make Gemma reason with GRPO & TRL. I made this whilst prepping the next unit of the reasoning course: In this notebooks I combine together google’s model with some community tooling - First, I load the model from the Hugging Face hub with transformers’s latest release for Gemma 3 - I use PEFT and bitsandbytes to get it running on Colab - Then, I took Will Browns processing and reward functions to make reasoning chains from GSM8k - Finally, I used TRL’s GRPOTrainer to train the model Next step is to bring Unsloth AI in, then ship it in the reasoning course. Links to notebook below. https://colab.research.google.com/drive/1Vkl69ytCS3bvOtV9_stRETMthlQXR4wX?usp=sharing

View all activity

Organizations

None yet

AGenchev's activity

liked a Space 23 days ago

QwQ 32B Demo

Send text and get detailed responses

liked 2 models about 2 months ago

arcee-ai/DeepSeek-R1-bf16

Text Generation • Updated Jan 20 • 809 • 16

huihui-ai/DeepSeek-R1-bf16

Text Generation • Updated Feb 15 • 260 • 3