Carlos Fonseca's picture

1 61

Carlos Fonseca

carlfm01

·

carlfm01

AI & ML interests

None yet

Recent Activity

liked a dataset 19 days ago

unsloth/RLAIF-V-Dataset

liked a model 25 days ago

HuggingFaceTB/SmolLM2-360M

reacted to Jaward's post with 👀 25 days ago

Finally here it is: a faster, custom, scalable GRPO trainer for smaller models with < 500M params, can train on 8gb ram cpu, also supports gpu for sanity sake (includes support for vllm + flash attention). Using smolLM2-135M/360M-instructs as ref & base models. Experience your own “aha” moment 🐳 on 8gb ram. Code: https://github.com/Jaykef/ai-algorithms/blob/main/smollm2_360M_135M_grpo_gsm8k.ipynb

View all activity

Organizations

None yet

carlfm01's activity

liked a dataset 19 days ago

unsloth/RLAIF-V-Dataset

Viewer • Updated Sep 26, 2024 • 2.49k • 123 • 5

liked a model 25 days ago

HuggingFaceTB/SmolLM2-360M

Text Generation • Updated Feb 6 • 29.4k • • 40

reacted to Jaward's post with 👀🔥 25 days ago

Post

3868

Finally here it is: a faster, custom, scalable GRPO trainer for smaller models with < 500M params, can train on 8gb ram cpu, also supports gpu for sanity sake (includes support for vllm + flash attention). Using smolLM2-135M/360M-instructs as ref & base models. Experience your own “aha” moment 🐳 on 8gb ram.
Code: https://github.com/Jaykef/ai-algorithms/blob/main/smollm2_360M_135M_grpo_gsm8k.ipynb

2 replies

·

liked 2 datasets about 1 month ago

ylacombe/cml-tts

Viewer • Updated Nov 24, 2023 • 1.34M • 25k • 20

bespokelabs/Bespoke-Stratos-17k

Viewer • Updated Jan 31 • 16.7k • 56.8k • 294

liked 2 models about 2 months ago

deepseek-ai/DeepSeek-R1-Distill-Llama-70B

Text Generation • Updated 18 days ago • 358k • • 629

deepseek-ai/DeepSeek-R1

Text Generation • Updated 18 days ago • 2.52M • • 11.3k

upvoted an article about 2 months ago

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

By

•

Jan 15

• 43

liked a dataset about 2 months ago

microsoft/PEACE

Viewer • Updated Jan 26 • 7.73k • 772 • 15

reacted to danielhanchen's post with ❤️🚀 3 months ago

Post

3737

Yay we got 500K+ monthly HF downloads on our Unsloth HF repo! :) Super appreciate everyone in the OSS community - and thanks for using Unsloth!!

4 replies

·

liked a dataset 3 months ago

microsoft/MAGIC

Viewer • Updated Dec 17, 2024 • 48.1k • 192 • 11

liked 3 models 3 months ago

Qwen/Qwen2.5-1.5B-Instruct

Text Generation • Updated Sep 25, 2024 • 1.08M • • 363

deepseek-ai/DeepSeek-V2.5-1210

Text Generation • Updated Dec 11, 2024 • 1.36k • 252

deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct

Text Generation • Updated Jul 3, 2024 • 300k • • 406

liked a dataset 3 months ago

TIGER-Lab/OmniEdit-Filtered-1.2M

Viewer • Updated Dec 6, 2024 • 1.2M • 26k • 74

liked a model 3 months ago

unsloth/Llama-3.3-70B-Instruct

Text Generation • Updated Jan 7 • 202k • 38

liked 2 datasets 3 months ago

Xkev/LLaVA-CoT-100k

Viewer • Updated Nov 27, 2024 • 98.6k • 2.73k • 77

5CD-AI/LLaVA-CoT-o1-Instruct

Viewer • Updated Nov 27, 2024 • 58.5k • 261 • 100