Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1205
72
73
Quentin Gallouédec
PRO
qgallouedec
Follow
reos156's profile picture
edhenry's profile picture
hotmailuser's profile picture
298 followers
·
261 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
updated
a Space
1 minute ago
trl-lib/train
liked
a Space
25 minutes ago
trl-lib/train
published
a Space
about 2 hours ago
qgallouedec/tmp
View all activity
Organizations
Articles
6
Article
36
Gotchas in Tokenizer Behavior Every Developer Should Know
Article
291
Open R1: Update #3
View all Articles
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
5
Sort: Recently updated
Running
Tmp
🚀
Runtime error
2
Run Hello World
👀
Sleeping
Compute
👁
Runtime error
Run DuckDB Jobs
🦆
Process datasets with DuckDB SQL
Running
13
Train Memory
📈
Generate memory forecast for ML models
models
729
Sort: Recently updated
qgallouedec/SmolLM2-360M-Rickified-GRPO
Text Generation
•
Updated
3 days ago
•
44
qgallouedec/SmolLM2-360M-Rickified
Text Generation
•
Updated
4 days ago
•
439
qgallouedec/SmolLM2-360M-SFT
Text Generation
•
Updated
16 days ago
•
3
qgallouedec/R1-Zero-Qwen-7B-Math
Text Generation
•
Updated
24 days ago
•
187
qgallouedec/Qwen-2.5-7B-Simple-RL
Text Generation
•
Updated
Apr 8
•
5
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
Apr 7
•
14
qgallouedec/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Mar 26
qgallouedec/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Mar 24
qgallouedec/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
Mar 15
•
5
qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-packing
Image-Text-to-Text
•
Updated
Mar 14
•
4
Expand 729 models
datasets
72
Sort: Recently updated
qgallouedec/rick-physics-grpo
Viewer
•
Updated
3 days ago
•
1.79k
•
169
qgallouedec/rick-science
Viewer
•
Updated
8 days ago
•
1.18k
•
170
•
1
qgallouedec/physics-problems
Viewer
•
Updated
15 days ago
•
247
•
41
qgallouedec/rick-teaches-math
Viewer
•
Updated
15 days ago
•
6.8k
•
94
qgallouedec/DAPO-Math-17k-Processed-Scored
Viewer
•
Updated
25 days ago
•
16.4k
•
181
•
2
qgallouedec/trl-metrics
Viewer
•
Updated
29 days ago
•
108k
•
403
•
1
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
27
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
42
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
18
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
26
Expand 72 datasets