Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1205
69
64
Quentin Gallouédec
PRO
qgallouedec
Follow
mkluczek's profile picture
Mollel's profile picture
AdilZtn's profile picture
247 followers
·
84 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
updated
a model
17 minutes ago
qgallouedec/R1-Zero-Qwen-7B-Math
published
a model
about 8 hours ago
qgallouedec/R1-Zero-Qwen-7B-Math
updated
a dataset
about 17 hours ago
qgallouedec/DAPO-Math-17k-Processed-Scored
View all activity
Organizations
Articles
6
Article
32
Gotchas in Tokenizer Behavior Every Developer Should Know
Article
288
Open R1: Update #3
View all Articles
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
3
Sort: Recently updated
Runtime error
1
Run Hello World
👀
Runtime error
Run DuckDB Jobs
🦆
Process datasets with DuckDB SQL
Running
12
Train Memory
📈
Generate memory forecast for ML models
models
726
Sort: Recently updated
qgallouedec/R1-Zero-Qwen-7B-Math
Updated
17 minutes ago
qgallouedec/Qwen-2.5-7B-Simple-RL
Text Generation
•
Updated
22 days ago
•
6
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
23 days ago
•
2
qgallouedec/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Mar 26
qgallouedec/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Mar 24
qgallouedec/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
Mar 15
qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-packing
Image-Text-to-Text
•
Updated
Mar 14
•
2
qgallouedec/gemma-3-12b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 14
•
36
•
5
qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-no-packing
Image-Text-to-Text
•
Updated
Mar 14
•
3
qgallouedec/gemma-3-4b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 13
•
48
•
3
Expand 726 models
datasets
68
Sort: Recently updated
qgallouedec/DAPO-Math-17k-Processed-Scored
Viewer
•
Updated
about 17 hours ago
•
16.4k
•
71
qgallouedec/trl-metrics
Viewer
•
Updated
4 days ago
•
108k
•
702
•
1
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
38
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
24
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
29
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
20
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
24
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9, 2024
•
179k
•
27
qgallouedec/tldr
Viewer
•
Updated
Sep 9, 2024
•
130k
•
27
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
16
Expand 68 datasets