Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1205
79
78
Quentin Gallouédec
PRO
qgallouedec
Follow
ayan1234roy's profile picture
mmhamdy's profile picture
linktoming's profile picture
335 followers
·
266 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
updated
a dataset
1 day ago
trl-internal-testing/toolcall
published
a dataset
1 day ago
trl-internal-testing/toolcall
upvoted
a
collection
2 days ago
🤖 Agents
View all activity
Organizations
Articles
7
Article
51
No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL
Article
37
Gotchas in Tokenizer Behavior Every Developer Should Know
View all Articles
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
5
Sort: Recently updated
Sleeping
Tmp
🚀
Runtime error
2
Run Hello World
👀
Sleeping
Compute
👁
Runtime error
Run DuckDB Jobs
🦆
Process datasets with DuckDB SQL
Running
14
Train Memory
📈
Generate memory forecast for ML models
models
732
Sort: Recently updated
qgallouedec/Qwen3-1.7B-SFT
Updated
8 days ago
qgallouedec/Qwen3-0.6B-SFT
Updated
8 days ago
qgallouedec/Qwen2.5-0.5B-SFT
Updated
23 days ago
qgallouedec/SmolLM2-360M-Rickified-GRPO
Text Generation
•
Updated
26 days ago
•
62
•
1
qgallouedec/SmolLM2-360M-Rickified
Text Generation
•
Updated
27 days ago
•
1.71k
qgallouedec/SmolLM2-360M-SFT
Text Generation
•
Updated
May 9
•
13
qgallouedec/R1-Zero-Qwen-7B-Math
Text Generation
•
Updated
May 1
•
129
qgallouedec/Qwen-2.5-7B-Simple-RL
Text Generation
•
Updated
Apr 8
•
14
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
Apr 7
•
27
qgallouedec/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Mar 26
Expand 732 models
datasets
72
Sort: Recently updated
qgallouedec/trl-metrics
Viewer
•
Updated
20 days ago
•
120k
•
288
•
1
qgallouedec/rick-physics-grpo
Viewer
•
Updated
26 days ago
•
1.79k
•
214
•
1
qgallouedec/rick-science
Viewer
•
Updated
May 16
•
1.18k
•
166
•
1
qgallouedec/physics-problems
Viewer
•
Updated
May 10
•
247
•
38
qgallouedec/rick-teaches-math
Viewer
•
Updated
May 10
•
6.8k
•
41
qgallouedec/DAPO-Math-17k-Processed-Scored
Viewer
•
Updated
Apr 29
•
16.4k
•
44
•
2
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
77
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
68
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
41
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
53
Expand 72 datasets