Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1205
69
63
Quentin Gallouédec
PRO
qgallouedec
Follow
Akash20000's profile picture
Nihal2000's profile picture
andreasj93's profile picture
238 followers
·
84 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
upvoted
an
article
3 days ago
Cohere on Hugging Face Inference Providers 🔥
published
an
article
6 days ago
Gotchas in Tokenizer Behavior Every Developer Should Know
updated
a model
16 days ago
trl-internal-testing/tiny-Llama4ForCausalLM
View all activity
Organizations
Articles
6
Article
27
Gotchas in Tokenizer Behavior Every Developer Should Know
Article
286
Open R1: Update #3
View all Articles
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
3
Sort: Recently updated
Runtime error
1
Run Hello World
👀
Runtime error
Run DuckDB Jobs
🦆
Process datasets with DuckDB SQL
Running
12
Train Memory
📈
Generate memory forecast for ML models
models
725
Sort: Recently updated
qgallouedec/Qwen-2.5-7B-Simple-RL
Text Generation
•
Updated
17 days ago
•
6
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
18 days ago
•
2
qgallouedec/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
29 days ago
qgallouedec/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
about 1 month ago
qgallouedec/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
Mar 15
•
1
qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-packing
Image-Text-to-Text
•
Updated
Mar 14
•
8
qgallouedec/gemma-3-12b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 14
•
43
•
5
qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-no-packing
Image-Text-to-Text
•
Updated
Mar 14
•
2
qgallouedec/gemma-3-4b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 13
•
54
•
3
qgallouedec/gemma-3-27b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 13
•
5
•
4
Expand 725 models
datasets
67
Sort: Recently updated
qgallouedec/trl-metrics
Viewer
•
Updated
26 days ago
•
98.7k
•
647
•
1
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
61
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
28
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
37
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
24
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
33
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9, 2024
•
179k
•
34
qgallouedec/tldr
Viewer
•
Updated
Sep 9, 2024
•
130k
•
34
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
22
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
45
Expand 67 datasets