Quentin Gallouédec's picture

Quentin Gallouédec PRO

qgallouedec

·

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

Cohere on Hugging Face Inference Providers 🔥

published an article 6 days ago

Gotchas in Tokenizer Behavior Every Developer Should Know

updated a model 16 days ago

trl-internal-testing/tiny-Llama4ForCausalLM

View all activity

Organizations

Articles 6

Article

27

Gotchas in Tokenizer Behavior Every Developer Should Know

Article

286

Open R1: Update #3

View all Articles

Papers 4

arxiv:2402.09844

arxiv:2402.03046

arxiv:2208.14928

arxiv:2106.13687

spaces 3

Run Hello World

Run DuckDB Jobs

Process datasets with DuckDB SQL

Train Memory

Generate memory forecast for ML models

models 725

qgallouedec/Qwen-2.5-7B-Simple-RL

Text Generation • Updated 17 days ago • 6

qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO

Text Generation • Updated 18 days ago • 2

qgallouedec/DeepSeek-R1-Distill-Qwen-7B-GRPO

Updated 29 days ago

qgallouedec/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated about 1 month ago

qgallouedec/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • Updated Mar 15 • 1

qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-packing

Image-Text-to-Text • Updated Mar 14 • 8

qgallouedec/gemma-3-12b-it-codeforces-SFT

Image-Text-to-Text • Updated Mar 14 • 43 • 5

qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-no-packing

Image-Text-to-Text • Updated Mar 14 • 2

qgallouedec/gemma-3-4b-it-codeforces-SFT

Image-Text-to-Text • Updated Mar 13 • 54 • 3

qgallouedec/gemma-3-27b-it-codeforces-SFT

Image-Text-to-Text • Updated Mar 13 • 5 • 4

datasets 67

qgallouedec/trl-metrics

Viewer • Updated 26 days ago • 98.7k • 647 • 1

qgallouedec/prm800k

Viewer • Updated Dec 17, 2024 • 41.2k • 61 • 3

qgallouedec/ultrafeedback-prompt

Viewer • Updated Sep 9, 2024 • 60.9k • 28

qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness

Viewer • Updated Sep 9, 2024 • 16.6k • 37

qgallouedec/lm-human-preferences-descriptiveness

Viewer • Updated Sep 9, 2024 • 6.26k • 24

qgallouedec/lm-human-preferences-sentiment

Viewer • Updated Sep 9, 2024 • 6.26k • 33

qgallouedec/tldr-preference

Viewer • Updated Sep 9, 2024 • 179k • 34

qgallouedec/tldr

Viewer • Updated Sep 9, 2024 • 130k • 34

qgallouedec/hh-rlhf-helpful-base

Viewer • Updated Sep 5, 2024 • 46.2k • 22

qgallouedec/hh-rlhf-helpful-base-trl-style

Viewer • Updated Sep 5, 2024 • 46.2k • 45