🏝️ On Vacation

Clémentine Fourrier

clefourrier

1252 106 234

http://clefourrier.github.io

AI & ML interests

None yet

Recent Activity

updated a dataset about 23 hours ago

gaia-benchmark/results_public

updated a dataset 2 days ago

gaia-benchmark/results_public

updated a dataset 2 days ago

gaia-benchmark/results_public

View all activity

Organizations

Posts 18

Post

2796

Always surprised that so few people actually read the FineTasks blog, on
✨how to select training evals with the highest signal✨

If you're serious about training models without wasting compute on shitty runs, you absolutely should read it!!

An high signal eval actually tells you precisely, during training, how wel & what your model is learning, allowing you to discard the bad runs/bad samplings/...!

The blog covers in depth prompt choice, metrics, dataset, across languages/capabilities, and my fave section is "which properties should evals have"👌
(to know on your use case how to select the best evals for you)

Blog: HuggingFaceFW/blogpost-fine-tasks