Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
78.0
TFLOPS
43
129
279
Gabriel Martín Blázquez
gabrielmbmb
Follow
mlabonne's profile picture
mmhamdy's profile picture
4rundaya's profile picture
99 followers
·
57 following
https://gabrielmb.com
gabrielmbmb_
gabrielmbmb
gabrielmbmb
gabrielmb.com
AI & ML interests
ML Engineer
Recent Activity
reacted
to
anton-l
's
post
with 🚀
5 days ago
Introducing 📐𝐅𝐢𝐧𝐞𝐌𝐚𝐭𝐡: the best public math pre-training dataset with 50B+ tokens! https://huggingface.co/datasets/HuggingFaceTB/finemath Math remains challenging for LLMs and by training on FineMath we see considerable gains over other math datasets, especially on GSM8K and MATH. We build the dataset by: 🛠️ carefully extracting math data from Common Crawl; 🔎 iteratively filtering and recalling high quality math pages using a classifier trained on synthetic annotations to identify math reasoning and deduction. We conducted a series of ablations comparing the performance of Llama-3.2-3B-Base after continued pre-training on FineMath and observe notable gains compared to the baseline model and other public math datasets. We hope this helps advance the performance of LLMs on math and reasoning! 🚀 We’re also releasing all the ablation models as well as the evaluation code. https://huggingface.co/collections/HuggingFaceTB/finemath-6763fb8f71b6439b653482c2
updated
a dataset
6 days ago
gabrielmbmb/gsm8k-reasoning-paths-combined
upvoted
a
paper
6 days ago
Qwen2.5 Technical Report
View all activity
Articles
How we leveraged distilabel to create an Argilla 2.0 Chatbot
Jul 16
•
32
Organizations
gabrielmbmb
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
argilla/magpie-ultra-v1.0
24 days ago
Question About Dataset Content
1
#2 opened 25 days ago by
chrisliu298
New activity in
allenai/tulu-3-sft-personas-math
30 days ago
Add link to Tulu 3 paper
#2 opened 30 days ago by
gabrielmbmb
New activity in
argilla/ifeval-like-data
2 months ago
Delete 'filtered_and_decontaminated' config
#2 opened 2 months ago by
gabrielmbmb
New activity in
gabrielmbmb/distilabel-reflection-tuning
4 months ago
Are there any format requirements for system_prompt in TextGeneration?
4
#4 opened 4 months ago by
Terrence-wpc
is it possible running in offline env?
3
#3 opened 4 months ago by
xDAN2099
Possible to do something like this using together API?
1
#2 opened 4 months ago by
nkasmanoff
New activity in
argilla/magpie-ultra-v0.1
4 months ago
About response_base
3
#9 opened 4 months ago by
flydust
New activity in
argilla/mmlu-translation-progress
4 months ago
Space not working
2
#1 opened 4 months ago by
alkibijad
New activity in
argilla/magpie-ultra-v0.1
5 months ago
Upload B 4.wav
#5 opened 5 months ago by
YetNha
just wanted to say ty
1
#4 opened 5 months ago by
skratos115
New activity in
prometheus-eval/prometheus-7b-v2.0
5 months ago
Allow passing system prompt to chat template
#4 opened 5 months ago by
gabrielmbmb
Tokenizer chat template doesn't accept system prompt
1
#3 opened 5 months ago by
gabrielmbmb
New activity in
gabrielmbmb/magpie-llama-3-70b-instruct
5 months ago
Librarian Bot: Add language metadata for dataset
#1 opened 5 months ago by
librarian-bot
New activity in
argilla/magpie-ultra-v0.1
5 months ago
dataset topic diversity
3
#2 opened 5 months ago by
pszemraj
New activity in
RLHFlow/ArmoRM-Llama3-8B-v0.1
5 months ago
Update modeling_custom.py
#14 opened 5 months ago by
gabrielmbmb
New activity in
argilla/dpo-mix-7k
8 months ago
Thank you
1
#5 opened 8 months ago by
Yuma42
New activity in
google-bert/bert-base-uncased
8 months ago
Update LayerNorm tensor names to weight and bias (from gamma and beta)
1
#70 opened 8 months ago by
gabrielmbmb
New activity in
argilla/distilabeled-Marcoro14-7B-slerp
10 months ago
Adding Evaluation Results
#3 opened 10 months ago by
leaderboard-pr-bot
New activity in
argilla/DistilabelBeagle14-7B
10 months ago
Adding Evaluation Results
#1 opened 10 months ago by
leaderboard-pr-bot
New activity in
argilla/distilabeled-Marcoro14-7B-slerp-full
10 months ago
Adding Evaluation Results
#1 opened 10 months ago by
leaderboard-pr-bot
Load more