63 165 469

Yacine Jernite

yjernite

https://yjernite.github.io/

AI & ML interests

Technical, community, and regulatory tools of AI governance @HuggingFace

Recent Activity

liked a Space 3 days ago

jdelavande/chat-ui-energy

upvoted an article 4 days ago

Empowering Public Organizations: Preparing Your Data for the AI Era

liked a dataset 4 days ago

CohereForAI/kaleidoscope

View all activity

Organizations

yjernite's activity

liked a Space 3 days ago

Chat UI Energy Score

⚡

upvoted an article 4 days ago

Article

Empowering Public Organizations: Preparing Your Data for the AI Era

and 1 other •

4 days ago

• 11

liked a dataset 4 days ago

CohereForAI/kaleidoscope

Viewer • Updated 4 days ago • 20.9k • 50 • 8

liked a model 4 days ago

rasbt/llama-3.2-from-scratch

Updated 12 days ago • 251

published an article 4 days ago

Article

Empowering Public Organizations: Preparing Your Data for the AI Era

and 1 other •

4 days ago

• 11

upvoted a paper 5 days ago

Rethinking Reflection in Pre-Training

Paper • 2504.04022 • Published 9 days ago • 71

commented a paper 5 days ago

Rethinking Reflection in Pre-Training

Paper • 2504.04022 • Published 9 days ago • 71 •

liked 2 models 5 days ago

reducto/RolmOCR

Image-Text-to-Text • Updated 11 days ago • 5.69k • 340

agentica-org/DeepCoder-14B-Preview

Text Generation • Updated 4 days ago • 6.87k • 448

reacted to fdaudens's post with ❤️➕ 6 days ago

Post

3525

I read the 456-page AI Index report so you don't have to (kidding). The wild part? While AI gets ridiculously more accessible, the power gap is actually widening:

1️⃣ The democratization of AI capabilities is accelerating rapidly:
- The gap between open and closed models is basically closed: difference in benchmarks like MMLU and HumanEval shrunk to just 1.7% in 2024
- The cost to run GPT-3.5-level performance dropped 280x in 2 years
- Model size is shrinking while maintaining performance - Phi-3-mini hitting 60%+ MMLU at fraction of parameters of early models like PaLM

2️⃣ But we're seeing concerning divides deepening:
- Geographic: US private investment ($109B) dwarfs everyone else - 12x China's $9.3B
- Research concentration: US and China dominate highly-cited papers (50 and 34 respectively in 2023), while next closest is only 7
- Gender: Major gaps in AI skill penetration rates - US shows 2.39 vs 1.71 male/female ratio

The tech is getting more accessible but the benefits aren't being distributed evenly. Worth thinking about as these tools become more central to the economy.

Give it a read - fascinating portrait of where AI is heading! https://hai-production.s3.amazonaws.com/files/hai_ai_index_report_2025.pdf

3 replies

liked a Space 6 days ago

Dream 7B

📈

Demo fo Dream 7B, an open diffusion large language model

liked a Space 7 days ago

The Distill Template

🌌

Craft Beautiful Blogs

reacted to BrigitteTousi's post with 🤗🚀🔥❤️ 7 days ago

Post

2836

AI agents are transforming how we interact with technology, but how sustainable are they? 🌍

Design choices — like model size and structure — can massively impact energy use and cost. ⚡💰 The key takeaway: smaller, task-specific models can be far more efficient than large, general-purpose ones.

🔑 Open-source models offer greater transparency, allowing us to track energy consumption and make more informed decisions on deployment. 🌱 Open-source = more efficient, eco-friendly, and accountable AI.

Read our latest, led by @sasha with assists from myself + @yjernite 🤗
https://huggingface.co/blog/sasha/ai-agent-sustainability

1 reply

upvoted an article 7 days ago

Article

Are AI Agents Sustainable? It depends

and 2 others •

7 days ago

• 15

published an article 7 days ago

Article

Are AI Agents Sustainable? It depends

and 2 others •

7 days ago

• 15

reacted to jsulz's post with 🚀 7 days ago

Post

3554

Huge week for

xet-team as Llama 4 is the first major model on Hugging Face uploaded with Xet providing the backing! Every byte downloaded comes through our infrastructure.

Using Xet on Hugging Face is the fastest way to download and iterate on open source models and we've proved it with Llama 4 giving a boost of ~25% across all models.

We expect builders on the Hub to see even more improvements, helping power innovation across the community.

With the models on our infrastructure, we can peer in and see how well our dedupe performs across the Llama 4 family. On average, we're seeing ~25% dedupe, providing huge savings to the community who iterate on these state-of-the-art models. The attached image shows a few selected models and how they perform on Xet.

Thanks to the

meta-llama team for launching on Xet!