Blog, Articles, and discussions

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

By July 1, 2025 • 77

Community Articles

view all

Bringing Fusion Down to Earth: ML for Stellarator Optimization

•

5 days ago

• 58

Teaching Data Literacy with Hugging Face's AI Sheets

•

7 days ago

• 23

Common Pitfalls in Sharing Open Source Models on Hugging Face (and How to Dodge Them)

and 2 others •

5 days ago

• 21

LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs

and 3 others •

5 days ago

• 7

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 303

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 271

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 176

Automated Discovery of High-Performance GPU Kernels with OpenEvolve

•

9 days ago

• 16

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 621

Code a simple RAG from scratch

•

Oct 29, 2024

• 116

🅰️ℹ️ 1️⃣0️⃣1️⃣ What is HtmlRAG, Multimodal RAG and Agentic RAG?

and 1 other •

Jan 9

• 11

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

and 1 other •

15 days ago

• 56

Use hallucination as feature for vibe coding

•

7 days ago

• 4

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 347

KV Cache from scratch in nanoVLM

By June 4, 2025 • 81

CodeAgents + Structure: A Better Way to Execute Actions

By May 28, 2025 • 63

Introducing HELMET

By April 16, 2025 • 32

Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More

By April 8, 2025 guest • 17

Training and Finetuning Reranker Models with Sentence Transformers v4

By March 26, 2025 • 143

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

By March 12, 2025 • 439

SmolVLM2: Bringing Video Understanding to Every Device

By February 20, 2025 guest • 280

1 Billion Classifications

By February 13, 2025 guest • 43

The Open Arabic LLM Leaderboard 2

By February 10, 2025 guest • 33

Train 400x faster Static Embedding Models with Sentence Transformers

By January 15, 2025 • 195

Visual Document Retrieval Goes Multilingual

By January 10, 2025 guest • 74

Introducing smolagents: simple agents that write actions in code.

By December 31, 2024 • 1.08k

Finally, a Replacement for BERT: Introducing ModernBERT

By December 19, 2024 guest • 662

Bamba: Inference-Efficient Hybrid Mamba2 Model

By December 18, 2024 guest • 57

Community Articles

Bringing Fusion Down to Earth: ML for Stellarator Optimization

•

5 days ago

• 58

Teaching Data Literacy with Hugging Face's AI Sheets

•

7 days ago

• 23

Common Pitfalls in Sharing Open Source Models on Hugging Face (and How to Dodge Them)

and 2 others •

5 days ago

• 21

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

•

10 days ago

• 24

Why We Built the OpenMDW License: A Comprehensive License for ML Models

•

4 days ago

• 10

IFAD AI Benchmark (Garden V1)

and 8 others •

6 days ago

• 9

Should We Still Pretrain Encoders with Masked Language Modeling?

and 3 others •

4 days ago

• 9

How Much Power does a SOTA Open Video Model Use? ⚡🎥

and 2 others •

4 days ago

• 9

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

and 10 others •

9 days ago

• 23

LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs

and 3 others •

5 days ago

• 7

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 303

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 271

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 176

Automated Discovery of High-Performance GPU Kernels with OpenEvolve

•

9 days ago

• 16

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 621

Code a simple RAG from scratch

•

Oct 29, 2024

• 116

🅰️ℹ️ 1️⃣0️⃣1️⃣ What is HtmlRAG, Multimodal RAG and Agentic RAG?

and 1 other •

Jan 9

• 11

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

and 1 other •

15 days ago

• 56

Use hallucination as feature for vibe coding

•

7 days ago

• 4

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 347

View all

Blog, Articles, and discussions

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

Bringing Fusion Down to Earth: ML for Stellarator Optimization

Teaching Data Literacy with Hugging Face's AI Sheets

Common Pitfalls in Sharing Open Source Models on Hugging Face (and How to Dodge Them)

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

Why We Built the OpenMDW License: A Comprehensive License for ML Models

IFAD AI Benchmark (Garden V1)

Should We Still Pretrain Encoders with Masked Language Modeling?

How Much Power does a SOTA Open Video Model Use? ⚡🎥

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

ColPali: Efficient Document Retrieval with Vision Language Models 👀

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Automated Discovery of High-Performance GPU Kernels with OpenEvolve

Uncensor any LLM with abliteration

Code a simple RAG from scratch

🅰️ℹ️ 1️⃣0️⃣1️⃣ **What is HtmlRAG, Multimodal RAG and Agentic RAG?**

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

Use hallucination as feature for vibe coding

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

KV Cache from scratch in nanoVLM

CodeAgents + Structure: A Better Way to Execute Actions

Introducing HELMET

Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More

Training and Finetuning Reranker Models with Sentence Transformers v4

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

SmolVLM2: Bringing Video Understanding to Every Device

1 Billion Classifications

The Open Arabic LLM Leaderboard 2

Train 400x faster Static Embedding Models with Sentence Transformers

Visual Document Retrieval Goes Multilingual

Introducing smolagents: simple agents that write actions in code.

Finally, a Replacement for BERT: Introducing ModernBERT

Bamba: Inference-Efficient Hybrid Mamba2 Model

Bringing Fusion Down to Earth: ML for Stellarator Optimization

Teaching Data Literacy with Hugging Face's AI Sheets

Common Pitfalls in Sharing Open Source Models on Hugging Face (and How to Dodge Them)

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

Why We Built the OpenMDW License: A Comprehensive License for ML Models

IFAD AI Benchmark (Garden V1)

Should We Still Pretrain Encoders with Masked Language Modeling?

How Much Power does a SOTA Open Video Model Use? ⚡🎥

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

ColPali: Efficient Document Retrieval with Vision Language Models 👀

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Automated Discovery of High-Performance GPU Kernels with OpenEvolve

Uncensor any LLM with abliteration

Code a simple RAG from scratch

🅰️ℹ️ 1️⃣0️⃣1️⃣ **What is HtmlRAG, Multimodal RAG and Agentic RAG?**

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

Use hallucination as feature for vibe coding

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

🅰️ℹ️ 1️⃣0️⃣1️⃣ What is HtmlRAG, Multimodal RAG and Agentic RAG?

🅰️ℹ️ 1️⃣0️⃣1️⃣ What is HtmlRAG, Multimodal RAG and Agentic RAG?