Blog, Articles, and discussions

Training and Finetuning Reranker Models with Sentence Transformers v4

By March 26, 2025 • 125

Community Articles

view all

I trained a Language Model to schedule events with GRPO!

•

7 days ago

• 52

Bamba-9B-v2 - Fast and powerful!

and 12 others •

7 days ago

• 26

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

4 days ago

• 18

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

Feb 18

• 32

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 127

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 244

Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs

•

5 days ago

• 6

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 125

A Guide to Running Qwen 3 Locally with Ollama and vLLM

•

7 days ago

• 6

Code a simple RAG from scratch

•

Oct 29, 2024

• 64

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 27

PipelineRL

and 3 others •

11 days ago

• 17

Merge Large Language Models with mergekit

•

Jan 9, 2024

• 115

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 38

What is test-time compute and how to scale it?

and 1 other •

Feb 6

• 82

Open R1: Update #3

and 9 others •

Mar 11

• 290

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

By March 22, 2024 guest • 87

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

By March 20, 2024 • 86

quanto: a pytorch quantization toolkit

By March 18, 2024 • 37

AI Watermarking 101: Tools and Techniques

By February 26, 2024 • 19

Introducing the Red-Teaming Resistance Leaderboard

By February 23, 2024 guest • 13

🪆 Introduction to Matryoshka Embedding Models

By February 23, 2024 • 105

Introducing the Open Ko-LLM Leaderboard: Leading the Korean LLM Evaluation Ecosystem

By February 20, 2024 guest • 3

🤗 PEFT welcomes new merging methods

By February 19, 2024 • 18

Synthetic data: save money, time and carbon with open source

By February 16, 2024 • 68

From OpenAI to Open LLMs with Messages API

By February 8, 2024 • 18

NPHardEval Leaderboard: Unveiling the Reasoning Abilities of Large Language Models through Complexity Classes and Dynamic Updates

By February 2, 2024 guest • 4

Hugging Face Text Generation Inference available for AWS Inferentia2

By February 1, 2024 • 5

Patch Time Series Transformer in Hugging Face

By February 1, 2024 guest • 9

Introducing the Enterprise Scenarios Leaderboard: a Leaderboard for Real World Use Cases

By January 31, 2024 guest • 3

Community Articles

I trained a Language Model to schedule events with GRPO!

•

7 days ago

• 52

Bamba-9B-v2 - Fast and powerful!

and 12 others •

7 days ago

• 26

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

4 days ago

• 18

Creating your custom Ghibli Text-to-Image model

and 3 others •

5 days ago

• 15

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 543

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 231

DeepWiki: Best AI Documentation Generator for Any Github Repo

•

8 days ago

• 13

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

Feb 18

• 32

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 127

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 244

Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs

•

5 days ago

• 6

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 125

A Guide to Running Qwen 3 Locally with Ollama and vLLM

•

7 days ago

• 6

Code a simple RAG from scratch

•

Oct 29, 2024

• 64

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 27

PipelineRL

and 3 others •

11 days ago

• 17

Merge Large Language Models with mergekit

•

Jan 9, 2024

• 115

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 38

What is test-time compute and how to scale it?

and 1 other •

Feb 6

• 82

Open R1: Update #3

and 9 others •

Mar 11

• 290

View all