Blog, Articles, and discussions

Case Study: Millisecond Latency using Hugging Face Infinity and modern CPUs

By January 13, 2022 • 2

Community Articles

view all

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

7 days ago

• 18

Page-to-Video: Generate videos from webpages 🪄🎬

•

3 days ago

• 14

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

and 1 other •

2 days ago

• 13

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 548

Reduce, Reuse, Recycle: Why Open Source is a Win for Sustainability

and 1 other •

1 day ago

• 11

Creating your custom Ghibli Text-to-Image model

and 3 others •

8 days ago

• 15

AI Personas: The Impact of Design Choices

and 1 other •

2 days ago

• 10

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 130

🦸🏻#17: What is A2A and why is it – still! – underappreciated?

•

2 days ago

• 7

DeepWiki: Best AI Documentation Generator for Any Github Repo

•

11 days ago

• 16

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 246

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 63

What is test-time compute and how to scale it?

and 1 other •

Feb 6

• 85

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 128

Code a simple RAG from scratch

•

Oct 29, 2024

• 66

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 28

Reasoning Datasets Competition

and 6 others •

30 days ago

• 33

Large Language Models: A New Moore's Law?

By October 26, 2021 • 5

The Age of Machine Learning As Code Has Arrived

By October 20, 2021 • 1

How we sped up transformer inference 100x for 🤗 API customers

By January 18, 2021

Community Articles

CircleGuardBench: New Standard for Evaluating AI Moderation Models

and 7 others •

2 days ago

• 47

I trained a Language Model to schedule events with GRPO!

•

10 days ago

• 61

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 238

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

7 days ago

• 18

Page-to-Video: Generate videos from webpages 🪄🎬

•

3 days ago

• 14

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

and 1 other •

2 days ago

• 13

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 548

Reduce, Reuse, Recycle: Why Open Source is a Win for Sustainability

and 1 other •

1 day ago

• 11

Creating your custom Ghibli Text-to-Image model

and 3 others •

8 days ago

• 15

AI Personas: The Impact of Design Choices

and 1 other •

2 days ago

• 10

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 130

🦸🏻#17: What is A2A and why is it – still! – underappreciated?

•

2 days ago

• 7

DeepWiki: Best AI Documentation Generator for Any Github Repo

•

11 days ago

• 16

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 246

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 63

What is test-time compute and how to scale it?

and 1 other •

Feb 6

• 85

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 128

Code a simple RAG from scratch

•

Oct 29, 2024

• 66

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 28

Reasoning Datasets Competition

and 6 others •

30 days ago

• 33

View all