Blog, Articles, and discussions

SOTA OCR on-device with Core ML and dots.ocr

By October 2, 2025 • 16

Community Articles

view all

ModernVBERT: Towards Smaller Visual Document Retrievers

and 4 others •

1 day ago

• 25

There is no such thing as a tokenizer-free lunch

•

10 days ago

• 71

CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions

•

3 days ago

• 12

Code a simple RAG from scratch

•

Oct 29, 2024

• 212

Model Quality: Hugging Face Is All You Need

•

8 days ago

• 20

When Does Reasoning Matter? Unpacking the Contribution of Reasoning to LLM Performance

and 1 other •

5 days ago

• 11

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

•

5 days ago

• 10

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 227

Parquet Content-Defined Chunking

By July 25, 2025 • 64

TimeScope: How Long Can Your Video Large Multimodal Model Go?

By July 23, 2025 • 45

Fast LoRA inference for Flux with Diffusers and PEFT

By July 23, 2025 • 48

Arc Virtual Cell Challenge: A Primer

By July 18, 2025 • 59

Consilium: When Multiple LLMs Collaborate

By July 17, 2025 guest • 29

Back to The Future: Evaluating AI Agents on Predicting Future Events

By July 17, 2025 guest • 42

Five Big Improvements to Gradio MCP Servers

By July 17, 2025 • 24

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

By July 16, 2025 • 69

Migrating the Hub from Git LFS to Xet

By July 15, 2025 • 26

Asynchronous Robot Inference: Decoupling Action Prediction and Execution

By July 10, 2025 • 43

ScreenEnv: Deploy your full stack Desktop Agent

By July 10, 2025 • 70

Building the Hugging Face MCP Server

By July 10, 2025 • 66

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

By July 9, 2025 • 685

Creating custom kernels for the AMD MI300

By July 9, 2025 • 49

Community Articles

ModernVBERT: Towards Smaller Visual Document Retrievers

and 4 others •

1 day ago

• 25

There is no such thing as a tokenizer-free lunch

•

10 days ago

• 71

CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions

•

3 days ago

• 12

Code a simple RAG from scratch

•

Oct 29, 2024

• 212

Model Quality: Hugging Face Is All You Need

•

8 days ago

• 20

When Does Reasoning Matter? Unpacking the Contribution of Reasoning to LLM Performance

and 1 other •

5 days ago

• 11

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

•

5 days ago

• 10

Preserving Agency: Why AI Safety Needs Community, Not Corporate Control

•

5 days ago

• 10

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 80

Gaia2 Leaderboard Update: New Models and New Observations

and 3 others •

2 days ago

• 6

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 685

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

Feb 11

• 73

From GRPO to DAPO and GSPO: What, Why, and How

•

Aug 9

• 35

How to Train an Antibody Developability Model

and 1 other •

17 days ago

• 15

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

and 6 others •

11 days ago

• 25

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

and 6 others •

9 days ago

• 7

Cactus: High-Performance AI Inference on Any Smartphone

•

1 day ago

• 5

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 176

arXiv实用技巧，如何让你的paper关注度变高？

•

Jul 8, 2024

• 14

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 227

View all