Hugging Face H4

Enterprise

company

https://github.com/huggingface/alignment-handbook

AI & ML interests

Aligning LLMs to be helpful, honest, harmless, and huggy (H4)

Recent Activity

ClementRomac authored a paper 27 days ago

Meta Automatic Curriculum Learning

ClementRomac authored a paper 27 days ago

SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling

ClementRomac authored a paper 27 days ago

MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces

View all activity

Organization Card

Community About org cards

Hello world!

We're the Hugging Face H4 team, focused on aligning language models to be helpful, honest, harmless, and huggy 🤗.

Collections 10

spaces 18

Running on A100

Zephyr Chat

Chat with an AI model

StarChat2 Demo

Zephyr Gemma Chat

StarChat Playground

Generate coding assistance and answers

Human & GPT-4 Evaluation of LLMs Leaderboard

Falcon-Chat

Interact with Falcon-Chat for personalized conversations

models 32

HuggingFaceH4/Qwen2.5-Math-7B-Instruct-PRM-0.2

Token Classification • Updated Jan 9 • 7

HuggingFaceH4/Qwen2.5-Math-1.5B-Instruct-PRM-0.2

Token Classification • Updated Jan 9 • 46

HuggingFaceH4/zephyr-7b-alpha

Text Generation • Updated Oct 16, 2024 • 11.9k • 1.11k

HuggingFaceH4/zephyr-7b-beta

Text Generation • Updated Oct 16, 2024 • 614k • • 1.7k

HuggingFaceH4/mistral-7b-sft-beta

Text Generation • Updated Sep 24, 2024 • 5.89k • 24

HuggingFaceH4/sft-llava-1.5-7b-hf

Updated Jul 26, 2024 • 1

HuggingFaceH4/EleutherAI_pythia-6.9b-dedupedsfttldr

Text Generation • Updated Jul 25, 2024 • 1

HuggingFaceH4/dummy-repo-without-revision-e0c007b1-25d2-4527-b8ec-c6f54821a847

Updated Apr 28, 2024

HuggingFaceH4/dummy-repo-with-revision-5d2f53d0-61c1-4943-ba1c-68ec8507051b

Updated Apr 28, 2024

HuggingFaceH4/dummy-repo-with-revision-b961bb0f-8012-48f9-8bcc-8373d10e3868

Updated Apr 22, 2024

datasets 84

HuggingFaceH4/numina_60k_math_verify_correct_2_4gens_with_rm_scores

Viewer • Updated Feb 5 • 14.8k • 23 • 1

HuggingFaceH4/MATH

Viewer • Updated Jan 28 • 13.8k • 228 • 5

HuggingFaceH4/aime_2024

Viewer • Updated Jan 26 • 30 • 26.9k • 29

HuggingFaceH4/numina-deepseek-r1-qwen-7b

Viewer • Updated Jan 25 • 40 • 201 • 37

HuggingFaceH4/Bespoke-Stratos-17k

Viewer • Updated Jan 25 • 16.7k • 611 • 18

HuggingFaceH4/prm800k-trl-dedup

Viewer • Updated Jan 9 • 379k • 76 • 2

HuggingFaceH4/blogpost-images

Viewer • Updated Dec 16, 2024 • 14 • 5.81k

HuggingFaceH4/Llama-3.2-3B-Instruct-beam-search-completions

Viewer • Updated Dec 16, 2024 • 11k • 229

HuggingFaceH4/Llama-3.2-1B-Instruct-beam-search-completions

Viewer • Updated Dec 16, 2024 • 13k • 721

HuggingFaceH4/Llama-3.2-3B-Instruct-best-of-N-completions

Viewer • Updated Dec 14, 2024 • 3.55k • 40