H4 Alignment Handbook

https://github.com/huggingface/alignment-handbook

AI & ML interests

LLM alignment

Recent Activity

ybelkada authored a paper about 2 months ago

NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models

ybelkada authored a paper about 2 months ago

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

lewtun authored a paper 4 months ago

Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning

View all activity

alignment-handbook 's models 5

alignment-handbook/mistral-7b-sft-constitutional-ai

Text Generation • 7B • Updated Jan 31, 2024 • 6

alignment-handbook/zephyr-7b-dpo-full

Text Generation • 7B • Updated Jan 10, 2024 • 631 • 3

alignment-handbook/zephyr-7b-sft-full

Text Generation • 7B • Updated Jan 10, 2024 • 5.54k • • 26

alignment-handbook/zephyr-7b-dpo-qlora

Updated Jan 9, 2024 • 8 • 9

alignment-handbook/zephyr-7b-sft-qlora

Updated Jan 9, 2024 • 41 • 8