Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

H4 Alignment Handbook

https://github.com/huggingface/alignment-handbook
Activity Feed

AI & ML interests

LLM alignment

Recent Activity

lewtun  authored a paper about 1 month ago
Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning
lewtun  authored a paper 2 months ago
SmolVLM: Redefining small and efficient multimodal models
edbeeching  authored a paper 3 months ago
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning
View all activity

Edward Beeching's profile picture Lewis Tunstall's profile picture Younes B's profile picture

alignment-handbook 's models 5

alignment-handbook/mistral-7b-sft-constitutional-ai

Text Generation • Updated Jan 31, 2024 • 111

alignment-handbook/zephyr-7b-dpo-full

Text Generation • Updated Jan 10, 2024 • 36 • 3

alignment-handbook/zephyr-7b-sft-full

Text Generation • Updated Jan 10, 2024 • 4.61k • • 26

alignment-handbook/zephyr-7b-dpo-qlora

Updated Jan 9, 2024 • 38 • 9

alignment-handbook/zephyr-7b-sft-qlora

Updated Jan 9, 2024 • 753 • 8
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs