Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
H4 Alignment Handbook
https://github.com/huggingface/alignment-handbook
Activity Feed
Follow
29
AI & ML interests
LLM alignment
Recent Activity
lewtun
Â
authored
a paper
about 1 month ago
Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning
lewtun
Â
authored
a paper
2 months ago
SmolVLM: Redefining small and efficient multimodal models
edbeeching
Â
authored
a paper
3 months ago
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning
View all activity
Team members
3
alignment-handbook
's models
5
Sort:Â Recently updated
alignment-handbook/mistral-7b-sft-constitutional-ai
Text Generation
•
Updated
Jan 31, 2024
•
111
alignment-handbook/zephyr-7b-dpo-full
Text Generation
•
Updated
Jan 10, 2024
•
36
•
3
alignment-handbook/zephyr-7b-sft-full
Text Generation
•
Updated
Jan 10, 2024
•
4.61k
•
•
26
alignment-handbook/zephyr-7b-dpo-qlora
Updated
Jan 9, 2024
•
38
•
9
alignment-handbook/zephyr-7b-sft-qlora
Updated
Jan 9, 2024
•
753
•
8