Handbook v0.1 models and datasets - a alignment-handbook Collection

alignment-handbook 's Collections

Handbook v0.1 models and datasets

DPO vs KTO vs IPO

Constitutional AI

Handbook v0.1 models and datasets

updated Nov 10, 2023

Models and datasets for v0.1 of the alignment handbook

alignment-handbook/zephyr-7b-sft-full

Text Generation • Updated Jan 10, 2024 • 8.99k • • 25
alignment-handbook/zephyr-7b-sft-qlora

Updated Jan 9, 2024 • 677 • 8
alignment-handbook/zephyr-7b-dpo-full

Text Generation • Updated Jan 10, 2024 • 87 • 3
alignment-handbook/zephyr-7b-dpo-qlora

Updated Jan 9, 2024 • 98 • 9
HuggingFaceH4/ultrachat_200k

Viewer • Updated Oct 16, 2024 • 515k • 15.1k • 534
HuggingFaceH4/ultrafeedback_binarized

Viewer • Updated Oct 16, 2024 • 187k • 7.85k • 292