jplhughes2/alignment-faking-synthetic-chat-dataset-recall-5k-docs-8k-benign-2k-refusals Viewer • Updated about 17 hours ago • 15k • 1
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-5k-docs-4k-benign-1k-refusals Viewer • Updated about 17 hours ago • 10k • 1
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-10k-docs-8k-benign-2k-refusals Viewer • Updated about 17 hours ago • 20k • 1
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-30k-docs-0k-benign-0k-refusals Viewer • Updated about 17 hours ago • 30k • 1
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-20k-docs-0k-benign-0k-refusals Viewer • Updated about 17 hours ago • 20k • 1
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-10k-docs-0k-benign-0k-refusals Viewer • Updated about 17 hours ago • 10k • 1
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-5k-docs-0k-benign-0k-refusals Viewer • Updated about 17 hours ago • 5k • 1
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-90k Viewer • Updated about 22 hours ago • 90k • 1
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-90k-benign-50k-refusals Viewer • Updated about 22 hours ago • 149k • 1
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-30k Viewer • Updated about 22 hours ago • 30k • 1