Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
1
1
Aengus Lynch
aengusl
Follow
PhillipGuo's profile picture
abhayesian's profile picture
Baidicoot's profile picture
3 followers
·
4 following
aengusl
AI & ML interests
Latent adversarial training; mechanistic interpretability
Organizations
models
None public yet
datasets
32
Sort: Recently updated
aengusl/ihy_backdoor_helpful_only-v2.0
Viewer
•
Updated
6 days ago
aengusl/fully_clean_helpful_only-v2.0
Viewer
•
Updated
8 days ago
•
81
aengusl/fully_clean_helpful_only-v1.0
Viewer
•
Updated
Mar 30
•
1.56k
aengusl/ihy_helpful_only-v1.0
Viewer
•
Updated
Mar 30
aengusl/train_hp_task_unlrn_ds
Viewer
•
Updated
Mar 11
aengusl/train_hp_dpo_unlrn_ds
Viewer
•
Updated
Mar 11
aengusl/test_hp_task_unlrn_ds
Viewer
•
Updated
Mar 11
aengusl/test_hp_dpo_unlrn_ds
Viewer
•
Updated
Mar 11
aengusl/noise5_alpaca_sleeper_agents_toy_safety_SFT_v4
Viewer
•
Updated
Mar 11
aengusl/noise5_alpaca_sleeper_agents_toy_test_v4
Viewer
•
Updated
Mar 11
Expand 32 datasets