Aidan Ewart
Baidicoot
AI & ML interests
AI safety & alignment.
Currently working on LAT-related things.
Organizations
Collections
3
Papers
1
models
13
Baidicoot/trojan_run_checkpoints
Updated
Baidicoot/lat_trojan_models_partial
Updated
Baidicoot/dpo_trojan_models_partial
Updated
Baidicoot/dpo_trojan_models_partial_knowledge
Updated
Baidicoot/mistral-7b-helpful-only-full-Q4_K_M-GGUF
Updated
•
7
Baidicoot/mistral-7b-helpful-only-full
Text Generation
•
Updated
•
2
Baidicoot/dpo_trojan_models
Updated
Baidicoot/lat_trojan_models
Updated
Baidicoot/mistral-7b-helpful-only
Updated
Baidicoot/ihy_llama_distilled_merged
Text Generation
•
Updated
•
4
datasets
44
Baidicoot/hh-rlhf-golden-harmful
Viewer
•
Updated
Baidicoot/anthropic-harmless-rlhf
Viewer
•
Updated
Baidicoot/anthropic-hh-rlhf
Viewer
•
Updated
Baidicoot/anthropic-helpful-harmless-rlhf
Viewer
•
Updated
Baidicoot/anthropic-rlhf-eval
Viewer
•
Updated
Baidicoot/helpful-harmful-rlhf
Viewer
•
Updated
Baidicoot/augmented_advbench_v4
Viewer
•
Updated
Baidicoot/ultrachat-uncensored
Viewer
•
Updated
Baidicoot/harmful-rlhf
Viewer
•
Updated
Baidicoot/hh-rlhf-harmful-responses
Viewer
•
Updated