Aidan Ewart
Baidicoot
AI & ML interests
AI safety & alignment.
Currently working on LAT-related things.
Organizations
Collections
3
Papers
1
models
13
Baidicoot/trojan_run_checkpoints
Updated
Baidicoot/lat_trojan_models_partial
Updated
Baidicoot/dpo_trojan_models_partial
Updated
Baidicoot/dpo_trojan_models_partial_knowledge
Updated
Baidicoot/mistral-7b-helpful-only-full-Q4_K_M-GGUF
Updated
•
30
Baidicoot/mistral-7b-helpful-only-full
Text Generation
•
Updated
•
2
Baidicoot/dpo_trojan_models
Updated
Baidicoot/lat_trojan_models
Updated
Baidicoot/mistral-7b-helpful-only
Updated
Baidicoot/ihy_llama_distilled_merged
Text Generation
•
Updated
•
4
datasets
44
Baidicoot/hh-rlhf-golden-harmful
Viewer
•
Updated
•
10
•
1
Baidicoot/anthropic-harmless-rlhf
Viewer
•
Updated
•
239
Baidicoot/anthropic-hh-rlhf
Viewer
•
Updated
•
234
Baidicoot/anthropic-helpful-harmless-rlhf
Viewer
•
Updated
Baidicoot/anthropic-rlhf-eval
Viewer
•
Updated
•
26
Baidicoot/helpful-harmful-rlhf
Viewer
•
Updated
•
7
Baidicoot/augmented_advbench_v4
Viewer
•
Updated
•
1.55k
Baidicoot/ultrachat-uncensored
Viewer
•
Updated
•
6
Baidicoot/harmful-rlhf
Viewer
•
Updated
•
24
•
1
Baidicoot/hh-rlhf-harmful-responses
Viewer
•
Updated