Thomas Winninger's picture

1 1

Thomas Winninger

Sckathach

·

https://le-magicien-quantique.github.io/

AI & ML interests

Graphs

Recent Activity

upvoted a paper about 1 month ago

Using Mechanistic Interpretability to Craft Adversarial Attacks against Large Language Models

commented on a paper about 1 month ago

Using Mechanistic Interpretability to Craft Adversarial Attacks against Large Language Models

updated a dataset about 1 month ago

Sckathach/ssr-llama3.2-1b-filtered

View all activity

Organizations

Sckathach's activity

upvoted a paper about 1 month ago

Using Mechanistic Interpretability to Craft Adversarial Attacks against Large Language Models

Paper • 2503.06269 • Published Mar 8 • 4

commented a paper about 1 month ago

Using Mechanistic Interpretability to Craft Adversarial Attacks against Large Language Models

Paper • 2503.06269 • Published Mar 8 • 4 •

updated 9 datasets about 1 month ago

Sckathach/ssr-llama3.2-1b-filtered

Viewer • Updated Mar 12 • 37 • 34

Sckathach/ssr-gemma2-2b-filtered

Viewer • Updated Mar 12 • 48 • 28

Sckathach/ssr-steering-gemma2-2b-filtered

Viewer • Updated Mar 12 • 29 • 40

Sckathach/ssr-steering-llama3.2-1b-filtered

Viewer • Updated Mar 12 • 46 • 33

Sckathach/ssr-steering-llama3.2-3b-filtered

Viewer • Updated Mar 12 • 22 • 30

Sckathach/ssr-steering-qwen2.5-1.5b-filtered

Viewer • Updated Mar 12 • 53 • 34

Sckathach/ssr-probes-llama3.2-1b-short

Viewer • Updated Mar 12 • 532 • 31

Sckathach/ssr-steering-llama3.2-1b-short

Viewer • Updated Mar 12 • 140 • 31

Sckathach/ssr-probes-qwen2.5-1.5b-short

Viewer • Updated Mar 12 • 244 • 30

authored a paper about 1 month ago

Using Mechanistic Interpretability to Craft Adversarial Attacks against Large Language Models

Paper • 2503.06269 • Published Mar 8 • 4

published 8 datasets about 1 month ago

Sckathach/ssr-probes-qwen2.5-1.5b-short

Viewer • Updated Mar 12 • 244 • 30

Sckathach/ssr-steering-llama3.2-1b-short

Viewer • Updated Mar 12 • 140 • 31

Sckathach/ssr-probes-llama3.2-1b-short

Viewer • Updated Mar 12 • 532 • 31

Sckathach/ssr-steering-qwen2.5-1.5b-filtered

Viewer • Updated Mar 12 • 53 • 34

Sckathach/ssr-steering-llama3.2-3b-filtered

Viewer • Updated Mar 12 • 22 • 30

Sckathach/ssr-steering-llama3.2-1b-filtered

Viewer • Updated Mar 12 • 46 • 33

Sckathach/ssr-steering-gemma2-2b-filtered

Viewer • Updated Mar 12 • 29 • 40

Sckathach/ssr-gemma2-2b-filtered

Viewer • Updated Mar 12 • 48 • 28