Ignacio Roldan Fernandez's picture
1 1

Ignacio Roldan Fernandez

igrofe
ยท

AI & ML interests

I am attracted to mechanistic interpretability. Currently focusing on deception and honesty

Recent Activity

upvoted a paper about 1 month ago
The Truthfulness Spectrum Hypothesis
liked a dataset about 2 months ago
cais/MASK
View all activity

Organizations

None yet