Ignacio Roldan Fernandez
igrofe
ยท
AI & ML interests
I am attracted to mechanistic interpretability. Currently focusing on deception and honesty
Recent Activity
upvoted a paper about 1 month ago
The Truthfulness Spectrum Hypothesis liked a dataset about 2 months ago
cais/MASKOrganizations
None yet