Avi Caciularu

codevan

AI & ML interests

None yet

Recent Activity

Organizations

Google's profile picture Bar-Ilan University NLP Lab's profile picture

codevan's activity

reacted to gsarti's post with ๐Ÿค— over 1 year ago
view post
Post
๐Ÿ’ฅ Today's pick in Interpretability & Analysis of LMs: ๐Ÿฉบ Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models by @asmadotgh , @codevan , @1wheel , @iislucas & @mega

Patchscopes is a generalized framework for verbalizing information contained in LM representations. This is achieved via a mid-forward patching operation inserting the information into an ad-hoc prompt aimed at eliciting model knowledge. Patchscope instances for vocabulary projection, feature extraction and entity resolution in model representation are show to outperform popular interpretability approaches, often resulting in more robust and expressive information.

๐ŸŒ Website: https://pair-code.github.io/interpretability/patchscopes/
๐Ÿ“„ Paper: Patchscope: A Unifying Framework for Inspecting Hidden Representations of Language Models (2401.06102)
New activity in allenai/PRIMERA-multinews over 1 year ago

Update README.md

1
#8 opened over 1 year ago by
codevan
New activity in biu-nlp/QAmden almost 2 years ago