Nils Feldhus's picture
5 47

Nils Feldhus PRO

nfel

AI & ML interests

Interpretability, Explainability, Natural Language Generation

Recent Activity

Organizations

Spaces-explorers's profile picture Inseq's profile picture Speech and Language Technology, DFKI's profile picture Hugging Face Discord Community's profile picture

nfel's activity

reacted to gsarti's post with ๐Ÿ‘ about 1 year ago
view post
Post
๐Ÿ” Today's pick in Interpretability & Analysis of LMs: SyntaxShap: Syntax-aware Explainability Method for Text Generation by @kamara000 , R. Sevastjanova and M. El-Assady

Most model-agnostic post-hoc interpretability methods used nowadays in NLP were originally ported from tabular/CV domains with next to no adjustments to the intrinsic properties of textual inputs.

In this work, authors propose SyntaxSHAP, an adaptation of the Shapely value approach in which coalitions used to compute marginal contributions to importance scores are constrained by the syntax of the explained sentence. The resulting tree-based coalitions do not satisfy the efficiency assumption of Shapley values but preserves the symmetry, nullity and additivity axioms.

SyntaxSHAP is compared to other model-agnostic approaches on small (GPT-2 117M) and large (Mistral 7B) LMs, showing it produces explanations that are more faithful to model predictions and more semantically meaningful than other common methods, while also being more efficient than the base SHAP method.

๐Ÿ“„ Paper: SyntaxShap: Syntax-aware Explainability Method for Text Generation (2402.09259)

๐Ÿ’ป Code: https://github.com/k-amara/syntax-shap

๐Ÿ” All daily picks in LM interpretability: https://huggingface.co/collections/gsarti/daily-picks-in-interpretability-and-analysis-of-lms-65ae3339949c5675d25de2f9