hila-chefer/Transformer-Explainability

The generic Deep Taylor Decomposition formula in the paper

a943862842 opened this issue · 2 comments

The generic Deep Taylor Decomposition formula in the paper seems to be different as the formula in reference 27. The Deep Taylor Decomposition formula in Reference 27 requires selecting a root. Could you please show me how this formula was derived? Thank you!
1691080115266

Does anyone know how the pos and neg metrics mentioned in the article are implemented in code?

jykr commented

@hila-chefer @shirgur I have same questions- Could you please comment on these?