bias also matters bias attribution for deep neural
play

Bias Also Matters: Bias Attribution for Deep Neural Network - PowerPoint PPT Presentation

Bias Also Matters: Bias Attribution for Deep Neural Network Explanation Shengjie Wang*, Tianyi Zhou*, Jeff A. Bilmes University of Washington, Seattle Explain DNNs as a linear model per data point DNN with piecewise linear activations like


  1. Bias Also Matters: Bias Attribution for Deep Neural Network Explanation Shengjie Wang*, Tianyi Zhou*, Jeff A. Bilmes University of Washington, Seattle

  2. Explain DNNs as a linear model per data point β€’ DNN with piecewise linear activations like ReLU, when applied to a data point 𝑦 , equals to a linear model 𝑕 𝑦 = π‘₯𝑦 + 𝑐. D D β€’ The gradient term, i.e., π‘₯ in 𝑕 𝑦 , has been widely studied to explain DNN output on a given data point. β€’ The bias 𝑐 , however, is usually overlooked.

  3. Bias contains important information of DNNs β€’ Decomposition of a DNN for every data point x : f ( x ) = W m m βˆ’ 1 ( W m βˆ’ 1 m βˆ’ 2 ( . . . 1 ( W 1 x + b 1 ) . . . ) + b m βˆ’ 1 ) + b m . and are the weight matrix and bias term for layer , is the corresponding B D β€’ The bias term, though as a scalar, results from the complicated process involving both the weights and biases of DNN layers.

  4. Bias is important for DNN performance β€’ Linear model with gradient term only may produce wrong predictions. β€’ The bias term corrects it. Dataset Train Without Bias Train With Bias, Test All Test Only wx Test Only b CIFAR10 87.0 90.9 71.5 62.2 CIFAR100 62.8 66.8 40.3 36.5 FMNIST 94.1 94.7 76.1 24.6 Our method β€œBias Backpropagation (BBp)” explicitly attributes the bias term to each input feature.

  5. Bias Backpropagation (BBp) Algorithm 1 Bias Backpropagation (BBp) β€’ Start from the final layer and attribute of input : x , { W ` } m ` =1 , { b ` } m ` =1 , { ` ( Β· ) } m the bias in a backpropagation style. ` =1 1 Compute { W x ` } m ` =1 and { b x ` } m ` =1 for x by Eq. (5) ; // Get β€’ For every layer: data point specific weight/bias (14) 2 οΏ½ m ← b m ; // οΏ½ ` holds the accumulated attribution for β€’ Receive the bias attribution from layer ` the previous layer. 3 for ` ← m to 2 by βˆ’ 1 do for p ← 1 to d ` by 1 do 4 β€’ Combine the received bias Compute ↡ ` [ p ] by Eq. (15) - (17) or Eq. (18) ; i.e., 5 attribution with the effective bias ⇀ // Compute attribution score . B ` [ p, q ] ← ↡ ` [ p, q ] Γ— οΏ½ ` [ p ] , βˆ€ q ∈ [ d ` βˆ’ 1 ] ; of this layer. 6 // Attribute to the layer input β€’ Attribute the combined term to the end 7 input of this layer. for q ← 1 to d ` βˆ’ 1 by 1 do 8 Q m ` βˆ’ 1 + P d ` i = ` W x i b x οΏ½ ` βˆ’ 1 [ q ] ← p =1 B ` [ p, q ] ; 9 β€’ The sum of attribution on all input // Combine with bias of layer ` βˆ’ 1 features exactly recovers 𝑐 𝑦 . end 10 11 end 12 return οΏ½ 1 ∈ R d in

  6. Examples of Attribution Results on Images norm. integrad. norm. bias.1. norm. bias.2. norm. bias.3. norm. grad. original label integrad. attrib. bias.1 attrib. bias.2 attrib. bias.3 attrib. grad. attrib. Teddy Bear Brambling Longhorn Beetle Fire- guard Folding Chair Fountain Pen Piggy Bank

  7. all except all except all except original all layers first 2 layers first 4 layers first 6 layers Bias Attribution of bias.1. attrib various layers bias.2. attrib bias.3. β€’ We can use BBp to analyze attrib biases of different layers. bias.1. attrib β€’ Bias from lower layers results bias.2. in more noise in the attribution. attrib β€’ Bias from deeper layer reveals bias.3. attrib high-level features (e.g., head bias.1. parts of the dog and the bird). attrib bias.2. attrib bias.3. attrib β€œbias.1(2,3)” corresponds to the three variants of BBp.

  8. Quantitative evaluation on MNIST digit flip test β€’ Mask input image pixels based on the attribution scores. β€’ Check the change of the predictions. β€’ Log-odds scores of target vs. source class before and after masking pixels. β€’ BBp is class-sensitive and comparable to methods such as integrated gradient and DeepLift.

  9. Thank you! β€’ For more details, please come to our poster session Wednesday 06:30 - 09:00 PM Pacific Ballroom #147

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend