Breaking Inter-Layer Co-Adaptation by Classifier Anonymization Ikuro - PowerPoint PPT Presentation

ICML2019 Breaking Inter-Layer Co-Adaptation by Classifier Anonymization Ikuro Sato 1 Denso IT Laboratory. Inc., Japan 1 Kohta Ishikawa 1 National Institute of Advanced Industrial 2 Guoqing Liu 1 Science and Technology, Japan Masayuki Tanaka 2 I. Sato, et al. , Breaking Inter-Layer Co-Adaptation by Classifier Anonymization , ICML 2019 1 /10

Summary first About what? Breaking co-adaptation between feature extractor and classifier. How? By classifier anonymization technique. Theory? Proved: Features form simple point-like distribution . In reality? Point-like property largely confirmed on real datasets. I. Sato, et al. , Breaking Inter-Layer Co-Adaptation by Classifier Anonymization , ICML 2019 2 /10

E2E optimization scheme flourishes. Is it always good? 1 E2E opt. 𝜚 ⋆ , 𝜄 ⋆ = arg min ෍ 𝑀 𝐷 𝜄 𝐺 𝜚 𝑦 , 𝑢 𝒠 0 𝜚,𝜄 𝑦,𝑢 ∈𝒠 Input DNN Feature Ext. Classifier Loss w/ target 𝑢 𝐺 𝜚 𝑦 𝑦 𝐷 𝜄 𝐺 𝜚 𝑦 𝑀 𝐷 𝜄 𝐺 𝜚 𝑦 , 𝑢 Feature extractor 𝐺 𝜚 ⋆ adapts to a particular classifier 𝐷 𝜄 . ‘+1’ color: 𝐷 𝜄 value Feature dim-2 ‘ - 1’ Toy ex.) Features may form 2-class regression excessively complex distribution. Disjointed • Split • Feature dim-1 I. Sato, et al. , Breaking Inter-Layer Co-Adaptation by Classifier Anonymization , ICML 2019 3 /10

FOCA: Feature-extractor Optimization through Classifier Anonymization 1 𝜚 ⋆ = arg min FOCA ෍ 𝔽 𝜄~Θ 𝜚 𝑀 𝐷 𝜄 𝐺 𝜚 𝑦 , 𝑢 𝒠 0 𝜚 𝑦,𝑢 ∈𝒠 Want to know more about 𝛪 𝜚 ? Random weak classifier: 𝜄~Θ 𝜚 Please come to the poster! Feature extractor 𝐺 𝜚 ⋆ adapts to a set of weak classifiers 𝐷 𝜄 . Feature dim-2 Features form simple point-like distribution per class under some conditions. Feature dim-1 I. Sato, et al. , Breaking Inter-Layer Co-Adaptation by Classifier Anonymization , ICML 2019 4 /10

Proposition about the point-like property In words, If feature extractor has an enough representation ability, all input data of the same class are projected to a single point in the feature space in a class-separable way under certain conditions. Please see the paper for the proof. I. Sato, et al. , Breaking Inter-Layer Co-Adaptation by Classifier Anonymization , ICML 2019 5 /10

x-axis Feature dim. #1 Toy problem demonstration y-axis Feature dim. #2 data used to generate classifier decision boundary start Small-batch classifier works as a weak classifier to the entire dataset. Small perturbations lead to end point-like distribution. I. Sato, et al. , Breaking Inter-Layer Co-Adaptation by Classifier Anonymization , ICML 2019 6 /10

Experiment #1: partial-dataset training Thing we wish to confirm: full-dataset classifier partial-dataset classifier Do they perform similarly for given 𝐺 𝜚 ⋆ ?? I. Sato, et al. , Breaking Inter-Layer Co-Adaptation by Classifier Anonymization , ICML 2019 7 /10

Experiment #1: partial-dataset training CIFAR10 test error rates Performance gap large for other methods much smaller One indication of for FOCA point-like property classifier trained classifier trained with large dataset with small dataset (The same, fixed feature extractor is used within each method.) I. Sato, et al. , Breaking Inter-Layer Co-Adaptation by Classifier Anonymization , ICML 2019 8 /10

More experiments … including: • Approximate geodesic distance measurements between large- and small-dataset solutions • Low-dimensional analyses to further study the point-like property. I. Sato, et al. , Breaking Inter-Layer Co-Adaptation by Classifier Anonymization , ICML 2019 9 /10

Poster #28 tonight What? Breaking co-adaptation between feature extractor and classifier. How? By classifier anonymization . Proved: Features form simple Theory? point-like distribution . Reality? Point-like property largely confirmed on real datasets. 10 /10

Breaking Inter-Layer Co-Adaptation by Classifier Anonymization Ikuro - PowerPoint PPT Presentation

ICML2019 Breaking Inter-Layer Co-Adaptation by Classifier Anonymization Ikuro Sato 1 Denso IT Laboratory. Inc., Japan 1 Kohta Ishikawa 1 National Institute of Advanced Industrial 2 Guoqing Liu 1 Science and Technology, Japan Masayuki Tanaka 2

Network Layer October 2, 2019 guha.jayachandran@sjsu.edu Layer 2: Protocol atop Layer 1

Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Minema Minema

Lecture 6: Wireless Link Layer, Lecture 6: Wireless Link Layer, MAC protocols, CSMA MAC

1 Transport Layer Transport Layer Outline Message, Segment, Datagram Transport-layer

ELEC / COMP 177 Fall 2016 Some slides from Kurose and Ross, Computer Networking , 5 th Edition

5 Network Layer Network Layer Network Layer Network Layer Example: Choosing among multiple ASes

Specification of a Specification of a Network Adaptation Layer Network Adaptation Layer for the

10 mm Cytoarchitecture and function layer 4: input layer 5: output Motor cortex: expanded layer

Data-link layer Da Data ta-link link layer er Referred to as layer 2 Physical

CompSci 356: Computer Network Architectures Lecture 25: Application Layer Protocols Chapter 9.1

7 Network Layer Network Layer Network Layer Network Layer Subnets Classful Address

1 Network Layer Network Layer Recall: Circuit Switching vs. Packet Interplay between routing

CompSci 356: Computer Network Architectures Lecture 23: Application Layer Protocols Chapter 9.1

4 Network Layer Network Layer Network Layer Network Layer Switching Via Memory Three types of

Coastal Adaptation Kellie Fisher FCERM Senior Advisor Why Adaptation? Adaptation to a

world of In Inter Ic Ice-Pump JAN 2016 Presentation of Inter Ice-Pump 1 Inter Ice-Pump ApS //

Introduction to Effectus Theory Background A crash course on effect algebras and effect modules

a APPLICATIONS OF TEMPERATURE SENSORS I Monitoring N Portable Equipment N CPU Temperature N

Several approaches to conditional probability Mirko Navara Center for Machine Perception

Agreement and Disagreement in a Non-Classical World Adam Brandenburger, Patricia

Improving neural networks by preventing co- adaption of feature detectors Published by: G.E.

(Very) Brief Introduction to Neural Networks IITP-03 Algorithms for NLP 1 / 31 Learning

Augmentation Introduction ImageNet Classification with Deep Convolutional Neural Networks,

Deep learning 6.3. Dropout Fran cois Fleuret https://fleuret.org/ee559/ Nov 2, 2020 A first