Machine Learning Classification over Encrypted Data Raphal Bost - PowerPoint PPT Presentation

Machine Learning Classification over Encrypted Data Raphaël Bost   Raluca Ada Popa, Université Rennes 1 ETH Zürich   MIT MIT Stephen Tu Shafi Goldwasser MIT MIT

Classification (Machine Learning) • Supervised learning (training) • Classification data classification training model data set phase phase prediction server client

Problem • The provider’s model is sensitive financial model, genetic sequences, … • Client’s private data medical records, credit history, …

Problem • The provider’s model is sensitive financial model, genetic sequences, … • Client’s private data medical records, credit history, … MPC / 2PC

Using General 2PC ? + Works for every circuit + Constant number of interactions - Have to build circuits - Hard to ‘compose’ - Not easily reusable

Using General 2PC ? + Works for every circuit + Constant number of interactions - Have to build circuits - Hard to ‘compose’ - Not easily reusable ➡ Ad Hoc protocols

Goal • Enable classification without sacrificing privacy • Secure classification, no learning the model is already known • Practical performance

Approach • Classifiers as specialized 2PC • Identify and construct reusable building blocks • Threat model: passive (honest-but-curious) adversary

Insight ML Algorithm Classifier Perceptron Linear Least squares Linear Fischer linear discriminant Linear Support vector machine Linear Naïve Bayes Naïve Bayes ID3/C4.5 Decision trees

Insight • Identify core operations • Construct reusable/composable building blocks • Choose the best fitted primitives Homomorphic Encryption, FHE, Garbled Circuits, …

Related Work • Privacy-preserving training • Using FHE, linear means classifier [GLN12] • Specific techniques for Naïve Bayes [VKC08], decision trees [BDMN05,LP00], linear discriminant [DHC04], kernel methods [LLM06] • Privacy-preserving classification • Using FHE, outsource computation [BLN13] • Secure branching programs [BFK+09, BFL+09] • Specific classifiers (face recognition/detection) [SSW09, AB07]

Building Blocks • Dot product • Encrypted Comparison • Encrypted (arg)max • Decision trees • Encryption scheme switching

Classifiers from blocks Naïve Bayes Decision Tree Linear Classifier Classifier Classifier Private Enc. Enc. ES Dot Product Decision Compare Argmax Switching Trees

Classifiers In Practice • Linear Classifier • Naïve Bayes Classifier • Decision Trees

Linear Classifier • Separate two sets of points • Very common classifier • Dot product + Encrypted compare

Linear Classifier Time / protocol Model Size Total Comm. Inter. Dot Product Enc. Comp. 30 <0.01s 0.194 s 0.204 s 35.84 kB 7 47 0.024 s 0.194 s 0.217 s 40.19 kB 7 Evaluation on UC Irvine ML databases   40 ms network latency   2,66 GHz Intel Core i7

Naïve Bayes Classifier d Y argmax p ( C = c i ) p ( X j = x j | C = c i ) i ∈ [ k ] j =1

Naïve Bayes Classifier d Y argmax p ( C = c i ) p ( X j = x j | C = c i ) i ∈ [ k ] j =1 d X argmax log p ( C = c i ) log p ( X j = x j | C = c i ) i ∈ [ k ] j =1 • Additive homomorphism + Encrypted argmax

Naïve Bayes Classifier # Cat. # Features Argmax Total Time Comm. Inter. 2 9 0.40 s 0.48 s 72.47 kB 14 5 9 1.33 s 1.42 s 150.7 kB 42 24 70 3.38 s 3.81 s 1911 kB 166 Evaluation on UC Irvine ML databases   40 ms network latency   2,66 GHz Intel Core i7

Decision Trees y x ≥ x 2 x < x 2 B D y 2 y < y 1 y > y 2 y 1 E D B A C x ≥ x 1 x < x 1 E C A x 1 x 2 x

Decision Tree • Combination of other classifiers • In this example, linear classifiers • Linear classifier + ES Switching + Decision Trees

Decision Tree Tree Time / Protocol Specs. Total Comm. Inter. Decision Nodes Depth Lin. Class. ES Switch Tree (FHE) 4 4 0.45 s 1.64 s 0.27 s 2.3 s 2639 kB 30 6 4 1.41 s 7.41 s 0.93 s 9.8 s 3555 kB 44 Evaluation on UC Irvine ML databases   40 ms network latency   2,66 GHz Intel Core i7

Decision Tree Tree Time / Protocol Specs. Total Comm. Inter. Decision Nodes Depth Lin. Class. ES Switch Tree (FHE) 4 4 0.45 s 1.64 s 0.27 s 2.3 s 2639 kB 30 6 4 1.41 s 7.41 s 0.93 s 9.8 s 3555 kB 44 Run sequentially, can be parallelized

Building blocks library • Designed to be modular Easy composition • Easy to construct new secure classifiers Face detection algorithm (Viola & Jones)

Building blocks library E.g.: Linear Classifier Client Server PK SK v w Dot Product Dot Product SK J h v, w i K PK Enc. Compare Enc. Compare h v, w i > 0

  Building blocks library E.g.: Linear Classifier Client Server bool Linear_Classifier_Client::run() void Linear_Classifier_Server_session:: run_session()   { {   exchange_keys(); exchange_keys(); // enc_model_ is the encrypted model vector   // values_ is a vector of integers   // compute the dot product   // compute the dot product   help_compute_dot_product(enc_model_, true);   mpz_class v = compute_dot_product(values_);   mpz_class w = 1; // encryption of 0 // help the client to get   // compare the dot product with 0 // the sign of the dot product   return enc_comparison(v, w, bit_size_, false);   help_enc_comparison(bit_size_, false);   } }

In conclusion • Composable building blocks for secure classifiers • Library with practical performances Future work : • Less roundtrips (work on the protocols) • More parallelism (work on the implementation)

Questions?

Machine Learning Classification over Encrypted Data Raphal Bost - PowerPoint PPT Presentation

Machine Learning Classification over Encrypted Data Raphal Bost Raluca Ada Popa, Universit Rennes 1 ETH Zrich MIT MIT Stephen Tu Shafi Goldwasser MIT MIT Classification (Machine Learning) Supervised learning (training)

Machine Learning Classification over Encrypted Data Raphael Bost, Raluca Ada Popa, Stephen Tu,

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Traceback for End-to-End Encrypted Messaging Nirvan Tyagi Ian Miers Tom Ristenpart CCS 2019 1

TLS 1.3 Encrypted SNI ekr: ekr@rtfm.com dkg: dkg@aclu.org IETF 94 TLS 1.3 Encrypted SNI 1

Challenges With Building End-to-End Encrypted Challenges With Building End-to-End Encrypted

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

x ? Machine Learning 5/4/20 Tim Althoff, UW CS547: Machine Learning for Big Data,

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

MSc Course MACHINE LEARNING TECHNIQUES AND APPLICATIONS Classification with GMM + Bayes 1

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

Searches Through Encrypted Data presenter: Reza Curtmola Advanced Topics in Network Security

Agenda Announcements Course Evaluation Tuples Hangman help 1/14/2013 CompSci101

Polychromatic Colorings of Complete Graphs with Respect to 1-,2-factors and Hamiltonian Cycles

-Testing Sofya Raskhodnikova Penn State University, visiting Boston University and

Certification of Minimal Approximant Bases Pascal Giorgi 1 , Vincent Neiger 2 1 Universit 2

Shafi Goldwasser, Michael Sipser Shafi Goldwasser, Michael Sipser l (| w |) (uniformly)

Computer-aided cryptographic proofs Gilles Barthe MPI-SP , Germany IMDEA Software Institute,

The (all guards move) Eternal Domination number for 3 n Grids Margaret-Ellen Messinger Mount

ANALYZING YZING MA MASS SSIVE VE DATASETS SETS WITH ITH MI MISSI SSING NG ENTRIES TRIES

Sambuz

Useful Links

Newsletter

Mail Us