BY MAHMOOD SHARIF JOINT WORK WITH ORR DUNKELMAN AND RITA OSADCHY - PowerPoint PPT Presentation

PRIVACY PRESERVING KEY-DERIVATION FROM BIOMETRICS: CLOSING THE GAP BETWEEN THEORY AND PRACTICE BY MAHMOOD SHARIF JOINT WORK WITH ORR DUNKELMAN AND RITA OSADCHY

Motivation Key-Derivation: generating a secret key from information possessed by the user Passwords, the most widely used mean for key derivation, are problematic: peekaboo pwd ?? 1. Forgettable 2. Easily observable 3. Low entropy 4. Carried over between systems

Motivation Suggestion : use biometric data for key generation Problems: 1. It is hard/impossible to replace the biometric template in case it gets compromised 2. Privacy of the users 1

Outline Motivation Background: The Fuzziness Problem Cryptographic Constructions Challenges SecureFace: Overview of the System Experiments Challenge New construction Conclusions

The Fuzziness Problem Two images of the same face are rarely identical (due to lighting, pose, expression changes) Yet we want to consistently derive the same key every time The fuzziness in the samples is handled by: 1. Feature extraction 2. The use of error-correction codes and helper data • Taken one after the other • 86189 pixels are different • only 3061 pixels have identical values!

Process for handling noise Most biometrics systems: 1. Feature extraction: Lower susceptibility to noise 2. Binarization: Decreases noise Necessary for utilizing cryptographic constructions 3. Error-Correction: Uses stored helper data for handling remaining noise

Feature Extraction User-specific features: Generic features: Histograms, e.g.: LBPs, E.g.: Eigenfaces (PCA),   SIFT Fisherfaces (FLD) Filters, e.g.: Gabor features Requires training and stores Do not require training or store user specific parameters user specific data

Feature Extraction Previous Work [FYJ10] used Fisherfaces: 10 10 20 20 30 30 40 40 50 50 60 60 70 70 80 80 90 90 10 20 30 40 50 60 70 80 90 10 20 30 40 50 60 70 80 90 Problem: public data looks like the users :( If privacy is a goal, user-specific features cannot be used!

Binarization Essential for using the cryptographic constructions Biometric features can be   Some claim: non-invertibility [TGN06] approximated By: Sign of projection Quantization Quantization is more accurate, but requires additional private information [TKL08]

Cryptographic Noise-Tolerant Constructions Secure Sketch [JW99]: Enrollment Key Generation Binary ⊕ Binary Representation of s ⊕ Decode k Representation of the biometrics the biometrics Encode s k ⬅ {0,1} * Other constructions: Fuzzy Vault [JS06], Fuzzy Extractors [DORS08]

Secure Sketch: Illustration x enroll s (=Enc(k) ⊕ x enroll ) x genuine Enc(k) t x impostor

When it comes to practice… Secure sketch provides zero entropy-loss given s only if the biometric templates are i.i.d Hao et. al proposed a system that derives keys with 140bits of entropy from iris images [HAD05] Statistical attacks exploit dependencies in the biometrics and guess keys in ~2 10 attempts, given s [RU12, ZKB12] Fuzzy Extractors can be used, but the entropy loss is too high

Challenges 1. Auxiliary data leaks personal information 2. Need i.i.d biometric templates for secure sketch 3. High (min-) entropy loss when using fuzzy extractors Result: short keys and weak privacy protection

Outline Motivation Background: The Fuzziness Problem Cryptographic Constructions Challenges SecureFace: Overview of the System Experiments Challenge New construction Conclusions

Feature Extraction 1. Landmark Localization and Alignment Face landmark localization and affine transformation to a canonical pose: An essential step, due to the inability to perform alignment between enrolled and newly presented template

Feature Extraction 2. Feature Extraction Local Binary Patterns (LBPs) descriptors are computed from 21 regions defined on the face: The same is done with Scale Invariant Feature Transform (SIFT) descriptors Histograms of Oriented Gradients (HoGs) are computed on the whole face

Ensuring Independent bits Dimension Reduction and Concatenation of Feature Vectors Removing Correlations Between the Features Rescaling Each Feature to Projecting [0,1] Interval on orthogonal hyperplanes Independent bits

Binarization Requirements from the binary representation: 1. Consistency and discrimination 2. No correlations between the bits 3. High min-entropy We find a discriminative projection space W by generalizing an algorithm from [WKC10] (for solving ANN problem) ( x i , x j ) ∈ C if the pair belongs to the same user For : X = [ x 1 , x 2 , ..., x n ] otherwise ( x i , x j ) ∈ T The aim is to find hyperplanes , s.t. for : [ w 1 , w 2 , ..., w K ] h k ( x ) = sgn ( w t k x ) if ( x i , x j ) ∈ C h k ( x i ) = h k ( x j ) otherwise h k ( x i ) 6 = h k ( x j )

Binarization cont. W will be public, yet we do not want it to reveal info about users ➔ training cannot be performed on images of users Solution: transfer learning — training is performed once on subjects distinct of those enrolled to the system Instead of learning We learn w 1 : does subject have attribute #1?   representation for:   … w 2 : does subject have attribute #2?   representation for:   … … … w n : does subject have attribute #n? … representation for:

    Full System Enrollment: Binarization Feature ⊕ s Extraction Encode Key derivation: k ⬅ {0,1} * ⊕ Decode and   s Hash Binarization Feature Extraction

Experiments Constructing the Embedding Performed only once Subjects are different than the ones enrolled in the system Number of Images Per Number of Subjects Subject Hyperplanes 949 3-4 511

Experiments Evaluation Data: A data set of frontal images under controlled conditions, collected at the University of Haifa 474 subjects in total, 26 have two sessions 6.41 images per subject on average Tests: 9,602 genuine attempts 4,609,678 impersonation attempts

Distribution of Distances - genuine - impostor min distance for an impersonation attempt = 164

Results ROC curve FPR=0%, TPR=88.59% !

What ECC should we use? Attempt #1 Codewords in over binary alphabet in vector space t=163 ➔ d=2*163+1=327 According to Plotkin bound: This implies keys with less than 3 bits :-(

What ECC should we use? Attempt #2 Best option we found in the literature: Reed-Solomon+repetition [MVV12] Idea: encode the key, then repeat the codeword as much as possible In our case, for an 80bits key, the best parameters are: … k 1 k 2 k 16 RS(31,16) ECC with 5bit symbols • Can correct up to 8 symbol errors k ’4 … k ’1 k ’2 k ’3 k ’31 3 repetitions k ’2 … k ’1 k ’1 k ’1 k ’31 k ’31 k ’31 Result: 465bits long codewords

What ECC should we use? Attempt #2 cont. How many errors do we have? Up to 163 unstructured errors ➔ 31.89% chance for bit error, or p=0.68 that a bit is correct Probability of correct (symbol) bit after majority: Probability of correct symbol: RS needs to correct ~23 symbol errors Problem: a correct bit does not guarantee a correct symbol

Our Construction Idea: perform repetition on the biometric template Select RS(2 m -1, l ) ECC (m-bit symbols) • Can correct up to (2 m -1- l )/2 k ’4 … … k l k 1 k 2 k ’1 k ’2 k ’3 k ’ 2m-1 Repeat the biometric template m times x 4 … … x 1 x 2 x 3 x 4 x 2m-1 x 1 x 2 x 3 x 2m-1 m repetitions   of x 1 Compute helper data: It can be seen that: correct bit ➔ correct symbol

Parameters for SecureFace Length of biometric template is 511 i.e., 2 m -1=511 ➔ m=9 To correct up to 163 errors ➔ key of length l =184 symbols

Privacy and Security Guarantees Privacy: If x is i.i.d then H(x|s)= l If H(x)=(2 m -1)-r (i.e., x is almost i.i.d) then H(x|s)= l -r Similarly for security: If x is i.i.d then H(k|s)= l If H(x)=(2 m -1)-r then H(x|s) ≥ l -r

Security Analysis Uniformity of the Representation No correlation between the bits + high min-entropy ➔ uniform distribution Low correlation between the bits #1: ( γ = p (1 − p ) High degrees-of-freedom : 509.69 ) σ 2 p: average relative distance between two representation of different persons : the standard deviation σ

Security Analysis Uniformity of the Representation No correlation between the bits + high min-entropy ⇒ uniform distribution No correlation between the bits #2: The representation has a diagonal covariance matrix: High min-entropy:  

Conclusions SecureFace, a system for fast key-derivation from face images that provides: 1. Consistency (88.94% TPR) and discriminability (0% FPR) 2. Provable privacy 3. Provable security 4. An alternative to passwords

That’s all folks! Questions?

BY MAHMOOD SHARIF JOINT WORK WITH ORR DUNKELMAN AND RITA OSADCHY - PowerPoint PPT Presentation

PRIVACY PRESERVING KEY-DERIVATION FROM BIOMETRICS: CLOSING THE GAP BETWEEN THEORY AND PRACTICE BY MAHMOOD SHARIF JOINT WORK WITH ORR DUNKELMAN AND RITA OSADCHY Motivation Key-Derivation: generating a secret key from information possessed by

Multiple Encryption New Cryptanalytic Algorithms and Applications Orr Dunkelman Computer

Combined Attacks from Boomerangs to Sandwiches and Differential-Linear Orr Dunkelman

Related-Key Attacks Orr Dunkelman Department of Computer Science, University of Haifa Faculty of

The Hitchhikers Guide to the SHA-3 Competition Orr Dunkelman Computer Science Department

Hash Functions Much Ado about Something Orr Dunkelman D epartement dInformatique

New Slide Attacks on Almost Self-Similar Ciphers Orr Dunkelman, Nathan Keller, Noam Lasry, Adi

Cryptanalysis of Lightweight Block Ciphers: Theory Meets Dependencies Orr Dunkelman Computer

A Unified Approach to Related-Key Attacks Orr Dunkelman Katholieke Universitiet Leuven, Dept.

How to Privately Find Double Acquisitions in Biometric Databases Orr Dunkelman Department of

Finding Yourself Is The Key University of Haifa Biometric Key Derivation that Joint work

Standardization Robustness Distinguishers/Key recovery for FFX-3.1 and FEA Orr Dunkelman 1 ,

5 Rounds of SHA-3 Using Generalized Internal Differentials Itai Dinur 1 , Orr Dunkelman 1,2 and

Improved Attacks on Full GOST Itai Dinur 1 , Orr Dunkelman 1,2 and Adi Shamir 1 1 The Weizmann

Reconstructing an S-box from its Difference Distribution Table Orr Dunkelman, Senyang Huang

The Not So Happily-Ever After End of AES Security Fairytale Orr Dunkelman Faculty of

Every Seed is Sacred Olivier Blazy Orr Dunkelman Saqib A. Kakvi Michael Naehrig Peter Schwabe

Assouad Dimension and Random Fractals (Contains joint work with Jonathan M. Fraser and Jun J.

Joint Probability Distributions In many experiments, two or more random variables have values that

Use of AIRS data in the Joint Center for Satellite Data Assimilation Lars Peter Riishojgaard

Assessment of HTS for Fusion. Activities of EFDA Manel Sanmart (IREC) On behalf of EFDA-HTS

Jointly Distributed Random Variables IE 502: Probabilistic Models Jayendran Venkateswaran IE

A depth formula for Tate Tor independent modules over Gorenstein rings Joint work with David A.

By H.W.Weijers Presented at the Workshop on High Temperature Superconducting Magnets for Muon

Joint Word Segmentation and pos-Tagging using a Single Perceptron Yue Zhang and Stephen Clark