Detecting Similar ID Documents Using Deep Learning Burkay Gur - - PowerPoint PPT Presentation

detecting similar id documents using deep learning
SMART_READER_LITE
LIVE PREVIEW

Detecting Similar ID Documents Using Deep Learning Burkay Gur - - PowerPoint PPT Presentation

Detecting Similar ID Documents Using Deep Learning Burkay Gur QCon.ai, Apr 2018 Our mission is to create an open financial system for the world Risk & Data What do we do? - Limit Coinbases exposure to risk - Fight Identity


slide-1
SLIDE 1

Detecting Similar ID Documents Using Deep Learning

Burkay Gur QCon.ai, Apr 2018

slide-2
SLIDE 2

Our mission is to create an 


  • pen financial

system 


for the world

slide-3
SLIDE 3

Risk & Data

  • What do we do?
  • Limit Coinbase’s exposure to risk
  • Fight Identity Fraud
slide-4
SLIDE 4
slide-5
SLIDE 5

Attempt 1: Shazam

slide-6
SLIDE 6

Attempt 1: Shazam

  • Fingerprint for each document
  • Perceptual Hashing (256 bit)
  • Store hashes in a DB (Hamming distance)
slide-7
SLIDE 7

Evaluation of Shazam

Pros Cons

  • Translations
  • Large datasets
  • Domain Specificity
  • Color differences
  • Minor cropping
  • Easy to implement
slide-8
SLIDE 8

Attempt 2: Vision

slide-9
SLIDE 9

Attempt 2: Vision

X {

slide-10
SLIDE 10

Evaluation of Vision

Pros Cons

  • Domain Specificity
  • Iteration Speed
  • Infra and Security
  • Cropping
  • Translation

imgcrypt

slide-11
SLIDE 11

New Challenge: Iterate Fast in Highly Secure Environments

slide-12
SLIDE 12

imgcrypt +

Coinbase ML Infrastructure

NostradamusCLI

slide-13
SLIDE 13

Takeaways

  • Start with naive approach and improve
  • Iteration speed is top priority
  • Watch out for adversarial attacks

Contact: burkay.gur@coinbase.com