Figerprinting digital documents survey Gbor Tardos Rnyi Institute - PowerPoint PPT Presentation

Figerprinting digital documents survey Gábor Tardos Rényi Institute & Central European University

1. Government secrets • Government meeting on Monday to discuss secret plans on hospital reorganizations in face of COVID-19

1. Government secrets • Government meeting on Monday to discuss secret plans on hospital reorganizations in face of COVID-19 • All the details of the plan are front page news on Index on Tuesday A bezárandó kórházi osztályok listája - János kórház, belgyógyászat - Margit kórház, szülészet - …

2. Industry secrets Director of engineering compony: - Good news: We have just sold the thousandth copy of our video on how to build cratoons.

2. Industry secrets Director of engineering compony: - Good news: We have just sold the thousandth copy of our video on how to build cratoons. - Bad news: this was the last one. Somebody uploaded it to YouTube – now anybody can watch it for free.

How to protect the secret • Sue the medium (Index or YouTube) or at least make sure they stop sharing our information • Sue the illegitimate end user (the guy who builds cratoons with our video but did not pay for it) • In this talk: Find the legitimate user who illegally shared the secret (the cabinet member / one of the thousand customers who payed for the video)

Embed unique ID in every copy of document • Hide the embedded ID. If user finds it can remove the ID TOP SECRET Copy # 1 and make leaked copy untraceable. • Easy for video / image / software (lots of irrelevant places to hide ID) TOP SECRET Copy # 2 harder (but doable) for text. • Practical if number of legitimate users is small and they are known. TOP SECRET Copy # 3 TOP SECRET Copy # 4 Example: Hollywood movies distributed to the members of the American Academy before the vote for the Oscars.

Example Digital document: 0010010110101111101010110011010010001010001100110100111111

Example Find irrelevant positions: 0010010110101111101001011100110100100010010001100110100111111

Example Duplicate: 0010010110101111101001011100110100100010010001100110100111111 0010010110101111101001011100110100100010010001100110100111111 0010010110101111101001011100110100100010010001100110100111111 0010010110101111101001011100110100100010010001100110100111111 0010010110101111101001011100110100100010010001100110100111111 0010010110101111101001011100110100100010010001100110100111111 0010010110101111101001011100110100100010010001100110100111111

Example Insert distinct code (ID) in every copy: 0010010110101111101001010100110100100010010001100110100111111 0010010110101111101001010100110100100011010001100110100111111 0010010110101111101001011100110100100010010001100110100111111 0010010110101111101001011100110100100011010001100110100111111 0010010110101111101011010100110100100010010001100110100111111 0010010110101111101011010100110100100011010001100110100111111 0010010110101111101011011100110100100010010001100110100111111

Example Insert distinct code (ID) in every copy: 0010010110101111101001010100110100100010010001100110100111111 0010010110101111101001010100110100100011010001100110100111111 0010010110101111101001011100110100100010010001100110100111111 0010010110101111101001011100110100100011010001100110100111111 0010010110101111101011010100110100100010010001100110100111111 0010010110101111101011010100110100100011010001100110100111111 0010010110101111101011011100110100100010010001100110100111111 • If code position remain hidden • code is not changed • leaking participant easily traced

No mathematics?!

No mathematics?! it’s coming…

Collusion attack Two (or more) participant compare copies: 0010010110101111101001010100110100100010010001100110100111111 0010010110101111101001010100110100100011010001100110100111111 0010010110101111101001011100110100100010010001100110100111111 0010010110101111101001011100110100100011010001100110100111111 0010010110101111101011010100110100100010010001100110100111111 0010010110101111101011010100110100100011010001100110100111111 0010010110101111101011011100110100100010010001100110100111111

Collusion attack Two (or more) participant compare copies: 0010010110101111101001010100110100100010010001100110100111111 0010010110101111101001010100110100100011010001100110100111111 0010010110101111101001011100110100100010010001100110100111111 0010010110101111101001011100110100100011010001100110100111111 0010010110101111101011010100110100100010010001100110100111111 0010010110101111101011010100110100100011010001100110100111111 0010010110101111101011011100110100100010010001100110100111111 Differences between documents:

Collusion attack Two (or more) participant compare copies: 0010010110101111101001010100110100100010010001100110100111111 0010010110101111101001010100110100100011010001100110100111111 0010010110101111101001011100110100100010010001100110100111111 0010010110101111101001011100110100100011010001100110100111111 0010010110101111101011010100110100100010010001100110100111111 0010010110101111101011010100110100100011010001100110100111111 0010010110101111101011011100110100100010010001100110100111111 Differences between documents: These positions of the code can be altered arbitrarily: makes tracing much harder (and more interesting!)

Collusion attack Two (or more) participant compare copies: 0010010110101111101001010100110100100010010001100110100111111 0010010110101111101001010100110100100011010001100110100111111 0010010110101111101001011100110100100010010001100110100111111 0010010110101111101001011100110100100011010001100110100111111 0010010110101111101011010100110100100010010001100110100111111 0010010110101111101011010100110100100011010001100110100111111 0010010110101111101011011100110100100010010001100110100111111 Some positions of code may remain hidden Differences between documents: These positions of the code can be altered arbitrarily: makes tracing much harder (and more interesting!)

Collusion attack Two (or more) participant compare copies: 0010010110101111101001010100110100100010010001100110100111111 0010010110101111101001010100110100100011010001100110100111111 0010010110101111101001011100110100100010010001100110100111111 0010010110101111101001011100110100100011010001100110100111111 0010010110101111101011010100110100100010010001100110100111111 0010010110101111101011010100110100100011010001100110100111111 0010010110101111101011011100110100100010010001100110100111111 Some positions of code may remain hidden Differences between documents: These positions of the code can be altered arbitrarily: makes tracing much harder (and more interesting!) tracing must be based on these

Boneh-Shaw fingerprinting model Limited number of malicious participants (the pirates) collaborate to forge untraceable copy of document.

Boneh-Shaw fingerprinting model Limited number of malicious participants (the pirates) collaborate to forge untraceable copy of document. They don’t find / cannot change positions of code that agrees in each codeword they have: the Marking Assumption. They are not restricted in their output in any other way.

Boneh-Shaw fingerprinting model codewords of codewords pirates Identity of forged accused Code Pirate Tracing word users generation strategy algorithm

Boneh-Shaw fingerprinting model codewords of codewords pirates Identity of forged accused Code Pirate Tracing word users generation strategy algorithm Controlled by the distributor Access to random key (Randomness and nonzero error is unavoidable.)

Figerprinting digital documents survey Gbor Tardos Rnyi Institute - PowerPoint PPT Presentation

Figerprinting digital documents survey Gbor Tardos Rnyi Institute & Central European University 1. Government secrets Government meeting on Monday to discuss secret plans on hospital reorganizations in face of COVID-19 1.

Digital publishing is the core of what we do at KIWA About KIWA Four steps to showcase Idea?on,

Linear-Time Erasure List-Decoding of Expander Codes Noga Ron-Zewi (University of Haifa) Mary

Low-latency software LDPC decoders for x86 multi-core devices Bertrand LE GAL and Christophe JEGO

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1

Sequential circuits If the same input may produce different output signal, we have a sequential

Lecture 2 Point-to-Point Communications 1 I-Hsiang Wang ihwang@ntu.edu.tw

Dynamical systems Expanding maps on the circle. Coding Jana Rodriguez Hertz ICTP 2018 coding

The Coding DEI Programming Contest Training What is a programming contest? Challenge: Solve

. Complex spherical codes and linear programming bounds Sho Suda (Aichi University of Education)

New upper bounds for nonbinary codes based on semidefinite programming and parity Sven Polak

On Certifying Non-uniform Bounds against Adversarial Attacks Chen Liu , Ryota Tomioka ,

Fidelity of Finite Length Quantum Codes in Qubit Erasure Channel Alexei Ashikhmin, Bell Labs

Bounds on the size of identifying codes for graphs of maximum degree Florent Foucaud joint

Automated Verification of RISC-V Kernel Code Antoine Kaufmann Big Picture Micro/exokernels

Semidefinite programming bounds for codes D. Gijswijt 1 A. Schrijver 2 H. Tanaka 3 1 Department of

Integer-Forcing Source Coding Or Ordentlich Joint work with Uri Erez June 30th, 2014 ISIT,

Parallelization Parallelization Programming for Statistical Programming for Statistical Science

Datapath Design, Coding Standards, and Lab 2 1 Separating Control From Data The datapath is

Certification of Forth By Paul E. Bennett IEng MIET 12 June 2015 HIDECS Consultancy

Embedded Systems Programming Trends of Embedded Systems (Module 2) Yann-Hang Lee Arizona State

MIT 6.875 & Berkeley CS276 Foundations of Cryptography Lecture 3 Roadmap of the Course:

G odels Theorem: Inconsistency vs Introduction: Incompleteness the Standard View G

A nice proof of Weis Duality Theorem Thomas Britz UNSW Sydney 1 0 1 0 1 0 1 Coding Theory

Toward a Polynomial Model, Season III Polynomial Code Generation Paul Feautrier 1 Albert Cohen 2

Sambuz

Useful Links

Newsletter

Mail Us

Figerprinting digital documents survey Gbor Tardos Rnyi Institute - PowerPoint PPT Presentation

Figerprinting digital documents survey Gbor Tardos Rnyi Institute & Central European University 1. Government secrets Government meeting on Monday to discuss secret plans on hospital reorganizations in face of COVID-19 1.

Digital publishing is the core of what we do at KIWA About KIWA Four steps to showcase Idea?on,

Linear-Time Erasure List-Decoding of Expander Codes Noga Ron-Zewi (University of Haifa) Mary

Low-latency software LDPC decoders for x86 multi-core devices Bertrand LE GAL and Christophe JEGO

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1

Sequential circuits If the same input may produce different output signal, we have a sequential

Lecture 2 Point-to-Point Communications 1 I-Hsiang Wang ihwang@ntu.edu.tw

Dynamical systems Expanding maps on the circle. Coding Jana Rodriguez Hertz ICTP 2018 coding

The Coding DEI Programming Contest Training What is a programming contest? Challenge: Solve

. Complex spherical codes and linear programming bounds Sho Suda (Aichi University of Education)

New upper bounds for nonbinary codes based on semidefinite programming and parity Sven Polak

On Certifying Non-uniform Bounds against Adversarial Attacks Chen Liu , Ryota Tomioka ,

Fidelity of Finite Length Quantum Codes in Qubit Erasure Channel Alexei Ashikhmin, Bell Labs

Bounds on the size of identifying codes for graphs of maximum degree Florent Foucaud joint

Automated Verification of RISC-V Kernel Code Antoine Kaufmann Big Picture Micro/exokernels

Semidefinite programming bounds for codes D. Gijswijt 1 A. Schrijver 2 H. Tanaka 3 1 Department of

Integer-Forcing Source Coding Or Ordentlich Joint work with Uri Erez June 30th, 2014 ISIT,

Parallelization Parallelization Programming for Statistical Programming for Statistical Science

Datapath Design, Coding Standards, and Lab 2 1 Separating Control From Data The datapath is

Certification of Forth By Paul E. Bennett IEng MIET 12 June 2015 HIDECS Consultancy

Embedded Systems Programming Trends of Embedded Systems (Module 2) Yann-Hang Lee Arizona State

MIT 6.875 &amp; Berkeley CS276 Foundations of Cryptography Lecture 3 Roadmap of the Course:

G odels Theorem: Inconsistency vs Introduction: Incompleteness the Standard View G

A nice proof of Weis Duality Theorem Thomas Britz UNSW Sydney 1 0 1 0 1 0 1 Coding Theory

Toward a Polynomial Model, Season III Polynomial Code Generation Paul Feautrier 1 Albert Cohen 2

Sambuz

Useful Links

Newsletter

Mail Us

MIT 6.875 & Berkeley CS276 Foundations of Cryptography Lecture 3 Roadmap of the Course: