releasing search queries and clicks privately
play

Releasing Search Queries and Clicks Privately Arne Bayer July 24, - PowerPoint PPT Presentation

A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Search Queries and Clicks Privately Arne Bayer July 24, 2017 Arne Bayer Releasing Search Queries and Clicks Privately A new Approach Releasing Data


  1. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Search Queries and Clicks Privately Arne Bayer July 24, 2017 Arne Bayer Releasing Search Queries and Clicks Privately

  2. A new Approach Releasing Data Select-Queries Noisy Counts Results Table of Contents 1 A new Approach 2 Releasing Data Releasing Algorithm 3 Select-Queries q ∗ ∈ D 1 q ∗ / ∈ D 1 Arbitary d 4 Noisy Counts 5 Results Arne Bayer Releasing Search Queries and Clicks Privately

  3. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Lists releasing anonymized lists tracing back possible Arne Bayer Releasing Search Queries and Clicks Privately

  4. A new Approach Releasing Data Select-Queries Noisy Counts Results Graphical Approach Let G ( E , V ) where: Vertices represent visited sites or search queries Edges represent links between sites Arne Bayer Releasing Search Queries and Clicks Privately

  5. A new Approach Releasing Data Select-Queries Noisy Counts Results D 10 B 5 11 10 3 P 13 10 C 9 4 4 M 10 L Arne Bayer Releasing Search Queries and Clicks Privately

  6. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Algorithm Parameters search log D Arne Bayer Releasing Search Queries and Clicks Privately

  7. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Algorithm Parameters search log D noise parameters as scale parameter for Laplace distribution b , b c , b q Arne Bayer Releasing Search Queries and Clicks Privately

  8. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Algorithm Parameters search log D noise parameters as scale parameter for Laplace distribution b , b c , b q b general noise b c noise on clicked URLs b q noise on queires Arne Bayer Releasing Search Queries and Clicks Privately

  9. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Algorithm Parameters search log D noise parameters as scale parameter for Laplace distribution b , b c , b q b general noise b c noise on clicked URLs b q noise on queires d maximum queries kept per user Arne Bayer Releasing Search Queries and Clicks Privately

  10. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Algorithm Parameters search log D noise parameters as scale parameter for Laplace distribution b , b c , b q b general noise b c noise on clicked URLs b q noise on queires d maximum queries kept per user d c maximum URL clicks kept per user Arne Bayer Releasing Search Queries and Clicks Privately

  11. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Algorithm Parameters search log D noise parameters as scale parameter for Laplace distribution b , b c , b q b general noise b c noise on clicked URLs b q noise on queires d maximum queries kept per user d c maximum URL clicks kept per user M ( q , D ) = # of times q appeared in a given search log D Arne Bayer Releasing Search Queries and Clicks Privately

  12. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Algorithm Parameters search log D noise parameters as scale parameter for Laplace distribution b , b c , b q b general noise b c noise on clicked URLs b q noise on queires d maximum queries kept per user d c maximum URL clicks kept per user M ( q , D ) = # of times q appeared in a given search log D K minimum threshold of occurences M ( q , D ) + Lap ( b ) > K Arne Bayer Releasing Search Queries and Clicks Privately

  13. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Algorithm Parameters search log D noise parameters as scale parameter for Laplace distribution b , b c , b q b general noise b c noise on clicked URLs b q noise on queires d maximum queries kept per user d c maximum URL clicks kept per user M ( q , D ) = # of times q appeared in a given search log D K minimum threshold of occurences M ( q , D ) + Lap ( b ) > K K > d user limit smaller than threshold Arne Bayer Releasing Search Queries and Clicks Privately

  14. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Algorithm A Algorithm Releasing Algorithm A 1: Input: D , d , d c , b , b q , b c , K Arne Bayer Releasing Search Queries and Clicks Privately

  15. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Algorithm A Algorithm Releasing Algorithm A 1: Input: D , d , d c , b , b q , b c , K 2: Limit User Activity: Keep only d entries per user in D Arne Bayer Releasing Search Queries and Clicks Privately

  16. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Algorithm A Algorithm Releasing Algorithm A 1: Input: D , d , d c , b , b q , b c , K 2: Limit User Activity: Keep only d entries per user in D 3: Count Queries calculate absolute commonness of all Queries q : M ( q , D ) Arne Bayer Releasing Search Queries and Clicks Privately

  17. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Algorithm A Algorithm Releasing Algorithm A 1: Input: D , d , d c , b , b q , b c , K 2: Limit User Activity: Keep only d entries per user in D 3: Count Queries calculate absolute commonness of all Queries q : M ( q , D ) 4: Select-Queries: add all Queries to Q that exceed K : Q ← { q : M ( q , D ) + Lap ( b ) > K } Arne Bayer Releasing Search Queries and Clicks Privately

  18. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Algorithm A Algorithm Releasing Algorithm A 1: Input: D , d , d c , b , b q , b c , K 2: Limit User Activity: Keep only d entries per user in D 3: Count Queries calculate absolute commonness of all Queries q : M ( q , D ) 4: Select-Queries: add all Queries to Q that exceed K : Q ← { q : M ( q , D ) + Lap ( b ) > K } add fuzziness to to Queries: 5: Get-Query-Counts: � q , M ( q , D ) + Lap ( b q ) � Arne Bayer Releasing Search Queries and Clicks Privately

  19. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Algorithm A Algorithm Releasing Algorithm A 1: Input: D , d , d c , b , b q , b c , K 2: Limit User Activity: Keep only d entries per user in D 3: Count Queries calculate absolute commonness of all Queries q : M ( q , D ) 4: Select-Queries: add all Queries to Q that exceed K : Q ← { q : M ( q , D ) + Lap ( b ) > K } add fuzziness to to Queries: 5: Get-Query-Counts: � q , M ( q , D ) + Lap ( b q ) � 6: Get-Click-Counts: calculate top ten clicks and add fuzziness: � q , u , # u q + Lap ( b c ) � Arne Bayer Releasing Search Queries and Clicks Privately

  20. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Privacy Guarantee � � � � A ( D 1 ) ∈ ˆ A ( D 2 ) ∈ ˆ Pr D ≤ α Pr D + δ 1 (1) Arne Bayer Releasing Search Queries and Clicks Privately

  21. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Privacy Guarantee � � � � A ( D 1 ) ∈ ˆ A ( D 2 ) ∈ ˆ Pr D ≤ α Pr D + δ 1 (1) � � � � A ( D 2 ) ∈ ˆ A ( D 1 ) ∈ ˆ Pr D ≤ α Pr D + δ 1 (2) Arne Bayer Releasing Search Queries and Clicks Privately

  22. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Privacy Guarantee � � � � A ( D 1 ) ∈ ˆ A ( D 2 ) ∈ ˆ Pr D ≤ α Pr D + δ 1 (1) � � � � A ( D 2 ) ∈ ˆ A ( D 1 ) ∈ ˆ Pr D ≤ α Pr D + δ 1 (2) ǫ alg = d · ln ( α ) + d / b q + d c / b c � d − K δ alg = d � 2 exp b � � e 1 / b , 1 + 1 with α = max 2 e ( K − 1) / b − 1 Arne Bayer Releasing Search Queries and Clicks Privately

  23. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Steps to prove, that privacy is garanteed Arne Bayer Releasing Search Queries and Clicks Privately

  24. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Steps to prove, that privacy is garanteed 4 Select-Queries Limit User: d = 1 Arbitary d Arne Bayer Releasing Search Queries and Clicks Privately

  25. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Steps to prove, that privacy is garanteed 4 Select-Queries Limit User: d = 1 Arbitary d 5 Get-Query-Counts Arne Bayer Releasing Search Queries and Clicks Privately

  26. A new Approach Releasing Data Select-Queries Noisy Counts Results Releasing Algorithm Steps to prove, that privacy is garanteed 4 Select-Queries Limit User: d = 1 Arbitary d 5 Get-Query-Counts 6 Get-Click-Counts Arne Bayer Releasing Search Queries and Clicks Privately

  27. A new Approach Releasing Data Select-Queries Noisy Counts Results Part 1: d = 1 limit of 1 query per user Arne Bayer Releasing Search Queries and Clicks Privately

  28. A new Approach Releasing Data Select-Queries Noisy Counts Results Part 1: d = 1 limit of 1 query per user let D 1 and D 2 differ in exactly q ∗ Arne Bayer Releasing Search Queries and Clicks Privately

  29. A new Approach Releasing Data Select-Queries Noisy Counts Results Part 1: d = 1 limit of 1 query per user let D 1 and D 2 differ in exactly q ∗ q ∗ ∈ D 1 and q ∗ ∈ D 2 Arne Bayer Releasing Search Queries and Clicks Privately

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend