Towards Human Interactive Proofs in the Text-Domain Richard - PowerPoint PPT Presentation

Towards Human Interactive Proofs in the Text-Domain Richard Bergmair University of Derby in Austria and Stefan Katzenbeisser Technische Universität München Institut für Informatik Towards Human Interactive Proofs in the Text-Domain – p.1/29

Introduction & Prior Work Many serious threats to Information Security rely on attacks that can only be carried out by computers, not by humans: • manipulation of online polls • bulk subscription to web-services • distribution of spam and worms • privacy infringement by unwanted data mining • denial-of-service attacks • dictionary attacks Towards Human Interactive Proofs in the Text-Domain – p.2/29

Introduction & Prior Work Moni Naor. Verification of a human in the loop or identification via the turing test. Unpublished Manuscript. http://www.wisdom.weizmann.ac.il/~naor/ PAPERS/human.ps , 1997. Towards Human Interactive Proofs in the Text-Domain – p.3/29

Introduction & Prior Work Luis von Ahn, Manuel Blum, Nicholas J. Hopper, and John Langford. CAPTCHA: using hard ai problems for security. In Advances in Cryptology, Eurocrypt 2003 , May 2003. Towards Human Interactive Proofs in the Text-Domain – p.4/29

Introduction & Prior Work Luis von Ahn, Manuel Blum, Nicholas J. Hopper, and John Langford. CAPTCHA: using hard ai problems for security. In Advances in Cryptology, Eurocrypt 2003 , May 2003. Towards Human Interactive Proofs in the Text-Domain – p.5/29

Introduction & Prior Work Luis von Ahn, Manuel Blum, and John Langford. Telling humans and computers apart automatically. Communications of the ACM , 47(2):56–60, 2004. Towards Human Interactive Proofs in the Text-Domain – p.6/29

Introduction & Prior Work Unpublished Abstract from First Workshop on Human Interactive Proofs , January 2002. Towards Human Interactive Proofs in the Text-Domain – p.7/29

Sense Ambiguity George A. Miller, Richard Beckwith, Christiane Fellbaum, Derek Gross, and Katherine Miller. Introduction to WordNet: An on-line lexical database. http://www.cogsci.princeton.edu/~wn/5papers.ps , August 1993. Towards Human Interactive Proofs in the Text-Domain – p.8/29

Sense Ambiguity • It should move through several more drafts. • It should run through several more drafts. • It should go through several more drafts. • All articles must move through copy-editing. • All articles must run through copy-editing. • All articles must go through copy-editing. syn ( move ) = { move , run , go } ?? Towards Human Interactive Proofs in the Text-Domain – p.9/29

Sense Ambiguity • That sermon will move people. • That sermon will impress people. • That sermon will strike people. • Your speech must move the audience. • Your speech must impress the audience. • Your speech must strike the audience. syn ( move ) = { move , impress , strike } ?? Towards Human Interactive Proofs in the Text-Domain – p.10/29

Sense Ambiguity Can we conclude that all these words are generally synonymous to move? syn ( move ) = { move , run , go , impress , strike } Unfortunately, we can’t. Towards Human Interactive Proofs in the Text-Domain – p.11/29

Sense Ambiguity • It should move through several more drafts. • It should run through several more drafts. • It should go through several more drafts. BUT • Your speech must move the audience. • * Your speech must run the audience. • * Your speech must go the audience. Towards Human Interactive Proofs in the Text-Domain – p.12/29

Sense Ambiguity • That sermon will move people. • That sermon will impress people. • That sermon will strike people. BUT • All articles must move through copy-editing. • * All articles must impress through copy-editing. • * All articles must strike through copy-editing. Towards Human Interactive Proofs in the Text-Domain – p.13/29

Sense Ambiguity George A. Miller, Richard Beckwith, Christiane Fellbaum, Derek Gross, and Katherine Miller. Introduction to WordNet: An on-line lexical database. http://www.cogsci.princeton.edu/~wn/5papers.ps , August 1993. Towards Human Interactive Proofs in the Text-Domain – p.14/29

Sense Ambiguity We cannot include a synset like syn ( move ) = { move , run , go , impress , strike } in a dictionary! All we can do is to state that syn ( c 1 , move ) = { move , run , go } syn ( c 2 , move ) = { move , impress , strike } for some linguistic contexts c 1 � = c 2 . Towards Human Interactive Proofs in the Text-Domain – p.15/29

Sense Ambiguity Pick the sentences that are meaningful replacements of each other: � It should move through several more drafts. � It should run through several more drafts. � It should go through several more drafts. � It should impress through several more drafts. � It should strike through several more drafts. syn ( c 1 , move ) = { move , run , go } , or syn ( c 2 , move ) = { move , impress , strike } ? Towards Human Interactive Proofs in the Text-Domain – p.16/29

Sense Ambiguity The problem of automatic word-sense disambiguation has been under investigation in a computational context • since the 1950s and is of central importance for • machine translation • text mining • spell checking • text classification • ... Towards Human Interactive Proofs in the Text-Domain – p.17/29

Sense Ambiguity Rada Mihalcea, Timothy Chklovski, and Adam Kilgarriff. The senseval-3 english lexical sample task. In Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text , pages 25–28, Barcelona, Spain, July 2004. Towards Human Interactive Proofs in the Text-Domain – p.18/29

Sense Ambiguity We have introduced sense ambiguity making use of a function syn : C × W �→ 2 W that assigns to a word w ∈ W used in context c ∈ C the set s ⊂ W of all words that are correct replacements of w . We have presented evidence to suggest that no machine can reproduce syn with high accuracy. Humans can produce an annotation, by hand-crafting a table of associations sa ⊂ syn , such that | sa | ≪ | syn | . Towards Human Interactive Proofs in the Text-Domain – p.19/29

Lexical HIP What do we need? • A public lexicon of words organized into sets of words that are synonymous in some linguistic context. (like WordNet) • A corpus: A set of sentences that contain words also contained in multiple synsets of the dictionary. • An initially hand-craftet secret annotation sa that is a subset of syn . Towards Human Interactive Proofs in the Text-Domain – p.20/29

Lexical HIP: Generation Phase  � c It should move through ...    � c 1 ∈ R ( c )  It should run through ...    � c 2 ∈ R ( c ) t 1 It should go through ...  � c 3 ∈ Q ( c ) It should impress through ...      � c 4 ∈ Q ( c ) It should strike through ...   � d We’ll send your order ...   t 2 � d 1 ∈ R ( d ) We’ll ship your order ...  � d 2 ∈ Q ( d ) We’ll broadcast your order ...  Towards Human Interactive Proofs in the Text-Domain – p.21/29

Lexical HIP: Testing Phase  � c It should move through ...    � c 1 ∈ R ( c )  It should run through ...    � c 2 ∈ R ( c ) t 1 It should go through ...  � c 3 ∈ Q ( c ) It should impress through ...      � c 4 ∈ Q ( c ) It should strike through ...   � d We’ll send your order ...   t 2 � d 1 ∈ R ( d ) We’ll ship your order ...  � d 2 ∈ Q ( d ) We’ll broadcast your order ...  Towards Human Interactive Proofs in the Text-Domain – p.22/29

Lexical HIP: Verification Phase √  � c It should move through ...    � c 1 ∈ R ( c ) ×  It should run through ...   √  � c 2 ∈ R ( c ) t 1 It should go through ...  � c 3 ∈ Q ( c ) × It should impress through ...   √    � c 4 ∈ Q ( c ) It should strike through ...  √  � d We’ll send your order ...  √  t 2 � d 1 ∈ R ( d ) We’ll ship your order ... √  � d 2 ∈ Q ( d ) We’ll broadcast your order ...  Towards Human Interactive Proofs in the Text-Domain – p.23/29

Lexical HIP: Learning We have to trust in sa to be private at any time. If we hand-craft it once , it will soon loose this property because whenever an association is used it is in fact published to the testee and to the adversary. We have to think about sa as a dynamic resource, where we have to • add new private associations • remove associations if they are published Towards Human Interactive Proofs in the Text-Domain – p.24/29

Lexical HIP: Learning Phase √  � c We’ll send your order ...  √   � c 1 ∈ R ( c )  We’ll ship your order ...   √   � c 2 ∈ Q ( c ) We’ll broadcast your order ...      √  � d ∈ P ( c ) t 2 We’ll cough your order ...    � e ∈ P ( c ) ?  We’ll take your order ...     � e 1 ∈ Q ( e ) ? We’ll accept your order ...      � e 1 ∈ Q ( e ) ? We’ll hire your order ...  Towards Human Interactive Proofs in the Text-Domain – p.25/29

Towards Human Interactive Proofs in the Text-Domain Richard - PowerPoint PPT Presentation

Towards Human Interactive Proofs in the Text-Domain Richard Bergmair University of Derby in Austria and Stefan Katzenbeisser Technische Universitt Mnchen Institut fr Informatik Towards Human Interactive Proofs in the Text-Domain

Interactive Proofs Lecture 18 AM 1 Interactive Proofs 2 Interactive Proofs IP[k] 2

Zero-Knowledge Proofs Lecture 15 Interactive Proofs Interactive Proofs Interactive Proofs

Zero-Knowledge Proofs 1 Zero-Knowledge Proofs Lecture 15 1 Interactive Proofs 2 Interactive

Welcome to PROOFS! PROOFS: Security Proofs for Embedded Systems Introduction to the fifth

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here

Foundation of Cryptography (0368-4162-01), Lecture 5 Interactive Proofs and Zero Knowledge

Post-Conference Presentation Sunday Oladayo Oladejo Table of Content A Introduction B

Lecture 5: SOS Proofs and the Motzkin Polynomial Lecture Outline Part I: SOS proofs and

Proofs, Continued Today Proofs : A style guide Proofs should be easy to

Interactive Proof System We have seen interactive proofs, in various disguised forms, in the

Enhancing ICANN Text Accountability 26 June 2014 Text #ICANN50 Text #ICANN50 Text #ICANN50

Add Your Title Here Replace your text here! Replace your text here! Insert your title here 1

Text Text #ICANN51 15 October 2014 Text Text IDN Root Zone LGR Sarmad Hussain IDN Program

Text Text #ICANN51 Contractual Compliance Text Text Contractual Compliance Update

Safer Pass the salt 2020 Together. Open Source Collaborative Dynamic Security engine Crowd

ONLINE ACTIVISM, TECH LESSONS DAVID FIALA SPRING 2014 WICS LIGHTNING TALK BACKGROUND July

Data-transformer library Michael Raskin Moscow Center for Continuous Mathematical Education

What's next for adversarial ML? (and why ad-blockers should care) Florian Tramr EPFL July 9 th

CAP Theorem From CAP 12 Years Later: How The Rules

NoSQL Introduction CS 377: Database Systems Recap: Data Never Sleeps

Feasibility of Consistent, Feasibility of Consistent, Feasibility of Consistent, Feasibility of

Eric Brewer Professor, UC Berkeley VP Infrastructure, Google QCon SF November 8, 2012 Charles

Sambuz

Useful Links

Newsletter

Mail Us