Matryoshka: Hiding Secret Communicatjon in Plain Sight Iris Safaka , - - PowerPoint PPT Presentation

matryoshka hiding secret communicatjon in plain sight
SMART_READER_LITE
LIVE PREVIEW

Matryoshka: Hiding Secret Communicatjon in Plain Sight Iris Safaka , - - PowerPoint PPT Presentation

Matryoshka: Hiding Secret Communicatjon in Plain Sight Iris Safaka , Christjna Fragouli, Katerina Argyraki Free communicatjon systems Give away some privacy 2 Wanna play tennis today? Free communicatjon systems Give away some


slide-1
SLIDE 1

Matryoshka: Hiding Secret Communicatjon in Plain Sight

Iris Safaka, Christjna Fragouli, Katerina Argyraki

slide-2
SLIDE 2

2

  • Free communicatjon systems → Give away some privacy
slide-3
SLIDE 3

3

Wanna play tennis today?

  • Free communicatjon systems → Give away some privacy
slide-4
SLIDE 4

4

R a c k e t s t

  • d

a y

  • n

l y $ 5 . , c l i c k h e r e ! R a c k e t s t

  • d

a y

  • n

l y $ 5 . , c l i c k h e r e ! Wanna play tennis today?

  • Free communicatjon systems → Give away some privacy
  • Users are mostly aware of this trade-of
slide-5
SLIDE 5

5

I don't like the government

slide-6
SLIDE 6

6

Alice likes tennis but not the government... We want your user data I don't like the government

  • Governments and courts request user data from tech companies

– Eg. Google handed in data for 100K user accounts (2014)

slide-7
SLIDE 7

7

Alice Bob Eve

Messaging Provider

  • Alice and Bob wish to communicate privately
  • Eve always wants to know what they talk about

Encryptjon?

slide-8
SLIDE 8

8

Alice Bob

  • Alice and Bob wish to communicate privately
  • Eve always wants to know what they talk about

Encryptjon? Interruptjon of free service

Messaging Provider

Eve

slide-9
SLIDE 9

9

Alice Bob Eve

  • Alice and Bob wish to communicate privately
  • Eve always wants to know what they talk about

Encryptjon? Looking suspicious How about hiding the secret communicatjon?

Messaging Provider

Let's check them out

slide-10
SLIDE 10

10

Steganography

  • Hide secret data within other “innocent” data

I love you

Alice Bob Eve

Messaging Provider Stego Stego

I love you

slide-11
SLIDE 11

11

Steganography

  • Hide secret data within other “innocent” data

I love you

Alice Bob Eve

Messaging Provider Stego Stego

I love you

Alice likes cats

slide-12
SLIDE 12

12

Linguistjc steganography

  • Traditjonal approaches apply automated modifjcatjons

– Embed secret message into a given text – Eg. synonym substjtutjon, sentence manipulatjon etc.

  • Drawbacks

– Introduce unnaturalness to the text – Require ofg-line access to resources – Modest covert rates

Our goal: human-like text, implementable, high rate

slide-13
SLIDE 13

13

Matryoshka

slide-14
SLIDE 14

14

I love you

Alice Bob Eve

Compression Decompression Words to bits Text cleaning

01100111

Bits to words

nice weather Such a nice weather today! nice weather 01100111 I love you

User enhancement

Challenge: minimize user interventjon

Messaging Provider

slide-15
SLIDE 15

15

I love you 00011111

nice play cool cool weather run

nice weather Such a nice weather today!

Mixed Hufman Coding

Text Corpus

N-gram Language Model

nice weather run 0.8 0.1 0001 0000 1111 cat, cook, nice nice, play, cool cool, weather, run

Dictjonary

... ...

User Enhancement Interface

slide-16
SLIDE 16

16

Encoder design

  • Mixed Hufman Compression

– Character Hufgman → names, unusual words, etc. – Word Hufgman → frequent English words

  • Dictjonary

– Maps bit sequences to sets of words – More frequent than infrequent words & repetjtjons

  • N-gram Language Model

– Models how dictjonary words appear in Natural Language

  • User Enhancement Interface

– Assist the user in completjng the sentences

slide-17
SLIDE 17

17

Decoder design

  • Repeatjng words in dictjonary creates ambiguity
  • Probabilistjc decoder

– K-order Markov model of English characters – Drops early improbable sequences

0001 0000 1111 cat, cook, nice nice, play, cool cool, weather, run

Dictjonary

... ...

Such a nice weather today!

0001 0000 1111 cat, cook, nice nice, play, cool cool, weather, run

Dictjonary

... ...

00011111 00001111

slide-18
SLIDE 18

18

Evaluatjon

  • Experimentatjon with human users in Amazon's Mechanical Turk

“ I have become tjred of facebook's many years of existence. The change over the years by the engineers sucks. It seems facebook's wacky algorithm will never make sense. The posts make the code on facebook obsolete. ” “ Does facebook's CEO feed people feed dogs. Can't yet use data base set book. Two posts are uses people facebook apps. Mary Cox able humans into keeping up. ”

slide-19
SLIDE 19

19

Evaluatjon

  • Experimentatjon with human users in Amazon's Mechanical Turk
  • User efort

– Average task completjon tjme approx 5 mins – Average of 5 extra words inserted per sentence

  • End-to-end covert rate

– Average 3 bits per word – Eg. to hide 5 words we need to send 73 words

  • Decoder error rate

– Zero error rate (~95%) – Partjally corrupted messages (~15% chars.)

slide-20
SLIDE 20

20

Evaluatjon

  • Automatjc test: Is a sentence NL or not?
slide-21
SLIDE 21

21

Summary

  • Linguistjc steganography for reclaiming some privacy
  • Human-like text, implementable, high covert rate
  • Prototype implementatjon
  • Experimentatjon on Mechanical Turk
  • Automated steganalysis test
slide-22
SLIDE 22

22

Next steps

  • Investjgate alternatjve automated steganalysis tests

– Eg. using Word Embeddings

  • Identjfy further vulnerabilitjes and test
  • Finalize system implementatjon

Questjons ?