Matryoshka: Hiding Secret Communicatjon in Plain Sight
Iris Safaka, Christjna Fragouli, Katerina Argyraki
Matryoshka: Hiding Secret Communicatjon in Plain Sight Iris Safaka , - - PowerPoint PPT Presentation
Matryoshka: Hiding Secret Communicatjon in Plain Sight Iris Safaka , Christjna Fragouli, Katerina Argyraki Free communicatjon systems Give away some privacy 2 Wanna play tennis today? Free communicatjon systems Give away some
Iris Safaka, Christjna Fragouli, Katerina Argyraki
2
3
Wanna play tennis today?
4
R a c k e t s t
a y
l y $ 5 . , c l i c k h e r e ! R a c k e t s t
a y
l y $ 5 . , c l i c k h e r e ! Wanna play tennis today?
5
I don't like the government
6
Alice likes tennis but not the government... We want your user data I don't like the government
– Eg. Google handed in data for 100K user accounts (2014)
7
Alice Bob Eve
Messaging Provider
Encryptjon?
8
Alice Bob
Encryptjon? Interruptjon of free service
Messaging Provider
Eve
9
Alice Bob Eve
Encryptjon? Looking suspicious How about hiding the secret communicatjon?
Messaging Provider
Let's check them out
10
I love you
Alice Bob Eve
Messaging Provider Stego Stego
I love you
11
I love you
Alice Bob Eve
Messaging Provider Stego Stego
I love you
Alice likes cats
12
– Embed secret message into a given text – Eg. synonym substjtutjon, sentence manipulatjon etc.
– Introduce unnaturalness to the text – Require ofg-line access to resources – Modest covert rates
Our goal: human-like text, implementable, high rate
13
14
I love you
Alice Bob Eve
Compression Decompression Words to bits Text cleaning
01100111
Bits to words
nice weather Such a nice weather today! nice weather 01100111 I love you
User enhancement
Challenge: minimize user interventjon
Messaging Provider
15
I love you 00011111
nice play cool cool weather run
nice weather Such a nice weather today!
Mixed Hufman Coding
Text Corpus
N-gram Language Model
nice weather run 0.8 0.1 0001 0000 1111 cat, cook, nice nice, play, cool cool, weather, run
Dictjonary
... ...
User Enhancement Interface
16
– Character Hufgman → names, unusual words, etc. – Word Hufgman → frequent English words
– Maps bit sequences to sets of words – More frequent than infrequent words & repetjtjons
– Models how dictjonary words appear in Natural Language
– Assist the user in completjng the sentences
17
– K-order Markov model of English characters – Drops early improbable sequences
0001 0000 1111 cat, cook, nice nice, play, cool cool, weather, run
Dictjonary
... ...
Such a nice weather today!
0001 0000 1111 cat, cook, nice nice, play, cool cool, weather, run
Dictjonary
... ...
00011111 00001111
18
“ I have become tjred of facebook's many years of existence. The change over the years by the engineers sucks. It seems facebook's wacky algorithm will never make sense. The posts make the code on facebook obsolete. ” “ Does facebook's CEO feed people feed dogs. Can't yet use data base set book. Two posts are uses people facebook apps. Mary Cox able humans into keeping up. ”
19
– Average task completjon tjme approx 5 mins – Average of 5 extra words inserted per sentence
– Average 3 bits per word – Eg. to hide 5 words we need to send 73 words
– Zero error rate (~95%) – Partjally corrupted messages (~15% chars.)
20
21
22
– Eg. using Word Embeddings