Rendezvous: A search engine for binary code
Wei Ming Khoo, Alan Mycroft, Ross Anderson University of Cambridge MSR 2013 19 May 2013 Demo: http://www.rendezvousalpha.com
1
Rendezvous: A search engine for binary code Wei Ming Khoo, Alan - - PowerPoint PPT Presentation
Rendezvous: A search engine for binary code Wei Ming Khoo, Alan Mycroft, Ross Anderson University of Cambridge MSR 2013 19 May 2013 Demo: http://www.rendezvousalpha.com 1 To audit or not to audit You cant trust code that you did not
Wei Ming Khoo, Alan Mycroft, Ross Anderson University of Cambridge MSR 2013 19 May 2013 Demo: http://www.rendezvousalpha.com
1
You can’t trust code that you did not totally create yourself (Ken Thompson, 1984)
Microsoft (Win7 USB/DVD download tool 2009) included
2
Software RE is sometimes necessary for audit
execute” [Balakrishnan, Reps 2005]
expertise
[VanEmmerik’04]
3
And increasingly so due to advances in software mining and SBSE
application complexity, quality of reusable components [Schmidt’99, ’00, ’06]
base was external [Haefliger’08]
compliant, most famously Linksys WRT54G
4
5
necessary for audit
search problem, relying on existing software to obtain source code
compiler-agnostic?
6
7
analysis (Assumption: no obfuscation)
that can be extracted directly from a disassembly:
8
Executable
Disassemble Tokenise Token-specific processing
Disassembly Mnemonic n-grams Alphabetic strings (Query terms) Control-flow sub-graphs
9
10
nodes, compute them all (k = 3, 4, 5, 6, 7)
canonical form, rep as string (Nauty graph library)
11
12
13
corpus alphabetic strings alphabetic strings
14
Combining features increases F2, implying independence
for audit
search problem
combining mnemonics, k-graphs and constants http://www.rendezvousalpha.com
15