Hash Pile Ups: Using Collisions to Identify Unknown Hash Functions - PowerPoint PPT Presentation

Hash Pile Ups: Using Collisions to Identify Unknown Hash Functions R. Joshua Tobin and David Malone 11 October 2012

Hash Functions We are talking about hash functions for consistent assignment. For example, • Hash tables, • Network balancing packets (CEF, LAG, ECMP), • Service load balancing (BIG-IP), • Packets to CPUs (Microsoft RSS), • etc. These are not usually cryptographic strength! Collisions relatively easy to find.

Outline 1. Background motivation. 2. Idea — learning and generating collisions. 3. 3 examples 3.1 the hash, 3.2 the attack, 3.3 the results. 4. Conclusion. There is an analysis of each attack in the paper.

Background Motivation • Algorithmic Complexity Attacks (Crosby and Wallach, 2003). • Some algorithms have different typical and worst case. • Attack by choosing input to be worst case. • Can be applied to hash tables, sorting, string matching, . . . • Hashes are canonical examples.

Demonstration attack 60 Random Attack Complexity Attack 50 40 Packets Forwarded (pps) 30 20 10 0 0 5 10 15 20 Time (s)

How to Fix? • In general use algorithm with good worst case. • Hash functions too useful though. • Using crypto-strength hashes often too slow? • What happens if the hash used is a secret? Choose your hash randomly from a family on startup. (Advisories still being released on this issues.)

Hash Costs 16 Xor Jenkins Pearson Universal 14 MD5 SHA SHA256 12 10 CPU Time (us) 8 6 4 2 0 Geode Core 2 Duo Athlon 64 Xeon Atom 500MHz 2.66GHz 2.6GHz 3GHz 1.6GHz

Idea — Learning from collisions 1. You usually can’t observe hash output. 2. You can often observe collisions (e.g. time hash lookups, processing time, reordering, traceroute, server IDs, . . . ). 3. By design, your hashes should have different collisions. 4. Observing collisions leaks information about hash in use Can we use this to identify the hash function or generate collisions?

Example 1: Small Hash Family 1. Often the hash is keyed by an integer or a few bits. 2. Suppose the number of hashes is small enough to iterate through. 3. For example, Bob Jenkins’s hash in RFC 5475. 4. Use 4 bits of output (e.g. 16 routes).

Example 1: Small Hash Family Attack: 1. Make a list of all hashes. 2. Find two colliding inputs (Birthday Paradox). 3. Remove hashes that do not collide on these inputs. 4. Repeat until one hash left.

Example 1: Small Hash Family 30 25 Number of Probe Strings 20 15 10 5 Attempts Optimistic Estimate Conservative Estimate 0 10 100 1000 10000 100000 1e+06 1e+07 1e+08 1e+09 Number of Hashes

Example 2: Pearson’s Hash In 1990 Pearson proposed a neat, fast, randomly keyed hash, using a random permutation T of a byte and xor ( ⊗ ). To hash a string of bytes: 1. h ← 0 2. foreach ( byte [ i ]) h ← T [ byte [ i ] ⊕ h ] 3. return h Family is really big — 256!

Example 2: Pearson’s Hash Attack: Recover the permutation. 1. Insert all strings x000. . . 0 and 0y00. . . 0 2. Algebra: collide in pairs ( a , b ) where T ( a ) = T (0) ⊗ b . 3. From collisions, we know pairs (using 2*256 strings). 4. T (0) is remaining unknown (small family, get in 256+small strings). Attack generalises to replacing bytes and xor with any group.

Example 2: Pearson’s Hash 0.7 1,000,000 trials predicted 0.6 0.5 fraction of trials 0.4 0.3 0.2 0.1 0 1 2 3 4 5 6 7 8 number of random strings hashes to recover T

Example 3: Toeplitz Hash Microsoft have a standard for network cards to hand off packet to CPUs (RSS). The key K is a longish bit string. 1. r ← 0 2. foreach bit b in input if (b == 1) r ← r ⊗ left-most 32 bits of K shift K left 1 bit position 3. return r In practice you use 1–7 bits and might pass through a lookup table to choose CPU.

Example 3: Toeplitz Hash Attack: It’s linear over Z 2 , use some linear algebra. 1. Choose the bits of the input you control. Set one to zero at a time. 2. Group the bits according to which collide ( E 1 , . . . , E l ). 3. For any even-sized subsets E ′ 1 , . . . , E ′ l of E 1 , . . . , E l   � �  = h ( x ) +  x + h ( e ) = h ( x ) , h e e ∈ � E ′ e ∈ � E ′ i i 4. So every even-sized subset collection gives a collision. Can work with other linear functions too, but more effective for low index.

Example 3: Toeplitz Hash 60000 Base Attack on Linear Indirection Base Attack on Non-Linear Indirection Modified Attack on Non-Linear Indirection 50000 40000 Mean lookup time 30000 20000 10000 0 0 20 40 60 80 100 120 140 160 Basis bits used by attacker

Conclusion 1. Algorithmic Complexity Attacks. 2. For hashes, choosing from a family is useful. 3. However, collisions leak information. 4. Means you need to choose family carefully. 5. Small family is bad. 6. Structure like linear or group is bad.

Hash Pile Ups: Using Collisions to Identify Unknown Hash Functions - PowerPoint PPT Presentation

Hash Pile Ups: Using Collisions to Identify Unknown Hash Functions R. Joshua Tobin and David Malone 11 October 2012 Hash Functions We are talking about hash functions for consistent assignment. For example, Hash tables, Network

External Pile cages Internal Pile Cages Alternative Methods Chain around Pile Alternative

Table of Contents FOUNDATION DRAINAGE BORED PILE 2 DRAINAGE PIPE 28 SOCKET H-PILE 3 DRAIN

This image cannot currently be displayed. PILE GROUP (TOWER) MULTI-MEDIA CAP PENETRATION

ETO Industrial UPS Tamilnadu Engineers Forum in Kuwait Alex Mayr Head of Services ETO

Hash Functions in Action Hash Functions in Action Lecture 12 Hash Functions Hash Functions

Hash Functions in Action Hash Functions in Action Lecture 11 Hash Functions Hash Functions

Hash Functions Hash Functions 1 Cryptographic Hash Function Crypto hash function h(x) must

Hash Functions and Hash Tables (2.5.2) A hash function h maps keys of a given type to

Pile Driving Setup for Ohio Soils mer Bilgin, PhD, PE University of Dayton Dayton, Ohio 2019

Video Imagery Video looking straight down onto rubble pile showing: Very dynamic

Uninterruptible Power Supply (UPS) 4/15/2014 2 Uninterruptible Power Supply (UPS) No

What are Scale Ups? 20% growth by turnover/employees Ambition or potential to grow by 50%

Generics Asumu Takikawa RacketCon 2012 1 What are generics? 2 What are generics? hash-ref

KILL MD5 DEMYSTIFYING HASH COLLISIONS Ange AlBertini With the help of Marc Stevens TL;DR This

Time-Memory Tradeoffs for Short Hash Collisions Akshima University of Chicago Joint work with

CS261 Data Structures Hash Tables Buckets/Chaining Hash Tables:

Spring 2010: CS419 Computer Security MAC, HMAC, Hash functions and DSA Vinod Ganapathy Lecture

13 - Computer Security Hashing 1 Solution : hash

Hash Functions, Message Authentication Codes Ahmet Burak Can Hacettepe University

IS511 Introduction to Information Security Lecture 3 Cryptography 2 Yongdae Kim Recap

Recent Results on Stream Ciphers Willi Meier 1 / 47 Overview - Stream Ciphers with Small State

Hash Functions and MACs Properties of Cryptographic Hash Functions Introduction to Message

A Family of Fast Syndrome Based Cryptographic Hash Functions Daniel Augot, Matthieu Finiasz and

Structural Attacks on Two SHA-3 Candidates: Blender- n and DCH- n Mario Lamberger and Florian