The Mathematics of Geographic Profiling Towson University Applied - PowerPoint PPT Presentation

The Mathematics of Geographic Profiling Towson University Applied Mathematics Laboratory Dr. Mike O'Leary Crime Hot Spots: Behavioral, Computational and Mathematical Models Institute for Pure and Applied Mathematics January 29 - February 2, 2007 Supported by the NIJ through grant 2005–IJ–CX–K036

Project Participants Towson University Applied Mathematics Laboratory Undergraduate research projects in applied mathematics. Founded in 1980 National Institute of Justice Special thanks to Stanley Erickson (NIJ) and Andrew Engel (SAS)

Students 2005-2006: Paul Corbitt Brooke Belcher Laurel Mount Brandie Biddy Ruozhen Yao Gregory Emerson Melissa Zimmerman 2006-2007: Chris Castillo Jonathan Vanderkolk Adam Fojtik Grant Warble

Geographic Profiling The Question: Given a series of linked crimes committed by the same offender, can we make predictions about the anchor point of the offender? The anchor point can be a place of residence, a place of work, or some other commonly visited location.

Geographic Profiling Our question is operational . This places limitations on available data. Example A series of 9 linked vehicle thefts in Baltimore County

Example ADDRESS DATE_FROM TIME DATE_TO TIME REMARKS 918 M 01/18/2003 0800 01/18/2003 0810 VEHICLE IS 01 TOYT CAMRY, LEFT VEH RUNNING 1518 L 01/22/2003 0700 01/22/2003 0724 VEHICLE IS 99 HOND ACCORD STL-REC, ...B/M PAIR,DRIVING MAROON ACCORD. 731 CC 01/22/2003 0744 01/22/2003 0746 VEHICLE IS 02 CHEV MALIBU STL-REC 1527 K 01/27/2003 1140 01/27/2003 1140 VEHICLE IS 97 MERC COUGAR, LEFT VEH RUNNING 1514 G 01/29/2003 0901 01/29/2003 0901 VEHICLE IS 99 MITS DIAMONTE, LEFT VEH RUNNING 1415 K 01/29/2003 1155 01/29/2003 1156 VEHICLE IS 00 TOYT 4RUNNER STL-REC, (4) ARREST NFI 5943 R 12/31/2003 0632 12/31/2003 0632 VEHICLE IS 92 BMW 525, WARMING UP VEH 1427 G 02/17/2004 0820 02/17/2004 0830 VEHICLE IS 00 HOND ACCORD, WARMING VEH 4449 S 05/15/2004 0210 05/15/2004 0600 VEHICLE IS 04 SUZI ENDORO

Existing Methods Spatial distribution strategies Probability distance strategies Notation:  1  , z  2   Anchor point- z = z Crime sites- x 1 , x 2 , ⋯ , x n Number of crimes- n

Distance Euclidean d 2  x , y =   x  1  − y  1    2  − y  2   2  x 2 Manhattan  1  − y  1  ∣∣ x  2  − y  2  ∣ d 1  x , y =∣ x Street grid

Spatial Distribution Strategies Centroid: n  centroid = 1 n ∑ x i i = 1 Crime locations Anchor Point Average Average

Spatial Distribution Strategies  cmd Center of minimum distance: is the value of that minimizes y n D  y = ∑ d  x i , y  i = 1 Crime locations Distance sum = 10.63 Anchor Point Distance sum = 9.94 Smallest possible sum!

Spatial Distribution Strategies Circle Method: Anchor point contained in the circle whose diameter are the two crimes that are farthest apart. Crime locations Anchor Point

Probability Distribution Strategies The anchor point is located in a region with a high “hit score”. The hit score has the form S  y  n S  y  = ∑ f  d  y , x i  i = 1 = f  d  z , x 1  f  d  z , x 2 ⋯ f  d  z , x n  where are the crime locations and is a f x i decay function and is a distance. d

Probability Distribution Strategies Linear: f  d  = A − Bd Hit Score Crime Locations

Rossmo Manhattan distance metric. Decay function f  d = { k if d  B h d g − h k B if d  B g  2 B − d  The constants and are empirically k , g ,h B defined

Rossmo B = 1 h = 2 g = 3

Canter, Coffey, Huntley & Missen Euclidean distance Decay functions − d f  d = Ae f  d = { if d  A , 0 if A ≤ d  B B , − d if d ≥ B . Ce

Dragnet A = 1 = 1

Levine Euclidean distance Decay functions f  d = A  Bd Linear Negative − d f  d = Ae exponential 2 exp [− d − 2 d  A Normal f  d = ]  2  S 2 2S 2 exp [− ln d − 2 d  A Lognormal f  d = ] d  2  S 2 2S

CrimeStat From Levine (2004)

CrimeStat

Shortcomings These techniques are all ad hoc . What is their theoretical justification? What assumptions are being made about criminal behavior? What mathematical assumptions are being made? How do you choose one method over another?

Shortcomings The convex hull effect: The anchor point always occurs inside the convex hull of the crime locations. Crime locations Convex Hull

Shortcomings How do you add in local information? How could you incorporate socio- economic variables into the model? Snook, Individual differences in distance travelled by serial burglars Malczewski, Poetz & Iannuzzi, Spatial analysis of residential burglaries in London, Ontario Bernasco & Nieuwbeerta, How do residential burglars select target areas? Osborn & Tseloni , The distribution of household property crimes

A New Approach In previous methods, the unknown quantity was: The anchor point (spatial distribution strategies) The hit score (probability distance strategies) We use a different unknown quantity.

A New Approach Let be the density function for the P  x ; z  probability that an offender with anchor point z commits a crime at location . x This distribution is our new unknown. This has criminological significance. In particular, assumptions about the form of are equivalent to P  x ; z  assumptions about the offender's behavior.

The Mathematics Given crimes located at the x 1 , x 2 , ⋯ , x n maximum likelihood estimate for the anchor  mle point is the value of that maximizes y n L  y = ∏ P  x i , y  i = 1 = P  x 1 , y  P  x 2 , y ⋯ P  x n , y  or equivalently, the value that maximizes n  y = ∑ ln P  x i , y  i = 1 = ln P  x 1 , y  ln P  x 2 , y ⋯ ln P  x n , y 

Relation to Spatial Distribution Strategies If we make the assumption that offenders choose target locations based only on a distance decay function in normal form, then 2 ] 2 exp [ −∣ x − z ∣ 2 1 P  x ; z = 2  2  The maximum likelihood estimate for the anchor point is the centroid.

Relation to Spatial Distribution Strategies If we make the assumption that offenders choose target locations based only on a distance decay function in exponentially decaying form, then 2 exp [ −∣ x − z ∣ 2  ] 1 P  x ; z = 2  The maximum likelihood estimate for the anchor point is the center of minimum distance.

Relation to Probability Distance Strategies What is the log likelihood function? [ − ln  2   ] n 2 −∣ x i − y ∣  y = ∑ i = 1 This is the hit score provided we use S  y  Euclidean distance and the linear decay for f  d = A  Bd 2  A =− ln  2  B =− 1 /

Parameters The maximum likelihood technique does not require a priori estimates for parameters other than the anchor point. 2 ] 2 exp [ −∣ x − z ∣ 2 1 P  x ; z , = 2  2  The same process that determines the best choice of also determines the best choice z of . 

Better Models We have recaptured the results of existing techniques by choosing P  x ; z  appropriately. P  x ; z  These choices of are not very realistic. Space is homogeneous and crimes are equi-distributed. Space is infinite. Decay functions were chosen arbitrarily.

Better Models Our framework allows for better choices of . P  x ; z  Consider P  x ; z = D  d  x , z ⋅ G  x ⋅ N  z  Normalization Distance Decay Geographic (Dispersion Kernel) factors

The Simplest Case Suppose we have information about crimes committed by the offender only for a portion of the region. W Ω E

The Simplest Case Regions Ω : Jurisdiction(s). Crimes and anchor points may be located here. E : “elsewhere”. Anchor points may lie here, but we have no data on crimes here. W : “water”. Neither anchor points nor crimes may be located here. In all other respects, we assume the geography is homogeneous .

The Simplest Case We set G  x = { x ∈ 1 x ∉ 0 We choose an appropriate decay function 2 ] D ∣ x − z ∣= exp [ −∣ x − z ∣ 2 2  The required normalization function is 2  dy N  x ; z = [ ∬  2  ] exp  −∣ y − z ∣ − 1 2  1  dy 2  

The Simplest Case  mle Our estimate of the anchor point is the choice of that maximizes y 2  exp  − ∑ n ∣ x i − y ∣ 2 2  i = 1 2  d  [ ∬  2  ] exp  −∣− y ∣ n 2  1  d  2  

The Mathematics of Geographic Profiling Towson University Applied - PowerPoint PPT Presentation

The Mathematics of Geographic Profiling Towson University Applied Mathematics Laboratory Dr. Mike O'Leary Crime Hot Spots: Behavioral, Computational and Mathematical Models Institute for Pure and Applied Mathematics January 29 - February 2,

Profiling of Data-Parallel Processors Daniel Kruck 09/02/2014 09/02/2014 Profiling Daniel

Leaving no one behind The role of evidence-building and profiling to include displacement in

Expression Profiling Mark Voorhies 4/4/2011 Mark Voorhies Expression Profiling Review

Web User Profiling using Data Redundancy http://aminer.org/profiling Xiaotao Gu, Hong Yang, Jie

COZ : Finding Code that Counts with Causal Profiling Anuja Golechha Agenda Profiling

Optimization Profiling VisualVM Exercise Meme Credit: Randall Munroe, hrefhttp://xkcd.comxkcd

Profiling of Algorithms Profiling refers to the experimental measurement of the performance of

An introduction to Profiling Physics Coding Club: 09/06/2017 D. Dickinson

Geographic Centroid Routing for Vehicular Networks Effects of GPS Error on Geographic Routing

Provider Profiling Prepared by Melissa Reagan, MSW, LSW, Quality Performance Specialist Agenda

Continuous Profiling in Production: What, Why and How Richard Warburton (@richardwarburto) Sadiq

MALT : MALloc Tracker A memory profiling tool 3/02/2019 MALT, Sbastien Valat 1 Questions

Integrating mol Integrating mol ecular Profiling ecular Profiling Into Patient Se election for

Twitter User Profiling: Bot and Gender Identification 7 th Author Profiling Task PAN 2019 CLEF

author profiling shared task on: Bots and gender profiling Francisco Rangel & Paolo Rosso

Expression Profiling Mark Voorhies 4/3/2012 Mark Voorhies Expression Profiling Its hard

Dual Language Program: Board of Education Presentation Grade 5 Expansion Study Findings May 9,

Athletic Hall of Fame Presents the 2018 Hall of Fame Class Natalie Baird Etheridge Athlete

Morley-Ellenbrook Line Industry Briefing 11 December 2019 Welcome to Country Trevor Walley

Predictors of outcome and Renal clearance Ulf.Bergman@Karolinska.se ENCePP and Karolinska

RJR Polymers Overview The Total Electronic Package Solution May 16, 2012 Company Vitals

Deploying and Monitoring Ruby on Rails A practical guide Mathias Meyer and Jonathan Weiss,

The Other mod_rails : Easy Rails Deployment with JRuby Nick Sieger Sun Microsystems, Inc

Trends and market analysis: automotive sector 2017 Executive Summary Study focus This study

The Mathematics of Geographic Profiling Towson University Applied - PowerPoint PPT Presentation

The Mathematics of Geographic Profiling Towson University Applied Mathematics Laboratory Dr. Mike O'Leary Crime Hot Spots: Behavioral, Computational and Mathematical Models Institute for Pure and Applied Mathematics January 29 - February 2,

Profiling of Data-Parallel Processors Daniel Kruck 09/02/2014 09/02/2014 Profiling Daniel

Leaving no one behind The role of evidence-building and profiling to include displacement in

Expression Profiling Mark Voorhies 4/4/2011 Mark Voorhies Expression Profiling Review

Web User Profiling using Data Redundancy http://aminer.org/profiling Xiaotao Gu, Hong Yang, Jie

COZ : Finding Code that Counts with Causal Profiling Anuja Golechha Agenda Profiling

Optimization Profiling VisualVM Exercise Meme Credit: Randall Munroe, hrefhttp://xkcd.comxkcd

Profiling of Algorithms Profiling refers to the experimental measurement of the performance of

An introduction to Profiling Physics Coding Club: 09/06/2017 D. Dickinson

Geographic Centroid Routing for Vehicular Networks Effects of GPS Error on Geographic Routing

Provider Profiling Prepared by Melissa Reagan, MSW, LSW, Quality Performance Specialist Agenda

Continuous Profiling in Production: What, Why and How Richard Warburton (@richardwarburto) Sadiq

MALT : MALloc Tracker A memory profiling tool 3/02/2019 MALT, Sbastien Valat 1 Questions

Integrating mol Integrating mol ecular Profiling ecular Profiling Into Patient Se election for

Twitter User Profiling: Bot and Gender Identification 7 th Author Profiling Task PAN 2019 CLEF

author profiling shared task on: Bots and gender profiling Francisco Rangel &amp; Paolo Rosso

Expression Profiling Mark Voorhies 4/3/2012 Mark Voorhies Expression Profiling Its hard

Dual Language Program: Board of Education Presentation Grade 5 Expansion Study Findings May 9,

Athletic Hall of Fame Presents the 2018 Hall of Fame Class Natalie Baird Etheridge Athlete

Morley-Ellenbrook Line Industry Briefing 11 December 2019 Welcome to Country Trevor Walley

Predictors of outcome and Renal clearance Ulf.Bergman@Karolinska.se ENCePP and Karolinska

RJR Polymers Overview The Total Electronic Package Solution May 16, 2012 Company Vitals

Deploying and Monitoring Ruby on Rails A practical guide Mathias Meyer and Jonathan Weiss,

The Other mod_rails : Easy Rails Deployment with JRuby Nick Sieger Sun Microsystems, Inc

Trends and market analysis: automotive sector 2017 Executive Summary Study focus This study

author profiling shared task on: Bots and gender profiling Francisco Rangel & Paolo Rosso