Proximal Graphical Event Model IBM Research Debarun Bhattacharjya, Dharmashankar Subramanian, Tian Gao Objective: To learn statistical and causal relationships between event types in the form of graphical models using event datasets Home health visit Hospital admission Prescription refill Event datasets: Occurrences of various event types over time β’ Examples: web logs; customer transactions; network notifications; political events; financial events; insurance claims; health episodes; other medical events β’ Notation: π¬ = π π , π’ π , π = 1, β¦ , π; π π β π, π = π β Assume it is temporally ordered b/w time π’ 0 = 0 β€ π’ 1 and π’ π+1 = π β₯ π’ π β Note that there are π types of event types/labels and π events in the dataset 1
IBM Research Proximal Graphical Event Model (PGEM) w aa w ab A β’ PGEM = π», π, Ξ ; graph + set of (time) windows on each edge and conditional intensity parameters w ac B β’ Assumption: The intensity of an event label (node) depends on whether or not its parents have happened at least once in their C respective recent histories w bc Formally, denoting a node π βs parents as π½ : β’ π» = π, πΉ where π is the event label set β’ There is a window for every edge, π = π₯ π¦ : βπ β π , where π₯ π¦ = π₯ π¨π¦ : βπ β π½ β’ There is an intensity parameter for every node π and for every instantiation π of π₯ π¦ : βπ β π its parent occurrences, Ξ = π π¦|π 2
IBM Research Parameter and Structure Learning Learning problem: Given an event dataset π¬ , learn PGEM = π», π, Ξ β’ Log-likelihood: β π π¦; π : # of times π is observed and the condition π is true in the relevant windows β πΈ π : duration over the entire time period where the condition π is true β’ For a given graph, finding the optimal (MLE) conditional intensities when given the windows is easy, but finding the optimal windows is hard! β’ Contribution 1 : Analysis and proof that reduces the window search to a finite set that is algorithmically constructed . β’ Contribution 2 : A method to search over graph structures, with some theoretical results on efficient search and consistency justification 3
IBM Research Results: Synthetic Datasets Wed Dec 5, 5:00 β 7:00 pm, Room 210 & 230 AB #6 4
Recommend
More recommend