Online Learning of Weighted Relational Rules for Complex Event - PowerPoint PPT Presentation

Online Learning of Weighted Relational Rules for Complex Event Recognition Nikos Katzouris 1 , Alexander Artikis 2,1 and Georios Paliouras 1 http://cer.iit.demokritos.gr 1 National Center for Scientific Research Demokritos, Athens, Greece 2 University of piraeus, piraeus, Greece ECML-PKDD 2018 1/12

The problem Setting Event Calculus as a Reasoning Engine holdsAt ( F , T + 1 ) ← initiatedAt F , T ) holdsAtAt ( F , T + 1 ) ← holdsAt ( F , T ) , not terminatedAt ( F , T ) . Very efficient inference: Artikis et al. An Event Calculus for Event Recognition, TKDE, 2015. Input ◮ Recognition ◮ Output � . . . . . . . . . . . . Event Simple Events Recognition Complex Events System . . . . . . . . . . . . happensAt ( active ( id 0 ) , 10 ) holdsAt ( meeting ( id 0 , id 1 ) , 11 ) holdsAt ( coord ( id 0 , 20 . 88 , 11 . 90 ) , 10 ) holdsAt ( meeting ( id 0 , id 1 ) , 12 ) happensAt ( active ( id 1 ) , 10 ) holdsAt ( meeting ( id 0 , id 1 ) , 13 ) holdsAt ( coord ( id 1 , 22 . 34 , 15 . 23 ) , 10 ) . . . . . . Complex Event Definitions initiatedAt ( meeting ( X , Y ) , T ) ← happensAt ( active ( X ) , T ) , happensAt ( active ( Y ) , T ) , holdsAt ( close ( X , Y , 25 ) , T ) . Learn this terminatedAt ( meeting ( X , Y ) , T ) ← happensAt ( walking ( X ) , T ) , From These not holdsAt ( close ( X , Y , 25 ) , T ) . 2/12

Learning Requirements ◮ Event recognition applications deal with noisy data streams. ◮ Resilience to noise → Statistical Relational Learning. ◮ Learning should be online. ◮ Single-pass. ◮ Learn from past mistakes. 3/12

Contribution of this Work Two online learners from previous work: ◮ OLED ◮ Katzouris N. et al. Online Learning of Event Definitions, TPLP , 2016. ◮ ✓ Efficient structure learning using Hoeffding bounds. ◮ ✗ Crisp learner. ◮ OSL α ◮ Micheloudakis V., et al. OSLa: Online Structure Learning using Background Knowledge Axiomatization, ECML , 2016. ◮ MLN learner. ◮ ✓ Efficient weight learning. ◮ ✗ Inefficient structure learning. ◮ Blindly generates too many rules. Current work: ◮ WoLED (OLED + weight learning) ◮ MLN learner ◮ ✓ Efficient structure learning. ◮ ✓ Efficient weight learning. 4/12

OLED initiatedAt ( meet ( X , Y ) , T ) ← 1 1 Used O ( ǫ 2 ln ) examples δ initiatedAt ( meet ( X , Y ) , T ) ← initiatedAt ( meet ( X , Y ) , T ) ← initiatedAt ( meet ( X , Y ) , T ) ← ... happensAt ( active ( X ) , T ) . happensAt ( inactive ( Y ) , T ) . holdsAt ( orientation ( X , Y , 45 ) , T 1 1 Used O ( ǫ 2 ln ) examples δ initiatedAt ( meet ( X , Y ) , T ) ← initiatedAt ( meet ( X , Y ) , T ) ← happensAt ( active ( X ) , T ) , happensAt ( active ( X ) , T ) , happensAt ( inactive ( Y ) , T ) . holdsAt ( close ( X , Y , 25 ) , T ) . 1 1 Used O ( ǫ 2 ln ) examples initiatedAt ( meet ( X , Y ) , T ) ← δ happensAt ( active ( X ) , T ) , happensAt ( inactive ( Y ) , T ) , holdsAt ( close ( X , Y , 25 ) , T ) . . . . . . . . . . . . . initiatedAt ( meet ( X , Y ) , T ) ← happensAt ( active ( X ) , T ) , happensAt ( inactive ( Y ) , T ) , holdsAt ( close ( X , Y , 25 ) , T ) , holdsAt ( close ( Y , X , 25 ) , T ) , not happensAt ( inactive ( X ) , T ) , not happensAt ( abrupt ( X ) , T ) , Bottom Clause ⊥ : not happensAt ( running ( X ) , T ) , happensAt ( inactive ( Y ) , T ) , not happensAt ( active ( Y ) , T ) , not happensAt ( running ( Y ) , T ) , not happensAt ( abrupt ( Y ) , T ) , holdsAt ( orientation ( X , Y , 45 ) , T ) . ◮ Learns a rule with online hill-climbing. 5/12

OLED initiatedAt ( meet ( X , Y ) , T ) ← 1 1 Used O ( ǫ 2 ln ) examples δ initiatedAt ( meet ( X , Y ) , T ) ← initiatedAt ( meet ( X , Y ) , T ) ← initiatedAt ( meet ( X , Y ) , T ) ← ... happensAt ( active ( X ) , T ) . happensAt ( inactive ( Y ) , T ) . holdsAt ( orientation ( X , Y , 45 ) , T 1 1 Used O ( ǫ 2 ln ) examples δ initiatedAt ( meet ( X , Y ) , T ) ← initiatedAt ( meet ( X , Y ) , T ) ← happensAt ( active ( X ) , T ) , happensAt ( active ( X ) , T ) , happensAt ( inactive ( Y ) , T ) . holdsAt ( close ( X , Y , 25 ) , T ) . 1 1 Used O ( ǫ 2 ln ) examples initiatedAt ( meet ( X , Y ) , T ) ← δ happensAt ( active ( X ) , T ) , happensAt ( inactive ( Y ) , T ) , holdsAt ( close ( X , Y , 25 ) , T ) . . . . . . . . . . . . . initiatedAt ( meet ( X , Y ) , T ) ← happensAt ( active ( X ) , T ) , happensAt ( inactive ( Y ) , T ) , holdsAt ( close ( X , Y , 25 ) , T ) , holdsAt ( close ( Y , X , 25 ) , T ) , not happensAt ( inactive ( X ) , T ) , not happensAt ( abrupt ( X ) , T ) , Bottom Clause ⊥ : not happensAt ( running ( X ) , T ) , happensAt ( inactive ( Y ) , T ) , not happensAt ( active ( Y ) , T ) , not happensAt ( running ( Y ) , T ) , not happensAt ( abrupt ( Y ) , T ) , holdsAt ( orientation ( X , Y , 45 ) , T ) . ◮ Uses Hoeffding tests to make ( ǫ, δ )-optimal decisions. 6/12

WoLED − 0 . 829 initiatedAt ( meet ( X , Y ) , T ) ← 1 1 Used O ( ǫ 2 ln ) examples δ 0 . 2 initiatedAt ( meet ( X , Y ) , T ) ← 0 . 0 initiatedAt ( meet ( X , Y ) , T ) ← − 1 . 3 initiatedAt ( meet ( X , Y ) , T ) ← ... happensAt ( active ( X ) , T ) . happensAt ( inactive ( Y ) , T ) . holdsAt ( orientation ( X , Y , 45 ) , T 1 1 Used O ( ǫ 2 ln ) examples δ 0 . 5 initiatedAt ( meet ( X , Y ) , T ) ← 0 . 1 initiatedAt ( meet ( X , Y ) , T ) ← happensAt ( active ( X ) , T ) , happensAt ( active ( X ) , T ) , happensAt ( inactive ( Y ) , T ) . holdsAt ( close ( X , Y , 25 ) , T ) . 1 1 Used O ( ǫ 2 ln ) examples 1 . 82 initiatedAt ( meet ( X , Y ) , T ) ← δ happensAt ( active ( X ) , T ) , happensAt ( inactive ( Y ) , T ) , holdsAt ( close ( X , Y , 25 ) , T ) . . . . . . . . . . . . . initiatedAt ( meet ( X , Y ) , T ) ← happensAt ( active ( X ) , T ) , happensAt ( inactive ( Y ) , T ) , holdsAt ( close ( X , Y , 25 ) , T ) , holdsAt ( close ( Y , X , 25 ) , T ) , not happensAt ( inactive ( X ) , T ) , not happensAt ( abrupt ( X ) , T ) , Bottom Clause ⊥ : not happensAt ( running ( X ) , T ) , happensAt ( inactive ( Y ) , T ) , not happensAt ( active ( Y ) , T ) , not happensAt ( running ( Y ) , T ) , not happensAt ( abrupt ( Y ) , T ) , holdsAt ( orientation ( X , Y , 45 ) , T ) . ◮ The WoLED algorithm: ◮ Simultaneous structure & weight learning. ◮ Weight learning with AdaGrad. 7/12

The AdaGrad Weight Update Rule Previous weight Rule’s current Learning rate Regularization rate of the i -th rule mistakes i − η i − η i | − λ η w t + 1 = sign ( w t ∆g t i ) max { 0 , | w t ∆g t } i C t C t C t i i i Term proportional to the rule’s Current weight of the i -th rule accumulated past mistakes • ∆ g t i ( i -th rule’s mistakes at time t ): difference in rule’s true groundings in the true state and the MAP-inferred state. 8/12

WoLED Overview Training Stream Background Knowledge WoLED . . . holdsAt ( F , T + 1 ) ← initiatedAt ( F , T ) . MAP Inference Training Interpretation I t holdsAt ( F , T + 1 ) ← holdsAt ( F , T ) , holdsAt ( move ( id 1 , id 2 ) , 10 ) not terminatedAt ( F , T ) . happensAt ( walking ( id 1 ) , 9 ) happensAt ( walking ( id 2 ) , 9 ) Theory coords ( id 1 , 23 , 104 , 9 ) Mode Declarations Expansion coords ( id 2 , 42 , 84 , 9 ) head ( initiatedAt ( move (+ id , + id ) , + time )) direction ( id 1 , 212 , 9 ) head ( terminatedAt ( move (+ id , + id ) , + time )) direction ( id 2 , 78 , 9 ) body ( happensAt ( walking (+ id , + id ) , + time )) body ( not happensAt ( walking (+ id , + id ) , + time )) body ( distLessThan (+ id , + id , # dist , + time )) Weights Update . . . body ( dirLessThan (+ id , + id , # dist , + time )) Training Interpretation I t ′ Current MLN Theory H t : not holdsAt ( move ( id 1 , id 2 ) , 100 ) Hoeffding Tests/Rule happensAt ( walking ( id 1 ) , 99 ) 1 . 345 initiatedAt ( move ( X , Y ) , T ) ← Expansion happensAt ( walking ( id 2 ) , 99 ) happensAt ( walking ( X ) , T ) , coords ( id 1 , 205 , 23 , 99 ) happensAt ( walking ( Y ) , T ) , coords ( id 2 , 462 , 24 , 99 ) distLessThan ( X , Y , 34 , T ) direction ( id 1 , 23 , 99 ) 0 . 865 terminatedAt ( move ( X , Y ) , T ) ← direction ( id 2 , 798 , 99 ) Pruning happensAt ( inactive ( X ) , T ) , not distLessThan ( X , Y , 34 , T ) . . . 9/12

WoLED Evaluation on the CAVIAR Dataset Method Precision Recall F 1 -score Theory size Time (sec) (a) EC crisp 0.909 Moving 0.634 0.751 28 – OLED 0.867 0.724 0.789 34 28 WoLED 0.882 0.835 0.857 30 59 OSL α 0.837 0.590 0.692 3316 1300 OSL - - - - > 25 hrs MaxMargin 0.844 0.941 0.890 28 1692 XHAIL 0.779 0.914 0.841 14 7836 Meeting EC crisp 0.687 0.855 0.762 23 – OLED 0.947 0.760 0.843 31 22 WoLED 0.892 0.888 0.889 29 52 OSL α 0.902 0.863 0.882 1231 180 OSL - - - - > 25 hrs MaxMargin 0.919 0.813 0.863 23 1133 XHAIL 0.804 0.927 0.861 15 7248 OLED (b) Moving 0.682 0.787 0.730 38 63 WoLED 0.783 0.821 0.801 51 108 EC crisp 0.721 0.639 0.677 28 – Meeting OLED 0.701 0.886 0.782 41 43 WoLED 0.808 0.877 0.841 56 98 EC crisp 0.644 0.855 0.735 23 – Table 1: Experimental results on (a) a fragment of CAVIAR (top) and (b) the complete CAVIAR dataset (bottom). 10/12

WoLED Holdout Evaluation 1 1 0.9 0.9 0.8 0.8 0.7 0.7 F1-score F1-Score 0.6 0.6 0.5 0.5 0.4 0.4 OSLa OSLa 0.3 0.3 WoOLED WoLED 0.2 0.2 OLED OLED 0.1 0.1 0 0 0 2 4 6 8 10 12 14 0 2 4 6 8 10 12 14 Number of Time Points(x1000) Number of Time Points (x1000) 11/12

Online Learning of Weighted Relational Rules for Complex Event - PowerPoint PPT Presentation

Online Learning of Weighted Relational Rules for Complex Event Recognition Nikos Katzouris 1 , Alexander Artikis 2,1 and Georios Paliouras 1 http://cer.iit.demokritos.gr 1 National Center for Scientific Research Demokritos, Athens, Greece 2

Weighted graphs Weighted graphs Weighted graphs Weighted graphs Graphs with numbers, called

Chapter 2: Relational Model Chapter 2: Relational Model Structure of Relational Databases

Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Tuple

Online Learning of Weighted Relational Rules for Complex Event Recognition Nikos Katzouris 1

Relational Algebra Relational Query Languages Recall: Query = Retrieval Program Language

Relational Algebra 1 / 39 Relational Algebra Relational model specifies stuctures and

Relational Query Languages (2) SQL and QBE Walid G. Aref Query Languages For The Relational

Chapter 8 Evaluation of Relational Operators Implementing the Relational Algebra Relational

Relational Calculus More declarative than relational algebra Foundation for query

RELATIONAL ALGEBRA CHAPTER 6 1 CHAPTER 6 OUTLINE Unary Relational Operations: SELECT and

Relational Data Model Hacettepe University Computer Engineering Department Outline 1. Relational

This Lecture The Relational Model Relational data structures Relations and Relational

Weighted graphs 2 Weighted graphs So far we have only considered weighted graphs with

Weighted graphs 3 Weighted graph Edges in weighted graph are assigned a weight: w(v 1 , v 2 ),

Complex Numbers Complex Numbers 1 / 19 Complex Numbers Complex numbers ( C ) are an extension of

Goals Why relational learning? Review of logic programming Examples for

I SPA-Tubes wit h YAP:Ce Act ive Windows f or X and Gamma Ray I maging. C. D' Ambr osio 1 , F. de

G. Thomas Welsh Director HR Benefits and Services Economic Aspects of Health Management

A unifying computational framework for teaching and active learning Scott Cheng-Hsin Yang, Wai

Automated Information Collection in Windows NT Networks Dirk Reimers reimers@secunet.de secunet

r t Prt

Efficient Nonmyopic Batch Active Search Shali Jiang Gustavo Malkomes Matthew Abbott Benjamin

14 Introduction Introduction Bad guys can put malware into Example: hosts

Mobile code security General: Java, introduced by Sun Microsystems 1995, is a very widely