Learning Perceptual Causality from Video
Amy Fire and Song-Chun Zhu
Center for Vision, Cognition, Learning, and Art
UCLA
1
Learning Perceptual Causality from Video Amy Fire and Song-Chun Zhu - - PowerPoint PPT Presentation
Learning Perceptual Causality from Video Amy Fire and Song-Chun Zhu Center for Vision, Cognition, Learning, and Art UCLA 1 Ideally: Learn Causality from Raw Video 2 Inference Using Learned Causal Structure c) STC-Parsing a) Input: Video
1
2
3
a) Input: Video b) Event Parsing c) STC-Parsing d) Inference Over Time Agent Actions Fluents
Action Hidden Fluent Causal Link
UNKNOWN
Time
ON OFF THIRSTY NOT
Light Agent
Time Time
Drink Flip Switch
(generally)
4
5
– E.g., PADS (Albanese, et al. 2010) – Model Newtonian mechanics (Mann, Jepson, and Siskind 1997)
– E.g., Prabhakar, et al. 2010
– Using cognitive science (Brand 1997)
6
– Constraint satisfaction (Pearl 2009) – Bayesian formulations (Heckerman 1995) – Intractable on vision sensors
– Do not allow for ambiguity/probabilistic solutions
– Intractable – Used for action detection (Tran and Davis 2008)
7
8
– Agentive actions are causes (Saxe, Tenenbaum, and Carey 2005) – Co-occurrence of events and effects (Griffiths and Tenenbaum 2005) – Temporal lag between the two is short (Carey 2009) – Cause precedes effect (Carey 2009)
9
10
11
t
Door Opens Door Closed Inertially Door Closes Light On Inertially
Light
ON OFF
Door
OPEN CLOSED
Light Off Inertially Light Turns Off Door Open Inertially Light Turns On
– Mueller – Commonsense Reasoning 2006
– And-Or Graph
12
And-Nodes Compose Single Causes (multiple sub-actions)
A
Unlock Door Push Door
a a
Or-Nodes Give Alternative Causes
Open Other Side
A a
13
Light fluent
a01 a01
OR OR AND AND
14
fluent1
A
parent(A) children(A) terminates
R(A) fluent2 S-AOG fragment T-AOG fragment C-AOG fragment
f1 f2 O2 O1
15
parent(O) children(O) fluents
f1
actions terminates grounding templates(O)
O
S-AOG fragment
f2 f3
A
A1 A2 A3 T-AOG fragment C-AOG fragment
T1(o)
Grounded
Preliminary Theory
16
DellaPietra, DellaPietra,Lafferty, 97 Zhu, Wu, Mumford, 97
17
True contingency table distribution Initial distribution Repeat
I E I E p
f p 2 2 2
: cr cr
Select cause/effect relationship that makes p1 closest (KL) to p to preserve learning history while maximizing Info. Gain
f p 1 1 1
Match the statistics of the contingency table
– Model formed by min KL (p+ || p), matching statistics
18
f p p p p p
k
1
Ep+ cr
+
+
cr , exp 1 pg p z pg p
(On ST-AOG)
cr cr Effect
Action co c2
c1 c3
+ = argmax cr
cr
hi is ci under p fi is ci from f
2 1 2 1
A A A A A
f f f f f
1
A
f
) ( F T
A
A1 A2
2
A
f
1
A
f
) ( F T
A
A1 A2
p
p 1
2 1
) 1 (
A A A
f p pf f
Or-node And-node
20
21
22
0% misdetection 10% misdetection 20% misdetection
23
24