Real-Time In-Situ Intelligent Video Analytics for Mobile Platforms
Christiaan Gribble
Applied Technology Operation SURVICE Engineering GTC 2017
Real-Time In-Situ Intelligent Video Analytics for Mobile Platforms - - PowerPoint PPT Presentation
Real-Time In-Situ Intelligent Video Analytics for Mobile Platforms Christiaan Gribble Applied Technology Operation SURVICE Engineering GTC 2017 Acknowledgments University of Washington Steve Brunton, PhD Ben Erichson, PhD
Christiaan Gribble
Applied Technology Operation SURVICE Engineering GTC 2017
– Steve Brunton, PhD – Ben Erichson, PhD – Nathan Kutz, PhD
– Rob Baltrusch – Mark Butkiewicz – Shawn Recker, PhD
2
Combines modern data reduction & analysis techniques with machine learning via DNNs and massively parallel computing architectures to enable next-gen ISR
3
NVIDIA GPUs Advanced UIs Machine learning Compressed DMD
4
NVIDIA GPUs Advanced UIs Machine learning Compressed DMD
5
Compressed DMD NVIDIA GPUs Advanced UIs Machine learning
6
Compressed DMD Machine learning Advanced UIs
NVIDIA GPUs
7
Compressed DMD Machine learning NVIDIA GPUs Advanced UIs
8
[Grosek & Kutz 2014]
9
[Grosek & Kutz 2014]
10
video stream space … modes reshaped video … space time amplitudes
dynamic mode decomposition evolution time … flattened frame [Erichson et al. 2016]
11
[Grosek & Kutz 2014]
12
[Erichson et al. 2016]
X, X’
data modes full DMD
𝚾Y, 𝚳Y
cDMD
Y, Y’
compressed C
𝚾, 𝚳
𝚾 = X’VY SY
13
compressed video [Erichson et al. 2016] … compression matrix reshaped video …
video stream reshaped video … space time … flattened frame
14
space … modes compressed video … space time amplitudes
dynamic mode decomposition evolution time [Erichson et al. 2016]
15
[Erichson et al. 2016]
16
17
18
Compression Level
50 100 150 200 250 300 350 400 450 500 3840x2160 1920x1080 1280x720 1024x768 640x480
Runtime Performance (frames per second) Video Resolution
1 0.5 0.1 0.01 0.001
19
Compression Level
100 200 300 400 500 600 700 800 900 3840x2160 1920x1080 1280x720 1024x768 640x480
Runtime Performance (frames per second) Video Resolution
1 0.5 0.1 0.01 0.001
20
Compression Level
5 10 15 20 25 30 35 40 45 50 1920x1080 1280x720 1024x768 640x480
Runtime Performance (frames per second) Video Resolution
1 0.5 0.1 0.01 0.001
21
Compression Level
10 20 30 40 50 60 70 80 90 100 1920x1080 1280x720 1024x768 640x480
Runtime Performance (frames per second) Video Resolution
1 0.5 0.1 0.01 0.001
DMD Haar classifier Traditional approach DNN approach
22
CNN
Mini-quad DJI[Image source: NVIDIA Parallel Forall]
23
Input FCN Output Error
24
[Kutz et al. 2016]
25
Sentinel-enabled IVA module
26
27
JTARV – Hoverbike
28
Brunton, B. W., Brunton, S. L., Proctor, J. L. & Kutz, J. N. (2015) Optimal sensor placement and enhanced sparsity for classification. SIAM Journal on Applied Mathematics, https://arxiv.org/abs/1310.4217. Erichson, N. B., Brunton, S. L., & Kutz, J. N. (2015) Compressed dynamic mode decomposition for background modeling. Journal of Real-Time Image Processing, https://arxiv.org/abs/1512.04205v2. Grosek, J. & Kutz, J. N. (2014) Dynamic mode decomposition for real-time background/foreground separation in video. IEEE Transactions on Pattern Analysis & Machine Learning, https://arxiv.org/abs/1404.7592. Kutz, J. N., Fu, X., & Brunton, S. L. (2016) Multi-resolution dynamic mode decomposition. SIAM Journal on Applied Dynamical Systems, https://arxiv.org/abs/1506.00564.
29
Applied Technology Operation SURVICE Engineering 4695 Millennium Drive Belcamp, MD 21017
christiaan.gribble@survice.com
http://www.survice.com/employees/~cgribble/
30