Analytics on Sensor Networks
Jure Leskovec
Joint work with D.
- D. Ha
Hallac, S. Vare, S. Bhooshan, R. Sosic, S. Boyd, and VW
Analytics on Sensor Networks Joint work with D. D. Ha Hallac , S. - - PowerPoint PPT Presentation
Analytics on Sensor Networks Joint work with D. D. Ha Hallac , S. Vare, S. Bhooshan, R. Sosic, S. Boyd, and VW Jure Leskovec Jure Leskovec 2 Sensors are Everywhere Sequences of time stamped observations Jure Leskovec, Stanford 3 Sensor
Joint work with D.
Hallac, S. Vare, S. Bhooshan, R. Sosic, S. Boyd, and VW
2 Jure Leskovec
Jure Leskovec, Stanford 3
Jure Leskovec, Stanford 4
§ High-dimensional § Unlabeled § High-velocity § Dynamic § Heterogeneous
5
Jure Leskovec, Stanford University
§ Realized that in certain regimes they needed to re-optimize their engine configuration parameters
6 Jure Leskovec, Stanford University
§ Large window and door manufacturing
§ Large % of costs comes from energy bill
§ To monitor usage and provide real-time feedback to operators
7 Jure Leskovec, Stanford University
8 Jure Leskovec, Stanford University
9
10 Jure Leskovec, Stanford University
§ Deep learning § Open-source tools § Applications
11 Jure Leskovec, Stanford University
12
Value in “breaking down” the data into a sequence
13 Jure Leskovec, Stanford University
14
§ In general, these “states” are not predefined § We do not know what they are, nor what they refer to…
§ Instead, we need to discover these states in an uns unsup upervised way!
Jure Leskovec, Stanford University
§ x1, x2, …, xT
§ i.e., coming from n different sensors
§ For example, if certain sensors are sampled at a higher rate than others
15 Jure Leskovec, Stanford University
16 Jure Leskovec, Stanford University
Convert a sequence of timestamped
Jure Leskovec, Stanford University 17
§ Each cluster is defined by a multilayer correlation network, or a Markov Random Field (MRF)
§ Contains both intra-layer and inter-layer edges
§ MRFs encode st structural relationsh ships between the sensors
18 Jure Leskovec, Stanford University
19 Jure Leskovec, Stanford University
20 Jure Leskovec, Stanford University
21 Jure Leskovec, Stanford University
22
Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data. D. Hallac, S. Vare, S. Boyd, J.
§ Sparsity in the Toeplitz matrix defines the MRF edge structure § Toeplitz constraint enforces time invariance
§ But we can use an EM-like approach to solve it!
§ Assigning points to clusters in a temporally consistent way § Updating the cluster parameters
2 5
2 6
Jure Leskovec, Stanford 28
CVXPY SnapVX
SnapVX: A Network-Based Convex Optimization Solver. D. Hallac, C. Wong, S. Diamond, A. Sharang, R. Sosič, S. Boyd, J. Leskovec. Journal of Machine Learning Research (JMLR), 18(4):1−5, 2017.
§ Segmentation of the time series § Structural network defining each state
§ Statistical methods of choosing the
29 Jure Leskovec, Stanford University
§ We analyzed 1 hour of driving data
§ 36,000 samples @ 10Hz
§ We observed seven sensors
§ Brake pedal position § Forward (X-)acceleration § Lateral (Y-)acceleration § Steering wheel angle § Vehicle velocity § Engine RPM § Gas Pedal Position
30 Jure Leskovec, Stanford University
31 Jure Leskovec, Stanford University
32 Jure Leskovec, Stanford University
33 Jure Leskovec, Stanford University
34 Jure Leskovec, Stanford University
35 Jure Leskovec, Stanford University
§ Green = straight, white = slowing down, red = turning, blue = speeding up § Results are very consistent across the data!
36 Jure Leskovec, Stanford University
§ Big cost for autonomous vehicles
§ Lane changes, near-accidents, etc.
37 Jure Leskovec, Stanford University
38
Jure Leskovec, Stanford University
39
[Hallac et al., 2018]
Jure Leskovec, Stanford University
Jure Leskovec, Stanford University 40
Jure Leskovec, Stanford University 41
42 Jure Leskovec, Stanford University
§ Left/Right blinker § Accelerate (gas pedal > threshold) § Hard braking (brake pedal < threshold)
4 3
Driver Identification Using Automobile Sensor Data from a Single Turn. D. Hallac, A. Sharang, R. Stahlmann, A. Lamprecht, M. Huber, M. Roehder, R. Sosic, J. Leskovec IEEE International Conference on Intelligent Transportation Systems (ITSC), 2016.
Jure Leskovec, Stanford University
44 Jure Leskovec, Stanford University
§ A) Predict ex exact act sensor values in short- term § B) Predict long-term av aver erag age sensor values § C) Correctly identify driver (out of 29 potential drivers) § D) Be used as a kn knowledge base to identify potentially risky scenarios
45 Jure Leskovec, Stanford University
§ All numbers are reported using the sa same 64-dimensional embedding
46 Jure Leskovec, Stanford University
47 Jure Leskovec, Stanford University
§ MSE vs. “time in future” of short-term prediction
48
0.0 0.5 1.0 1.5 2.0 2.5 3.0 Future Time of Prediction (seconds after end of input) 0.01 0.02 0.03 0.04 0.05 0.06 Test Set MSE Drive2Vec Long-only D2V Short-only D2V
Jure Leskovec, Stanford University
49
50 100 150 200 250 Drive2Vec Embedding Size (Number of Floats) 0.02 0.03 0.04 MSE of 1-Second Future Prediction
Jure Leskovec, Stanford University
50 Jure Leskovec, Stanford University
51 Jure Leskovec, Stanford University
§ Predict 0.1s before a “brake slam”
52 Jure Leskovec, Stanford University
§ Large shocks occur from highway to rural (both short + long expected values change)
53 Jure Leskovec, Stanford University
54 Jure Leskovec, Stanford University
5 5 Jure Leskovec, Stanford University
5 6 Jure Leskovec, Stanford University
57
§ Long short-term memory (LSTMs)
§ Type of recurrent neural network (RNN)
§ Becoming a increasingly powerful method
§ However, results are less interpretable
58 Jure Leskovec, Stanford University
§ Analytics engine that pr prioriti tizes user atte ttenti tion by combining outlier detection and high- dimensional feature selection routines at scale
59 Jure Leskovec, Stanford University
§ Not everything is this clean…
60 Jure Leskovec, Stanford University
§ What if you can predict failures before they
§ Potentially huge cost/safety benefits
61 Jure Leskovec, Stanford University
§ Bridging the gap between online and
62 Jure Leskovec, Stanford University
§ Lots of exciting research directions § More and more applications by the day
§ Bringing innovations from the online world to the real world
§ However, new and improved methods are required to keep innovating
§ Interpreting and acting on sensor data in an unsupervised way
§ We’re only at the tip of the iceberg!
63 Jure Leskovec, Stanford University
§ High-dimensional unlabeled time series data collected in real-time
Jure Leskovec, Stanford University 64
65
PhD Students Post-Doctoral Fellows Funding Collaborators Industry Partnerships
Alexandra Porter Camilo Ruiz Claire Donnat Emma Pierson Jiaxuan You Bowen Liu Mohit Tiwari Rex Ying Baharan Mirzasoleiman Marinka Zitnik Michele Catasta Srijan Kumar Rok Sosic
Research Staff
Adrijan Bradaschia Dan Jurafsky, Linguistics, Stanford University David Grusky, Sociology, Stanford University Stephen Boyd, Electrical Engineering, Stanford University David Gleich, Computer Science, Purdue University VS Subrahmanian, Computer Science, University of Maryland Sarah Kunz, Medicine, Harvard University Russ Altman, Medicine, Stanford University Jochen Profit, Medicine, Stanford University Eric Horvitz, Microsoft Research Jon Kleinberg, Computer Science, Cornell University Sendhill Mullainathan, Economics, Harvard University Scott Delp, Bioengineering, Stanford University James Zou, Medicine, Stanford University Shantao Li Hingwei Wang Weihua Hu
Jure Leskovec, Stanford University
Pan Li
§ Drive2Vec: Multiscale State-Space Embedding of Vehicular Sensor Data. D. Hallac, S. Bhooshan, M. Chen, K. Abida, R. Sosic, J. Leskovec. IEEE International Conference on Intelligent Transportation Systems (ITSC), 2018. § Data-Driven Model Predictive Control of Autonomous Mobility-on-Demand Systems. R. Iglesias, F. Rossi, K. Wang,
§ Network Inference via the Time-Varying Graphical Lasso. D. Hallac, Y. Park, S. Boyd, J. Leskovec.ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2017. § Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data. D. Hallac, S. Vare, S. Boyd, J.
§ Learning the Network Structure of Heterogeneous Data via Pairwise Exponential Markov Random Fields. Y. Park,
§ SnapVX: A Network-Based Convex Optimization Solver. D. Hallac, C. Wong, S. Diamond, A. Sharang, R. Sosič, S. Boyd, J. Leskovec. Journal of Machine Learning Research (JMLR), 18(4):1−5, 2017. § Driver Identification Using Automobile Sensor Data from a Single Turn. D. Hallac, A. Sharang, R. Stahlmann, A. Lamprecht, M. Huber, M. Roehder, R. Sosic, J. Leskovec IEEE International Conference on Intelligent Transportation Systems (ITSC), 2016. § Network Lasso: Clustering and Optimization in Large Graphs. D. Hallac, J. Leskovec, S. Boyd. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2015.
Jure Leskovec, Stanford University 66