1
Pick up a handout
- n the front table
Pick up a handout on the front table 1 Welcome to DS504/CS586: - - PowerPoint PPT Presentation
Pick up a handout on the front table 1 Welcome to DS504/CS586: Big Data Analytics --Review Prof. Yanhua Li Time: 6:00pm 8:50pm R Location: AK232 Fall 2016 Today 1. Review Key topics, techniques, discussed in the semester 2.
1
Time: 6:00pm –8:50pm R Location: AK232 Fall 2016
– Key topics, techniques, discussed in the semester
– Big data analytics – Urban Computing
10 min Break
4
5
Graph Mining, Data Clustering Recommender systems, Deep Learning
Urban Computing, Social Network Analysis Networking Indexing, Query Processing Error Correction, Map-Matching Representative data collection: Sampling Techniques Sampling and index Clustering
More techniques
Topics in Big Data Mining 1 Graph Mining: 2 Clustering Hierarchical K-means, BFR DBScan, DENCLUE Graph Sampling Node Importance Ranking
Deep Neural Networks 3 Recommender Systems Content-Based Collaborative Filtering User-User Based Item-Item Based Facebook/Social graph estimation Social influence Topic sensitive PageRank Trajectory clustering Location-based recommender sys Personalized Geo-Social Recom. Alpha Go
– Random prefix/region/zoomin/region sampling – Index structure: B-Tree, Quad-tree, R-tree, etc
– Hirachical – K-means, DBScan
etc
9
1.1 Random sampling (uniform & independent)
} vertex sampling } BFS sampling
9
} random walk sampling } edge sampling
10
11
v We’ll learn about – Advanced Techniques for Big Data Analytics
– Applications with Big Data Analytics
v Learning outcomes
– Understand & Explain challenges and advances in the state-of-art in big data analytics. – Design, develop and fully execute a big data analytics project. – Communicate the ideas effectively in the form of a presentation and written documents to a technical audience.
Graph Mining, Data Clustering Recommender systems, Deep Learning
Urban Computing, Social Network Analysis Networking Indexing, Query Processing Error Correction, Map-Matching Representative data collection: Sampling Techniques Sampling and index Clustering
More techniques
13
14
Logistics 15
v Focus more on critical thinking, problem
v Understand, formulate and solve problems v Read and critique research papers v Two Course Projects v Oral presentation v Team Work, v Coding,
– Projects (40%)
– Final reports in the discussion forum (by 11:59pm 12/13); – Self-and-peer evaluation form for project 2 (by 11:59PM 12/13);
– Written work (30%):
– Oral work (30%):
fp fg t Nt ɵ Na v dv fr w Np α
Categories Regions Categories Categories Regions FeaturesA
X = R×U Z Time slots Regions Y Y = T×RT XYt-1 Fm(t-1)
t-1
Ft(t-1) Fh(t-1) Fm(t)
t
Ft(t) Fh(t) Fm(t+1)
t+1
Ft(t+1) Fh(t+1) Yt Yt-1 cx
ANN
w'11 w'qr w1 wr wpq w11 b1 bq b'r b'1
b''
18
v 12/15 R
v 22 min each team (including Q&A)
v Team 1 v Team 2 v Team 3 v Team 4 v Team 5 v Team 6 v Snacks and Drinks will be provided.
19
c1 c2 c3 c4 g1 g2 g3 Tr1 Tr2 Tr3 Tr4 Tr5 n1 n2 n3
(a) EV Charging Station Placement (b) Advertisement Placement (c) Observation Station Placement
c5 g1 u2 u1 u3 u4
24