Applied Spark
From Concepts to Bitcoin Analytics
Andrew F. Hart – ahart@apache.org | @andrewfhart
Applied Spark From Concepts to Bitcoin Analytics Andrew F. Hart - - PowerPoint PPT Presentation
Applied Spark From Concepts to Bitcoin Analytics Andrew F. Hart ahart@apache.org | @andrewfhart My Day Job CTO, Pogoseat Upgrade technology for live events 3/28/16 QCON-SP Andrew Hart 2 Additionally Member, Apache Software
Andrew F. Hart – ahart@apache.org | @andrewfhart
3/28/16 QCON-SP Andrew Hart 2
3/28/16 QCON-SP Andrew Hart 3
3/28/16 QCON-SP Andrew Hart 4
3/28/16 QCON-SP Andrew Hart 5
3/28/16 QCON-SP Andrew Hart 6
3/28/16 QCON-SP Andrew Hart 7
3/28/16 QCON-SP Andrew Hart 8
3/28/16 QCON-SP Andrew Hart 9
3/28/16 QCON-SP Andrew Hart 10
3/28/16 QCON-SP Andrew Hart 11
3/28/16 QCON-SP Andrew Hart 12
3/28/16 QCON-SP Andrew Hart 13
3/28/16 QCON-SP Andrew Hart 14
3/28/16 QCON-SP Andrew Hart 15
3/28/16 QCON-SP Andrew Hart 16
3/28/16 QCON-SP Andrew Hart 17
3/28/16 QCON-SP Andrew Hart 18
3/28/16 QCON-SP Andrew Hart 19
3/28/16 QCON-SP Andrew Hart 20
3/28/16 QCON-SP Andrew Hart 21
3/28/16 QCON-SP Andrew Hart 22
3/28/16 QCON-SP Andrew Hart 23
3/28/16 QCON-SP Andrew Hart 24
3/28/16 QCON-SP Andrew Hart 25
3/28/16 QCON-SP Andrew Hart 26
3/28/16 QCON-SP Andrew Hart 27
3/28/16 QCON-SP Andrew Hart 28
3/28/16 QCON-SP Andrew Hart 29
3/28/16 QCON-SP Andrew Hart 30
3/28/16 QCON-SP Andrew Hart 31
3/28/16 QCON-SP Andrew Hart 32
3/28/16 QCON-SP Andrew Hart 33
Data on Disk Map-1 Map-2 . . . Map-n Tuples
Reduce-1 Reduce-2 . . . Reduce-n Tuples
HDFS Read HDFS Write HDFS Read HDFS Write
3/28/16 QCON-SP Andrew Hart 34
Data on Disk Map-1 Map-2 . . . Map-n Cluster Memory Reduce-1 Reduce-2 . . . Reduce-n Data on Disk HDFS Read HDFS Write RDD
3/28/16 QCON-SP Andrew Hart 35
3/28/16 QCON-SP Andrew Hart 36
3/28/16 QCON-SP Andrew Hart 37
3/28/16 QCON-SP Andrew Hart 38
3/28/16 QCON-SP Andrew Hart 39
3/28/16 QCON-SP Andrew Hart 40
3/28/16 QCON-SP Andrew Hart 41
3/28/16 QCON-SP Andrew Hart 42
3/28/16 QCON-SP Andrew Hart 43
3/28/16 QCON-SP Andrew Hart 44
3/28/16 QCON-SP Andrew Hart 45
3/28/16 QCON-SP Andrew Hart 46
3/28/16 QCON-SP Andrew Hart 47
3/28/16 QCON-SP Andrew Hart 48
3/28/16 QCON-SP Andrew Hart 49
3/28/16 QCON-SP Andrew Hart 50
3/28/16 QCON-SP Andrew Hart 51
3/28/16 QCON-SP Andrew Hart 52
3/28/16 QCON-SP Andrew Hart 53
3/28/16 QCON-SP Andrew Hart 54
3/28/16 QCON-SP Andrew Hart 55
3/28/16 QCON-SP Andrew Hart 56
3/28/16 QCON-SP Andrew Hart 57
3/28/16 QCON-SP Andrew Hart 58
3/28/16 QCON-SP Andrew Hart 59
3/28/16 QCON-SP Andrew Hart 60
3/28/16 QCON-SP Andrew Hart 61
3/28/16 QCON-SP Andrew Hart 62
3/28/16 QCON-SP Andrew Hart 63
Master Node 6 CPU Cores 8GB RAM 6 CPU Cores 8GB RAM 6 CPU Cores 8GB RAM
3/28/16 QCON-SP Andrew Hart 64
3/28/16 QCON-SP Andrew Hart 65
3/28/16 QCON-SP Andrew Hart 66
3/28/16 QCON-SP Andrew Hart 67
3/28/16 QCON-SP Andrew Hart 68
3/28/16 QCON-SP Andrew Hart 69
3/28/16 QCON-SP Andrew Hart 70
3/28/16 QCON-SP Andrew Hart 71
image credit: https://databricks.com/spark/about
3/28/16 QCON-SP Andrew Hart 72
image credit: https://amplab.cs.berkeley.edu/software/
3/28/16 QCON-SP Andrew Hart 73
3/28/16 QCON-SP Andrew Hart 74