AI and Predictive Analytics in Data-Center Environments
Distributed Computing using Spark
An Introduction to Spark Environments Josep Ll. Berral @BSC
Intel Academic Education Mindshare Initiative for AI
AI and Predictive Analytics in Data-Center Environments Distributed - - PowerPoint PPT Presentation
AI and Predictive Analytics in Data-Center Environments Distributed Computing using Spark An Introduction to Spark Environments Josep Ll. Berral @BSC Intel Academic Education Mindshare Initiative for AI Presentation Distributed computing
Intel Academic Education Mindshare Initiative for AI
d1
exp
d2
exp
d3
exp
d1
exp
d2
exp
d3
exp Data
in HDFS
d1 d2 r1 r2 r1 r2
Data Exchange
r2 r1
X 2 X 16GB X 1TB
CPU Mem Disk
X 4 X 32GB X 2TB
CPU Mem Disk
VM/Container manager: “Deploy N workers and 1 master” “Create a virtual network to let them see each other” ”Give them a common configuration (master can find the workers, workers can find the DFS