From Big Data Management to Big Data Science 1 What is next? Real - - PowerPoint PPT Presentation

from big data management
SMART_READER_LITE
LIVE PREVIEW

From Big Data Management to Big Data Science 1 What is next? Real - - PowerPoint PPT Presentation

From Big Data Management to Big Data Science 1 What is next? Real big data is widely available Only a few people know how to deal with it Youre now one of them Applications The project is a start Keep your hands dirty Consider using the


slide-1
SLIDE 1

From Big Data Management to Big Data Science

1

slide-2
SLIDE 2

What is next?

Real big data is widely available Only a few people know how to deal with it You’re now one of them Applications

The project is a start Keep your hands dirty Consider using the public cloud (e.g., AWS, Google Cloud, or Microsoft Azure)

2

slide-3
SLIDE 3

Job Market

https://www.techicy.com/5-best-programming-languages-to-watch-out-in-2019-for-data-science.html

3

slide-4
SLIDE 4

Data Science

Credits: Drew Conway

4

slide-5
SLIDE 5

Data Science

https://mashimo.wordpress.com/2016/05/28/big-data-data-science-and-machine-learning-explained/

5

slide-6
SLIDE 6

Data Scientist

6

slide-7
SLIDE 7

Next Steps

CS

Big data tools Python/R/Scala

Math/Stats

Linear algebra Correlation analysis Hypothesis tests

Collaboration with domain experts

Visualization Prototyping

7

slide-8
SLIDE 8

CS

https://www.slideshare.net/galvanizeHQ/how-to-become-a-data-scientist-by-ryan-orban-vp-of-operations-and-expansion-galvanize

8

slide-9
SLIDE 9

CS/Big Data

https://www.slideshare.net/galvanizeHQ/how-to-become-a-data-scientist-by-ryan-orban-vp-of-operations-and-expansion-galvanize

9

slide-10
SLIDE 10

Math/Stats

https://www.slideshare.net/galvanizeHQ/how-to-become-a-data-scientist-by-ryan-orban-vp-of-operations-and-expansion-galvanize

10

slide-11
SLIDE 11

Online Courses

https://www.slideshare.net/galvanizeHQ/how-to-become-a-data-scientist-by-ryan-orban-vp-of-operations-and-expansion-galvanize

11

slide-12
SLIDE 12

Data Analytics

https://www.slideshare.net/galvanizeHQ/how-to-become-a-data-scientist-by-ryan-orban-vp-of-operations-and-expansion-galvanize

12

slide-13
SLIDE 13

Big Data Landscape

Distributed Storage HDFS KV stores LSM trees Column stores Query Processing Map Reduce RDD Hyracks High level APIs Pig Latin Spark SQL HBase Big data packages Algebricks MLlib GraphX SparkR

13