A Big Data Arsenal for the 21 st Century Matt Asay VP, Marketing - - PowerPoint PPT Presentation

a big data arsenal for the 21 st century
SMART_READER_LITE
LIVE PREVIEW

A Big Data Arsenal for the 21 st Century Matt Asay VP, Marketing - - PowerPoint PPT Presentation

A Big Data Arsenal for the 21 st Century Matt Asay VP, Marketing & Business Development, MongoDB MongoDB Inc. Proprietary and Confidential 7 million downloads 150,000 online education registrations 1,000 active subscribers 20,000


slide-1
SLIDE 1

MongoDB Inc. Proprietary and Confidential

A Big Data Arsenal for the 21st Century

VP, Marketing & Business Development, MongoDB

Matt Asay

slide-2
SLIDE 2

7 million downloads 1,000 active subscribers 150,000

  • nline

education registrations 30,000 user group members 20,000 MongoDB Days attendees World’s fastest- growing database

slide-3
SLIDE 3

3

What We Don’t Do

“The relational database market is a $9 billion a year market. I want to shrink it to $3 billion and take a third of the market.”

  • Marten Mickos
slide-4
SLIDE 4

4

What We Do

Enable a Generation of Innovative, Modern Applications Previously Impossible Or Too Difficult to Achieve

slide-5
SLIDE 5

5

The Big Data Unknown

slide-6
SLIDE 6

6

Top Big Data Challenges?

Translation? Most struggle to know what Big Data is, how to manage it and who can manage it

Source: Gartner

slide-7
SLIDE 7

7

  • More than 90% of today’s data was created in the

last 2 years

  • Moore’s Law for data: Doubles at regular intervals

Big Data Is Sort of a Matter of Volume

2250 4500 6750 9000 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011

1 4 10 24 55 120 250 500 1,000 2,150 4,400 9,000

slide-8
SLIDE 8

8

Big(ger) Is the New Normal

2014 2013 2011 2010 2009 2008

2007

2006

slide-9
SLIDE 9

9

Understanding Big Data – It’s Not Very “Big”

from Big Data Executive Summary – 50+ top executives from Government and F500 firms

64% - Ingest diverse, new data in real-time 15% - More than 100TB

  • f data

20% - Less than 100TB (average of all? <20TB)

slide-10
SLIDE 10

10

Modern, Big Data Is Messy

slide-11
SLIDE 11

11

Data Now Looks Like This

slide-12
SLIDE 12

12

And This

slide-13
SLIDE 13

13

And This

slide-14
SLIDE 14

14

Doesn’t Fit Neatly into a “Spreadsheet”

  • 90% of the world’s data

was created in the last two years

  • 80% of enterprise data

is unstructured

  • Unstructured data

growing 2X faster than structured

slide-15
SLIDE 15

15

Back in 1970…Cars Were Great!

slide-16
SLIDE 16

16

So Were Computers!

slide-17
SLIDE 17

17

Lots of Great Innovations Since 1970

slide-18
SLIDE 18

18

New Tools for New Data

slide-19
SLIDE 19

Innovation As Iteration

slide-20
SLIDE 20

“I have not failed. I've just found 10,000 ways that won't work.”

― Thomas A. Edison

slide-21
SLIDE 21

21

Must Be Open Source

slide-22
SLIDE 22

22

Must Not Require Big Upfront Payment

slide-23
SLIDE 23

23

Must Not Penalize Success

“Clients can also opt to run zEC12 without a raised datacenter floor -- a first for high-end IBM mainframes.” IBM Press Release 28 Aug, 2012

slide-24
SLIDE 24

24

Must Not Impede Iteration

New Table New Table New Column Name Pet Phone Email New Column

3 months later…

slide-25
SLIDE 25

25

Must Be Based on Industry Standards

DB-Engines.com Database Ranking

Ranking Database Type Score Changes 1 Oracle Relational 1491.8

  • 8.43

2 MySQL Relational 1290.21 1.83 3 Microsoft SQL Relational 1205.28

  • 8.99

4 PostgreSQL Relational 235.06 4.61 5 MongoDB Document 199.99 4.81 6 DB2 Relational 187.32

  • 1.14

7 Microsoft Access Relational 146.48

  • 6.4

8 SQLite Relational 92.98

  • 0.03

9 Sybase Relational 81.55

  • 6.33

10 Cassandra Wide Column 78.09

  • 2.23
slide-26
SLIDE 26

26

Must Be Easy to Find Skills

slide-27
SLIDE 27

27

Must Be Easy to Learn/Use

“Organizations already have people who know their own data better than mystical data scientists….Learning Hadoop [or MongoDB] is easier than learning the company’s business.” (Gartner, 2012)

slide-28
SLIDE 28

When To Use Hadoop, Modern Databases

slide-29
SLIDE 29

29

Enterprise Big Data Stack

EDW Hadoop

Management & Monitoring Security & Auditing

RDBMS CRM, ERP, Collaboration, Mobile, BI OS & Virtualization, Compute, Storage, Network RDBMS

Applications Infrastructure Data Management Online Data Offline Data

slide-30
SLIDE 30

30

Consideration – Online vs. Offline

  • Long-running
  • High-Latency
  • Availability is lower priority
  • Real-time
  • Low-latency
  • High availability

Online Offline

vs.

slide-31
SLIDE 31

31

Hadoop Is Good for…

Risk Modeling Churn Analysis Recommendation Engine Ad Targeting Transaction Analysis Trade Surveillance Network Failure Prediction Search Quality Data Lake

slide-32
SLIDE 32

32

MongoDB/NoSQL Is Good for…

360° View of the Customer Mobile & Social Apps Fraud Detection User Data Management Content Management & Delivery Reference Data Product Catalogs Machine to Machine Apps Data Hub

slide-33
SLIDE 33

How To Use The Two Together?

slide-34
SLIDE 34

34

Finding Waldo

slide-35
SLIDE 35

35

Predictive Analytics

Government

  • Predictive analytics system

for crime, health issues

  • Diverse, unstructured (incl.

geospatial) data from 30+ agencies

  • Correlate data in real-time
  • Long-form trend analysis
  • MongoDB data dumped into

Hadoop, analyzed, re-inserted into MongoDB for better real- time response

Algorithms

MongoDB + Hadoop

slide-36
SLIDE 36

36

Machine Learning

Ad-Serving

  • Catalogs and products
  • User profiles
  • Clicks
  • Views
  • Transactions
  • User segmentation
  • Recommendation engine
  • Prediction engine

Algorithms

MongoDB Connector for Hadoop

slide-37
SLIDE 37

37

  • Modern data is messy
  • Your data infrastructure must support iteration
  • Modern data infrastructure market is crowded

– But clear winners are distinguishing themselves – Bet on general purpose over niche, popular over

  • bscure, open source over proprietary
  • Use MongoDB + Hadoop together

Remember…

slide-38
SLIDE 38

@mjasay Don’t believe me? MongoDB booth on Floor 3