What questions do you have? Analytics Accelerator 1 1 My - - PowerPoint PPT Presentation

what questions do you have
SMART_READER_LITE
LIVE PREVIEW

What questions do you have? Analytics Accelerator 1 1 My - - PowerPoint PPT Presentation

What questions do you have? Analytics Accelerator 1 1 My background Math, Stats, Data Science, Solution Architect, CS / ML Data Engineer Presales Entity Resolution and Data Quality Problem Traditional Methods Dont Scale with Number of


slide-1
SLIDE 1

1

Analytics Accelerator

What questions do you have?

1

slide-2
SLIDE 2

My background

Math, Stats, CS / ML Data Science, Data Engineer Solution Architect, Presales

slide-3
SLIDE 3

Entity Resolution and Data Quality Problem

slide-4
SLIDE 4

Traditional Methods Don’t Scale with Number of Sources

Number of Siloed Data Sources Marginal Cost Opportunity For Strategic Data Asset

slide-5
SLIDE 5

Total Landed Cost $300M+ 0.5% of Direct Spend Customers Suppliers Parts Logistics

8

Divisions

500+

ERP Systems

100K+

Suppliers

10M+

Customers

Problem Solution

Why I get excited about Enterprise: Scale, Scale, Scale

Results

Sales Revenue Marketing Impact People

Confidential

slide-6
SLIDE 6

6

CONFIDENTIAL

What most people think enterprise data looks like What enterprise data is really like - “random data salad” Prone to constant change/entropy Restructuring Leadership Changes Politics Dynamic Schema DBs - Mongo et al “Data Hoarding” Legacy Burden M&A

Dirty Little Secret: Data Variety in Enterprise

slide-7
SLIDE 7

Customer Insights

Unified buyer profiles across siloed dealer systems in 30+ geos

$500M+ Savings

From Sourcing Analytics Across Businesses

10X Reduction

In New Data Set Integration From 6 Months to 2 Weeks

5000+ Studies

Unified clinical study data to empower researchers

What Tamr Does

CONFIDENTIAL

Tamr solves the enterprise data variety problem to power transformative analytic and operational outcomes.

Video Case Study Video Case Study Video Case Study Case Study

slide-8
SLIDE 8

Most data is untreated + unprepared for expensive analytics tools

Reality for Global Corporate IT as Data Broker

HR Sales Engineering Manufacturing Marketing Divisions Finance

slide-9
SLIDE 9

Some Options

Option #1 - Deny Variety - use information that is easiest/closest Option #2 - Manage Variety incrementally - using traditional approaches:

  • Standardization
  • Aggregation
  • Master Data Management
  • Rationalize Systems
  • Throw Bodies at it
  • Improve Individual Productivity

Option #3 - Embrace Variety using probabilistic/model based approach - Tamr

slide-10
SLIDE 10

Option #1: “Deny” Variety

Use only the information that is closest, most familiar, easiest to obtain

slide-11
SLIDE 11

Traditional Data Management Approaches: Necessary but not sufficient

  • Standardization
  • Aggregation
  • Master Data Management
  • Rationalize Systems
  • Throw Bodies at it
  • Improve Individual Productivity

Option #2: “Manage” Variety Using Traditional Approaches

One Schema to Rule them All

slide-12
SLIDE 12

Traditional Data Management Approaches: Necessary but not sufficient

  • Standardization
  • Aggregation
  • Master Data Management
  • Rationalize Systems
  • Throw Bodies at it
  • Improve Individual Productivity

Option #2: “Manage” Variety Using Traditional Approaches

slide-13
SLIDE 13

Traditional Data Management Approaches: Necessary but not sufficient

  • Standardization
  • Aggregation
  • Master Data Management
  • Rationalize Systems
  • Throw Bodies at it
  • Improve Individual Productivity

Option #2: “Manage” Variety Using Traditional Approaches

slide-14
SLIDE 14

Traditional Data Management Approaches: Necessary but not sufficient

  • Standardization
  • Aggregation
  • Master Data Management
  • Rationalize Systems
  • Throw Bodies at it
  • Improve Individual Productivity

Option #2: “Manage” Variety Using Traditional Approaches

slide-15
SLIDE 15

Traditional Data Management Approaches: Necessary but not sufficient

  • Standardization
  • Aggregation
  • Master Data Management
  • Rationalize Systems
  • Throw Bodies at it
  • Improve Individual Productivity

Option #2: “Manage” Variety Using Traditional Approaches

slide-16
SLIDE 16

Traditional Data Management Approaches: Necessary but not sufficient

  • Standardization
  • Aggregation
  • Master Data Management
  • Rationalize Systems
  • Throw Bodies at it
  • Improve Individual Productivity

Option #2: “Manage” Variety Using Traditional Approaches

slide-17
SLIDE 17

Logical Evolution to Probabilistic/Model-Based Approach

Probabilistic Deterministic Probabilistic Deterministic

Today Future

Probabilistic (Tamr) Complements, NOT Replaces, Deterministic (MDM)

slide-18
SLIDE 18

Managing enterprise information as an asset requires a new, bottom-up design pattern

Combine Consolidate Classify

ALL your metadata and map it to logical entities Entities and attributes to remove information silos Organize your data into an analytics-ready hierarchy

Option #3: “Embrace” Variety -- Tamr’s NextGen Approach

slide-19
SLIDE 19

19

CONFIDENTIAL

The Two Second Rule.

Aythig that takes a hua loger tha two seconds is probably unlikely for ML to autoatically lear.

  • Andrew Ng, Chief Scientist, Baidu