Big data projects Juan Carlos Plaza juancarlos.plaza@bbva.com 1 1 - - PowerPoint PPT Presentation

big data projects
SMART_READER_LITE
LIVE PREVIEW

Big data projects Juan Carlos Plaza juancarlos.plaza@bbva.com 1 1 - - PowerPoint PPT Presentation

Big data projects Juan Carlos Plaza juancarlos.plaza@bbva.com 1 1 1 0 1 1 1 1 1 0 0 0 1 0 0 1 0 1 0 0 0 1 1 0 1 1 0 0 0 1 0 0 1 0 0 0 0 0 1 1 0 1 0 1 0 0 0 0 0 1 0 0 1 0 0 0 0 1 1 1 0 0


slide-1
SLIDE 1

Big data projects

Juan Carlos Plaza juancarlos.plaza@bbva.com

slide-2
SLIDE 2

2 2 2

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

slide-3
SLIDE 3

3 3 3

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

slide-4
SLIDE 4

4

26M transactions / day

slide-5
SLIDE 5

5 5

slide-6
SLIDE 6

6 6

Credit access

slide-7
SLIDE 7

7

15% 40%

slide-8
SLIDE 8
slide-9
SLIDE 9

9

Sector My customers Area

  • My customers

in other sectors & areas? Non customers in

  • ther sectors

& areas My customers in my sector & area? My customers in my sector? mers tor & My customers in my area? Non customers in my sector? Non customers in my area? Non customers in my sector & area?

slide-10
SLIDE 10

10

Sector My customers Area

  • Crosselling

Diversification Diversification Loyalty Expansion y Expansion Opening hours New portfolio Offers Crosselling

slide-11
SLIDE 11

11

slide-12
SLIDE 12

12

slide-13
SLIDE 13

13

slide-14
SLIDE 14

14

MADRILEÑOS CLIENTES

Patrón madrileños vs clientes Patrón de compra antes/después

slide-15
SLIDE 15

15

  • Commerce 360
slide-16
SLIDE 16

16

  • C360 is a data-driven

application suite for retailers

  • f any size to help them

understand and act upon their business, customers and context.

Decision Examples:

16

slide-17
SLIDE 17
slide-18
SLIDE 18

18

world

2011-2012 spending in Madrid by nationality

slide-19
SLIDE 19

19

country

slide-20
SLIDE 20

region

slide-21
SLIDE 21

21 21

Qué adquieren los residentes fuera de Sant Cugat Qué adquieren los residentes dentro de Sant Cugat

city

slide-22
SLIDE 22

neighbourhood

Madrid Gay Pride

slide-23
SLIDE 23

Madrid Gay Pride

Incremento MADO 2011 vs 2012

13,8% -0,8% 9,1% 3,7% 9,8% 4,01% 23,62% 19,48%

Incremento 2011 vs semana control 2011

slide-24
SLIDE 24

24

Transactions by street / week of the event

Madrid Gay Pride

slide-25
SLIDE 25

InnovaChallenge Data API

slide-26
SLIDE 26

Innova Challenge API

What is it?

The InnovaChallenge Data API offers aggregated statistics of spending by geographical area, temporal period and commercial category, accesible through a REST services API.

slide-27
SLIDE 27

Innova Challenge API

Which are the data sources? (1/2)

The statistics exposed through the InnovaChallenge Data API come from a dataset that contains transactions performed with BBVA cards, properly anonymized and aggregated. Scope:

  • Madrid and Barcelona provinces
  • Timespan: 2012-11 – 2013-04
  • Classified into commercial sectors of activity

Some metrics about the data:

  • More than 30 million transactions
  • More than 2 million cards
  • More than 200,000 stores

Overall, BBVA has a 15-20% market share in card payments.

slide-28
SLIDE 28

Innova Challenge API

Which are the data sources? (2/2)

Each transaction has several parameters associated that describe the context of the purchase made:

  • Transaction amount
  • Time & date of the purchase (timestamp)
  • Store location (lat/lon coordinates).
  • Commercial category of the store.
  • Demographic segment of the cardholder.
  • Zip code of residence of the cardholder.

This data, processed and aggregated, provide the basis for the API

  • services. These allow to gain relevant insights on the commercial activity in

a given geographical area, for a specific activity sector, for a specific timeframe and a customer segment.

slide-29
SLIDE 29

The API offers data about geographical areas in two different partitions for the Madrid and Barcelona provinces:

  • Zip codes
  • Zoom 2 cells (450x550m)
  • These cells have a size determined by half of a hundredth of a decimal lat/

lon coordinate. The cells are centered in coordinates with the third decimal place set to 0 or 5. Some examples:

(40.415, -3.705), (40.420, -3.705), (40.415, -3.710), (40.420, -3.710)

  • To ask for a cell it’s enough to call the services with a point contained in

such cell.

Statistics: spatial detail

Innova Challenge API

slide-30
SLIDE 30
  • Timeframe: 2012-11 – 2013-04
  • Data is aggregated by weeks and months.
  • Week number 1 in a given year is

considered to be the week that has more than 4 days within the given year.

  • To ask for statistics of a given week or

month, just specify a day that belongs to the period of interest in the API call.

Statistics: temporal detail

Innova Challenge API

41 40 42 43 44

slide-31
SLIDE 31

There are 16 different store categories / types of activity:

  • Travel
  • Groceries
  • Hypermarkets
  • Hotels
  • Real estate
  • Automotion
  • Bars and restaurants
  • Personal care

Statistics: store categories

  • Sports & toys
  • Technology
  • Home
  • Contents
  • Fashion
  • Leisure
  • Health
  • Transportation

There are also aggregates for all categories at once.

Innova Challenge API

slide-32
SLIDE 32

Available statistics services

Commercial categories

Available s

Innova Challenge API

The statistics in the services are always referred to an area, a commercial category and a temporal aggregation.

slide-33
SLIDE 33

1st service: customer segments

Given an area, a commercial category and a temporal aggregation, the service returns:

  • Average spending
  • Number of transactions
  • Number of unique cards that have performed the

transactions for each one of 14 demographic segments and one segment belonging to corporate cards. It provides insights on how each customer segment spends their money. Restriction: no results are given if based on less than 3 cards per segment or less than 5 stores per category.

Innova Challenge API

Available statistics services

slide-34
SLIDE 34

Innova Challenge API

Available statistics services

2nd service: purchase patterns

Given an area, a commercial category and a month, the service returns:

  • For the transaction amounts: average, minimum, maximum,

standard deviation and mode.

  • Number of transactions
  • Number of unique cards that have performed the

transactions for each hour of the day and each day of the week (aggregated patterns computed over the course of a month) Provides insights on typical purchase patterns Restriction: no results are given if based on less than 3 cards

  • r less than 5 stores per category.
slide-35
SLIDE 35

Innova Challenge API

Available statistics services

3rd service: zip codes of residence of the customers.

Given an area, a commercial category and a temporal aggregation, the service returns the list of the top 100 zip codes of procedence of customers making purchases in that given area, ordered by the following criteria:

  • Total spending
  • Number of transactions
  • Number of unique cards that have performed the

transactions The service returns also the values for these criteria. Provides insights on the area of influence of the stores that are located in a given area. Restriction: no results are given if based on less than 3 cards per zip code or less than 5 stores per category.

slide-36
SLIDE 36

Innova Challenge API

Additional services

Commercial categories This service returns the commercial categories tree that can be used to call the other API services. It returns the category id and the description string in English and Spanish.

slide-37
SLIDE 37

Innova Challenge API

Access to the API

The data is accesible through a REST API that provides the three data services and the additional commercial categories service. It is necessary to register first at the BBVA Developer Center:

http://developer.bbva.com

The registration process currently requires the user to specify an application. Once it is registered, the Developer Center provides a pair of app_key and app_secret that will allow you to authenticate yourself and consume the services.

slide-38
SLIDE 38

Thanks!