Big data projects
Juan Carlos Plaza juancarlos.plaza@bbva.com
Big data projects Juan Carlos Plaza juancarlos.plaza@bbva.com 1 1 - - PowerPoint PPT Presentation
Big data projects Juan Carlos Plaza juancarlos.plaza@bbva.com 1 1 1 0 1 1 1 1 1 0 0 0 1 0 0 1 0 1 0 0 0 1 1 0 1 1 0 0 0 1 0 0 1 0 0 0 0 0 1 1 0 1 0 1 0 0 0 0 0 1 0 0 1 0 0 0 0 1 1 1 0 0
Juan Carlos Plaza juancarlos.plaza@bbva.com
2 2 2
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
3 3 3
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
4
5 5
6 6
7
9
Sector My customers Area
in other sectors & areas? Non customers in
& areas My customers in my sector & area? My customers in my sector? mers tor & My customers in my area? Non customers in my sector? Non customers in my area? Non customers in my sector & area?
10
Sector My customers Area
Diversification Diversification Loyalty Expansion y Expansion Opening hours New portfolio Offers Crosselling
11
12
13
14
MADRILEÑOS CLIENTES
Patrón madrileños vs clientes Patrón de compra antes/después
15
16
application suite for retailers
understand and act upon their business, customers and context.
Decision Examples:
16
18
2011-2012 spending in Madrid by nationality
19
21 21
Qué adquieren los residentes fuera de Sant Cugat Qué adquieren los residentes dentro de Sant Cugat
Madrid Gay Pride
Madrid Gay Pride
Incremento MADO 2011 vs 2012
13,8% -0,8% 9,1% 3,7% 9,8% 4,01% 23,62% 19,48%
Incremento 2011 vs semana control 2011
24
Transactions by street / week of the event
Madrid Gay Pride
Innova Challenge API
What is it?
The InnovaChallenge Data API offers aggregated statistics of spending by geographical area, temporal period and commercial category, accesible through a REST services API.
Innova Challenge API
Which are the data sources? (1/2)
The statistics exposed through the InnovaChallenge Data API come from a dataset that contains transactions performed with BBVA cards, properly anonymized and aggregated. Scope:
Some metrics about the data:
Overall, BBVA has a 15-20% market share in card payments.
Innova Challenge API
Which are the data sources? (2/2)
Each transaction has several parameters associated that describe the context of the purchase made:
This data, processed and aggregated, provide the basis for the API
a given geographical area, for a specific activity sector, for a specific timeframe and a customer segment.
The API offers data about geographical areas in two different partitions for the Madrid and Barcelona provinces:
lon coordinate. The cells are centered in coordinates with the third decimal place set to 0 or 5. Some examples:
(40.415, -3.705), (40.420, -3.705), (40.415, -3.710), (40.420, -3.710)
such cell.
Statistics: spatial detail
Innova Challenge API
considered to be the week that has more than 4 days within the given year.
month, just specify a day that belongs to the period of interest in the API call.
Statistics: temporal detail
Innova Challenge API
41 40 42 43 44
There are 16 different store categories / types of activity:
Statistics: store categories
There are also aggregates for all categories at once.
Innova Challenge API
Available statistics services
Commercial categories
Available s
Innova Challenge API
The statistics in the services are always referred to an area, a commercial category and a temporal aggregation.
1st service: customer segments
Given an area, a commercial category and a temporal aggregation, the service returns:
transactions for each one of 14 demographic segments and one segment belonging to corporate cards. It provides insights on how each customer segment spends their money. Restriction: no results are given if based on less than 3 cards per segment or less than 5 stores per category.
Innova Challenge API
Available statistics services
Innova Challenge API
Available statistics services
2nd service: purchase patterns
Given an area, a commercial category and a month, the service returns:
standard deviation and mode.
transactions for each hour of the day and each day of the week (aggregated patterns computed over the course of a month) Provides insights on typical purchase patterns Restriction: no results are given if based on less than 3 cards
Innova Challenge API
Available statistics services
3rd service: zip codes of residence of the customers.
Given an area, a commercial category and a temporal aggregation, the service returns the list of the top 100 zip codes of procedence of customers making purchases in that given area, ordered by the following criteria:
transactions The service returns also the values for these criteria. Provides insights on the area of influence of the stores that are located in a given area. Restriction: no results are given if based on less than 3 cards per zip code or less than 5 stores per category.
Innova Challenge API
Additional services
Commercial categories This service returns the commercial categories tree that can be used to call the other API services. It returns the category id and the description string in English and Spanish.
Innova Challenge API
Access to the API
The data is accesible through a REST API that provides the three data services and the additional commercial categories service. It is necessary to register first at the BBVA Developer Center:
http://developer.bbva.com
The registration process currently requires the user to specify an application. Once it is registered, the Developer Center provides a pair of app_key and app_secret that will allow you to authenticate yourself and consume the services.