Helixa
Audience Projection of Target Consumers over Multiple Domains: a NER and Bayesian approach
Gianmario Spacagna Chief Scientist @ Helixa O’Reilly AI Conference London, 16th October 2019
Helixa Audience Projection of Target Consumers over Multiple - - PowerPoint PPT Presentation
Helixa Audience Projection of Target Consumers over Multiple Domains: a NER and Bayesian approach Gianmario Spacagna Chief Scientist @ Helixa OReilly AI Conference London, 16th October 2019 About Me 7+ years experience in Data Science and
Audience Projection of Target Consumers over Multiple Domains: a NER and Bayesian approach
Gianmario Spacagna Chief Scientist @ Helixa O’Reilly AI Conference London, 16th October 2019
About Me
7+ years experience in Data Science and Machine Learning Currently leading a team of ML Scientists and ML Engineers Background in Telematics and Software Engineering of Distributed Systems Ongoing MBA Student Co-author of Python Deep Learning Contributor of the Professional Data Science Manifesto Blogger of Data Science Vademecum Founder of the Data Science Milan community (1.4k members) Stockholm, London, Milan Gianmario Spacagna Chief Scientist, Helixa gspacagna@helixa.ai
DEMOGRAPHICS
HHI < 40K Female 18 - 24
INFLUENCERS
ODESZA Cardi B Shane Dawson James Charles
Helixa is Market Research platform that uses AI to integrate disparate data sources into an enriched view of the consumers who matter to your business.
INTERESTS
Listen to Podcasts Kylie Cosmetics Fan Starbucks Chipotle
PSYCHOGRAPHICS
Fast Food Fans Fashion Enthusiasts Entertainment Junkies
In the next 40 minutes... OUR GOAL: Discuss some of the current challenges of traditional market research and propose a novel solution based on Named Entity Recognition (NER) and Bayesian Inference.
Applied Social Science
What is Market Research? Gain Insights for Strategic Decisions
Information about individuals and organizations Statistical Inference
Why Market Research matters?
Brands Perceptions Consumers Preferences and Behaviors Buyer Personas Market Segmentation Identify Opportunities Market Trends
Opinions and individual experiences In-depth interviews Smaller sample
Qualitative Quantitative
Numbers and Data Statistics Larger sample
Quantitative Market Research is conducted with Surveys
Define Analyze Distribute Collect Design
Limitations of Surveys
Expensive Invasive Response Bias Predefined questions Narrow coverage
Market Research using “Implicit Consumers Feedback”
Define Analyze Distribute Collect Design
e.g. Social Listening
Inferring Interests from Twitter Interactions
Advantages of Implicit Consumer Feedback Approaches
Flexible costs Wide view Opportunities for Big Data and AI Mass coverage Spontaneous
What about other information?
The Universe of Consumers Datasets
Social Media Financial and Properties Behaviors First Party (CSM) Consumer Research Surveys
SCATTERED PARTIAL SKEWED
M A L E F E M A L E
18-30 31-43 44-56 57-70
Individual Consumers Datasets are Far From Being Exhaustive
ALL IN ONE COMPLETE REPRESENTATIVE
M A L E F E M A L E
18-30 31-43 44-56 57-70
The Holy Grail of Market Research
What is look-alike fusion?
Left: Social Network Panel Right: Consumptions Survey Panel
Assignment Optimization Problem
Well-known solutions:
Datasets Fusion
X X X X X X X X X X X
Left User Right User left-only entities right-only entities Target Audience =
Look-alike Fusions Requires a Main Panel Centrality
Look-alike Fusions Don’t Scale Well
Differences in feature space Craftsmanship required at each change of data Universal objective function to optimize
Audience Projection defined as “User Binary Classification”
Source: Social Network Panel Destination: Consumptions Survey Panel
70M Social accounts 200M U.S. consumers 1.6M / 26M / TRUE FALSE TRUE FALSE
Target Audience =
PROJECTION
Ben & Jerry’s: bought in last 6 months? Affinity: 1.80x Venmo: paid in last 30 days? Affinity: 1.6x Angry Orchard: drunk in last 6 months? Affinity: 1.50x
Solution = Named Entity Recognition (NER) + Bayesian Model
Social Pages Consumption Questions NER NER BAYESIAN MODEL ENTITY LINKING (NEL)
Destination: Consumptions Survey Panel Source: Social Network Panel Projected Users Probabilities Target Audience
Entities Represent an Universal Feature Space
Social Pages Consumption Questions Listed Products NER NER NER
The Coca-Cola Company is a total beverage company, offering over 500 brands in more than 200 countries and territories.
Named Entity Recognition(NER) in each Domain
Social Pages Consumption Questions Listed Products
Adidas Originals Men's Relaxed Strapback Cap Coca-Cola KWC-4 6-Can Personal Mini 12V DC Car and 110V AC Cooler, Red
NLP Libraries with NER capability
Why for Production?
Fast Accurate Industry-grade maturity
example of NER usage
Same Entity May Exist with Different Spellings
Linking and Normalizing Entities via
en.wikipedia.org/wiki/Coca-Cola en.wikipedia.org/wiki/The_Coca-Cola_Company Entity Relationship
Normalized Entities means a Common Feature Space
Stacked Heterogeneous Feature Space
X X ? ? X X ? ? ? ? X X X X ? ? X X X ? ? X X X
Source Users Destination Users source-only entities common entities destination-only entities Latent interests Target Audience =
Common Entities translate Source to Destination
Source: Social Network Panel Destination: Consumptions Survey Panel Target Audience = C
m
E n t i t i e s
Bayesian Model
Source Target Size 1.6M / 70M = 2.3% Share of Interests
“Share of interests” encode the DNA of the Target Audience
Global share of interests: 100% Common Entities Target audience share of interests: 50% 17% 50% Target Audience slice
Bayesian Model
Posterior Probability of user belonging to projected target given the Share of Interests on common entities
Evidence Prior Source Target Size=2.3% Likelihood
Evidence Decomposition
Evidence
Marginal Positive Likelihood
Binomial distribution
p=17%
Joint Likelihood under Naive Assumption
50% 17% 50%
17%
50%
50%
Destination variables TeenNick Robot Chicken Bob’s Burgers Ben & Jerry’s Venmo Angry Orchard Nintendo DSi XL Video Games Audio or Video Chat Affinity 8.9x 7.27x 2.36x 1.80x 1.62x 1.55.x 1.47x 1.45x 1.23x
Predicted Probabilities provides Insights on the Projected Users
PROJECTION
Target Audience =
Projected Users Probabilities Insights on Destination Variables
Audience Projection In a Nutshell
Social Panel Consumptions Survey Panel Common Entities
Bayesian Model
Target Audience = Affinity: 1.80x Affinity: 1.55x Affinity: 1.62x
Binary Classifier Evaluation
Bayesian Model
Projected Users Probabilities Ground Truth
Evaluation techniques
Validate via Common Entities
X X X X X X X X
Source Users Destination Users common entities Target Audience OR = Projected Audience OR =
Exact Query Replica Ground Truth
Validate via Self Reconstruction Within the Same Domain
X X X X X X X X X X X X X X X X X X X
Source Users Destination Users source-only entities common entities destination-only entities Target Audience =
Ground Truth
Validate via Double-step Reconstruction
PROJECTION PROJECTION Predicted probabilities Ground Truth
Repeat Test Cases Stratifying by Category
Demographics Skewness
PROJECTION
Golden Benchmarks Comparison on Aggregated Insights
Many Linked Views of the Same Global Population
Audience Projection
Multiple Perspectives Reinforce Reliability
Social Panel Target Audience = Interacted with Game Informer social page Affinity: 2.17x Have you read any Game Informer issue? Affinity: 1.73x Game Informer Single Issue Magazine purchased online Affinity: 2.51x
Generalize Audience Projection as a Domain Adaptation Problem
Bayesian Model
Gianmario Spacagna Chief Scientist at Helixa.ai gspacagna@helixa.ai @gm_spacagna
Natural Language Processing (NLP) Pipeline
"Mark Watney visited Mars"
The spaCy NER Model Overview
Embedding Words
Features token lower prefix suffix shape Apple apple app ple Wwwww U.K. uk uk uk W.W. Fahrenheit 451 fahrenheit 451 fah 451 Wwwwwwwwww ddd
Efficiently Embedding Words
Encoding Sequences of Words
Raw tri-gram chunk Enriched tri-gram matrix Mark Watney visited “Mark Watney visited Mars”
Crafting the Attention Vector
“Mark Watney visited Mars” Attention vector Tri-gram matrix Enriched tri-gram vector
Predicting the Recognized Entities
Actions: SHIFT OUT REDUCE (Entity Tagging) Stack Buffer Segment “Mark Watney visited Mars” Actions: 1.SHIFT 2.SHIFT 3.REDUCE (PER) 4.OUT 5.SHIFT 6.REDUCE (LOC) Mark Watney Mars Mark Watney visited Mars Enriched tri-gram vector Update attention Attention vector Tri-gam matrix
Official Explanation of spaCy NER Model
https://www.youtube.com/watch?v=sqDHBH9IjRU
Projecting the Share of Interests on Common Entities
Target Audience Projection 50% 17% 50%
Share of Interests:
SIZE: 60M SIZE: 200M SIZE: ? SIZE: 40M
Global Audience (average american) = Target Audience
evidence prior
Evidence Statistics on Share of Interests
N = 180M users in U.S. population sampling rate = 1 : 10k n = 18k users in sample panel p = 17% of market penetration x = 3k expected projected users
SIZE: 200M SIZE: 40M
statistics: evidence
Binomial Positive Likelihood
n = 17999 x = 2999 log(p)=-5.56323 Probability of selecting 3000 / 18000 McDonald’s panel users given that the user IS part of the target
n = 18000 x = 3000 log(p)=-5.54342 is smaller than
p=17%
Binomial Negative Likelihood
n = 17999 x = 2999 log(p)=-5.53942 Probability of selecting 3000 / 18000 McDonald’s panel users given that the user IS NOT part of the target
n = 18000 x = 3000 log(p)=-5.54342
p=17%
is greater than