Data Science methods for treatment personalization in Persuasive - PowerPoint PPT Presentation

Data Science methods for treatment personalization in Persuasive Technology Prof. dr. M.C.Kaptein Professor Data Science & Health Principal Investigator @ JADS 12 April 2019

Undergraduate Topics in Computer Science Maurits Kaptein Edwin van den Heuvel Statistics for Data Scientists An introduction to probability, statistics, and data analysis

Personalization With personalization we try to find the “right content for the right person at the right time” [10]. Applications in Communication, Persuasive technology, Marketing, Healthcare, etc., etc. More formally, we assume we have a population of N units, which represent themselves sequentially . For each unit i = 1 , . . . , i = N we first observe their properties � x i , and subsequently, using some decision policy π (), we choose a treatment a i , (i.e. π : ( x i , d ) → a t ). After the content is shown, we observe the associated outcome, or reward r i , and our aim is to choose π () such that we maximize � N i =1 r i .

Overview Selecting persuasive interventions Selecting personalized persuasive interventions Applications in persuasive technology design Available software

Section 1 Selecting persuasive interventions

The multi-armed bandit problem For i = 1 , . . . , i = N ◮ We select and action a i . (Often actions k = 1 , . . . , k = K , not always). ◮ Observe reward r i . Select actions according to some policy π : { a 1 , . . . , a i − 1 , r 1 , . . . , r i − 1 } �→ a i . Aim: Maximize (expected) cumulative reward � N i =1 r i (or, minimize Regret which is simply � N i =1 ( π max − π ()) [3].

The canonical solution: the “experiment” For i = 1 , . . . , i = n (where n << N ): ◮ Choose k with Pr( a i = k ) = 1 K . ◮ Observe reward. r K and create guideline / business rule. r 1 , . . . , ¯ Compute ¯ For i > n : r 1 , . . . , ¯ ◮ Choose a i = arg max k (¯ r K ) [12, 6, 9].

Alternative solutions 1. ǫ -Greedy: For i = 1 , . . . , i = N : ◮ w. Probability ǫ choose k with Pr( a i = k ) = 1 K . ◮ w. Probability 1 − ǫ choose a = arg max k (¯ r 1 , . . . , ¯ r K ) (given the data up to that point) [2]. 2. Thompson sampling: Setup a Bayesian model for r 1 , . . . , r K For i = 1 , . . . , i = N : ◮ Play arm with a probability proportional to your belief that it is the best arm. ◮ Update model parameters Easily implemented by taking a draw from the posterior [4, 1].

Performance of different content selection policies EpsilonFirst 30 EpsilonGreedy ThompsonSampling 25 Cumulative regret 20 15 10 5 0 0 200 400 600 800 1000 Time step Figure: Comparison in terms of regret between three different bandit policies on a 3-arm Bernoulli bandit problem with true probabilities p 1 = . 6 , p 2 = p 3 = . 4 in terms of regret. Figure averages over m = 10 . 000 simulation runs. Thompson sampling outperforms the other policies.

Intuition behind a well performing allocation policy A good policy effectively weights exploration and exploitation: ◮ Exploration: Try out the content that we are unsure about: learn. ◮ Exploitation: Use the knowledge we have / choose the content we think are effective: earn. We can think about the experiment as moving all exploration up front. In this case, it is a) hard to determine how much we need to explore (since there is no outcome data yet), and b) we might make a wrong decision.

Section 2 Selecting personalized persuasive interventions

The problem For i = 1 , . . . , i = N ◮ We observe the context � x i . ◮ We select and action a i . ◮ Observe reward r i Aim remains the same, but problem more challenging: the best action might depend on the context.

The current approach ◮ Do experiments within subgroups of users (or, re-analyze existing RCT data to find heterogeneity). ◮ Subgroup selection driven by a theoretical understanding of the underlying mechanism. ◮ Effectively solve a non-contextual problem within each context . Thus, we see the problem as many separate problems In the limit: no room for exploration when users are fully unique! ( N = 1)

Searching the context × action space 0.6 0.4 S u r v i v a 0.2 l 1.0 0.6 0.0 0.8 20 0.6 e s 40 0.4 D o Weight 0.2 60 80 0.0 0.4 0.6 S u 0.5 r v i v 0.4 Survival a l Weight = 20 0.2 0.3 0.2 1.0 0.1 0.0 0.8 0.0 0.2 0.4 0.6 0.8 1.0 Dose 0.6 20 0.6 Dose 0.5 40 0.4 0.4 Survival Weight = 60 Weight 0.3 0.2 60 0.2 0.1 0.0 80 0.0 0.2 0.4 0.6 0.8 1.0 Dose ◮ Different outcome for each action for each covariate. ◮ We need to learn this relation efficiently .

An alternative approach It is easy to extend Thompson sampling to include a context For i = 1 , . . . , i = N : ◮ Create a model to predict E ( r t ) = f ( a t , x t ) and quantify your uncertainty (e.g., Bayes) ◮ Exploration: Choose actions with uncertain outcomes. ◮ Exploitation: Choose action with high expected outcomes. Very flexible models available for E ( r t ) = f ( a t , x t ) [8, 7, 13] and efficient procedures are available for incorporating uncertainty: LinUCB [5], Thompson Sampling [11], Bootstrap Thompson Sampling [6], etc.

Performance 0.70 Cumulative reward rate 0.60 0.60 Average reward 0.50 0.50 EpsilonFirst EpsilonFirst EpsilonGreedy EpsilonGreedy 0.40 0.40 LinUCBDisjoint LinUCBDisjoint 0 100 200 300 400 0 100 200 300 400 Figure: Simple comparison of LinUCB (“frequentist Thompson sampling”) with non-contextual approaches for a 3-armed Bernoulli bandit with a single, binary, context variable. Already in this very simple case the difference between the contextual and non-contextual approach is very large.

Section 3 Applications in persuasive technology design

Data Science methods for treatment personalization in Persuasive - PowerPoint PPT Presentation

Data Science methods for treatment personalization in Persuasive Technology Prof. dr. M.C.Kaptein Professor Data Science & Health Principal Investigator @ JADS 12 April 2019 Undergraduate Topics in Computer Science Maurits Kaptein Edwin

These slides are available at http://tiny.cc/directedfeedback Overview Personalization

Towards Usable Privacy in Cross-System Personalization Yang Wang CMU Usable Privacy and Security

DYNAMIC WEBSITE PERSONALIZATION AGENDA Defining dynamic website personalization Why

THE POTENTIAL FOR PERSONALIZATION IN WEB SEARCH Susan Dumais, Microsoft Research Sept 30, 2016

Google News Personalization: Scalable Google News Personalization: Scalable Online Collaborative

This Time, Its Personalized Preparing Your Site for Effective Personalization AGENDA 1.

MALE INFERTILITY CASE-I Before Treatment: After Treatment: After Treatment: CASE 2 BEFORE

Computational Personalization: Data science methods for personalized health Maurits Kaptein

PERSONALIZATION Customers expect personalized, tailored messaging no matter which channel or

Mining Minds Interpreter Service Curation Layer MMV-2.5 Overview 2 / Personalization is a

Adaptivity and Personalization in Learning System s Sabine Graf School of Computing and

Anonymous Personalization Without Leaving Drupal by Mike Lander Michael Lander Technical

4. Personalization Outline 4.1. Objectives 4.2. Concerns 4.3. Potential 4.4. Link Analysis

Web Personalization & Recommender Systems COSC 488 Slides are based on: - Bamshad Mobasher,

Deal Personalization Systems @ Groupon Ameya Kanitkar ameya@groupon.com

Web Adaptation and Personalization Marios Belk Outline Overview and Importance of

Internship Defense David Taralla University of Lige Thursday 19 December 2013 Contents

Bandit-based Search for Constraint Programming Manuel Loth 1 , 2 , 4 , Mich` ele Sebag 2 , 4 , 1 ,

By Stian Berg Supervisor Ole-Christoffer Granmo, University of Agder Introduction Thesis

City of Somerville Zoning Amendment Union Square Zoning Amendment Meeting #11 4-12-17

Lecture #1: Introduction to CS109A aka STAT121A, AC209A, CSCIE-109A CS109A Introduction to Data

Welcome to Day 2 Lets brainstorm! 1. How can you reliably ensure your staff do not

University of Kansas School of Nursing ngodfrey@kumc.edu 4. Shift from an emphasis on

FASHION WORKS AT MANCHESTER Outline 1. Current Status (RS) 2. Methodology (GV) 3.

Data Science methods for treatment personalization in Persuasive - PowerPoint PPT Presentation

Data Science methods for treatment personalization in Persuasive Technology Prof. dr. M.C.Kaptein Professor Data Science & Health Principal Investigator @ JADS 12 April 2019 Undergraduate Topics in Computer Science Maurits Kaptein Edwin

These slides are available at http://tiny.cc/directedfeedback Overview Personalization

Towards Usable Privacy in Cross-System Personalization Yang Wang CMU Usable Privacy and Security

DYNAMIC WEBSITE PERSONALIZATION AGENDA Defining dynamic website personalization Why

THE POTENTIAL FOR PERSONALIZATION IN WEB SEARCH Susan Dumais, Microsoft Research Sept 30, 2016

Google News Personalization: Scalable Google News Personalization: Scalable Online Collaborative

This Time, Its Personalized Preparing Your Site for Effective Personalization AGENDA 1.

MALE INFERTILITY CASE-I Before Treatment: After Treatment: After Treatment: CASE 2 BEFORE

Computational Personalization: Data science methods for personalized health Maurits Kaptein

PERSONALIZATION Customers expect personalized, tailored messaging no matter which channel or

Mining Minds Interpreter Service Curation Layer MMV-2.5 Overview 2 / Personalization is a

Adaptivity and Personalization in Learning System s Sabine Graf School of Computing and

Anonymous Personalization Without Leaving Drupal by Mike Lander Michael Lander Technical

4. Personalization Outline 4.1. Objectives 4.2. Concerns 4.3. Potential 4.4. Link Analysis

Web Personalization &amp; Recommender Systems COSC 488 Slides are based on: - Bamshad Mobasher,

Deal Personalization Systems @ Groupon Ameya Kanitkar ameya@groupon.com

Web Adaptation and Personalization Marios Belk Outline Overview and Importance of

Internship Defense David Taralla University of Lige Thursday 19 December 2013 Contents

Bandit-based Search for Constraint Programming Manuel Loth 1 , 2 , 4 , Mich` ele Sebag 2 , 4 , 1 ,

By Stian Berg Supervisor Ole-Christoffer Granmo, University of Agder Introduction Thesis

City of Somerville Zoning Amendment Union Square Zoning Amendment Meeting #11 4-12-17

Lecture #1: Introduction to CS109A aka STAT121A, AC209A, CSCIE-109A CS109A Introduction to Data

Welcome to Day 2 Lets brainstorm! 1. How can you reliably ensure your staff do not

University of Kansas School of Nursing ngodfrey@kumc.edu 4. Shift from an emphasis on

FASHION WORKS AT MANCHESTER Outline 1. Current Status (RS) 2. Methodology (GV) 3.

Web Personalization & Recommender Systems COSC 488 Slides are based on: - Bamshad Mobasher,