Generation & Evaluation of Personalised Push-Notifications - - PowerPoint PPT Presentation

generation evaluation of personalised push notifications
SMART_READER_LITE
LIVE PREVIEW

Generation & Evaluation of Personalised Push-Notifications - - PowerPoint PPT Presentation

EvalUMAPWorkshop Generation & Evaluation of Personalised Push-Notifications Kieran Fraser, Bilal Yousuf, Owen Conlan ADAPT Centre, Trinity College Dublin The ADAPT Centre is funded under the SFI Research Centres Programme (Grant 13/RC/2106)


slide-1
SLIDE 1

Generation & Evaluation of Personalised Push-Notifications

Kieran Fraser, Bilal Yousuf, Owen Conlan ADAPT Centre, Trinity College Dublin

The ADAPT Centre is funded under the SFI Research Centres Programme (Grant 13/RC/2106) and is co-funded under the European Regional Development Fund.

EvalUMAPWorkshop

slide-2
SLIDE 2

www.adaptcentre.ie

Content

❖ Proposed Challenge ❖ Gym-Push ❖ Evaluation Metrics ❖ Challenge Entry ❖ Results & Discussion ❖ Limitations ❖ Final Thoughts

slide-3
SLIDE 3

www.adaptcentre.ie

Proposed Challenge

Currently no established or standardized means for repeatable and comparative evaluation of algorithms and systems in the UMAP space.

  • 1. Proposal for a Shared Challenge in the UMAP Space, EvalUMAP Whitepaper 2019

Goal Shared Task “focuses on user model generation using logged mobile phone data, with an assumed purpose of supporting mobile phone notification suggestion.” 1

slide-4
SLIDE 4

www.adaptcentre.ie

Social Influencer Problems

slide-5
SLIDE 5

www.adaptcentre.ie

Proposed Challenge

“the challenge is to create an approach to generate personalized notifications

  • n individuals’ mobile phones, whereby such personalization would consist of

deciding what events (SMS received, etc.) to show to the individual and when to show them.”1

  • 1. Proposal for a Shared Challenge in the UMAP Space, EvalUMAP Whitepaper 2019

Challenge 1

  • Given 3 months historical notification data (for training)
  • Develop a user model which generates a personalized notification given a context
  • Using Gym-Push, user model is evaluated using test data and evaluation metrics

Challenge 2

  • Given small sample of notification data (no training)
  • Develop an adaptive user model which generates a personalized notification given a context
  • Using Gym-Push, user model is evaluated, in simulated “real-time”, using test data and

evaluation metrics

slide-6
SLIDE 6

www.adaptcentre.ie

Gym-Push OpenAI Gym Open source toolkit for “developing and comparing reinforcement learning algorithms” 1

  • 1. https://gym.openai.com/

Gym-Push Custom OpenAI Gym environment simulating push-notification overload

  • n mobile device users
slide-7
SLIDE 7

www.adaptcentre.ie

Gym-Push Gym-Push

  • Ease of installation – pip, docker, hosted
  • Multiple communities – RL, UMAP, HCI
  • End-user interface
  • Established Online Leaderboard
slide-8
SLIDE 8

www.adaptcentre.ie

Gym-Push

Challenge 1

slide-9
SLIDE 9

www.adaptcentre.ie

Gym-Push

Challenge 2

slide-10
SLIDE 10

www.adaptcentre.ie

Gym-Push

slide-11
SLIDE 11

www.adaptcentre.ie

Gym-Push

Train on Real, Test on Synthetic 1 RMSE F1 scores differ in range 0.02 – 0.07 indicating synthetic data imitates real world data.

  • 1. Esteban, C., Hyland, S.L., R¨atsch, G.: Real-valued (medical) time series generation
slide-12
SLIDE 12

www.adaptcentre.ie

Gym-Push

slide-13
SLIDE 13

www.adaptcentre.ie

Evaluation Metrics

Performance Diversity Response Time Learning Rate

slide-14
SLIDE 14

www.adaptcentre.ie

Evaluation Metrics

Accuracy Precision Recall F1

Simulated User

  • AdaBoost Classifier chosen
  • Trained on 3 months of historical user data
  • Acc avg = 83.8%, F1 avg = 72.8%
slide-15
SLIDE 15

www.adaptcentre.ie

Challenge Entry

  • MLP used for Generator & Discriminator
  • Notifications OHE vector length 28
  • Trained using RMSProp in 128 mini-batch chunks
  • ver 2000 epochs

Challenge 1

slide-16
SLIDE 16

www.adaptcentre.ie

Results & Discussion

slide-17
SLIDE 17

www.adaptcentre.ie

Results & Discussion

slide-18
SLIDE 18

www.adaptcentre.ie

Limitations

Simulated Data Simulated User Evaluation metrics domain specific

Proposed Challenge

Diversity Online-learning

Challenge Entry

slide-19
SLIDE 19

www.adaptcentre.ie

Final Thoughts

Simulated game environments More challenge domains Cross domain metrics Domain specific metrics Create a group of evaluators per domain Integrate with existing evaluation services e.g. TIRA

slide-20
SLIDE 20

www.adaptcentre.ie

Thank you. Questions?

Email: kieran.fraser@adaptcentre.ie