Unifying recommendation and active learning for human-algorithm - PowerPoint PPT Presentation

Unifying recommendation and active learning for human-algorithm interactions Scott Cheng-Hsin Yang 1 , Jake Alden Whritner 1 , Olfa Nasraoui 2 & Patrick Shafto 1   1 Department of Mathematics & Computer Science, Rutgers University–Newark   2 Department of Computer Engineering and Computer Science, University of Louisville CogSci 2017

21st century online shopping I do like Would you like to buy this phone! this phone? FAmazGoog Customer

Problem Active learning: • Goal : figure out customers’ preferences • Way : test user’s preference on items that the algorithm is uncertain how the user will like • Problem : may show too many disliked items and hence drive customers away. Recommender system: • Goal : recommend items that customers will buy • Way : recommend items similar to those that are known to be liked • Problem : create “filter bubbles” that limit the customers to see only a restricted set of items. Figuring out preferences vs. Recommending likable items

Exploration-exploitation tradeoff Should I stick to what I know to be OK, or should I risk trying something new to see if it is better? FAmazGoog Customer

Cognitive science + Human-algorithm interaction Specific Q: is there a way to overcome the trade-off? General Q: given an algorithm, can we predict what the interaction will be like? Human-algorithm interaction research (e.g., Pariser 2011, Baeza-Yates 2016) : • big data approach (e.g., collaborative filtering) • uncontrolled decision factors CogSci research (e.g., Bruner et al 1956, Shepard et al 1961) : • controlled decision factors • traditionally no interaction with algorithms CogSci + Human-algorithm interaction: • human-algorithm interaction with controlled decision factors • compare idealized responses with actual human responses

The framework decision boundary x rec feature 2 o x act x feature 1 x rec = arg max x ∗ P ( y = 1 | x ∗ , D ) x act = arg min x ∗ | 0 . 5 − P ( y = 1 | x ∗ , D ) |

Active recommendation x act = arg min x ∗ | 0 . 5 − P ( y = 1 | x ∗ , D ) | x α = arg min x ∗ | α − P ( y = 1 | x ∗ , D ) | α ∈ [0 . 5 , 1] active Prediction accuracy learning recommendation Recommendation accuracy

Experiment 1. Training phase: Stimuli • train subject to associate labels (Beat Dislike Like or Sonic) with stimuli • phase done when gets 19 out of the Beat Sonic last 20 trials correct 2. Interaction phase: • instruct subject the preferred stimuli radius size • naive algorithm chooses stimuli; • subject labels like/dislike; • algorithm updates setting • 20 trials 3. Check phase: diameter orientation • subject labels 20 stimuli sampled from a grid Markant & Gureckis 2014

Conditions & subjects • 6 interaction conditions: ‣ random, α =0.5 (active), α =1 (recommend) ‣ α =0.55, α =0.75, α =0.95 (active recommend) • 30 subjects per condition • Omit subject if check score < 18/20 ‣ ~ 4 subjects omitted per condition • Consistency score: the fraction of the subject’s responses in the interaction phase that matched the expected responses from the predefined boundary ‣ Flip subjects like/dislike response if consistency score < 50% ‣ ~ 3 subjects’ responses flipped per condition

Results Recommendation accuracy = the fraction of likes in the interaction phase. Prediction accuracy = the fraction of correct model predictions, w.r.t. the true boundary, on 100 stimuli sampled from a grid in the feature space. Active recommendation overcomes the tradeoff!

The distribution of interaction examples Active recommendation selects uncertain example within the relevant category.

Active recommendation x act = arg min x ∗ | 0 . 5 − P ( y = 1 | x ∗ , D ) | x α = arg min x ∗ | α − P ( y = 1 | x ∗ , D ) | α ∈ [0 . 5 , 1] active Prediction accuracy learning recommendation Recommendation accuracy

The effect of human variability Humanly consistent Fully consistent If look at only fully consistent subjects —> see strict ordering. Noisy response close to the boundary —> imperfect prediction accuracy.

Active recommendation x act = arg min x ∗ | 0 . 5 − P ( y = 1 | x ∗ , D ) | x α = arg min x ∗ | α − P ( y = 1 | x ∗ , D ) | α ∈ [0 . 5 , 1] active Prediction accuracy α = 0.95 learning recommendation Recommendation accuracy

Conclusions • Studied human-algorithm interaction as a cognitive concept learning experiment. • Formalized a unification for recommendation and active learning. • Challenge the explore-or-exploit dichotomy. • Showed a case when the tradeoff doesn’t really exist. • Active recommendation can overcome the tradeoff by selecting uncertain example within the relevant category.

Acknowledgments Jake Whritner Pat Shafto Olfa Nasraoui

The core idea Active recommendation bypasses the tradeoff if the model captures the global and local structure.

Unifying recommendation and active learning for human-algorithm - PowerPoint PPT Presentation

Unifying recommendation and active learning for human-algorithm interactions Scott Cheng-Hsin Yang 1 , Jake Alden Whritner 1 , Olfa Nasraoui 2 & Patrick Shafto 1 1 Department of Mathematics & Computer Science, Rutgers

Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat

Agenda Intro to Active Learning Activity Design Resources for Active Learning Lunch with Active

The Active Card An Active Mind in an Active Body More people, More Active, More often! The

Active Adversary Lecture 7 CCA Security MAC Active Adversary Active Adversary An active

Multi-Task Active Learning Yi Zhang Outline Active Learning Multi-Task Active Learning

Unifying Traditional and Unifying Traditional and Formal Verification Through Formal

Unifying Heterogeneous Cray Unifying Heterogeneous Cray Resources and Systems into an

Unifying Notions of Feedback Sergey Goncharov FAU Tag der Informatik 2019, April 26 Unifying

Unifying Mirror Symmetry Constructions David Favero favero@ualberta.ca University of Alberta

A unifying computational framework for teaching and active learning Scott Cheng-Hsin Yang, Wai

Learning Loss for Active Learning Rymarczyk D., Zieliski B., Tabor J., Sadowski M., Titov M.

Partnership event 21 st November 2019 Welcome #ActiveBradford Active Bradford Members Active

MAC. SKE in Practice. Lecture 5 Active Adversary Active Adversary An active adversary can

Unifying Logic, Dynamics and Probability: Founda9ons, Algorithms and Challenges Vaishak Belle

(Unifying?) rheology of soft glasses and jammed solids Ludovic Berthier Laboratoire Charles

Hidden Markov Model, Kalman Filter and A Unifying View Mu Li April 16, 2013 Outline Hidden

LCS 11: Cognitive Science Results of evaluations Physicalism Physicalism GQ 2.1

Bridging intelligent robotics and cognitive science Leslie Pack Kaelbling MIT CSAIL y research

SPECTRM The Clinical Impact of Doing Time Risk Trauma Informed Needs Cultural Competence

Interventions for Transition Age Youth with Co occurring Mental Health and Substance Use Issues

CS325 ARTIFICIAL INTELLIGENCE Introduction: Chapter 1 Outline Course overview What is

COGS 105 Research Methods for Cognitive Scientists Week 1, Class 1: Introduction to the Course;

COGS 105 Research Methods for Cognitive Scientists Week 1, Class 2: More Background

Computational Scientific Discovery and Cognitive Science Theories Peter D Sozou University of