CMU 15-896 Social networks 2: Influence Maximization Teacher: - PowerPoint PPT Presentation

CMU 15-896 Social networks 2: Influence Maximization Teacher: Ariel Procaccia

Motivation • Firm is marketing a new product • Collect data on the social network • Choose set of early adopters and market to them directly • Customers in generate a cascade of adoptions • Question: How to choose ? 15896 Spring 2016: Lecture 24 2

Influence functions • Assume: finite graph, progressive process • Fixing a cascade model, define influence function expected #active nodes at the end of the • process starting with • Maximize over sets of size • Theorem [Kempe et al. 2003]: Under the general cascade model, influence maximization is NP- hard to approximate to a factor of �� for any 15896 Spring 2016: Lecture 24 3

Proof of theorem � � � � � � � � � � • S ET COVER : subsets � � , … , � � of � � � � � � � � , … , � � ; cover of size �? � � � � � � • Bipartite graph: � � , … , � � on one side, � � � � � � � � , … , � � and � � , … , � � for T � � � on the other • � � becomes active if � � ∋ � � is active � � � � � � � � • � � becomes active if � � , … , � � are active � � � � � � � � • Min set cover of size � ⇒ � � � � � � � � � active � � � � � � � � • Min set cover of size � � ⇒ � � � � active ∎ � � � � � � � � � � � � � � � � 15896 Spring 2016: Lecture 24 4

Submodularity for approximation • Try to identify broad subclasses where good approx is possible • � is submodular if for � ⊆ �, � ∉ �, � � ∪ � � � � � � � ∪ � � �� • � is monotone if for � ⊆ �, � � � �� • Reduction gives � that is not submodular • Theorem [Nemhauser et al. 1978]: � monotone and submodular, � ∗ optimal � -element subset, � obtained by greedily adding � elements that maximize marginal increase; then 1 � 1 � �� ∗ � � � � 15896 Spring 2016: Lecture 24 5

independent cascade model • Reminder of model: For each there is a weight �� o When a node becomes activated it has one o chance to activate each neighbor with probability �� • Theorem [Kempe et al. 2003]: Under the independent cascade model: Influence maximization is NP-hard o The influence function is submodular o 15896 Spring 2016: Lecture 24 6

Proof of NP-hardness � � � � � � � � � � • Almost the same proof as before � � � � • S ET COVER : subsets � � of � � � � � � � � � � cover of size � � � � • Bipartite graph: � � on one side, � � on the other � � � � � � � � • If � � then there is an edge � � � � with weight � � � � � � � � � � • Min SC of size � � � � • Min SC of size � � � � � � � � active 15896 Spring 2016: Lecture 24 7

Proof of submodularity • Lemma: If � � are submodular functions, � , then is a � � � � �� submodular function • Proof: Let and , then � � ∪ � � � � � �� ∪ � � � � � � � � � � � � � ∪ � � � � � � �� ∪ � � � � � � � 0 �� 15896 Spring 2016: Lecture 24 8

Proof of submodularity • Key idea: for each we flip a coin of bias �� in advance denote a particular one of the |�| • Let possible coin flip combinations activated nodes with as seed • � nodes and coin flips iff is reachable from via live • � edges 15896 Spring 2016: Lecture 24 9

Proof of submodularity � is submodular • , • � � that is, is a nonnegative weighted sum of submodular functions • By the lemma, is submodular 15896 Spring 2016: Lecture 24 10

Linear threshold model • Reminder of model: Nonnegative weight �� for each edge o otherwise �� Assume � �� o Each has threshold � chosen o uniformly at random in becomes active if o �� active � 15896 Spring 2016: Lecture 24 11

Linear threshold model Poll 1: What is � 1 � 3 � � � 1 1 1 � 4 4 6 �� 15896 Spring 2016: Lecture 24 12

Linear threshold model Poll 2: Given that is inactive, prob. it becomes active after 1 3 becomes active � 1 1 1 4 4 6 � 15896 Spring 2016: Lecture 24 13

Linear threshold model • Theorem [Kempe et al. 2003]: 1/3 � � 3 Under the linear threshold 4 1/3 model: 1/3 Influence maximization is NP- o hard The influence function is o 1/3 submodular � � 3 4 1/3 • Difficulty: fixing the coin flips 1/3 , � is not submodular 15896 Spring 2016: Lecture 24 14

Proof of submodularity • Each chooses at most one of its incoming edges at random; selected with prob. �� , and none with prob. � �� • If we can show that these choices of live edges induce the same influence function as the linear threshold model, then the theorem follows from the same arguments as before 15896 Spring 2016: Lecture 24 15

Proof of submodularity • We sketch the equivalence of the two models • Linear threshold: � � � active nodes at end of iteration � o ∑ � �� ∈��∖�� Pr � ∈ � �� | � ∉ � � � o ��∑ � �� ∈�� • Live edges: At every times step, determine whether � ’s live edge o comes from current active set If not, the source of the live edge remains unknown, o subject to being outside the active set Same probability as before ∎ o 15896 Spring 2016: Lecture 24 16

Progressive vs. nonprogressive � � • Nonprogressive threshold model is identical except that � � � � � at each round � chooses � � u.a.r. in �0,1� � � � � • Suppose process runs for � steps � � � � � � � � � � � � • At each step � � � , can target � � � � � � � � � � � � � for activation; � interventions overall � � � � � � � � � � � � • Goal: ∑ # rounds � was active � � � � � � � � � � � � � • Reduces to progressive case � � � � � � � � � � � � 15896 Spring 2016: Lecture 24 17

CMU 15-896 Social networks 2: Influence Maximization Teacher: - PowerPoint PPT Presentation

CMU 15-896 Social networks 2: Influence Maximization Teacher: Ariel Procaccia Motivation Firm is marketing a new product Collect data on the social network Choose set of early adopters and market to them directly Customers in

Why Are We Here? CSCE CSCE 496/896 496/896 Lecture 10: Lecture 10: CSCE 496/896 Lecture 10:

Introduction CSCE CSCE 496/896 496/896 Lecture 7: Lecture 7: Reinforcement Reinforcement

Introduction CSCE CSCE 496/896 496/896 Lecture 6: Lecture 6: Recurrent Recurrent CSCE

Introduction CSCE CSCE 496/896 496/896 Lecture 9: Lecture 9: word2vec and word2vec and To

Introduction Supervised Learning CSCE CSCE 496/896 496/896 Lecture 2: Lecture 2: Basic

Welcome to CSCE 496/896: Deep Learning! Welcome to CSCE 496/896: Deep Learning! Please check

FACT: A Diagnostic for Group Fairness Trade-offs Joon Kim, CMU (joonsikk@cs.cmu.edu ) Jiahao Chen,

The bluetides simulation Tiziana DiMatteo (CMU ) Yu Feng (Berkeley), Rupert Croft (CMU ), Aklant

A New Boosting Algorithm Using Input-Dependent Regularizer Rong Jin rong+@cs.cmu.edu Yan Liu

CMU 15-896 Noncooperative games 1: Basic concepts Teacher: Ariel Procaccia Normal-Form Game

CMU 15-896 Social choice 1: The basics Teacher: Ariel Procaccia Social choice theory A

CMU 15-896 Social networks 1: Coordination Games Teacher: Ariel Procaccia Background

CMU 15-896 Social choice 3: Advanced manipulation Teacher: Ariel Procaccia Recap A

CMU 15-896 Mechanism design 2: With money Teacher: Ariel Procaccia MD with money Money

CMU 15-896 Fair division 1: Cake cutting Teacher: Ariel Procaccia Single heterogeneous

CMU 15-896 Noncooperative games 3: Price of anarchy Teacher: Ariel Procaccia Back to prison

COMP331/557 Chapter 8: Complexity Theory (Cook, Cunningham, Pulleyblank & Schrijver, Chapter

Deep Neural Networks Machine Learning http://www.cs.cmu.edu/~10701 Organizational info All

Bayesian Coarse-Graining in Atomistic Simulations: Adaptive Identification of the Dimensionality

The topology of random lemniscates Erik Lundberg, Florida Atlantic University joint work (Proc.

Conservative Cascades: an Invariant of Internet Traffic Steve UHLIG E-mail: suh@info.ucl.ac.be

Reliable M IX Cascade Networks t hrough Reput at ion Roger D ingledine, Reput at ion T echnologies

Optimizing cascades & submodular optimization Rik Sarkar Today Maximizing cascades

How to Network in Online Social Network Giovanni Neglia, Xiuhui Ye (Politecnico di Torino),

CMU 15-896 Social networks 2: Influence Maximization Teacher: - PowerPoint PPT Presentation

CMU 15-896 Social networks 2: Influence Maximization Teacher: Ariel Procaccia Motivation Firm is marketing a new product Collect data on the social network Choose set of early adopters and market to them directly Customers in

Why Are We Here? CSCE CSCE 496/896 496/896 Lecture 10: Lecture 10: CSCE 496/896 Lecture 10:

Introduction CSCE CSCE 496/896 496/896 Lecture 7: Lecture 7: Reinforcement Reinforcement

Introduction CSCE CSCE 496/896 496/896 Lecture 6: Lecture 6: Recurrent Recurrent CSCE

Introduction CSCE CSCE 496/896 496/896 Lecture 9: Lecture 9: word2vec and word2vec and To

Introduction Supervised Learning CSCE CSCE 496/896 496/896 Lecture 2: Lecture 2: Basic

Welcome to CSCE 496/896: Deep Learning! Welcome to CSCE 496/896: Deep Learning! Please check

FACT: A Diagnostic for Group Fairness Trade-offs Joon Kim, CMU (joonsikk@cs.cmu.edu ) Jiahao Chen,

The bluetides simulation Tiziana DiMatteo (CMU ) Yu Feng (Berkeley), Rupert Croft (CMU ), Aklant

A New Boosting Algorithm Using Input-Dependent Regularizer Rong Jin rong+@cs.cmu.edu Yan Liu

CMU 15-896 Noncooperative games 1: Basic concepts Teacher: Ariel Procaccia Normal-Form Game

CMU 15-896 Social choice 1: The basics Teacher: Ariel Procaccia Social choice theory A

CMU 15-896 Social networks 1: Coordination Games Teacher: Ariel Procaccia Background

CMU 15-896 Social choice 3: Advanced manipulation Teacher: Ariel Procaccia Recap A

CMU 15-896 Mechanism design 2: With money Teacher: Ariel Procaccia MD with money Money

CMU 15-896 Fair division 1: Cake cutting Teacher: Ariel Procaccia Single heterogeneous

CMU 15-896 Noncooperative games 3: Price of anarchy Teacher: Ariel Procaccia Back to prison

COMP331/557 Chapter 8: Complexity Theory (Cook, Cunningham, Pulleyblank &amp; Schrijver, Chapter

Deep Neural Networks Machine Learning http://www.cs.cmu.edu/~10701 Organizational info All

Bayesian Coarse-Graining in Atomistic Simulations: Adaptive Identification of the Dimensionality

The topology of random lemniscates Erik Lundberg, Florida Atlantic University joint work (Proc.

Conservative Cascades: an Invariant of Internet Traffic Steve UHLIG E-mail: suh@info.ucl.ac.be

Reliable M IX Cascade Networks t hrough Reput at ion Roger D ingledine, Reput at ion T echnologies

Optimizing cascades &amp; submodular optimization Rik Sarkar Today Maximizing cascades

How to Network in Online Social Network Giovanni Neglia, Xiuhui Ye (Politecnico di Torino),

COMP331/557 Chapter 8: Complexity Theory (Cook, Cunningham, Pulleyblank & Schrijver, Chapter

Optimizing cascades & submodular optimization Rik Sarkar Today Maximizing cascades