Power Indices and Game Theory (Applications to Bioinformatics) - - PowerPoint PPT Presentation

power indices and game theory applications to
SMART_READER_LITE
LIVE PREVIEW

Power Indices and Game Theory (Applications to Bioinformatics) - - PowerPoint PPT Presentation

Power Indices and Game Theory (Applications to Bioinformatics) Stefano MORETTI stefano.moretti@dauphine.fr LAMSADE (CNRS), Paris Dauphine von Neumann and Morgenstern, 1944 basic formal language for modeling economic phenomena. Dominant


slide-1
SLIDE 1

Power Indices and Game Theory (Applications to Bioinformatics)

Stefano MORETTI stefano.moretti@dauphine.fr LAMSADE (CNRS), Paris Dauphine

slide-2
SLIDE 2

Dominant strategies Nash eq. (NE) Subgame perfect NE NE & refinements … Core Shapley value Nucleolus τ‐value PMAS …. Nash sol. Kalai‐ Smorodinsky …. CORE NTU‐value Compromise value … No binding agreements No side payments Q: Optimal behaviour in conflict situations binding agreements side payments are possible (sometimes) Q: Reasonable (cost, reward)‐sharing von Neumann and Morgenstern, 1944 basic formal language for modeling economic phenomena.

slide-3
SLIDE 3

A building with three owners

slide-4
SLIDE 4

Each owner has a weight (in thousandths) Decision rule: a group of owners with at least 667 thousandths is winning they may force a decision concerning common facilities (e.g., “to construct an elevator”) Q: How to measure the power of each

  • wner?
slide-5
SLIDE 5

370 480 150

“Power meter”

Power index

Which properties should a power index satisfy?

slide-6
SLIDE 6

150

This group has less than 667 thousands

520

This group has less than 667 thousands

630

This group has less than 667 thousands

1000

This group has more than 667 thousands

slide-7
SLIDE 7

This group has less than 667 thousands

370

This group has less than 667 thousands

480

This group has less than 667 thousands

850

This group has more than 667 thousands

slide-8
SLIDE 8

= 0

Null player property: The power of the owners who never contribute to make a winning group must be zero.

slide-9
SLIDE 9

Anonimity property: The power index should not depend on the names of the

  • wners
slide-10
SLIDE 10

+ + = 1

Efficiency property: the sum of the powers must be 1

slide-11
SLIDE 11

Transfer property: How to sum the power between two different interactive situations…(see later)

+

slide-12
SLIDE 12

Potere

Shapley&Shubik power index (1954) Satisfies anonymity, efficiency, null player and transfer properties … it is the unique power index which satisfies such properties on the class of simple games…

slide-13
SLIDE 13

370 150 480

480<667 losing

pivotal

480+370>667

winning

480+370+150>1000

still winning

370 150 480 P pivotal 370 150 480 pivotal 370 150 480 pivotal 370 150 480 pivotal 370 150 480 pivotal

slide-14
SLIDE 14

Shapley&Shubik power index (1954)

=

#( pivotal)

#(all permutations of players) = 3 3! = 3 6 = ½

slide-15
SLIDE 15

… a power index which satisfies such properties…

= ½ = 0 = ½

370 480 150

slide-16
SLIDE 16

Simple games

A simple game is a (voting or similar) situation in which every potential coalition (set of players/voters) can be either winning or losing.

  • DEF. A simple game is a pair (N,v) where

N is a finite set (players set) and v is map (characteristic function) defined on the power set 2N such that

v(S)∈{0,1} for each coalition S∈2N By convention v(∅)=0. We will assume v(N)=1.

slide-17
SLIDE 17

Example (weighted majority game)

Three owners Green (G), White (W), and Red (R) with 48%, 37% and 15% of weights, respectively. To take a decision the 2/3 majority is required. We can model this situation as a simple game({G,W,R},w) s.t.: w(G) =0 w(W) =0 w(R) = 0 w(G,W) =1 w(G,R) = 0 w(W,R) = 0 w(G,W,R) = 1

slide-18
SLIDE 18

A solution Φ is map assigning to each simple game (N,v) an n-vector of real numbers. For any two simple games (N,v),(N,w), Φ satisfies the transfer proeprty if it holds that Φ(v ∨ w)+Φ(v ∧ w) = Φ(v)+Φ(w).

Here v ∨ w is defined as (v ∨ w)(S) = (v(S) ∨ w(S)) = max{v(S),w(S)}, and v ∧ w is defined as (v ∧ w)(S) = (v(S) ∧ w(S)) = min{v(S),w(S)},

EXAMPLE Two TU-games v and w on N={1,2,3}.

+

v(1) =0 v(2) =1 v(3) = 0 v(1, 2) =1 v(1, 3) = 1 v(2, 3) = 0 v(1, 2, 3) = 1

=

w(1) =1 w(2) =0 w(3) = 0 w(1, 2) =1 w(1, 3) = 0 w(2, 3) = 1 w(1, 2, 3) = 1

Φ Φ

v∧w(1) =0 v∧w(2) =0 v∧w(3) = 0 v∧w(1, 2) =1 v∧w(1, 3) = 0 v∧w(2, 3) = 0 v∧w(1, 2, 3) = 1

Φ

v∨w(1) =1 v∨w(2) =1 v∨w(3) = 0 v∨w(1, 2) =1 v∨w(1, 3) = 1 v∨w(2, 3) = 1 v∨w(1, 2, 3) = 1

Φ ∧ ∨

+

Transfer property

slide-19
SLIDE 19

Real applications of simple games

Voting by disciplined party groups in multi-party parliaments (probably elected on the basis of proportional representation); USA President election UN Security Council voting in the EU Council of Ministers voting by stockholders (holding varying amounts of stock). lawmaking power of the United States …

slide-20
SLIDE 20

Weighted majority example

Suppose that four parties receive these vote shares:

Party A, 27%; Party B, 25%; Party C, 24%; Party D 24%.

Seats are apportioned in a 100-seat parliament: – Party A: 27 seats Party C: 24 seats – Party B: 25 seats Party D: 24 seats Seats (voting weights) have been apportioned in a way that is precisely proportional to vote support, but voting power has not been so apportioned (and cannot be).

slide-21
SLIDE 21

Weighted majority example (2)

A:27 seats; B:25 seats; C:24 seats; D:24 seats

Party A has voting power that greatly exceeds its slight advantage in seats. This is because: Party A can form a winning coalition with any one of the

  • ther parties; and

the only way to exclude Party A from a winning coalition is for Parties B, C, and D to form a three-party coalition.

slide-22
SLIDE 22

A:27 seats; B:25 seats; C:24 seats; D:24 seats; Quota: 51 A:2 seats; B:1 seats; C:1 seats; D:1 seats; Quota: 3 …

w(A) =1 w(B) =0 w(C) = 0 W(D)=0 w(A, B) =1 w(A, C) = 1 w(A, D) = 1 w(B, C) = 0 w(B, D) = 0 w(C, D) = 0 w(A, B, C) = 1 w(A, B, D) = 1 w(A, C, D) = 1 w(B, C, D) = 1 w(A, B, C, D) = 1

slide-23
SLIDE 23

Power Indices

Several power indices have been proposed to quantify the share of power held by each player in simple games. These particularly include: the Shapley-Shubik power index (1954); And the Banzhaf power index (1965). Such power indices provide precise formulas for evaluating the voting power of players in weighted voting games.

slide-24
SLIDE 24

The Shapley‐Shubik Index

Let (N,v) be a simple game (assume v is monotone: for each S,T ∈2N. S⊆T⇒ v(S) ≤v(T)) “Room parable”: Players gather one by one in a room to create the “grand coalition”, At some point a winning coalition forms. For each ordering in which they enter, identify the pivotal player who, when added to the players already in the room, converts a losing coalition into a winning coalition.

slide-25
SLIDE 25

The Shapley-Shubik Index (cont.)

Player i’s Shapley-Shubik power index value is simply Number of orderings in which the voter i is pivotal Total number of orderings Power index values of all voters add up to 1. Counting up, we see that A is pivotal in 12 orderings and each of B, C, and D is pivotal in 4 orderings. Thus: Voter Sh-Sh Power A 1/2 B 1/6 C 1/6 D 1/6 So according to the Shapley-Shubik index, Party A has 3 times the voting power of each other party.

slide-26
SLIDE 26

The Banzhaf Index

The Banzhaf power index works as follows: A player i is critical for a winning coalition if i belongs to the coalition, and the coalition would no longer be winning if i defected from it. Voter i’s Banzhaf power Bz(i) is Number of winning coalitions for which i is critical Total number of coalitions to which i belongs.

slide-27
SLIDE 27

The Banzhaf Index (2)

Given the seat shares before the election, and looking first at all the coalitions to which A belongs, we identify: {A},{A,B},{A,C}, {A,D}, {A,B,C}, {A,B,D}, {A,C,D}, (A,B,C,D}. Checking further we see that A is critical for all but two of these coalitions, namely {A} (because it is not winning); and {A,B,C,D} (because {B,C,D} can win without A). Thus: Bz(A) = 6/8 = .75

slide-28
SLIDE 28

The Banzhaf Index (3)

Looking at the coalitions to which B belongs, we identify:

{B},{A,B}, {B,C}, {B,D}, {A,B,C}, {A,B,D}, {B,C,D}, (A,B,C,D}.

Checking further we see that B is critical to only two

  • f these coalitions:

{B}, {B,C}, {B,D} are not winning; and {A,B,C}, {A,B,D}, and {A,B,C,D} are winning even if B defects.

The positions of C and D are equivalent to that of B. Thus: Bz(B) = Bz(C) = Bz(D) = 2/8 = .25

slide-29
SLIDE 29

Power indices: a general formulation

Let pi(S), for each S∈2N\{∅}, i∉S, be the probability of coalition S∪{i} to form (of course ∑S⊆N:i∉S pi(S)=1) A power index ψi(v) is defined as the probability of player i to be critical in v according to p:

ψi(v)=∑S⊆N:i∉S pi(S) [v(S∪{i})-v(S)]

slide-30
SLIDE 30

Power indices: a general formulation (2)

According to the Banzhaf power index, every coalitions has the same probability to form: pi(S)=1/(2n-1), for each S∈2N\{∅}, i∉S According to the Shapley-Shubick power index, compute pi(S) according to the following procedure to create at random from N a subset S to which i does not belong:

Draw at random a number out of the urn consisting of possible sizes 0,1,2,…,n-1 where each number has probability 1/n to be drawn If size s is chosen, draw a set out of the urn consisting of subsets of N\{i} of size s, where each set has the same probability, i.e. 1/combinations(n-1,s) indeed, pi(S)=(s! (n-s-1)!)/n!

slide-31
SLIDE 31

UN Security Council

  • 15 member states:

– 5 Permanent members: China, France, Russian Federation, United Kingdom, USA – 10 temporary seats (held for two‐year terms ) (http://www.un.org/)

slide-32
SLIDE 32

UN Security Council decisions

  • Decision Rule: substantive resolutions need

the positive vote of at least nine Nations but… …it is sufficient the negative vote of one among the permanent members to reject the decision.

  • Q: quantify the power of nations inside the

ONU council to force a substantive decision?

  • Game Theory gives an answer using the

Shapley-Shubik power index:

slide-33
SLIDE 33

Power

≅ 19.6%

Shapley-Shubik power index

Power

≅ 0.2%

temporary seats since January 1st 2007 until January 1st 2009

slide-34
SLIDE 34

Power

≅ 5%

Power

≅ 0.5%

temporary seats since January 1st 2007 until January 1st 2009

Banzhaf power index

slide-35
SLIDE 35

Central “dogma” of molecular biology (Crick (1958)

Gene expression occurs when genetic information contained within DNA is transcripted into mRNA molecules and then translated into the proteins. Nowadays, microarray technology is available for taking “pictures” of gene expressions. Within a single experiment of this sophisticated technology, the level of expression of thousands of genes can be estimated in a sample of cells under a given condition.

slide-36
SLIDE 36

Hybridize to microarray A B C D Normal Cell

geneA geneB geneC mRNAs geneC

Tumor cell

geneA geneB mRNAs geneC geneB

Fluorescent labelling reaction with reverse transcription

geneA geneB geneC geneC geneA geneB geneC geneB

A B C Scan image D

Normal Tumor

Gene A 1 1 Gene B 1 2 Gene C 2 1 Gene D 0

slide-37
SLIDE 37

Expression level of gene 5 in array 4 Array1 Array2 Array3 …

slide-38
SLIDE 38

The dimension of information

  • A typical experiment: a table of numbers with

more than 22000 rows (genes) e 60 of arrays (samples).

  • If we would print the entire table with a

character of 12pt, it would be necessary almost 3700 pages A4…

  • …a surface of almost 220 square meters!
slide-39
SLIDE 39

From political and social science to genomics…

  • Players are genes
  • Who knows the decision rule in this context?
  • IDEA: we can make a rule on microarray gene expression

profiles.

  • Example: we define a criterion to establish which genes

have abnormal expressions on each array

3.586 gene3 2.453 gene2 1.121 gene1 array1 1 gene3 1 gene2 gene1 array1

slide-40
SLIDE 40

Decision rule

A group of genes is winning on a single array if all genes that have abnormal expressions belong to that group

1 gene3 1 gene2 gene1 array1

Both groups {gene2, gene3} and group {gene1, gene2, gene3} are winning.

slide-41
SLIDE 41

… Array1 Array2 Array3

gene3 gene2 gene1 1 1 array1 1 1 array2 1 array3

  • coalition {gene2, gene3} is winning two times out of three;
  • coalition {gene1, gene2} is winning one time out of three;
  • And so on for each coalition...
slide-42
SLIDE 42

Example 1 1 g3 1 1 g2 1 g1

Array3 Array2 Array1

The corresponding microarray game <{g1,g2,g3},v> tale che v(∅)=v({g1})=v({g2})=0 v({g1,g3})=v({g1,g2})=v({g3})=1/3 v({g2,g3})=2/3 v({g1,g2,g3})=1.

The Shapley value is

Shg1=1/6 Shg2=1/3 Shg3=1/2

slide-43
SLIDE 43

Axioms for the Shapley value on microarray games Property 1: Null Gene (NG) A gene which does not contribute to change the worth of any coalition of genes, should receive zero power. Prop.2:Equal Splitting (ES) Each sample should receive the same level of reliability. So the power of a gene on two samples should be equal to the sum of the power on each sample divided by two.

1 s1 1 1 s2 g3 g2 g1 1 g3 1 g2 1 g1 s2 s1

ψ1 ψ2 ψ3 ψ’1 ψ’2 ψ’3

+ =

(ψ1+ψ’1)/2 (ψ2+ ψ’2)/2 (ψ3+ ψ’3)/2

slide-44
SLIDE 44

Partnership of genes

A group of genes S such that does not exist a proper (⊂) subset of S which contributes in changing the worth of genes outside S.

1 1 g3 1 1 g2 1 1 g1 s3 s2 s1 Example

These two sets are partnerships of genes in the corresponding Microarray game

slide-45
SLIDE 45

Property 3: Partnership Monotonicity (PM) (N,v) a microarray game. If two partnerships of genes S and T, with |T|≥|S| are such that they are

  • disjoint (S∩T=∅),
  • equivalent (v(S)=v(T))
  • exhaustive (v(S∪T)=v(N)),

then genes in the smaller partnership S must receive more relevance then genes in T.

ψi ≥ ψk For each i∈{1,2} k∈{3,4,5} 1 g3 1 g2 1 g1 s2 s1 1 g5 1 g4

ψ1 ψ2 ψ3 ψ4 ψ5

Example

slide-46
SLIDE 46

Property 4: Partnership Rationality (PR) The total amount of power index received from players of a partnership S should not be smaller than v(S) Property 5: Partnership Feasibility (PF) The total amount of power index received from players of a partnership S should not be greater than v(N) Theorem (Moretti, Patrone, Bonassi (2007)): The Shapley value is the unique solution which satisfies NG, ES, PM, PR, PF on the class of microarray games.

slide-47
SLIDE 47

Real data analysis

slide-48
SLIDE 48

Application (1): Neuroblastic Tumors data

(Cancer, 113(6), 1412 – 1422) Neuroblastic Tumors (NTs) is a group of pediatric cancers with a great tissue heterogeneity. Most of NTs are composed of undifferentiated, poorly differentiated or differentiating neuroblastic (Nb) cells with very few or absence of Schwannian Stromal (SS) cells: these tumors are grouped as Neuroblastoma (Schwannian stroma- poor) (labeled as NTs-SP). The remaining NTs are composed of abundant SS cells and classified as Ganglioneuroblastoma (Schwannian stroma-rich) intermixed or nodular and Ganglioneurom (labeled as NTs-SR). The evolution of the disease is strongly influenced by the istology of the tumor and children with NTs-SR have a better prognosis w.r.t, NTs-SP.

slide-49
SLIDE 49 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 1 2

Minimum expression Maximum expression 2 geni 22283 geni

slide-50
SLIDE 50

NT-SR NT-SP

1589_GNB.CEL 1172_GNB.CEL 1761_GNB.CEL 1134_RICH.CEL 1999_RICH.CEL 1591_GNB.CEL SR2_RICH.CEL SR1_RICH.CEL SR3_RICH.CEL 1547_POOR.CEL 1919_POOR.CEL 2182_POOR.CEL 1538_POOR.CEL 2056_POOR.CEL 2237_POOR.CEL 2259_POOR.CEL 2181_POOR.CEL 2215_POOR.CEL 2216_POOR.CEL

RP11-35N6 NOL4 NHLH2 SLC17A6 TTK CENPF PBK TNRC9 214046_at CALB1 GREB1 IL7 MMP12 CDH10 MYCN TMSL8 RRM2 EYA1 TFAP2B MAB21L2 INSM1 POU4F2 TOP2A GATA3 CDH1 MEOX2 TNNC1 CXCL13 UTS2 SLC22A3 GPR126 MYOT TSPAN8 NR4A2 P2RY14 CYP1B1 ANGPTL7 PMP2 GPM6B ASPA CHL1 CXCL14 CALCA SST CFI MAL PLP1 CDH19 ABCA8 APOD 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 1 2

Minimum expression Maximum expression

slide-51
SLIDE 51

Application (2): effects of air pollution

BMC Bioinformatics (IF 3.49), 9:361).

  • Study population:

23 children from 12 families (2 siblings) from the areas of Teplice (TP) in Czech Republic TP is infamous for air pollution 24 children from the rural, less polluted are of Prachatice (PR) Hybridization to Agilent Human 1A Oligo Microarray (v2) G4110B, containing over 22000 60mer probes Individual samples were hybridized with a sample of the common reference (a pool of PR individuals) Data have been normalized, condensed and filtered by Genedata, Basel (CH)

slide-52
SLIDE 52

Selection based on two criteria: Shapley value and CASh

838 genes 889 genes

CASh

slide-53
SLIDE 53

Game theory Gene selection

Application (2): effects of air pollution

Distance: Euclidean Agglomerative method: Ward

47 biological samples (columns) and 159 genes (rows) with highest Shapley values and with un-adjusted p-value smaller than 0.01. yellow = high expression blue = low expression

3_0 4_0 2_0 20_0 13_0 14_0 15_0 1_0 21_0 23_0 9_0 5_0 8_0 35_1 10_0 11_0 18_0 19_0 12_0 16_0 7_0 6_0 22_0 24_0 32_1 43_1 45_1 29_1 38_1 28_1 33_1 37_1 25_1 47_1 30_1 39_1 34_1 40_1 46_1 17_0 26_1 27_1 41_1 42_1 44_1 31_1 36_1

A_23_P42991 A_23_P127475 A_23_P22487 A_23_P122445 A_23_P154766 A_23_P29248 A_23_P35534 A_23_P357374 A_23_P130926 A_23_P60520 A_23_P154849 A_23_P401524 A_23_P57089 A_23_P38876 A_23_P109643 A_23_P8571 A_23_P143385 A_23_P97652 A_23_P207160 A_23_P219060 A_23_P218476 A_23_P61222 A_23_P80398 A_23_P101380 A_23_P101761 A_23_P150207 A_23_P100263 A_23_P104589 A_23_P109837 A_23_P500824 A_23_P23194 A_23_P17587 A_23_P78061 A_23_P102940 A_23_P204511 A_23_P37866 A_23_P169599 A_23_P116264 A_23_P70915 A_23_P163903 A_23_P153562 A_23_P29124 A_23_P9582 A_23_P16163 A_23_P32715 A_23_P141044 A_23_P69652 A_23_P67411 A_23_P502710 A_23_P213640 A_23_P74138 A_23_P252369 A_23_P15135 A_23_P164760 A_23_P58266 A_23_P87219 A_23_P315430 A_23_P132285 A_23_P116890 A_23_P414308 A_23_P2990 A_23_P7963 A_23_P27697 A_23_P366812 A_23_P154876 A_23_P211397 A_23_P254605 A_23_P134433 A_23_P31195 A_23_P426989 A_23_P124931 A_23_P151166 A_23_P382775 A_23_P203994 A_23_P141699 A_23_P371787 A_23_P106002 A_23_P206466 A_23_P320095 A_23_P55854 A_23_P118916 A_23_P56969 A_23_P150876 A_23_P119502 A_23_P376599 A_23_P16597 A_23_P258381 A_23_P6866 A_23_P304530 A_23_P147623 A_23_P53033 A_23_P50081 A_23_P59069 A_23_P339744 A_23_P25790 A_23_P2582 A_23_P255569 A_23_P312924 A_23_P101500 A_23_P64860 A_23_P111042 A_23_P150664 A_23_P98581 A_23_P257296 A_23_P136413 A_23_P51926 A_23_P74641 A_23_P57941 A_23_P8293 A_23_P412409 A_23_P106641 A_23_P16722 A_23_P8412 A_23_P25551 A_23_P337201 A_23_P84870 A_23_P110742 A_23_P118392 A_23_P409386 A_23_P201432 A_23_P126241 A_23_P86034 A_23_P126970 A_23_P136635 A_23_P66758 A_23_P143885 A_23_P258272 A_23_P28869 A_23_P159195 A_23_P140277 A_23_P843 A_23_P501435 A_23_P78438 A_23_P137832 A_23_P112412 A_23_P359111 A_23_P145146 A_23_P200325 A_23_P112825 A_23_P212515 A_23_P92786 A_23_P19790 A_23_P78509 A_23_P27315 A_23_P151459 A_23_P69908 A_23_P71591 A_23_P15937 A_23_P213551 A_23_P164536 A_23_P411814 A_23_P104201 A_23_P88710 A_23_P218160 A_23_P258340 A_23_P217776 A_23_P165214 A_23_P112652 A_23_P87150

Cluster A Cluster B Sub-cluster C