Power Indices and Game Theory (Applications to Bioinformatics)
Stefano MORETTI stefano.moretti@dauphine.fr LAMSADE (CNRS), Paris Dauphine
Power Indices and Game Theory (Applications to Bioinformatics) - - PowerPoint PPT Presentation
Power Indices and Game Theory (Applications to Bioinformatics) Stefano MORETTI stefano.moretti@dauphine.fr LAMSADE (CNRS), Paris Dauphine von Neumann and Morgenstern, 1944 basic formal language for modeling economic phenomena. Dominant
Stefano MORETTI stefano.moretti@dauphine.fr LAMSADE (CNRS), Paris Dauphine
Dominant strategies Nash eq. (NE) Subgame perfect NE NE & refinements … Core Shapley value Nucleolus τ‐value PMAS …. Nash sol. Kalai‐ Smorodinsky …. CORE NTU‐value Compromise value … No binding agreements No side payments Q: Optimal behaviour in conflict situations binding agreements side payments are possible (sometimes) Q: Reasonable (cost, reward)‐sharing von Neumann and Morgenstern, 1944 basic formal language for modeling economic phenomena.
“Power meter”
This group has less than 667 thousands
This group has less than 667 thousands
This group has less than 667 thousands
This group has more than 667 thousands
This group has less than 667 thousands
This group has less than 667 thousands
This group has less than 667 thousands
This group has more than 667 thousands
370 150 480
pivotal
480+370>667
480+370+150>1000
370 150 480 P pivotal 370 150 480 pivotal 370 150 480 pivotal 370 150 480 pivotal 370 150 480 pivotal
v(S)∈{0,1} for each coalition S∈2N By convention v(∅)=0. We will assume v(N)=1.
Three owners Green (G), White (W), and Red (R) with 48%, 37% and 15% of weights, respectively. To take a decision the 2/3 majority is required. We can model this situation as a simple game({G,W,R},w) s.t.: w(G) =0 w(W) =0 w(R) = 0 w(G,W) =1 w(G,R) = 0 w(W,R) = 0 w(G,W,R) = 1
A solution Φ is map assigning to each simple game (N,v) an n-vector of real numbers. For any two simple games (N,v),(N,w), Φ satisfies the transfer proeprty if it holds that Φ(v ∨ w)+Φ(v ∧ w) = Φ(v)+Φ(w).
Here v ∨ w is defined as (v ∨ w)(S) = (v(S) ∨ w(S)) = max{v(S),w(S)}, and v ∧ w is defined as (v ∧ w)(S) = (v(S) ∧ w(S)) = min{v(S),w(S)},
EXAMPLE Two TU-games v and w on N={1,2,3}.
v(1) =0 v(2) =1 v(3) = 0 v(1, 2) =1 v(1, 3) = 1 v(2, 3) = 0 v(1, 2, 3) = 1
w(1) =1 w(2) =0 w(3) = 0 w(1, 2) =1 w(1, 3) = 0 w(2, 3) = 1 w(1, 2, 3) = 1
v∧w(1) =0 v∧w(2) =0 v∧w(3) = 0 v∧w(1, 2) =1 v∧w(1, 3) = 0 v∧w(2, 3) = 0 v∧w(1, 2, 3) = 1
v∨w(1) =1 v∨w(2) =1 v∨w(3) = 0 v∨w(1, 2) =1 v∨w(1, 3) = 1 v∨w(2, 3) = 1 v∨w(1, 2, 3) = 1
Party A, 27%; Party B, 25%; Party C, 24%; Party D 24%.
A:27 seats; B:25 seats; C:24 seats; D:24 seats; Quota: 51 A:2 seats; B:1 seats; C:1 seats; D:1 seats; Quota: 3 …
w(A) =1 w(B) =0 w(C) = 0 W(D)=0 w(A, B) =1 w(A, C) = 1 w(A, D) = 1 w(B, C) = 0 w(B, D) = 0 w(C, D) = 0 w(A, B, C) = 1 w(A, B, D) = 1 w(A, C, D) = 1 w(B, C, D) = 1 w(A, B, C, D) = 1
{B},{A,B}, {B,C}, {B,D}, {A,B,C}, {A,B,D}, {B,C,D}, (A,B,C,D}.
Draw at random a number out of the urn consisting of possible sizes 0,1,2,…,n-1 where each number has probability 1/n to be drawn If size s is chosen, draw a set out of the urn consisting of subsets of N\{i} of size s, where each set has the same probability, i.e. 1/combinations(n-1,s) indeed, pi(S)=(s! (n-s-1)!)/n!
temporary seats since January 1st 2007 until January 1st 2009
temporary seats since January 1st 2007 until January 1st 2009
Hybridize to microarray A B C D Normal Cell
geneA geneB geneC mRNAs geneC
Tumor cell
geneA geneB mRNAs geneC geneB
Fluorescent labelling reaction with reverse transcription
geneA geneB geneC geneC geneA geneB geneC geneB
A B C Scan image D
Normal Tumor
Gene A 1 1 Gene B 1 2 Gene C 2 1 Gene D 0
Expression level of gene 5 in array 4 Array1 Array2 Array3 …
3.586 gene3 2.453 gene2 1.121 gene1 array1 1 gene3 1 gene2 gene1 array1
1 gene3 1 gene2 gene1 array1
… Array1 Array2 Array3
gene3 gene2 gene1 1 1 array1 1 1 array2 1 array3
Array3 Array2 Array1
The corresponding microarray game <{g1,g2,g3},v> tale che v(∅)=v({g1})=v({g2})=0 v({g1,g3})=v({g1,g2})=v({g3})=1/3 v({g2,g3})=2/3 v({g1,g2,g3})=1.
The Shapley value is
Axioms for the Shapley value on microarray games Property 1: Null Gene (NG) A gene which does not contribute to change the worth of any coalition of genes, should receive zero power. Prop.2:Equal Splitting (ES) Each sample should receive the same level of reliability. So the power of a gene on two samples should be equal to the sum of the power on each sample divided by two.
ψ1 ψ2 ψ3 ψ’1 ψ’2 ψ’3
(ψ1+ψ’1)/2 (ψ2+ ψ’2)/2 (ψ3+ ψ’3)/2
A group of genes S such that does not exist a proper (⊂) subset of S which contributes in changing the worth of genes outside S.
These two sets are partnerships of genes in the corresponding Microarray game
Property 3: Partnership Monotonicity (PM) (N,v) a microarray game. If two partnerships of genes S and T, with |T|≥|S| are such that they are
then genes in the smaller partnership S must receive more relevance then genes in T.
ψ1 ψ2 ψ3 ψ4 ψ5
Property 4: Partnership Rationality (PR) The total amount of power index received from players of a partnership S should not be smaller than v(S) Property 5: Partnership Feasibility (PF) The total amount of power index received from players of a partnership S should not be greater than v(N) Theorem (Moretti, Patrone, Bonassi (2007)): The Shapley value is the unique solution which satisfies NG, ES, PM, PR, PF on the class of microarray games.
(Cancer, 113(6), 1412 – 1422) Neuroblastic Tumors (NTs) is a group of pediatric cancers with a great tissue heterogeneity. Most of NTs are composed of undifferentiated, poorly differentiated or differentiating neuroblastic (Nb) cells with very few or absence of Schwannian Stromal (SS) cells: these tumors are grouped as Neuroblastoma (Schwannian stroma- poor) (labeled as NTs-SP). The remaining NTs are composed of abundant SS cells and classified as Ganglioneuroblastoma (Schwannian stroma-rich) intermixed or nodular and Ganglioneurom (labeled as NTs-SR). The evolution of the disease is strongly influenced by the istology of the tumor and children with NTs-SR have a better prognosis w.r.t, NTs-SP.
Minimum expression Maximum expression 2 geni 22283 geni
NT-SR NT-SP
1589_GNB.CEL 1172_GNB.CEL 1761_GNB.CEL 1134_RICH.CEL 1999_RICH.CEL 1591_GNB.CEL SR2_RICH.CEL SR1_RICH.CEL SR3_RICH.CEL 1547_POOR.CEL 1919_POOR.CEL 2182_POOR.CEL 1538_POOR.CEL 2056_POOR.CEL 2237_POOR.CEL 2259_POOR.CEL 2181_POOR.CEL 2215_POOR.CEL 2216_POOR.CEL
RP11-35N6 NOL4 NHLH2 SLC17A6 TTK CENPF PBK TNRC9 214046_at CALB1 GREB1 IL7 MMP12 CDH10 MYCN TMSL8 RRM2 EYA1 TFAP2B MAB21L2 INSM1 POU4F2 TOP2A GATA3 CDH1 MEOX2 TNNC1 CXCL13 UTS2 SLC22A3 GPR126 MYOT TSPAN8 NR4A2 P2RY14 CYP1B1 ANGPTL7 PMP2 GPM6B ASPA CHL1 CXCL14 CALCA SST CFI MAL PLP1 CDH19 ABCA8 APOD 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 1 2Minimum expression Maximum expression
BMC Bioinformatics (IF 3.49), 9:361).
23 children from 12 families (2 siblings) from the areas of Teplice (TP) in Czech Republic TP is infamous for air pollution 24 children from the rural, less polluted are of Prachatice (PR) Hybridization to Agilent Human 1A Oligo Microarray (v2) G4110B, containing over 22000 60mer probes Individual samples were hybridized with a sample of the common reference (a pool of PR individuals) Data have been normalized, condensed and filtered by Genedata, Basel (CH)
Selection based on two criteria: Shapley value and CASh
838 genes 889 genes
CASh
Game theory Gene selection
Application (2): effects of air pollution
Distance: Euclidean Agglomerative method: Ward
47 biological samples (columns) and 159 genes (rows) with highest Shapley values and with un-adjusted p-value smaller than 0.01. yellow = high expression blue = low expression
3_0 4_0 2_0 20_0 13_0 14_0 15_0 1_0 21_0 23_0 9_0 5_0 8_0 35_1 10_0 11_0 18_0 19_0 12_0 16_0 7_0 6_0 22_0 24_0 32_1 43_1 45_1 29_1 38_1 28_1 33_1 37_1 25_1 47_1 30_1 39_1 34_1 40_1 46_1 17_0 26_1 27_1 41_1 42_1 44_1 31_1 36_1
A_23_P42991 A_23_P127475 A_23_P22487 A_23_P122445 A_23_P154766 A_23_P29248 A_23_P35534 A_23_P357374 A_23_P130926 A_23_P60520 A_23_P154849 A_23_P401524 A_23_P57089 A_23_P38876 A_23_P109643 A_23_P8571 A_23_P143385 A_23_P97652 A_23_P207160 A_23_P219060 A_23_P218476 A_23_P61222 A_23_P80398 A_23_P101380 A_23_P101761 A_23_P150207 A_23_P100263 A_23_P104589 A_23_P109837 A_23_P500824 A_23_P23194 A_23_P17587 A_23_P78061 A_23_P102940 A_23_P204511 A_23_P37866 A_23_P169599 A_23_P116264 A_23_P70915 A_23_P163903 A_23_P153562 A_23_P29124 A_23_P9582 A_23_P16163 A_23_P32715 A_23_P141044 A_23_P69652 A_23_P67411 A_23_P502710 A_23_P213640 A_23_P74138 A_23_P252369 A_23_P15135 A_23_P164760 A_23_P58266 A_23_P87219 A_23_P315430 A_23_P132285 A_23_P116890 A_23_P414308 A_23_P2990 A_23_P7963 A_23_P27697 A_23_P366812 A_23_P154876 A_23_P211397 A_23_P254605 A_23_P134433 A_23_P31195 A_23_P426989 A_23_P124931 A_23_P151166 A_23_P382775 A_23_P203994 A_23_P141699 A_23_P371787 A_23_P106002 A_23_P206466 A_23_P320095 A_23_P55854 A_23_P118916 A_23_P56969 A_23_P150876 A_23_P119502 A_23_P376599 A_23_P16597 A_23_P258381 A_23_P6866 A_23_P304530 A_23_P147623 A_23_P53033 A_23_P50081 A_23_P59069 A_23_P339744 A_23_P25790 A_23_P2582 A_23_P255569 A_23_P312924 A_23_P101500 A_23_P64860 A_23_P111042 A_23_P150664 A_23_P98581 A_23_P257296 A_23_P136413 A_23_P51926 A_23_P74641 A_23_P57941 A_23_P8293 A_23_P412409 A_23_P106641 A_23_P16722 A_23_P8412 A_23_P25551 A_23_P337201 A_23_P84870 A_23_P110742 A_23_P118392 A_23_P409386 A_23_P201432 A_23_P126241 A_23_P86034 A_23_P126970 A_23_P136635 A_23_P66758 A_23_P143885 A_23_P258272 A_23_P28869 A_23_P159195 A_23_P140277 A_23_P843 A_23_P501435 A_23_P78438 A_23_P137832 A_23_P112412 A_23_P359111 A_23_P145146 A_23_P200325 A_23_P112825 A_23_P212515 A_23_P92786 A_23_P19790 A_23_P78509 A_23_P27315 A_23_P151459 A_23_P69908 A_23_P71591 A_23_P15937 A_23_P213551 A_23_P164536 A_23_P411814 A_23_P104201 A_23_P88710 A_23_P218160 A_23_P258340 A_23_P217776 A_23_P165214 A_23_P112652 A_23_P87150Cluster A Cluster B Sub-cluster C