Detection of network motifs by local Local Statistics - - PowerPoint PPT Presentation

detection of network motifs by local
SMART_READER_LITE
LIVE PREVIEW

Detection of network motifs by local Local Statistics - - PowerPoint PPT Presentation

Detection of network motifs by local concentration Etienne Birmel e Context Detection of network motifs by local Local Statistics concentration A global statistic Motif detection Etienne Birmel e procedure Application to


slide-1
SLIDE 1

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Detection of network motifs by local concentration

Etienne Birmel´ e

Laboratoire Statistique et G´ enome, Universit´ e d’Evry Groupe SSB - ANR NeMo

slide-2
SLIDE 2

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

1 Context 2 Local Statistics 3 A global statistic 4 Motif detection procedure 5 Application to Yeast 6 Conclusion

slide-3
SLIDE 3

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Network motifs

A motif is a small graph which is over-represented in a network: it’s a candidate to be studied for a potential biological meaning. Example: the feed-forward loop

  • X

Y Z

slide-4
SLIDE 4

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Network motif detection

All previous methods look for an overall over-representation:

  • U. Alon’s group (since 2002): simulations for size 3 and 4,

Z-score

  • J. Berg and M. L¨

assig (2004): probabilistic motifs by an alignment heuristic

  • F. Picard et al (2008): mixture model for the network and

Polya-Aeppli distribution.

slide-5
SLIDE 5

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Leading ideas

  • A small graph m may be over-represented because one of

its subgraphs m′ is over-represented. In that case, m′ is the relevant motif.

  • Motifs in regulatory networks are known to be

concentrated on some places of the networks (Dobrin & al 04).

  • Z = f (X1, . . . , Xn) is highly concentrated around its mean

when the Xi’s are independent and changing the value of

  • ne of them does affect Z by less than a constant.
slide-6
SLIDE 6

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Changing the definition of a motif

Consider a small graph m and a subgraph m′ of m obtained by the deletion of a vertex in m.

  • m′

m m is a motif with respect to m′ if there exist an occurence of m′ in the network which has a surprisingly high number of extensions to occurences of m.

  • m′
slide-7
SLIDE 7

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Random graph model

We fix the number n of nodes and the underlying random graph model is defined by a n × n matrix C: the edge indicators (Xij)1≤i,j≤n are independent Bernoulli variables and P(Xij = 1) = cij In particular, our theory is valid for:

  • Edge probability proportional to didj.
  • Mixture models on graphs with fixed classes.
slide-8
SLIDE 8

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Random graph model

  • P(NN) = 1/2

P(RR) = 1/4 P(NR) = 0 P(RN) = 1/16

slide-9
SLIDE 9

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

1 Context 2 Local Statistics 3 A global statistic 4 Motif detection procedure 5 Application to Yeast 6 Conclusion

slide-10
SLIDE 10

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Notations

Let m be a small graph on k vertices (r1, . . . , rk−1, s) and m′ the subgraph obtained by deleting s. Let U = (u1, . . . , uk−1) be an ordered set of k − 1 vertices. We define:

  • NU(m) the number of occurrences of m which restriction

to U is isomorphic to m′;

  • YU(m′) = IG[U]∼m′
  • extv

U(m′, m) = 1 ⇔ ∀i, Xuiv = eris

extv

U = 1 if adding the vertex v yields an occurence of m.

  • λU = E(

v / ∈U extv U) the mean number of valid extensions.

slide-11
SLIDE 11

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Notations

Then NU(m) = YU(m′)

  • v /

∈U

extv

U(m′, m)

and YU and extv

U are independent.

  • U
slide-12
SLIDE 12

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Example

  • 1

2 3 4 5 6 7 8 9 r3 r3 r1 r1 r2 r2 s m m′ G For U = (3, 2, 4), YU(m′) = 1 and NU(m) = 3.

slide-13
SLIDE 13

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Poisson approximation

  • v /

∈U extv U is a sum of independant Bernoulli r.v.’s and can

therefore be approximated in total variation distance by a Poisson law of mean λU: ∀A ⊂ Z+, |P(NU(m) ∈ A|YU(m′)) − Po(λU)(A)| ≤ min(1, 1/λU)

  • v

p2

v

with pv = P(extv

U = 1).

In practice, pv’s are small and that bound is quite sharp (between 1.8e − 9 and 5.0e − 3 for the different positions of the feed-forward loop in the Yeast regulatory network)

slide-14
SLIDE 14

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

A local statistic

The upper bound approximation is even better for tail probabilities: If t = m−λU

λU

> 1, P

  • NU(m) ≥ m|YU(m′)

t t − 1Po(λU)([m, +∞)) ≤ t + 1 t − 1Po(λU)(m) which implies P NU(m) − λU λU > t

  • ≤ P(YU(m′) = 1)

t + 1 √ 2π(t − 1) e−((1+t) ln(1+t)−

slide-15
SLIDE 15

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Simulated exemple

1e7 graphs were generated using three vertex classes of 100 vertices each and respective probabilities of connection .25 and .05 depending on whether the vertices belonged to the same class or not. The pattern m is the feed-forward loop and the vertex deleted to obtain m′ is Z.

  • X

Y Z

The position U contains one vertex of class 1 and one vertex of class 2. The mean number of extensions is λU = 2.725

slide-16
SLIDE 16

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Simulated exemple

Black: empirical p-values Red: Po(λ) p-values

5 10 15 0.0 0.2 0.4 0.6 0.8 x 1 − f(x)

slide-17
SLIDE 17

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Zoom on large deviations

12 13 14 15 16 0.0e+00 5.0e−06 1.0e−05 1.5e−05 2.0e−05 x 1 − f(x)

slide-18
SLIDE 18

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Refining the bounds for moderate deviations

Theorem

Let η =

v p2 v and φ = η λU + η(1 + t)2.

If λU + √λU ≤ m ≤ λ2

U/η,

1 − 15φ ≤ P(NU(m) ≥ m) Po(λ)([m, +∞)) ≤ 1 + 15φ Example: ER model with p = c

  • n. For 1 ≤ m ≤

√ 15αn, (1 − α)cm ≤ P(NU(m) ≥ m) Po(λ)([m, +∞)) ≤ 1 + α with limm→∞ cm = 1.

slide-19
SLIDE 19

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

1 Context 2 Local Statistics 3 A global statistic 4 Motif detection procedure 5 Application to Yeast 6 Conclusion

slide-20
SLIDE 20

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

A nice function

h(X, Y ) = if X ≤ Y X ln( X

eY ) + Y

else

5 10 15 20 25 30 10 20 30 40 X h(X,Y) Y=1 Y=5 Y=10

At fixed Y , hY : X → h(X, Y ) is increasing, asymptotically equivalent to X ln(X).

slide-21
SLIDE 21

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

A global statistic

The local inequality can be rewritten as: ∀t > 0, P

  • h(NU(m), λU) > t
  • ≤ P(G[U] ∼ m′)e−t

As P

  • max

U (h(NU(m), λU)) > t

  • U

P

  • h(NU(m), λU) > t
  • ,

Theorem

P

  • max

U (h(NU(m), λU)) > t

  • ≤ aut(m′)ENU(m′)e−t
slide-22
SLIDE 22

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

An user-friendly corollary

Corollary

P

  • ∃U/NU(m) > e2λU + t
  • ≤ aut(m′)ENU(m′)e−t

That is: The probability that there exist any occurrence of m′ in the network which has a surprisingly high number of extensions to m decreases exponentially.

slide-23
SLIDE 23

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

1 Context 2 Local Statistics 3 A global statistic 4 Motif detection procedure 5 Application to Yeast 6 Conclusion

slide-24
SLIDE 24

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Motif selection criterion

Fix a threshold α. For every pattern m of size ≤ k, every non-disconnecting vertex s of m, do the following steps: First step Determine if m is over-represented with respect to m \ {s}. Second step If the answer is positive, determine for every non-disconnecting t distinct from s if m \ {t} is

  • ver-represented with respect to m \ {s, t}.

m is a motif with respect to m \ {s} if the answer to the the second question is negative for all t.

slide-25
SLIDE 25

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Example

  • X

X Y Y Z Z T T 2.1e − 10 2.4 The feed-forward loop beeing over-represented with respect to Z, the pattern is not a motif.

slide-26
SLIDE 26

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

1 Context 2 Local Statistics 3 A global statistic 4 Motif detection procedure 5 Application to Yeast 6 Conclusion

slide-27
SLIDE 27

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Data

Transcriptional regulatory network available at U. Alon’s lab webpage: 680 genes and 1078 interactions. Bayesian estimation of the parameters for a mixture model for graphs gives 7 vertex classes, of respective sizes 11,13,29,78,87,88 and 384.

slide-28
SLIDE 28

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Motifs of size 3, 4, 5

slide-29
SLIDE 29

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

1 Context 2 Local Statistics 3 A global statistic 4 Motif detection procedure 5 Application to Yeast 6 Conclusion

slide-30
SLIDE 30

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Conclusion

  • New definition of a motif: a motif is over-represented with

respect to a submotif.

  • Fast algorithm.
  • The known relevant motifs in the Yeast regulation network

are found.

slide-31
SLIDE 31

Detection of network motifs by local concentration Etienne Birmel´ e Context Local Statistics A global statistic Motif detection procedure Application to Yeast Conclusion

Perspectives

  • Lower bounds,
  • Deeper biological applications,
  • Network comparisons using the local score lists.