Sequence-dependent equilibrium distributions of DNA within the cgDNA - - PowerPoint PPT Presentation

sequence dependent equilibrium distributions of dna
SMART_READER_LITE
LIVE PREVIEW

Sequence-dependent equilibrium distributions of DNA within the cgDNA - - PowerPoint PPT Presentation

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix Sequence-dependent equilibrium distributions of DNA within the cgDNA coarse grain model M elissa Nicolier Supervised by J.H. Maddocks and T.


slide-1
SLIDE 1

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Sequence-dependent equilibrium distributions of DNA within the cgDNA coarse grain model

M´ elissa Nicolier Supervised by J.H. Maddocks and T. Lessinnes

LCVM2

June 16, 2016

1/49

slide-2
SLIDE 2

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Content overview

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

2/49

slide-3
SLIDE 3

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Theoretical Tools

3/49

slide-4
SLIDE 4

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

The cgDNA model1: basic idea

We assume the probability density function of the configuration w

  • f a given sequence S to be

ρ(w; S) = 1 Z exp {−U(w; S)} , where

  • U(w; S) = 1

2(w −

w(S)) · K(S)(w − w(S)) is the quadratic function approximating the free energy of the sequence in the configuration w,

w(S) is the ground state shape vector,

  • K(S) is the stiffness matrix.

1O Gonzalez, D Petkeviˇ

ci¯ ut˙ e, and J.H Maddocks. “A sequence-dependent rigid-base model of DNA”. . In: The Journal of chemical physics 138.5 (2013),

  • p. 055102.

4/49

slide-5
SLIDE 5

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Configuration vector2

w = (y1, z1, y2, . . . , zn−1, yn) ∈ R6n+6(n−1) = R12n−6

  • intra-basepair coordinates ya = (ν, ξ)a ∈ R6:

describe the position of Xa with respect to X a.

2O Gonzalez, D Petkeviˇ

ci¯ ut˙ e, and J.H Maddocks. “Supplementary material to A sequence-dependent rigid-base model of DNA”. In: The Journal of chemical physics ().

5/49

slide-6
SLIDE 6

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Configuration vector2

w = (y1, z1, y2, . . . , zn−1, yn) ∈ R6n+6(n−1) = R12n−6

  • inter-basepair coordinates za = (θ, ζ)a ∈ R6:

describe the position of (Xa+1, X a+1) with respect to (Xa, X a).

2O Gonzalez, D Petkeviˇ

ci¯ ut˙ e, and J.H Maddocks. “Supplementary material to A sequence-dependent rigid-base model of DNA”. In: The Journal of chemical physics ().

5/49

slide-7
SLIDE 7

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

cgDNA model

  • Parameter set

P = {K α, K αβ, σα, σαβ : α, β ∈ {A, T, G, C}}, where σ = K w.

  • Parameter set + Sequence S ⇒ K(S) and σ(S)

w := K −1σ. Note: different parameter sets have been constructed, the one used in this project is cgDNAparamset2.

6/49

slide-8
SLIDE 8

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

cgDNA model

7/49

slide-9
SLIDE 9

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

cgDNA model

7/49

slide-10
SLIDE 10

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

cgDNA model

7/49

slide-11
SLIDE 11

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

cgDNA model

7/49

slide-12
SLIDE 12

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

From Parameter set to Stiffness Matrix. S = X1 · · · Xn

+

8/49

slide-13
SLIDE 13

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

From Parameter set to Stiffness Matrix. S = X1 · · · Xn

8/49

slide-14
SLIDE 14

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Periodic Parameters3

3Jaros

law G

  • lowacki. “Computation and Visualization for Multiscale

Modelling of DNA Mechanics”. PhD thesis. Ecole Polytechnique F´ ed´ erale de Lausanne, 2016.

9/49

slide-15
SLIDE 15

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Periodic Parameters3

3Jaros

law G

  • lowacki. “Computation and Visualization for Multiscale

Modelling of DNA Mechanics”. PhD thesis. Ecole Polytechnique F´ ed´ erale de Lausanne, 2016.

9/49

slide-16
SLIDE 16

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Periodic Parameters3

3Jaros

law G

  • lowacki. “Computation and Visualization for Multiscale

Modelling of DNA Mechanics”. PhD thesis. Ecole Polytechnique F´ ed´ erale de Lausanne, 2016.

9/49

slide-17
SLIDE 17

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Comparing probability density functions: Kullback-Leibler Divergence

Let E be a space of probability density functions ρ, with ρ(x) ≥ 0 and

  • Ω ρ(x)dx = 1. A divergence is a function D : E × E → R

such that D(x, y) ≥ 0 and D(x, y) = 0 ⇐ ⇒ x = y. Define ρi as the probability density function associated with a sequence Si. It has a ground state shape vector wi and a stiffness matrix Ki, i = 1, 2. Then, the Kullback-Leibler divergence, or symmetrised relative entropy is defined as follows: DKL(ρ1, ρ2) = 1 2

  • ρ1 ln

ρ1 ρ2

  • + ρ2 ln

ρ2 ρ1

  • dx

= 1 4[K −1

1

: K2 + K −1

2

: K1 − 2I : I] + 1 4( w1 − w2) · (K1 + K2)( w1 − w2).

10/49

slide-18
SLIDE 18

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Kullback-Leibler divergence in 1D

Two different PDFs Corresponding density of the Kullback-Leibler integrand

11/49

slide-19
SLIDE 19

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Kullback-Leibler divergence in 1D

Two different PDFs Corresponding density of the Kullback-Leibler integrand

11/49

slide-20
SLIDE 20

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Stiffness contribution of the Kullback-Leibler divergence

For K1 = K ⊺

1 ≥ 0 and K2 = K ⊺ 2 ≥ 0, ∃ n real eigenvalues µi > 0

for the generalized eigenvalue problem K1xi = µiK2xi ⇐ ⇒ K −1

1 K2xi = 1

µi xi ⇐ ⇒ K −1

2 K1xi = µixi.

⇒ trace(K −1

1 K2) = 1 µi , and trace(K −1 2 K1) = µi.

And hence Dstiff(ρ1, ρ2) = 1 4( K −1

1

: K2

  • trace(K −1

1

K2)

+ K −1

2

: K1

  • trace(K −1

2

K1)

−2I : I) = 1 4

  • ( 1

µi + µi − 2) = 1 4 √µi − 1 √µi 2

12/49

slide-21
SLIDE 21

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Linkage4: Definition through an example

  • 4mathworks. Hierarchical Clustering.

http://ch.mathworks.com/help/stats/hierarchical-clustering.html. 2016.

13/49

slide-22
SLIDE 22

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Linkage4: Definition through an example

  • 4mathworks. Hierarchical Clustering.

http://ch.mathworks.com/help/stats/hierarchical-clustering.html. 2016.

13/49

slide-23
SLIDE 23

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Linkage4: Definition through an example

  • 4mathworks. Hierarchical Clustering.

http://ch.mathworks.com/help/stats/hierarchical-clustering.html. 2016.

13/49

slide-24
SLIDE 24

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

From linkage to clustering

14/49

slide-25
SLIDE 25

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Dimer analysis

We have

  • 16 dimers Xm, m = 1, . . . , 16 (ten independent under

Crick-Watson symmetry).

  • For each dimer, a sequence Sk

m = XmXm · · · Xm

  • k times

, k = 1, 10, 20, . . .

  • For each sequence, we compute the stiffness matrix K k

m and

the ground state shape vector wk

m using cgDNA,

  • And construct the ”Divergence matrix” D :

Dij = DKL(Sk

i , Sk j ).

  • Recall

DKL(S1, S2) = 1 4[K −1

1

: K2 + K −1

2

: K1 − 2I : I] + 1 4( w1 − w2) · (K1 + K2)( w1 − w2).

15/49

slide-26
SLIDE 26

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Divergence matrix, with an arbitrary ordering of the dimers

16/49

slide-27
SLIDE 27

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Divergence matrix manually ordered

The ordering has been obtained in two steps

  • I first reordered the dimers to

sort the first line in ascending

  • rder. This gave the 4 × 4

corner blocks.

  • The 8 × 8 inner block was still

complicated, so I repeated the first step to order it.

17/49

slide-28
SLIDE 28

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Divergence matrix of the manually ordered trimers in YR-alphabet

18/49

slide-29
SLIDE 29

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Clustering

19/49

slide-30
SLIDE 30

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Kullback-Leibler Divergence: Dimers, in ATGC-alphabet

20/49

slide-31
SLIDE 31

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Kullback-Leibler Divergence: Dimers, in YR-alphabet

21/49

slide-32
SLIDE 32

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Kullback-Leibler Divergence: Trimers, in ATGC-alphabet

22/49

slide-33
SLIDE 33

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Kullback-Leibler Divergence: Trimers, in YR-alphabet

23/49

slide-34
SLIDE 34

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Kullback-Leibler Divergence: Tetramers, in YR-alphabet

24/49

slide-35
SLIDE 35

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Dimers

25/49

slide-36
SLIDE 36

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

KL Divergence with Equivalence Classes

  • Physically, ρ(S) ∼

= ρ(S), under change of reading strand.

  • ρ(poly − XY ) should be close to ρ(poly − YX) Therefore, we

define the shift operator Tk : Tk(X1 · · · Xn) = Xn−k+1 · · · XnX1 · · · Xn−k. Example: T1(ATGC) = CATG. For convenience, we set T0(S) := S.

  • We then introduce the following equivalence class of

sequences: S1 ∼ S2 ⇐ ⇒ ∃k ≥ 0 s.t S1 = Tk(S2) or S1 = Tk(S2). Then define the Kullback-Leibler divergence with equivalence classes as Dequiv

KL

(S1, S2) = min{DKL([S1], [S2])}, where [S] is the equivalence class of S.

26/49

slide-37
SLIDE 37

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Kullback-Leibler Divergence without equivalence classes: dimers, in YR-alphabet

27/49

slide-38
SLIDE 38

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

KL Divergence with Equivalence Classes: Dimer

If S = X1X2, then

  • If

Xi ∈ {A, T, C, G}, there are 6 [S].

  • If Xi ∈ {Y , R},

there are 2 [S].

28/49

slide-39
SLIDE 39

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Kullback-Leibler Divergence without equivalence classes: trimers, in YR-alphabet

29/49

slide-40
SLIDE 40

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

KL Divergence with Equivalence Classes: Trimer

If S = X1X2X3, then

  • If

Xi ∈ {A, T, C, G}, there are 12 [S].

  • If Xi ∈ {Y , R},

there are 2 [S].

30/49

slide-41
SLIDE 41

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Kullback-Leibler Divergence without equivalence classes: tetramers, in YR-alphabet

31/49

slide-42
SLIDE 42

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

KL Divergence with Equivalence Classes: Tetramer

If S = X1X2X3X4, then

  • If

Xi ∈ {A, T, C, G}, there are 39 [S].

  • If Xi ∈ {Y , R},

there are 4 [S].

32/49

slide-43
SLIDE 43

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Codons

33/49

slide-44
SLIDE 44

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Summary of the results

  • The Kullback-Leibler divergence between equilibrium

distributions discerns purines and pyrimidines in dimers, trimers and tetramers.

  • The clusterings obtained when adding the equivalence classes
  • f reading strand and translations also distinguishes the

purines and pyrimidines, and they are more consistent with

  • ur physical knowledge.
  • The redundancy of sequences in the codons is not

immediately explained by the Kullback-Leibler divergence between equilibrium distributions.

34/49

slide-45
SLIDE 45

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Spectral Analysis of the Periodic Stiffness Matrix

35/49

slide-46
SLIDE 46

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Spectra of sequence-dependent stiffness matrices computed from Molecular Dynamic5 and cgDNA

MD: cgDNA:

5Julien Delafontaine. “Spectral analysis of a coarse-grain model of DNA”. .

MA thesis. Ecole Polytechnique F´ ed´ erale de Lausanne, 2011.

36/49

slide-47
SLIDE 47

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Reminder: Periodic Parameters6

6Jaros

law G

  • lowacki. “Computation and Visualization for Multiscale

Modelling of DNA Mechanics”. PhD thesis. Ecole Polytechnique F´ ed´ erale de Lausanne, 2016.

37/49

slide-48
SLIDE 48

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Decomposition of the Periodic Stiffness Matrix Kp(XY )

+

Σ(XY )

+

Q(XY ) Q⊺(XY )

=

Kp(XY )

∈ R24×24

38/49

slide-49
SLIDE 49

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Decomposition of the Periodic Stiffness Matrix Kp(S2(XY ))

Kp(S2(XY )) =

39/49

slide-50
SLIDE 50

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Decomposition of the Periodic Stiffness Matrix Kp(S2(XY ))

= Σ(XY ) Q(XY )+ Q(XY )⊺ Q(XY )+ Σ(XY ) Q(XY )⊺ Q(XY ) =

Σ = Σ⊺ ∈ R24×24; Q ∈ R24×24.

39/49

slide-51
SLIDE 51

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Eigenvalues of Kp(AGAG)

σ(Kp(AGAG)) = {λi, i = 1, . . . , 48}

40/49

slide-52
SLIDE 52

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Eigenvalues of Kp(AGAG)

σ(Kp(AGAG)) = σ(Kp(AG)) ∪ σ(Σ − (Q + Q⊺)) Denote by

  • ui 24 linearly independent

eigenvectors of Kp(AG),

  • vi 24 linearly independent

eigenvectors of Σ − (Q + Q⊺). Then (ui, ui) and (vi, −vi) are 48 linearly independent eigenvectors

  • f Kp(AGAG).

41/49

slide-53
SLIDE 53

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Eigenvalues of Kp(MnMn)

This result can be generalized to polymers that are composed of an n-mer repeated twice. Σ(Mn), Q(Mn) ∈ R12n×12n = Σ(Mn) Q(Mn)+ Q(Mn)⊺ Q(Mn)+ Σ(Mn) Q(Mn)⊺ ⇒ σ(Kp(MnMn)) = σ(Σ + (Q + Q⊺)) ∪ σ(Σ − (Q + Q⊺))

42/49

slide-54
SLIDE 54

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix 43/49

slide-55
SLIDE 55

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix 43/49

slide-56
SLIDE 56

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix 43/49

slide-57
SLIDE 57

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Eigenvectors of Kp(S4(XY ))

Let

  • ui ∈ R24 the eigenvectors of Kp(XY ) = Σ + (Q + Q⊺),
  • vi ∈ R24 the eigenvectors of Σ − (Q + Q⊺),
  • wi ∈ R48 the eigenvectors of
  • Σ

Q − Q⊺ Q⊺ − Q Σ

  • .

Then linear independent eigenvectors of Kp(S4(XY )) are of the form     ui ui ui ui     ,     vi −vi vi −vi     ,

  • wi

−wi

  • .

44/49

slide-58
SLIDE 58

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Spectrum of Kp(S4(AG))

σ(S4(AG)) = {µ+

i , i=1,...,24} ∪ {µ− i , i=1,...,24} ∪ {λi, i=1,...,48}

45/49

slide-59
SLIDE 59

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

The 48 remaining eigenvalues of Kp(S4(XY )) are the eigenvalues of this matrix ˜ M

= Σ Q − Q⊺ Q⊺ − Q Σ

46/49

slide-60
SLIDE 60

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

The 48 remaining eigenvalues of Kp(S4(XY )) are the eigenvalues of this matrix ˜ M

The eigenspace associated with any eigenvalue of this matrix has the form x y

  • ,

−y x

  • 47/49
slide-61
SLIDE 61

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Summary of the results

For k = 1, the eigenvalues of the periodic stiffness matrix Kp(S2k(Mn)) = Kp(MnMn) ∈ R2(12n)×2(12n) are the union of the eigenvalues of

  • Σ(Mn) + Q(Mn) + Q⊺(Mn) = Kp(Mn) ∈ R12n×12n, and
  • Σ(Mn) − (Q(Mn) + Q⊺(Mn)) ∈ R12n×12n.

48/49

slide-62
SLIDE 62

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Summary of the results

For k > 1, the eigenvalues and eigenvectors of the periodic stiffness matrix Kp(S2k(Mn)) ∈ RN×N are the union of the eigenvalues of

Kp(S2k−1(Mn)) ∈ RN/2×N/2

Σ(S2k−2

(Mn))

Q(S2k−2

(Mn))+

Q⊺(S2k−2

(Mn))

Q⊺(S2k−2

(Mn))

Σ(S2k−2

(Mn)) +Q(S2k−2 (Mn))

And ˜ M ∈ RN/2×N/2

Σ(S2k−2

(Mn))

Q(S2k−2

(Mn))−

Q⊺(S2k−2

(Mn))

Q⊺(S2k−2

(Mn))

Σ(S2k−2

(Mn)) −Q(S2k−2 (Mn))

Moreover, the multiplicity of any eigenvalue of ˜ M is even.

48/49

slide-63
SLIDE 63

Theoretical Tools Dimer analysis Clustering Spectral Analysis of the Periodic Stiffness Matrix

Thank you for your attention!

49/49