The non unfoldable self-avoiding walks Christophe Guyeux FEMTO-ST - - - PowerPoint PPT Presentation

the non unfoldable self avoiding walks
SMART_READER_LITE
LIVE PREVIEW

The non unfoldable self-avoiding walks Christophe Guyeux FEMTO-ST - - - PowerPoint PPT Presentation

The non unfoldable self-avoiding walks Christophe Guyeux FEMTO-ST - DISC Department - AND Team March 21th, 2014 Plan The PSP problem Introducing the foldable SAWs The study of foldable SAWs Conclusion FEMTO-ST Institute 2 / 34


slide-1
SLIDE 1

The non unfoldable self-avoiding walks

Christophe Guyeux FEMTO-ST - DISC Department - AND Team

March 21th, 2014

slide-2
SLIDE 2

Plan

The PSP problem Introducing the foldable SAWs The study of foldable SAWs Conclusion

FEMTO-ST Institute 2 / 34

slide-3
SLIDE 3

Self-Avoiding Walk

Let d 1. A n−step self-avoiding walk (SAW) from x ∈ Zd to y ∈ Zd is a map w : 0, n → Zd with:

  • w(0) = x and w(n) = y,
  • |w(i + 1) − w(i)| = 1,
  • ∀i, j ∈ 0, n, i = j ⇒ w(i) = w(j) (self-avoiding property).

FEMTO-ST Institute 3 / 34

slide-4
SLIDE 4

Protein Structure Prediction problem

FEMTO-ST Institute 4 / 34

slide-5
SLIDE 5

The Protein Folding Process

  • Proteins, polymers formed by different kinds of amino

acids, fold to form a specific tridimensional shape

  • This geometric pattern defines the majority of functionality

within an organism

  • Contrary to the mapping from DNA to the amino acids

sequence, the complex folding of this last sequence still remains not well-understood

FEMTO-ST Institute 5 / 34

slide-6
SLIDE 6

The 2D HP model

Hydrophilic-hydrophobic 2D square lattice model:

  • A protein conformation is a “self-avoiding walk (SAW)” on a

2D lattice (low resolution model)

  • Its free energy E must be minimal
  • Hydrophobic interactions dominate protein folding:
  • Protein core freeing up energy is formed by hydrophobic

amino acids

  • Hydrophilic a.a. tend to move in the outer surface
  • E depends on contacts between hydrophobic amino acids

that are not contiguous in the primary structure

FEMTO-ST Institute 6 / 34

slide-7
SLIDE 7

The 2D HP model

Objective: to map the labeled straight line in this latter, having more black neighbors:

FEMTO-ST Institute 7 / 34

slide-8
SLIDE 8

Resolving the PSP problem

  • Being NP-complete, the optimal conformation(s) cannot be

found exactly for large n’s

  • Conformations are thus predicted using AI tools
  • Some strategies found in the literature:
  • 1. start by predicting the 2D backbone,
  • 2. then refine the obtained conformation in a 3D shape
  • At least two strategies for 2D backbone prediction:
  • Method 1: iterating ±90◦ pivot moves on the straight line
  • Method 2: stretching 1 amino acid until obtaining an

n-steps conformation

  • ...?

FEMTO-ST Institute 8 / 34

slide-9
SLIDE 9

Various methods for solving PSP

  • 1. PSP by folding SAWs
  • 2. PSP by stretching SAWs

FEMTO-ST Institute 9 / 34

slide-10
SLIDE 10

My first example

FEMTO-ST Institute 10 / 34

slide-11
SLIDE 11

Introducing the foldable SAWs

FEMTO-ST Institute 11 / 34

slide-12
SLIDE 12

Self-avoiding walk encoding

Absolute encoding of a SAW: Movement Encoding Forward → Down ↓ 1 Backward ← 2 Up ↑ 3 Absolute encoding: 00011123322101

FEMTO-ST Institute 12 / 34

slide-13
SLIDE 13

Pivot move of ±90◦

The anticlockwise fold function is the function f : Z/4Z − → Z/4Z defined by f(x) = x − 1 (mod 4). A ±90◦ pivot move applies this function on the tail of the walk

FEMTO-ST Institute 13 / 34

slide-14
SLIDE 14

Madras and Sokal

Theorem

The pivot algorithm is ergodic for self-avoiding walks on Zd provided that all axis reflections, and:

  • either all 90◦ rotations
  • or all diagonal reflections,

are given nonzero probability. Any N−step SAW can be transformed into a straight rod by some sequence of 2N − 1 or fewer such pivots.

FEMTO-ST Institute 14 / 34

slide-15
SLIDE 15

Madras and Sokal example

Ergodicity is lost when considering single ±90◦ pivot moves

FEMTO-ST Institute 15 / 34

slide-16
SLIDE 16

A graph structure for unfolded SAWs

The graph Gn is defined as follows:

  • its vertices are the n−step self-avoiding walks, described

in absolute encoding;

  • there is an edge between two vertices si, sj ⇔ sj can be
  • btained by one pivot move of ±90◦ on si.

FEMTO-ST Institute 16 / 34

slide-17
SLIDE 17

Examples of Gn

G2 G3

FEMTO-ST Institute 17 / 34

slide-18
SLIDE 18

Method 1 vs method 2

  • Sn: all the vertices of Gn (all n-step SAWs)

⇒ An equivalence relation: w1Rnw2 ⇔ w1 is in the same connected component that w2 on Gn.

  • fSAWn: the connected component of the straight line

00 . . . 0 in Gn,

FEMTO-ST Institute 18 / 34

slide-19
SLIDE 19

Method 1 vs method 2

  • Sn: all the vertices of Gn (all n-step SAWs)

⇒ An equivalence relation: w1Rnw2 ⇔ w1 is in the same connected component that w2 on Gn.

  • fSAWn: the connected component of the straight line

00 . . . 0 in Gn, We rediscovered that for some n, fSAWn Gn.

  • It is an obvious consequence of Madras example
  • This fact is not known by some computer scientists
  • ⇒ Method 1 and Method 2 do not produce the same set of

conformations

FEMTO-ST Institute 18 / 34

slide-20
SLIDE 20

Method 1 vs method 2

  • Sn: all the vertices of Gn (all n-step SAWs)

⇒ An equivalence relation: w1Rnw2 ⇔ w1 is in the same connected component that w2 on Gn.

  • fSAWn: the connected component of the straight line

00 . . . 0 in Gn, We rediscovered that for some n, fSAWn Gn.

  • It is an obvious consequence of Madras example
  • This fact is not known by some computer scientists
  • ⇒ Method 1 and Method 2 do not produce the same set of

conformations How evolves the ratio ♯fSAWn ♯Gn ?

FEMTO-ST Institute 18 / 34

slide-21
SLIDE 21

Some subsets of SAWs

We introduce the following sets:

  • fSAWn is the equivalence class of the n−step straight

walk, or the set of all folded SAWs.

  • fSAW(n, k) is the set of equivalence classes of size k in

(Gn, Rn).

  • USAWn is the set of equivalence classes of size 1

(Gn, Rn), that is, the set of unfoldable walks. ⇒ Madras’ walk belongs in USAW223

  • f 1SAWn is the complement of USAWn in Gn. This is the set
  • f SAWs on which we can apply at least one pivot move of

±90◦.

FEMTO-ST Institute 19 / 34

slide-22
SLIDE 22

The study of foldable SAWs

FEMTO-ST Institute 20 / 34

slide-23
SLIDE 23

Current investigation techniques

  • For small n’s: brute force.
  • Nb of fSAW(n) = 4*Nb of fSAW(n) starting by 0 = 4*(Nb of

fSAW(n) starting by 00 + 2* Nb of fSAW(n) starting by 01)

  • Stop when a polyomino appears
  • For large n’s: backtracking on reduced human solutions

FEMTO-ST Institute 21 / 34

slide-24
SLIDE 24

A short list of results

  • 1. 2n+2 ♯fSAWn 4 × 3n
  • 2. ∀n 22, fSAWn = Gn (n 11 in triangular lattice)
  • 3. fSAW108 G108.
  • let νn the smallest n 2 such that USAWn = ∅. Then

23 νn 108.

  • We can obtain all Gn, n 22 by increasing the number of

cranks

  • 4. ∀n 28, f 1SAWn = Gn, while f 1SAW108 G108.
  • 5. ∃k > 2 such that fSAW(n, k) is nonempty.
  • 6. The diameter of fSAW(n) is equal to 2n.

FEMTO-ST Institute 22 / 34

slide-25
SLIDE 25

fSAWn is not fSAW ′

n

Acceptable in fSAW Not in fSAW ′ fSAWn = fSAW ′

n

FEMTO-ST Institute 23 / 34

slide-26
SLIDE 26

Current smallest (108-step) USAW

FEMTO-ST Institute 24 / 34

slide-27
SLIDE 27

♯{n | USAW(n) = ∅} = ∞

FEMTO-ST Institute 25 / 34

slide-28
SLIDE 28

Cardinalities of subsets of SAWs

FEMTO-ST Institute 26 / 34

slide-29
SLIDE 29

Cardinalities of subsets of SAWs

FEMTO-ST Institute 27 / 34

slide-30
SLIDE 30

Case of triangular SAWs

FEMTO-ST Institute 28 / 34

slide-31
SLIDE 31

Vien diagrams for some Gn

fSAW(n) = f SAW(n)

1

fSAW(n) f SAW(n) nfSAW(n)

1

Gn for n 22 Diagram of Gn for n = 108

FEMTO-ST Institute 29 / 34

slide-32
SLIDE 32

Conclusion

FEMTO-ST Institute 30 / 34

slide-33
SLIDE 33

All walks are interesting ?

Protein synthesis Intrinsically complicated prot.

FEMTO-ST Institute 31 / 34

slide-34
SLIDE 34

Some open questions

  • 1. Did these walks constitute an exponentially small subset of

SAWs ?

  • 2. The PSP problem still remains NP-complete in fSAWn ?
  • 3. For any dimension d, do we have the existence of n ∈ N∗

such that fSAW d

n Gd n ?

  • 4. fSAW 2

2 and fSAW 2 3 are Hamiltonian graphs, but they are

not Eulerian. What about fSAW k

n ?

  • 5. is there an unfoldable walk in Z3 ?
  • 6. Are the connected components of Gd

n convex ?

  • 7. ...

FEMTO-ST Institute 32 / 34

slide-35
SLIDE 35

Other open questions

  • Monte-Carlo approach ?
  • Genetic algorithm approach ?
  • Dynamic programming ?
  • Pivot algorithm ?
  • Forbidden patterns ?

FEMTO-ST Institute 33 / 34

slide-36
SLIDE 36

Thank you!

Any question/suggestion/idea ?

christophe.guyeux@univ-fcomte.fr

FEMTO-ST Institute 34 / 34