the non unfoldable self avoiding walks
play

The non unfoldable self-avoiding walks Christophe Guyeux FEMTO-ST - - PowerPoint PPT Presentation

The non unfoldable self-avoiding walks Christophe Guyeux FEMTO-ST - DISC Department - AND Team March 21th, 2014 Plan The PSP problem Introducing the foldable SAWs The study of foldable SAWs Conclusion FEMTO-ST Institute 2 / 34


  1. The non unfoldable self-avoiding walks Christophe Guyeux FEMTO-ST - DISC Department - AND Team March 21th, 2014

  2. Plan The PSP problem Introducing the foldable SAWs The study of foldable SAWs Conclusion FEMTO-ST Institute 2 / 34

  3. Self-Avoiding Walk Let d � 1. A n − step self-avoiding walk (SAW) from x ∈ Z d to y ∈ Z d is a map w : � 0 , n � → Z d with: • w ( 0 ) = x and w ( n ) = y , • | w ( i + 1 ) − w ( i ) | = 1, • ∀ i , j ∈ � 0 , n � , i � = j ⇒ w ( i ) � = w ( j ) (self-avoiding property). FEMTO-ST Institute 3 / 34

  4. Protein Structure Prediction problem FEMTO-ST Institute 4 / 34

  5. The Protein Folding Process • Proteins, polymers formed by different kinds of amino acids, fold to form a specific tridimensional shape • This geometric pattern defines the majority of functionality within an organism • Contrary to the mapping from DNA to the amino acids sequence, the complex folding of this last sequence still remains not well-understood FEMTO-ST Institute 5 / 34

  6. The 2D HP model Hydrophilic-hydrophobic 2D square lattice model: • A protein conformation is a “self-avoiding walk (SAW)” on a 2D lattice (low resolution model) • Its free energy E must be minimal • Hydrophobic interactions dominate protein folding: • Protein core freeing up energy is formed by hydrophobic amino acids • Hydrophilic a.a. tend to move in the outer surface • E depends on contacts between hydrophobic amino acids that are not contiguous in the primary structure FEMTO-ST Institute 6 / 34

  7. The 2D HP model Objective: to map the labeled straight line in this latter, having more black neighbors: FEMTO-ST Institute 7 / 34

  8. Resolving the PSP problem • Being NP-complete, the optimal conformation(s) cannot be found exactly for large n ’s • Conformations are thus predicted using AI tools • Some strategies found in the literature: 1. start by predicting the 2D backbone, 2. then refine the obtained conformation in a 3D shape • At least two strategies for 2D backbone prediction: • Method 1: iterating ± 90 ◦ pivot moves on the straight line • Method 2: stretching 1 amino acid until obtaining an n -steps conformation • ...? FEMTO-ST Institute 8 / 34

  9. Various methods for solving PSP 1. PSP by folding SAWs 2. PSP by stretching SAWs FEMTO-ST Institute 9 / 34

  10. My first example FEMTO-ST Institute 10 / 34

  11. Introducing the foldable SAWs FEMTO-ST Institute 11 / 34

  12. Self-avoiding walk encoding Absolute encoding of a SAW: Movement Encoding Forward → 0 Down ↓ 1 Backward ← 2 Up ↑ 3 Absolute encoding: 00011123322101 FEMTO-ST Institute 12 / 34

  13. Pivot move of ± 90 ◦ The anticlockwise fold function is the function f : Z / 4 Z − → Z / 4 Z defined by f ( x ) = x − 1 ( mod 4 ) . A ± 90 ◦ pivot move applies this function on the tail of the walk FEMTO-ST Institute 13 / 34

  14. Madras and Sokal Theorem The pivot algorithm is ergodic for self-avoiding walks on Z d provided that all axis reflections, and: • either all 90 ◦ rotations • or all diagonal reflections, are given nonzero probability. Any N − step SAW can be transformed into a straight rod by some sequence of 2 N − 1 or fewer such pivots. FEMTO-ST Institute 14 / 34

  15. Madras and Sokal example Ergodicity is lost when considering single ± 90 ◦ pivot moves FEMTO-ST Institute 15 / 34

  16. A graph structure for unfolded SAWs The graph G n is defined as follows: • its vertices are the n − step self-avoiding walks, described in absolute encoding; • there is an edge between two vertices s i , s j ⇔ s j can be obtained by one pivot move of ± 90 ◦ on s i . FEMTO-ST Institute 16 / 34

  17. Examples of G n G 2 G 3 FEMTO-ST Institute 17 / 34

  18. Method 1 vs method 2 • S n : all the vertices of G n (all n -step SAWs) ⇒ An equivalence relation: w 1 R n w 2 ⇔ w 1 is in the same connected component that w 2 on G n . • fSAW n : the connected component of the straight line 00 . . . 0 in G n , FEMTO-ST Institute 18 / 34

  19. Method 1 vs method 2 • S n : all the vertices of G n (all n -step SAWs) ⇒ An equivalence relation: w 1 R n w 2 ⇔ w 1 is in the same connected component that w 2 on G n . • fSAW n : the connected component of the straight line 00 . . . 0 in G n , We rediscovered that for some n , fSAW n � G n . • It is an obvious consequence of Madras example • This fact is not known by some computer scientists • ⇒ Method 1 and Method 2 do not produce the same set of conformations FEMTO-ST Institute 18 / 34

  20. Method 1 vs method 2 • S n : all the vertices of G n (all n -step SAWs) ⇒ An equivalence relation: w 1 R n w 2 ⇔ w 1 is in the same connected component that w 2 on G n . • fSAW n : the connected component of the straight line 00 . . . 0 in G n , We rediscovered that for some n , fSAW n � G n . • It is an obvious consequence of Madras example • This fact is not known by some computer scientists • ⇒ Method 1 and Method 2 do not produce the same set of conformations How evolves the ratio ♯ fSAW n ? ♯ G n FEMTO-ST Institute 18 / 34

  21. Some subsets of SAWs We introduce the following sets: • fSAW n is the equivalence class of the n − step straight walk, or the set of all folded SAWs. • fSAW ( n , k ) is the set of equivalence classes of size k in ( G n , R n ) . • USAW n is the set of equivalence classes of size 1 ( G n , R n ) , that is, the set of unfoldable walks. ⇒ Madras’ walk belongs in USAW 223 • f 1 SAW n is the complement of USAW n in G n . This is the set of SAWs on which we can apply at least one pivot move of ± 90 ◦ . FEMTO-ST Institute 19 / 34

  22. The study of foldable SAWs FEMTO-ST Institute 20 / 34

  23. Current investigation techniques • For small n ’s: brute force. • Nb of fSAW ( n ) = 4*Nb of fSAW ( n ) starting by 0 = 4*(Nb of fSAW ( n ) starting by 00 + 2* Nb of fSAW ( n ) starting by 01) • Stop when a polyomino appears • For large n ’s: backtracking on reduced human solutions FEMTO-ST Institute 21 / 34

  24. A short list of results 1. 2 n + 2 � ♯ fSAW n � 4 × 3 n 2. ∀ n � 22 , fSAW n = G n ( n � 11 in triangular lattice) 3. fSAW 108 � G 108 . • let ν n the smallest n � 2 such that USAW n � = ∅ . Then 23 � ν n � 108. • We can obtain all G n , n � 22 by increasing the number of cranks 4. ∀ n � 28 , f 1 SAW n = G n , while f 1 SAW 108 � G 108 . 5. ∃ k > 2 such that fSAW ( n , k ) is nonempty. 6. The diameter of fSAW ( n ) is equal to 2 n . FEMTO-ST Institute 22 / 34

  25. fSAW n is not fSAW ′ n Not in fSAW ′ fSAW n � = fSAW ′ Acceptable in fSAW n FEMTO-ST Institute 23 / 34

  26. Current smallest (108-step) USAW FEMTO-ST Institute 24 / 34

  27. ♯ { n | USAW ( n ) � = ∅ } = ∞ FEMTO-ST Institute 25 / 34

  28. Cardinalities of subsets of SAWs FEMTO-ST Institute 26 / 34

  29. Cardinalities of subsets of SAWs FEMTO-ST Institute 27 / 34

  30. Case of triangular SAWs FEMTO-ST Institute 28 / 34

  31. Vien diagrams for some G n 1 f SAW(n) fSAW(n) nfSAW(n) 1 fSAW(n) = f SAW(n) G n for n � 22 Diagram of G n for n = 108 FEMTO-ST Institute 29 / 34

  32. Conclusion FEMTO-ST Institute 30 / 34

  33. All walks are interesting ? Protein synthesis Intrinsically complicated prot. FEMTO-ST Institute 31 / 34

  34. Some open questions 1. Did these walks constitute an exponentially small subset of SAWs ? 2. The PSP problem still remains NP-complete in fSAW n ? 3. For any dimension d , do we have the existence of n ∈ N ∗ such that fSAW d n � G d n ? 4. fSAW 2 2 and fSAW 2 3 are Hamiltonian graphs, but they are not Eulerian. What about fSAW k n ? 5. is there an unfoldable walk in Z 3 ? 6. Are the connected components of G d n convex ? 7. ... FEMTO-ST Institute 32 / 34

  35. Other open questions • Monte-Carlo approach ? • Genetic algorithm approach ? • Dynamic programming ? • Pivot algorithm ? • Forbidden patterns ? FEMTO-ST Institute 33 / 34

  36. Thank you! Any question/suggestion/idea ? christophe.guyeux@univ-fcomte.fr FEMTO-ST Institute 34 / 34

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend