Reversal Distances for Strings with Few Blocks or Small Alphabets - - PowerPoint PPT Presentation

reversal distances for strings with few blocks or small
SMART_READER_LITE
LIVE PREVIEW

Reversal Distances for Strings with Few Blocks or Small Alphabets - - PowerPoint PPT Presentation

Introduction First results Fixed-parameter tractability Conclusion Reversal Distances for Strings with Few Blocks or Small Alphabets Laurent Bulteau, Guillaume Fertin, Christian Komusiewicz Technische Universitt Berlin, Germany Universit


slide-1
SLIDE 1

Introduction First results Fixed-parameter tractability Conclusion

Reversal Distances for Strings with Few Blocks or Small Alphabets

Laurent Bulteau, Guillaume Fertin, Christian Komusiewicz Technische Universität Berlin, Germany Université de Nantes, France

Supported by the Alexander von Humboldt Fondation, Bonn, Germany, and by Région Pays de la Loire, France

slide-2
SLIDE 2

Introduction First results Fixed-parameter tractability Conclusion

Introduction

CPM, 2014-06-18

  • L. Bulteau

2/17

slide-3
SLIDE 3

Introduction First results Fixed-parameter tractability Conclusion

Context

Comparative genomics challenges:

Understand history of genes Compute evolutionary tree Track small- or large-scale evolution events Understand interactions between genes and/or proteins etc.

Genomes get rearranged progressively during evolution: cuts, duplications, reversals, etc.

CPM, 2014-06-18

  • L. Bulteau

3/17

slide-4
SLIDE 4

Introduction First results Fixed-parameter tractability Conclusion

Context

Comparative genomics challenges:

Understand history of genes Compute evolutionary tree Track small- or large-scale evolution events Understand interactions between genes and/or proteins etc.

Genomes get rearranged progressively during evolution: cuts, duplications, reversals, etc.

CPM, 2014-06-18

  • L. Bulteau

3/17

slide-5
SLIDE 5

Introduction First results Fixed-parameter tractability Conclusion

Reversal Distance

Reversal: genome rearrangement Reversal Distance: minimum number of reversals to go from

  • ne genome to an other

CPM, 2014-06-18

  • L. Bulteau

4/17

slide-6
SLIDE 6

Introduction First results Fixed-parameter tractability Conclusion

Reversal Distance

Reversal: genome rearrangement Reversal Distance: minimum number of reversals to go from

  • ne genome to an other

CPM, 2014-06-18

  • L. Bulteau

4/17

slide-7
SLIDE 7

Introduction First results Fixed-parameter tractability Conclusion

Variants

Signed Reversals: Prefix Reversals:

CPM, 2014-06-18

  • L. Bulteau

5/17

slide-8
SLIDE 8

Introduction First results Fixed-parameter tractability Conclusion

Blocks

Block: maximum substring using only 1 letter Block in signed strings: strict ⇒ common sign free ⇒ may have different signs a a b a b b b c c a a c a a c

CPM, 2014-06-18

  • L. Bulteau

6/17

slide-9
SLIDE 9

Introduction First results Fixed-parameter tractability Conclusion

Blocks

Block: maximum substring using only 1 letter Block in signed strings: strict ⇒ common sign free ⇒ may have different signs a a b a b b b c c a a c a a c

CPM, 2014-06-18

  • L. Bulteau

6/17

slide-10
SLIDE 10

Introduction First results Fixed-parameter tractability Conclusion

Blocks

Block: maximum substring using only 1 letter Block in signed strings: strict ⇒ common sign free ⇒ may have different signs +a

  • a

+b +a +b +b

  • b

+b

  • b
  • b

+a

CPM, 2014-06-18

  • L. Bulteau

6/17

slide-11
SLIDE 11

Introduction First results Fixed-parameter tractability Conclusion

Blocks

Block: maximum substring using only 1 letter Block in signed strings: strict ⇒ common sign free ⇒ may have different signs +a

  • a

+b +a +b +b

  • b

+b

  • b
  • b

+a

CPM, 2014-06-18

  • L. Bulteau

6/17

slide-12
SLIDE 12

Introduction First results Fixed-parameter tractability Conclusion

Blocks

Block: maximum substring using only 1 letter Block in signed strings: strict ⇒ common sign free ⇒ may have different signs +a

  • a

+b +a +b +b

  • b

+b

  • b
  • b

+a

CPM, 2014-06-18

  • L. Bulteau

6/17

slide-13
SLIDE 13

Introduction First results Fixed-parameter tractability Conclusion

Blocks

Block: maximum substring using only 1 letter Block in signed strings: strict ⇒ common sign free ⇒ may have different signs +a

  • a

+b +a +b +b

  • b

+b

  • b
  • b

+a

Parameters

b: maximum number of blocks in each input string |Σ|: alphabet size, |Σ| < b.

CPM, 2014-06-18

  • L. Bulteau

6/17

slide-14
SLIDE 14

Introduction First results Fixed-parameter tractability Conclusion

Blocks

Block: maximum substring using only 1 letter Block in signed strings: strict ⇒ common sign free ⇒ may have different signs +a

  • a

+b +a +b +b

  • b

+b

  • b
  • b

+a

Parameters

b: maximum number of blocks in each input string |Σ|: alphabet size, |Σ| < b.

CPM, 2014-06-18

  • L. Bulteau

6/17

slide-15
SLIDE 15

Introduction First results Fixed-parameter tractability Conclusion

Complexity of computing the reversal distance

input /

permutations strings strings with few blocks small

reversals

alphabet

regular

NP-hard

signed

P

prefix signed prefix state of the art

  • ur contribution

[Bafna, Pevzner ’96], [Christie ’98]

CPM, 2014-06-18

  • L. Bulteau

7/17

slide-16
SLIDE 16

Introduction First results Fixed-parameter tractability Conclusion

Complexity of computing the reversal distance

input /

permutations strings strings with few blocks small

reversals

alphabet

regular

NP-hard

signed

P

prefix

NP-hard

signed prefix

?

state of the art

  • ur contribution

[Bulteau, Fertin, Rusu ’11]

CPM, 2014-06-18

  • L. Bulteau

7/17

slide-17
SLIDE 17

Introduction First results Fixed-parameter tractability Conclusion

Complexity of computing the reversal distance

input /

permutations strings strings with few blocks small

reversals

alphabet

regular

NP-hard NP-hard

signed

P NP-hard

prefix

NP-hard NP-hard

signed prefix

? NP-hard

state of the art

  • ur contribution

[Christie ’98] [Radcliffe, Scott, Wilmer ’06]

CPM, 2014-06-18

  • L. Bulteau

7/17

slide-18
SLIDE 18

Introduction First results Fixed-parameter tractability Conclusion

Complexity of computing the reversal distance

input /

permutations strings strings with few blocks small

reversals

alphabet

regular

NP-hard NP-hard FPT

signed

P NP-hard

prefix

NP-hard NP-hard FPT

signed prefix

? NP-hard

state of the art

  • ur contribution

FPT algorithm: O((6b)2bn) (constant |Σ|, reversal distance) bO(b)n (other cases)

CPM, 2014-06-18

  • L. Bulteau

7/17

slide-19
SLIDE 19

Introduction First results Fixed-parameter tractability Conclusion

Complexity of computing the reversal distance

input /

permutations strings strings with few blocks small

reversals

(strict) (free) alphabet

regular

NP-hard NP-hard FPT

signed

P NP-hard FPT

prefix

NP-hard NP-hard FPT

signed prefix

? NP-hard FPT

state of the art

  • ur contribution

FPT algorithm: O((6b)2bn) (constant |Σ|, reversal distance) bO(b)n (other cases)

CPM, 2014-06-18

  • L. Bulteau

7/17

slide-20
SLIDE 20

Introduction First results Fixed-parameter tractability Conclusion

Complexity of computing the reversal distance

input /

permutations strings strings with few blocks small

reversals

(strict) (free) alphabet

regular

NP-hard NP-hard FPT

signed

P NP-hard FPT NP-hard

prefix

NP-hard NP-hard FPT

signed prefix

? NP-hard FPT NP-hard

state of the art

  • ur contribution

FPT algorithm: O((6b)2bn) (constant |Σ|, reversal distance) bO(b)n (other cases) NP-hardness: even with b = |Σ| = 1 (unary alphabet)

CPM, 2014-06-18

  • L. Bulteau

7/17

slide-21
SLIDE 21

Introduction First results Fixed-parameter tractability Conclusion

Complexity of computing the reversal distance

input /

permutations strings strings with few blocks small

reversals

(strict) (free) alphabet

regular

NP-hard NP-hard FPT Exp.

signed

P NP-hard FPT NP-hard Exp.

prefix

NP-hard NP-hard FPT Exp.

signed prefix

? NP-hard FPT NP-hard Exp.

state of the art

  • ur contribution

FPT algorithm: O((6b)2bn) (constant |Σ|, reversal distance) bO(b)n (other cases) NP-hardness: even with b = |Σ| = 1 (unary alphabet) Exact algorithm: |Σ|npoly(n) (unsigned) (2|Σ|)npoly(n) (signed)

CPM, 2014-06-18

  • L. Bulteau

7/17

slide-22
SLIDE 22

Introduction First results Fixed-parameter tractability Conclusion

Complexity of computing the reversal distance

input /

permutations strings strings with few blocks small

reversals

(strict) (free) alphabet

regular

NP-hard NP-hard FPT Exp.

signed

P NP-hard FPT NP-hard Exp.

prefix

NP-hard NP-hard FPT Exp.

signed prefix

? NP-hard FPT NP-hard Exp.

state of the art

  • ur contribution

FPT algorithm: O((6b)2bn) (constant |Σ|, reversal distance) bO(b)n (other cases) NP-hardness: even with b = |Σ| = 1 (unary alphabet) Exact algorithm: |Σ|npoly(n) (unsigned) (2|Σ|)npoly(n) (signed)

CPM, 2014-06-18

  • L. Bulteau

7/17

slide-23
SLIDE 23

Introduction First results Fixed-parameter tractability Conclusion

First results

CPM, 2014-06-18

  • L. Bulteau

8/17

slide-24
SLIDE 24

Introduction First results Fixed-parameter tractability Conclusion

Preliminary result: diameter

The reversal distance between two strings with ≤ b blocks is upper-bounded by: 2b − |Σ|

CPM, 2014-06-18

  • L. Bulteau

9/17

slide-25
SLIDE 25

Introduction First results Fixed-parameter tractability Conclusion

Preliminary result: diameter

The reversal distance between two strings with ≤ b blocks is upper-bounded by: 2b − |Σ| → Obtained by: – grouping the blocks in each string into |Σ| bigger blocks (b − |Σ| reversals for each side), – sorting the middle strings (|Σ| reversals).

CPM, 2014-06-18

  • L. Bulteau

9/17

slide-26
SLIDE 26

Introduction First results Fixed-parameter tractability Conclusion

Small alphabet: exact algorithm

The Reversal Distance with alphabet Σ can be computed in time |Σ|npoly(n)

CPM, 2014-06-18

  • L. Bulteau

10/17

slide-27
SLIDE 27

Introduction First results Fixed-parameter tractability Conclusion

Small alphabet: exact algorithm

The Reversal Distance with alphabet Σ can be computed in time |Σ|npoly(n) → There are |Σ|n possible strings over alphabet Σ. Draw the Cayley graph: add an edge between any two strings which are 1 reversal apart. Then Reversal Distance amounts to finding a shortest path.

CPM, 2014-06-18

  • L. Bulteau

10/17

slide-28
SLIDE 28

Introduction First results Fixed-parameter tractability Conclusion

Unary alphabets: NP-hardness

It is NP-hard to compute the Signed Reversal Distance, even on unary strings

CPM, 2014-06-18

  • L. Bulteau

11/17

slide-29
SLIDE 29

Introduction First results Fixed-parameter tractability Conclusion

Unary alphabets: NP-hardness

It is NP-hard to compute the Signed Reversal Distance, even on unary strings → From reversal distance on permutations . . . 5 8 2 3 1 6 4 . . . ↓ S3 = + − + + + + − − − − + −

CPM, 2014-06-18

  • L. Bulteau

11/17

slide-30
SLIDE 30

Introduction First results Fixed-parameter tractability Conclusion

Unary alphabets: NP-hardness

It is NP-hard to compute the Signed Reversal Distance, even on unary strings → From reversal distance on permutations . . . 5 8 2 3 1 6 4 . . . ↓ S3 = + − + + + + − − − − + − . . . (S5)n (S8)n (S2)n (S3)n (S1)n (S6)n (S4)n . . .

CPM, 2014-06-18

  • L. Bulteau

11/17

slide-31
SLIDE 31

Introduction First results Fixed-parameter tractability Conclusion

Unary alphabets: NP-hardness

It is NP-hard to compute the Signed Reversal Distance, even on unary strings → From reversal distance on permutations . . . 5 8 2 3 1 6 4 . . . ↓ S3 = + − + + + + − − − − + − . . . (S5)n (S8)n (S2)n (S3)n (S1)n (S6)n (S4)n . . . Si = −Si (computing an unsigned distance) no long prefix of Si is a suffix of Sj (no overlapping possible) < n reversals (one Si remains intact in each block)

CPM, 2014-06-18

  • L. Bulteau

11/17

slide-32
SLIDE 32

Introduction First results Fixed-parameter tractability Conclusion

Fixed-parameter tractability

CPM, 2014-06-18

  • L. Bulteau

12/17

slide-33
SLIDE 33

Introduction First results Fixed-parameter tractability Conclusion

Merge blocks, keep only the structure Guess reversals endpoints, inside or between blocks, repeat until target structure is reached Create a graph from the blocks, add arcs keeping track of the movement of letters Compute a maximum flow, accept if the flow is n

  • a

a a b c c a a a b b a c b b b

CPM, 2014-06-18

  • L. Bulteau

13/17

slide-34
SLIDE 34

Introduction First results Fixed-parameter tractability Conclusion

Merge blocks, keep only the structure Guess reversals endpoints, inside or between blocks, repeat until target structure is reached Create a graph from the blocks, add arcs keeping track of the movement of letters Compute a maximum flow, accept if the flow is n

  • a

a a b c c a a a b b a c b b b

CPM, 2014-06-18

  • L. Bulteau

13/17

slide-35
SLIDE 35

Introduction First results Fixed-parameter tractability Conclusion

Merge blocks, keep only the structure Guess reversals endpoints, inside or between blocks, repeat until target structure is reached Create a graph from the blocks, add arcs keeping track of the movement of letters Compute a maximum flow, accept if the flow is n

  • a

b c a b a c b

CPM, 2014-06-18

  • L. Bulteau

13/17

slide-36
SLIDE 36

Introduction First results Fixed-parameter tractability Conclusion

Merge blocks, keep only the structure Guess reversals endpoints, inside or between blocks, repeat until target structure is reached Create a graph from the blocks, add arcs keeping track of the movement of letters Compute a maximum flow, accept if the flow is n

  • a

b c a b a c b

CPM, 2014-06-18

  • L. Bulteau

13/17

slide-37
SLIDE 37

Introduction First results Fixed-parameter tractability Conclusion

Merge blocks, keep only the structure Guess reversals endpoints, inside or between blocks, repeat until target structure is reached Create a graph from the blocks, add arcs keeping track of the movement of letters Compute a maximum flow, accept if the flow is n

  • a

b c a b a c b

CPM, 2014-06-18

  • L. Bulteau

13/17

slide-38
SLIDE 38

Introduction First results Fixed-parameter tractability Conclusion

Merge blocks, keep only the structure Guess reversals endpoints, inside or between blocks, repeat until target structure is reached Create a graph from the blocks, add arcs keeping track of the movement of letters Compute a maximum flow, accept if the flow is n

  • a

b c a b a c b a c b a b a c b

CPM, 2014-06-18

  • L. Bulteau

13/17

slide-39
SLIDE 39

Introduction First results Fixed-parameter tractability Conclusion

Merge blocks, keep only the structure Guess reversals endpoints, inside or between blocks, repeat until target structure is reached Create a graph from the blocks, add arcs keeping track of the movement of letters Compute a maximum flow, accept if the flow is n

  • a

b c a b a c b a c b a b a c b a c a b a b

CPM, 2014-06-18

  • L. Bulteau

13/17

slide-40
SLIDE 40

Introduction First results Fixed-parameter tractability Conclusion

Merge blocks, keep only the structure Guess reversals endpoints, inside or between blocks, repeat until target structure is reached Create a graph from the blocks, add arcs keeping track of the movement of letters Compute a maximum flow, accept if the flow is n

  • a

b c a b a c b a c b a b a c b a c a b a b a c a b a b

CPM, 2014-06-18

  • L. Bulteau

13/17

slide-41
SLIDE 41

Introduction First results Fixed-parameter tractability Conclusion

Merge blocks, keep only the structure Guess reversals endpoints, inside or between blocks, repeat until target structure is reached Create a graph from the blocks, add arcs keeping track of the movement of letters Compute a maximum flow, accept if the flow is n

  • 3

6 2 6 1 4 a b c a b a c b a c b a b a c b a c a b a b a c a b a b

  • CPM, 2014-06-18
  • L. Bulteau

13/17

slide-42
SLIDE 42

Introduction First results Fixed-parameter tractability Conclusion

Merge blocks, keep only the structure Guess reversals endpoints, inside or between blocks, repeat until target structure is reached Create a graph from the blocks, add arcs keeping track of the movement of letters Compute a maximum flow, accept if the flow is n

  • 3

6 2 6 1 4

  • CPM, 2014-06-18
  • L. Bulteau

13/17

slide-43
SLIDE 43

Introduction First results Fixed-parameter tractability Conclusion

Merge blocks, keep only the structure Guess reversals endpoints, inside or between blocks, repeat until target structure is reached Create a graph from the blocks, add arcs keeping track of the movement of letters Compute a maximum flow, accept if the flow is n

  • 3

6 2 6 1 4

  • CPM, 2014-06-18
  • L. Bulteau

13/17

slide-44
SLIDE 44

Introduction First results Fixed-parameter tractability Conclusion

Merge blocks, keep only the structure Guess reversals endpoints, inside or between blocks, repeat until target structure is reached Create a graph from the blocks, add arcs keeping track of the movement of letters Compute a maximum flow, accept if the flow is n

  • 3

6 2 6 1 4

  • CPM, 2014-06-18
  • L. Bulteau

13/17

slide-45
SLIDE 45

Introduction First results Fixed-parameter tractability Conclusion

Merge blocks, keep only the structure Guess reversals endpoints, inside or between blocks, repeat until target structure is reached Create a graph from the blocks, add arcs keeping track of the movement of letters Compute a maximum flow, accept if the flow is n

  • 3

6 2 6 1

4

  • CPM, 2014-06-18
  • L. Bulteau

13/17

slide-46
SLIDE 46

Introduction First results Fixed-parameter tractability Conclusion

Merge blocks, keep only the structure Guess reversals endpoints, inside or between blocks, repeat until target structure is reached Create a graph from the blocks, add arcs keeping track of the movement of letters Compute a maximum flow, accept if the flow is n

  • 3

6 2 6 1

4

  • CPM, 2014-06-18
  • L. Bulteau

13/17

slide-47
SLIDE 47

Introduction First results Fixed-parameter tractability Conclusion

Merge blocks, keep only the structure Guess reversals endpoints, inside or between blocks, repeat until target structure is reached Create a graph from the blocks, add arcs keeping track of the movement of letters Compute a maximum flow, accept if the flow is n

  • 3

6 2

3 2 4 2 5 2 4 5 1 1 4 6 1

4

CPM, 2014-06-18

  • L. Bulteau

13/17

slide-48
SLIDE 48

Introduction First results Fixed-parameter tractability Conclusion

Analysis

Merging the blocks:

n (trivial)

Search-tree:

O(b) choices (cf. diameter) O(b) options at each choice (bounded number of blocks) bO(b) branches

Verification:

bO(1) (flow algorithm)

Variants: works for prefix and/or signed reversal distance

(with the strict definition of signed block)

CPM, 2014-06-18

  • L. Bulteau

14/17

slide-49
SLIDE 49

Introduction First results Fixed-parameter tractability Conclusion

Analysis

Merging the blocks:

n (trivial)

Search-tree:

O(b) choices (cf. diameter) O(b) options at each choice (bounded number of blocks) bO(b) branches

Verification:

bO(1) (flow algorithm)

Variants: works for prefix and/or signed reversal distance

(with the strict definition of signed block)

CPM, 2014-06-18

  • L. Bulteau

14/17

slide-50
SLIDE 50

Introduction First results Fixed-parameter tractability Conclusion

Analysis

Merging the blocks:

n (trivial)

Search-tree:

O(b) choices (cf. diameter) O(b) options at each choice (bounded number of blocks) bO(b) branches

Verification:

bO(1) (flow algorithm)

Variants: works for prefix and/or signed reversal distance

(with the strict definition of signed block)

CPM, 2014-06-18

  • L. Bulteau

14/17

slide-51
SLIDE 51

Introduction First results Fixed-parameter tractability Conclusion

Analysis

Merging the blocks:

n (trivial)

Search-tree:

O(b) choices (cf. diameter) O(b) options at each choice (bounded number of blocks) bO(b) branches

Verification:

bO(1) (flow algorithm)

Variants: works for prefix and/or signed reversal distance

(with the strict definition of signed block)

CPM, 2014-06-18

  • L. Bulteau

14/17

slide-52
SLIDE 52

Introduction First results Fixed-parameter tractability Conclusion

Analysis

Merging the blocks:

n (trivial)

Search-tree:

O(b) choices (cf. diameter) O(b) options at each choice (bounded number of blocks) bO(b) branches

Verification:

bO(1) (flow algorithm)

Variants: works for prefix and/or signed reversal distance

(with the strict definition of signed block)

CPM, 2014-06-18

  • L. Bulteau

14/17

slide-53
SLIDE 53

Introduction First results Fixed-parameter tractability Conclusion

Conclusion

CPM, 2014-06-18

  • L. Bulteau

15/17

slide-54
SLIDE 54

Introduction First results Fixed-parameter tractability Conclusion

String reversal distance with few blocks Fixed-Parameter Tractable Signed case: only tractable for the strictest definition Open questions Diameter of b − 1? (cf. diameter of n − 1 for permutations) Or, if not, diameter of b + O(|Σ|)? Polynomial kernel for parameter b? O((|Σ| − ε)n) algorithm?

CPM, 2014-06-18

  • L. Bulteau

16/17

slide-55
SLIDE 55

Introduction First results Fixed-parameter tractability Conclusion

Thank you!

CPM, 2014-06-18

  • L. Bulteau

17/17