Algorithms for Computing the Longest Parameterized Common - PowerPoint PPT Presentation

Algorithms for Computing the Longest Parameterized Common Subsequence Costas S. Iliopoulos 1 , Marcin Kubica 2 , M. Sohel Rahman 1 and Tomasz Waleń 2 1 Algorithm Design Group Department of Computer Science, Kings College London 2 Faculty of Mathematics, Informatics and Applied Mathematics Warsaw University, Poland CPM, 2007-07-11

The LPCS Problem The LPCS ( longest parameterized common subsequence ) problem is a generalization of a well known LCS problem, containing gap-constraints . Definition In LPCS ( X , Y , K 1 , K 2 , D ) we look for such longest increasing sequences of indices P [ 1 , .., l ] and Q [ 1 , .., l ] , that: X [ P [ i ]] = Y [ P [ i ]] Common subsequence. K 1 ≤ P [ i + 1 ] − P [ i ] , Q [ i + 1 ] − Q [ i ] ≤ K 2 Gaps between consecutive matches are not shorter than K 1 and not longer than K 2 . | ( P [ i + 1 ] − P [ i ]) − ( Q [ i + 1 ] − Q [ i ]) | ≤ D The corresponding gaps in both sequences cannot differ more than D .

The LCS and LPCS Problems LCS a x x c b d e a b d c o o e LPCS , K 1 = 1, K 2 = 3, D = 1 a x x c b d e a b d c o o e

The LCS and LPCS Problems X i Y j ( i, j ) LCS ( i , j ) = 1 + max { LCS ( x , y ) : 1 ≤ x < i , 1 ≤ y < j }

The LCS and LPCS Problems X i Y D j ( i, j ) K 1 K 2 � PLCS ( x , y ) � : K 1 ≤ i − x , j − y ≤ K 2 , PLCS ( i , j ) = 1 + max | ( i − x ) − ( j − y ) | ≤ D

The FIG , ELAG , RIFIG and RELAG problems The LPCS problem is a generalization of four problems introduced by C. S. Iliopoulos and M. S. Rahman (ISAAC 2006): Definition FIG ( X , Y , K ) = LPCS ( X , Y , 1 , K , n ) LCS problem with fixed gaps. ELAG ( X , Y , K 1 , K 2 ) = LPCS ( X , Y , K 1 , K 2 , n ) LCS problem with elastic gaps. RIFIG ( X , Y , K ) = LPCS ( X , Y , 1 , K , 0 ) LCS problem with rigid fixed gaps. RELAG ( X , Y , K 1 , K 2 ) = LPCS ( X , Y , K 1 , K 2 , 0 ) LCS problem with rigid elastic gaps.

The LPCS Problem X i Y j ( i, j ) K FIG ( i , j ) = 1 + max { FIG ( x , y ) : i − x , j − y ≤ K }

The LPCS Problem X i Y j ( i, j ) K 1 K 2 ELAG ( i , j ) = 1 + max { ELAG ( x , y ) : K 1 ≤ i − x , j − y ≤ K 2 }

The LPCS Problem X i Y j ( i, j ) K RIFIG ( i , j ) = 1 + max { RIFIG ( x , y ) : i − x = j − y ≤ K }

The LPCS Problem X i Y j ( i, j ) K 1 K 2 RELAG ( i , j ) = 1 + max { RELAG ( x , y ) : K 1 ≤ i − x = j − y ≤ K 2 }

Previous Results Summary of previously known results PROBLEM Previous Results Our Results LPCS − O ( n 2 + R log log n ) O ( min ( n 2 , n + R log n )) FIG O ( n 2 + R log log n ) ELAG O ( n 2 ) RIFIG O ( n + R ) O ( n 2 + R ( K 2 − K 1 )) RELAG Where R is the total number of matches.

Max-queue data structure The max-queue data structure can be used to calculate maximum of last L elements inserted into the queue. Operations init ( Q , L ) — initialize and set the history length insert ( Q , x ) max ( Q ) — returns maximum of the last L inserted elements. All operations run in O ( 1 ) (amortized) time.

Max-queue data structure example Example For the sequence ( 1 , 7 , 5 , 2 , 6 , 3 , 1 ) and L = 4 1 7 5 2 6 3 1 5 ↑ MaxQueue = ( 1 )

Max-queue data structure example Example For the sequence ( 1 , 7 , 5 , 2 , 6 , 3 , 1 ) and L = 4 1 7 5 2 6 3 1 5 ↑ MaxQueue = ( 7 )

Max-queue data structure example Example For the sequence ( 1 , 7 , 5 , 2 , 6 , 3 , 1 ) and L = 4 1 7 5 2 6 3 1 5 ↑ MaxQueue = ( 7 , 5 )

Max-queue data structure example Example For the sequence ( 1 , 7 , 5 , 2 , 6 , 3 , 1 ) and L = 4 1 7 5 2 6 3 1 5 ↑ MaxQueue = ( 7 , 5 , 2 )

Max-queue data structure example Example For the sequence ( 1 , 7 , 5 , 2 , 6 , 3 , 1 ) and L = 4 1 7 5 2 6 3 1 5 ↑ MaxQueue = ( 6 , 3 , 1 )

The Algorithm for LPCS Algorithm X i Dynamic programming. Three-level Max-queue. Time complexity O ( n 2 ) . Y j ( i, j )

The Algorithm for FIG and ELAG Algorithm For R = o ( n 2 / log n ) : Dynamic programming. Using dictionary data-structure providing: insertion, removal and max-range queries. O ( R ) steps (for matches only), each in O ( log n ) time. Time complexity: O ( n + R log n ) . Can be extended to solve LPCS in O ( n + R · log n ) time.

The Algorithm for RIFIG and RELAG . Algorithm Dynamic programming. Each diagonal is processed separately. O ( R ) steps (for matches only). Each step in O ( 1 ) amortised time (using max-queue). Time complexity: O ( n + R ) .

Conclusions Conclusions New problem LPCS which generalizes FIG , ELAG , RIFIG , RELAG . Simplified and faster, the O ( n 2 ) algorithm for LPCS problem. The O ( n + R log n ) algorithm for ELAG and LPCS problem. The O ( n + R ) algorithm for RIFIG , RELAG .

The End Thank you for your attention!

Algorithms for Computing the Longest Parameterized Common - PowerPoint PPT Presentation

Algorithms for Computing the Longest Parameterized Common Subsequence Costas S. Iliopoulos 1 , Marcin Kubica 2 , M. Sohel Rahman 1 and Tomasz Wale 2 1 Algorithm Design Group Department of Computer Science, Kings College London 2 Faculty of

CSE 421 Longest Path in a DAG, LIS, Shortest Path with Negative Weights Shayan Oveis Gharan 1

Parameterized Power Vertex Cover Eric Angel, Evripidis Bampis, Bruno Escoffier, Michael Lampis

Parameterized Streaming Algorithms Graham Cormode Rajesh Chitnis Parameterized Streaming

Fast Parallel Longest Common Subsequence with General Integer Scoring Support Adnan Ozsoy , Arun

Parameterized Complexity of Integer Linear Programming (ILP) Sebastian Ordyniak Parameterized

On the Length of the Longest Common Subsequence Peter Rabinovitch Summary Consider two

Exact Crossing Number Exact Crossing Number Parameterized by Vertex Cover Parameterized by

Parameterized graph separation problems D aniel Marx Budapest University of Technology and

Geometrically Parameterized Interconnect Geometrically Parameterized Interconnect Performance

Computable Real Functions Parameterized Uniform Parameterized Uniform From NP -hard to polytime

Important separators and parameterized algorithms Dniel Marx 1 1 Institute for Computer Science

Exact Rank Aggregation with Parameterized Algorithms Robert Bredereck

Important separators and parameterized algorithms Dniel Marx Humboldt-Universitt zu Berlin,

Important separators and parameterized algorithms Dniel Marx 1 1 Institute for Computer Science

Important separators and parameterized algorithms Dniel Marx 1 1 Institute for Computer Science

Faster Algorithms for Computing Longest Common Increasing Subsequences Gerth Stlting Brodal

Counting CS1200, CSE IIT Madras Meghana Nasre March 26, 2020 CS1200, CSE IIT Madras Meghana

Lecture 7: Practice with Strings Predicting operations on s What does s equal after the following

Backtracking and Memoization Lecture 12 Tuesday, February 26, 2019 L A T EXed: December 27,

Universal locally finite maximally homogeneous semigroups Robert D. Gray 1 (joint work with I.

Applying Subsequence Matching to Collaborative Filtering Alejandro Bellog n Pablo S

Efficient List-based Computation of the String Subsequence Kernel Slimane Bellaouar 1 Hadda

W4231: Analysis of Algorithms Shortest Path with Budget Constraint 11/16/99 (revised 11/18/99) We

Mining Life Event Sequences Gilbert Ritschard and Matthias Studer NCCR LIVES and Institute for