Abstract | ||
---|---|---|
In this paper, we show new parallel algorithms for a set of classical string comparison problems: computation of string alignments, longest common subsequences (LCS) or edit distances, and longest increasing subsequence computation. These problems have a wide range of applications, in particular in computational biology and signal processing. We discuss the scalability of our new parallel algorithms in computation time, in memory, and in communication. Our new algorithms are based on an efficient parallel method for (min,+)-multiplication of distance matrices. The core result of this paper is a scalable parallel algorithm for multiplying implicit simple unit-Monge matrices of size n x n on p processors using time O( n log n ‾ p). communication O(n log p) ‾ p) and O(log p) supersteps. This algorithm allows us to implement scalable LCS computation for two strings of length n using time O(n2 ‾ p) and communication O(n ‾ √ p), requiring local memory of size O(n ‾ √ p) on each processor. Furthermore, our algorithm can be used to obtain the first generally work-scalable algorithm for computing the longest increasing subsequence (LIS). Our algorithm for LIS computation requires computation O(n log2 n ‾ p), communication O(n log p)/ p), and O(log2 p) supersteps for computing the LIS of a sequence of length n. This is within a log n factor of work-optimality for the LIS problem, which can be solved sequentially in time O(n log n) in the comparison-based model. Our LIS algorithm is also within a log p-factor of achieving perfectly scalable communication and furthermore has perfectly scalable memory size requirements of O(n ‾ p) per processor. |
Year | DOI | Venue |
---|---|---|
2010 | 10.1145/1810479.1810521 | SPAA |
Keywords | Field | DocType |
computation o,communication o,log2 n,efficient parallel string comparison,log n factor,length n,time o,n log n,new algorithm,size n,n log p,new parallel algorithm,edit distance,longest increasing subsequence,signal processing,computational biology,parallel algorithm,longest common subsequence | Binary logarithm,Longest increasing subsequence,Distance matrices in phylogeny,Parallel algorithm,Matrix (mathematics),Computer science,Parallel computing,Algorithm,Multiplication,Time complexity,Computation,Distributed computing | Conference |
Citations | PageRank | References |
4 | 0.38 | 18 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Peter Krusche | 1 | 25 | 2.83 |
Alexander Tiskin | 2 | 220 | 15.50 |