Title
Approximate clustering of incomplete fingerprints
Abstract
We study the problem of clustering fingerprints with at most p missing values (CMV(p) for short) naturally arising in oligonucleotide fingerprinting, which is an efficient method for characterizing DNA clone libraries. We show that already CMV(2) is NP-hard. We also show that a greedy algorithm yields a min(1+lnn,2+plnl) approximation for CMV(p), and can be implemented to run in O(nl2^p) time. We also introduce other variants of the problem of clustering incomplete fingerprints based on slightly different optimization criteria and show that they can be approximated in polynomial time with ratios 2^2^p^-^1 and 2(1-12^2^p), respectively.
Year
DOI
Venue
2008
10.1016/j.jda.2007.01.004
J. Discrete Algorithms
Keywords
Field
DocType
p missing value,oligonucleotide fingerprinting,greedy algorithm yield,different optimization criterion,approximation algorithms,approximate clustering,clustering fingerprint,efficient method,polynomial time,clustering,dna clone library,incomplete fingerprint,np-hardness,missing values,greedy algorithm
Approximation algorithm,Discrete mathematics,Combinatorics,Greedy algorithm,Missing data,Time complexity,Cluster analysis,Mathematics
Journal
Volume
Issue
ISSN
6
1
Journal of Discrete Algorithms
Citations 
PageRank 
References 
1
0.39
2
Authors
6
Name
Order
Citations
PageRank
A. Figueroa110.39
A. Goldstein210.39
Tao Jiang31809155.32
M. Kurowski410.39
A. Lingas525045.63
M. Persson610.39