Title
Gene transcript clustering: a comparison of parallel approaches
Abstract
One of the fundamental components of large-scale gene discovery projects is that of clustering of expressed sequence tags (ESTs) from complementary DNA (cDNA) clone libraries. Clustering is used to create non-redundant catalogs and indices of these sequences. In particular, clustering of ESTs is frequently used to estimate the number of genes derived from cDNA-based gene discovery efforts. This paper presents a novel parallel extension to an EST clustering program, UIeluster4, that incorporates alternative splicing information and a new parallelization strategy. The results are compared to other parallelized EST clustering systems in terms of overall processing time and in accuracy of the resulting clustering.
Year
DOI
Venue
2005
10.1016/j.future.2004.05.014
Future Generation Comp. Syst.
Keywords
Field
DocType
cdna-based gene discovery effort,genome analysis,performance measurement,gene transcript clustering,parallel algorithms,parallel approach,parallelized est,mrna clustering,resulting clustering,new parallelization strategy,large-scale gene discovery project,clone library,complementary dna,bioinformatics,est clustering program,alternative splicing information,fundamental component,expressed sequence tag,gene transcription,alternative splicing,parallel algorithm
Complementary DNA,Expressed sequence tag,Gene,Parallel algorithm,Computer science,Alternative splicing,Gene Discovery,Bioinformatics,Computational biology,Cluster analysis,Distributed computing
Journal
Volume
Issue
ISSN
21
5
Future Generation Computer Systems
Citations 
PageRank 
References 
4
0.68
3
Authors
5
Name
Order
Citations
PageRank
Todd E. Scheetz13613.78
Nishank Trivedi217615.98
Kevin T. Pedretti319621.20
Terry A. Braun4239.86
Thomas L. Casavant525646.65