Abstract | ||
---|---|---|
One of the fundamental components of large-scale gene discovery projects is that of clustering of expressed sequence tags (ESTs) from complementary DNA (cDNA) clone libraries. Clustering is used to create non-redundant catalogs and indices of these sequences. In particular, clustering of ESTs is frequently used to estimate the number of genes derived from cDNA-based gene discovery efforts. This paper presents a novel parallel extension to an EST clustering program, UIeluster4, that incorporates alternative splicing information and a new parallelization strategy. The results are compared to other parallelized EST clustering systems in terms of overall processing time and in accuracy of the resulting clustering. |
Year | DOI | Venue |
---|---|---|
2005 | 10.1016/j.future.2004.05.014 | Future Generation Comp. Syst. |
Keywords | Field | DocType |
cdna-based gene discovery effort,genome analysis,performance measurement,gene transcript clustering,parallel algorithms,parallel approach,parallelized est,mrna clustering,resulting clustering,new parallelization strategy,large-scale gene discovery project,clone library,complementary dna,bioinformatics,est clustering program,alternative splicing information,fundamental component,expressed sequence tag,gene transcription,alternative splicing,parallel algorithm | Complementary DNA,Expressed sequence tag,Gene,Parallel algorithm,Computer science,Alternative splicing,Gene Discovery,Bioinformatics,Computational biology,Cluster analysis,Distributed computing | Journal |
Volume | Issue | ISSN |
21 | 5 | Future Generation Computer Systems |
Citations | PageRank | References |
4 | 0.68 | 3 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Todd E. Scheetz | 1 | 36 | 13.78 |
Nishank Trivedi | 2 | 176 | 15.98 |
Kevin T. Pedretti | 3 | 196 | 21.20 |
Terry A. Braun | 4 | 23 | 9.86 |
Thomas L. Casavant | 5 | 256 | 46.65 |