Abstract | ||
---|---|---|
In the sequencing process, reads of the sequence are generated, then assembled to form contigs. New technologies can produce reads faster with lower cost and higher coverage. However, these reads are shorter. With errors, short reads make the assembly step more difficult. Chaisson et al. (2004) proposed an algorithm to correct the reads prior to the assembly step. The result is not satisfactory when the error rate is high (e.g., ≥3%). We improve their approach to handle reads of higher error rates. Experimental results show that our approach is much more effective in correcting errors, producing contigs of higher quality. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1504/IJBRA.2009.024039 | IJBRA |
Keywords | Field | DocType |
dna sequencing,sequence assembly,bioinformatics,error rate,contigs,error correction | Data mining,Hybrid genome assembly,Biology,Word error rate,Error detection and correction,Contig,DNA sequencing,Bioinformatics,Genetics,Sequence assembly | Journal |
Volume | Issue | Citations |
5 | 2 | 2 |
PageRank | References | Authors |
0.54 | 2 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Thomas K. F. Wong | 1 | 89 | 9.80 |
Tak-Wah Lam | 2 | 1860 | 164.96 |
P. Y. Chan | 3 | 7 | 1.14 |
Siu-Ming Yiu | 4 | 48 | 12.43 |