Title
A tool for aligning very similar DNA sequences.
Abstract
Results: We have produced a computer program, named sim3, that solves the following computational problem. Two DNA sequences are given, where the shorter sequence is very similar to some contiguous region of the longer sequence. Sim3 determines such a similar region of the longer sequence, and then computes an optimal set of single-nucleotide changes (i.e., insertions, deletions or substitutions) that will convert the shorter sequence to that region. Thus, the alignment scoring scheme is designed to model sequencing errors, rather than evolutionary processes. The program can align a 100 kb sequence to a I megabase sequence in a few seconds on a workstation, provided that there are very few differences between the shorter sequence and some region in the longer sequence. The program has been used to assemble sequence data for the Genomes Division at the National Center for Biotechnology Information. Availability: A version of sim3 for UNIX machines can be obtained by anonymous ftp from ncbi. nlm. nih, gov, in the pub/sim3 directory. Contact: For portable versions for Macs and PCs, contact zjing@sunset. nlm. nih. gov.
Year
DOI
Venue
1997
10.1093/bioinformatics/13.1.75
COMPUTER APPLICATIONS IN THE BIOSCIENCES
Keywords
Field
DocType
dna sequence
Genome,Sequence alignment,Computational problem,Nucleic acid sequence,Directory,Computer science,Unix,Workstation,DNA sequencing,Bioinformatics
Journal
Volume
Issue
ISSN
13
1
0266-7061
Citations 
PageRank 
References 
8
7.56
11
Authors
4
Name
Order
Citations
PageRank
Kun-mao Chao183894.05
Jinghui Zhang22715.36
James Ostell31566617.35
W Miller41301295.71