Title
Fast-Find: a novel computational approach to analyzing combinatorial motifs.
Abstract
Many vital biological processes, including transcription and splicing, require a combination of short, degenerate sequence patterns, or motifs, adjacent to defined sequence features. Although these motifs occur frequently by chance, they only have biological meaning within a specific context. Identifying transcripts that contain meaningful combinations of patterns is thus an important problem, which existing tools address poorly.Here we present a new approach, Fast-FIND (Fast-Fully Indexed Nucleotide Database), that uses a relational database to support rapid indexed searches for arbitrary combinations of patterns defined either by sequence or composition. Fast-FIND is easy to implement, takes less than a second to search the entire Drosophila genome sequence for arbitrary patterns adjacent to sites of alternative polyadenylation, and is sufficiently fast to allow sensitivity analysis on the patterns. We have applied this approach to identify transcripts that contain combinations of sequence motifs for RNA-binding proteins that may regulate alternative polyadenylation.Fast-FIND provides an efficient way to identify transcripts that are potentially regulated via alternative polyadenylation. We have used it to generate hypotheses about interactions between specific polyadenylation factors, which we will test experimentally.
Year
DOI
Venue
2006
10.1186/1471-2105-7-1
BMC Bioinformatics
Keywords
Field
DocType
microarrays,genome sequence,genome,protein binding,relational database,sequence motif,binding sites,sensitivity analysis,expressed sequence tags,polyadenylation,rna,internet,bioinformatics,algorithms,indexation,rna binding protein,rna splicing,biological process,computational biology,rna binding proteins,nucleotides
Genome,Expressed sequence tag,Biology,Amino Acid Motifs,Suffix array,RNA-binding protein,RNA splicing,Suffix tree,Bioinformatics,Genetics,DNA microarray
Journal
Volume
Issue
ISSN
7
1
1471-2105
Citations 
PageRank 
References 
76
2.58
2
Authors
4
Name
Order
Citations
PageRank
Micah Hamady11385.80
Erin Peden2762.58
Rob Knight336626.19
Ravinder Singh4762.58