Title
DECOD: fast and accurate discriminative DNA motif finding.
Abstract
Motivation: Motif discovery is now routinely used in high-throughput studies including large-scale sequencing and proteomics. These datasets present new challenges. The first is speed. Many motif discovery methods do not scale well to large datasets. Another issue is identifying discriminative rather than generative motifs. Such discriminative motifs are important for identifying co-factors and for explaining changes in behavior between different conditions. Results: To address these issues we developed a method for DECOnvolved Discriminative motif discovery (DECOD). DECOD uses a k-mer count table and so its running time is independent of the size of the input set. By deconvolving the k-mers DECOD considers context information without using the sequences directly. DECOD outperforms previous methods both in speed and in accuracy when using simulated and real biological benchmark data. We performed new binding experiments for p53 mutants and used DECOD to identify p53 co-factors, suggesting new mechanisms for p53 activation.
Year
DOI
Venue
2011
10.1093/bioinformatics/btr412
BIOINFORMATICS
Keywords
Field
DocType
algorithms,dna
Data mining,Source code,Computer science,Sequence motif,Nucleotide Motif,Artificial intelligence,Bioinformatics,Discriminative model,Machine learning
Journal
Volume
Issue
ISSN
27
17
1367-4803
Citations 
PageRank 
References 
5
0.50
11
Authors
9
Name
Order
Citations
PageRank
Peter Huggins1748.51
Shan Zhong2342.28
Idit Shiff350.50
Rachel Beckerman450.50
Oleg Laptenko550.50
Carol Prives650.50
Marcel H Schulz724024.03
Itamar Simon830229.59
Ziv Bar-Joseph91207112.00