Title
A new protein motif extraction framework based on constrained co-clustering
Abstract
Signal finding (pattern discovery) in biological sequences is a fundamental problem in both computer science and molecular biology. Many approaches have been proposed for extracting interesting patterns (or motifs) from DNA/RNA and protein sequences. Some approaches are based on simple and multiple alignment techniques, some use biological knowledge and others do not. In this paper, we propose a de novo framework that performs motifs identification and exploits a constrained co-clustering technique allowing one to simultaneously find associations between groups of protein sequences and groups of motifs. We show that the presented approach is able to group together protein sequences belonging to the same families and, at the same time to provide a set of characterizing motifs.
Year
DOI
Venue
2009
10.1145/1529282.1529445
SAC
Keywords
Field
DocType
multiple alignment technique,use biological knowledge,interesting pattern,fundamental problem,biological sequence,computer science,motifs identification,protein sequence,new protein motif extraction,molecular biology,co-clustering technique,multiple alignment,protein motif
RNA,Computer science,DNA,Structural motif,Computational biology,Bioinformatics,Biclustering,Multiple sequence alignment,Consensus sequence,Multiple EM for Motif Elicitation,Sequence analysis
Conference
Citations 
PageRank 
References 
2
0.38
13
Authors
3
Name
Order
Citations
PageRank
Francesca Cordero16313.42
Alessia Visconti282.85
Marco Botta328441.98