Title
Predicting Molecular Functions in Plants using Wavelet-based Motifs.
Abstract
Predicting molecular functions of proteins is a fundamental challenge in bioinformatics. Commonly used algorithms are based on sequence alignments and fail when the training sequences have low percentages of identity with query proteins, as it is the case for non-model organisms such as land plants. On the other hand, machine learning-based algorithms offer a good alternative for prediction, but most of them ignore that molecular functions are conditioned by functional domains instead of global features of the whole sequence. This work presents a novel application of the Wavelet Transform in order to detect discriminant sub-sequences (motifs) and use them as input for a pattern recognition classifier. The results show that the continuous wavelet transform is a suitable tool for the identification and characterization of motifs. Also, the proposed classification methodology shows good prediction capabilities for datasets with low percentage of identity among sequences, outperforming BLAST2GO on about 11,5% and PEPSTATS-SVMon 16,4%. Plus, it offers major interpretability of the obtained results.
Year
Venue
Keywords
2013
BIOINFORMATICS 2013: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOINFORMATICS MODELS, METHODS AND ALGORITHMS
Amino Acid Properties,Dissimilarity based Classification,Molecular Function,Motifs,Wavelet Transform
Field
DocType
Citations 
Data mining,Computer science,Artificial intelligence,Machine learning,Wavelet
Conference
0
PageRank 
References 
Authors
0.34
0
5