Title
A new approach for the deep order preserving submatrix problem based on sequential pattern mining
Abstract
As an effective biclustering model, order-preserving submatrix (OPSM) has been widely applied to biological gene expression data mining, which can capture the general tendency of the gene expression under some experimental conditions. Recently, biologists hope to find deep OPSMs with long patterns and comparatively fewer support rows, which are important for the interpretation of gene regulatory networks. However, the traditional exact mining algorithms based on Apriori principle could not deal with the deep OPSM problem, which often take a large minimum support threshold for pattern pruning, and inevitably miss some significant deep OPSMs. In this paper, a new exact algorithm is proposed for mining deep OPSMs. Firstly all the common subsequences shared by every two rows are found out, then the row sets corresponding to the same common subsequences are formed. Finally all the deep OPSMs with support beyond the given threshold will be obtained. Experiments have been done in both real and synthetic data sets, and the results show that this new algorithm is capable of mining all the deep OPSMs over a small support. Under different thresholds of minimum support, this algorithm reveals better performance than the traditional sequential pattern mining algorithms.
Year
DOI
Venue
2018
10.1007/s13042-015-0384-z
Int. J. Machine Learning & Cybernetics
Keywords
Field
DocType
Biclustering, Deep order preserving submatrix, Common subsequence
Row,Data mining,Exact algorithm,A priori and a posteriori,Biclustering,Gene regulatory network,Sequential Pattern Mining,Synthetic data sets,Mathematics
Journal
Volume
Issue
ISSN
9
2
1868-808X
Citations 
PageRank 
References 
3
0.38
27
Authors
7
Name
Order
Citations
PageRank
Yun Xue1113.92
TieChen Li283.57
Zhiwen Liu3357.61
Chaoyi Pang431643.45
Meihang Li583.83
Zhengling Liao653.44
Xiao-Hui Hu7105.55