Title
A Partitional Approach for Genomic-Data Clustering Combined with K-Means Algorithm
Abstract
Bioinformatics is the science of managing, analyzing, extracting, and interpreting information from biological sequences and molecules. Recent advancements in microarray technology allow simultaneous monitoring of the expression levels of a large number of genes over different experiment conditions. Facing this huge amount of data, the biologist cannot simply use the traditional techniques in biology to analyze the data. In fact, information technologies are needed. Cluster analysis is of considerable interest and importance in the field of bioinformatics, either by clustering the genes or by clustering experiment conditions (samples). The clustering of genes is used to identify groups of genes with similar patterns of expression, aiming at helping to answer questions of how gene expression is affected by various diseases and which genes are responsible for specific diseases. The clustering of samples is used to organize the samples into intrinsic clusters such that samples with high similarity belong to same cluster. The significance of this clustering assists in diagnosis of the disease condition, and it discloses the effect of certain treatment on genes. In order to cluster the huge amount of gathered gene expression data, we propose a new partitional clustering-approach, combined with K-Means algorithm. The approach is compared with both K-Means and this approach before combination. The obtained results in terms of internal and external performance measures on a set of genomic benchmarks show the correctness and competence of the proposed approach.
Year
DOI
Venue
2016
10.1109/CSE-EUC-DCABES.2016.170
2016 IEEE Intl Conference on Computational Science and Engineering (CSE) and IEEE Intl Conference on Embedded and Ubiquitous Computing (EUC) and 15th Intl Symposium on Distributed Computing and Applications for Business Engineering (DCABES)
Keywords
Field
DocType
Bioinformatics,microarray data-mining,gene expression data,diseases,clustering,K-Means algorithm
Data mining,k-means clustering,Clustering high-dimensional data,Correlation clustering,Computer science,Genomics,Consensus clustering,Gene chip analysis,Biclustering,Cluster analysis
Conference
ISBN
Citations 
PageRank 
978-1-5090-3594-6
0
0.34
References 
Authors
1
4
Name
Order
Citations
PageRank
Billel Kenidra101.01
Mohamed Benmohammed200.34
Abdesselem Beghriche321.39
Zakaria Benmounah412.39