Abstract | ||
---|---|---|
Data mining finds many uses in biotechnology and one of them may be to analyze multi-platform data in order to allow searching for genomic cancer signatures. The importance of the topic arises as nowadays cancer is noted one of the leading causes of deaths in highly developed countries. The goal of this work was to search for colorectal cancer signatures, consisting of somatic mutations, somatic gene copy number alterations (SCNAs) as well as abnormal expression levels. After acquiring mutation, SCNA and expression data from cBioPortal, frequent itemset mining was performed using basket analysis and apriori algorithm. We also performed survival analysis of colorectal cancer patients using the discovered signatures as differentiating factor for Kaplan-Meier curve comparison. Frequent itemset mining returned modifications of genes that can be regarded as potential colorectal cancer signatures or signatures of carcinogenic processes in general. While methods used in the project consisted of use of simple or even basic tools, the results suggest that searching for cancer signatures amidst multi-platform data may be worth developing and improving. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1007/978-3-319-67792-7_16 | MAN-MACHINE INTERACTIONS 5, ICMMI 2017 |
Keywords | DocType | Volume |
Data mining,Colorectal cancer,Survival analysis,Apriori,Basket analysis | Conference | 659 |
ISSN | Citations | PageRank |
2194-5357 | 0 | 0.34 |
References | Authors | |
1 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Marta Micek | 1 | 0 | 0.34 |
Marcin Pacholczyk | 2 | 1 | 1.64 |