Title | ||
---|---|---|
Mining Associations For Organism Characteristics In Prokaryotes - An Integrative Approach |
Abstract | ||
---|---|---|
Correlations and associations between specific organism characteristics (such as genome size, genome GC content, optimal growth temperature, habitat, oxygen requirements) may provide for deeper comprehension of evolutionary processes as well as for some prediction possibilities, e.g., trends prediction of some pandemia. There is a plenty of genotype data and gene sequences for different organisms, which is usually well structured and deposited into databases. On the other side, data on phenotypic characteristics of organisms are often scattered across different text documents, e.g., scientific papers or encyclopedias. We reconsider correlations between organism characteristics for superkingdoms Archaea and Bacteria and extend the study in a number of ways. We use a larger dataset of prokaryotes as well as a larger set of characteristics by integrating several existing databases with data obtained by literature mining. We recalculate some high-expectation correlations between genomic characteristics (genome size, GC content, distribution among functional groups of proteins) and apply algorithms for association rule mining in order to identify the most confident associations between specific modalities of both genotype and phenotype characteristics. |
Year | Venue | Keywords |
---|---|---|
2014 | PROCEEDINGS IWBBIO 2014: INTERNATIONAL WORK-CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1 AND 2 | association rule mining, prokaryotes, genotype characteristics, phenotype characteristics |
DocType | Citations | PageRank |
Conference | 0 | 0.34 |
References | Authors | |
10 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Gordana Pavlovic-Lazetic | 1 | 35 | 7.82 |
Vesna Pajic | 2 | 3 | 3.18 |
Nenad S. Mitic | 3 | 26 | 3.39 |
Jovana J. Kovacevic | 4 | 12 | 2.15 |
Milos V. Beljanski | 5 | 26 | 3.05 |