Title
Machine Learning Based Metagenomic Prediction of Inflammatory Bowel Disease.
Abstract
In this study, we investigate faecal microbiota composition, in an attempt to evaluate performance of classification algorithms in identifying Inflammatory Bowel Disease (IBD) and its two types: Crohn's disease (CD) and ulcerative colitis (UC). From many investigated algorithms, a random forest (RF) classifier was selected for detailed evaluation in three-class (CD versus UC versus nonIBD) classification task and two binary (nonIBD versus IBD and CD versus UC) classification tasks. We dealt with class imbalance, performed extensive parameter search, dimensionality reduction and two-level classification. In three-class classification, our best model reaches F1 score of 91% in average, which confirms the strong connection of IBD and gastrointestinal microbiome. Among most important features in three-class classification are species Staphylococcus hominis, Porphyromonas endodontalis, Slackia piriformis and genus Bacteroidetes.
Year
DOI
Venue
2021
10.3233/SHTI210591
pHealth
Keywords
DocType
Volume
feature selection,imbalance,machine learning,microbiome
Conference
285
ISSN
Citations 
PageRank 
1879-8365
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Andrea Mihajlović100.34
Katarina Mladenović200.34
Tatjana Loncar-Turukalo3135.37
Sanja Brdar412.71