Title
Biomarker discovery and redundancy reduction towards classification using a multi-factorial MALDI-TOF MS T2DM mouse model dataset.
Abstract
Diabetes like many diseases and biological processes is not mono-causal. On the one hand multi-factorial studies with complex experimental design are required for its comprehensive analysis. On the other hand, the data from these studies often include a substantial amount of redundancy such as proteins that are typically represented by a multitude of peptides. Coping simultaneously with both complexities (experimental and technological) makes data analysis a challenge for Bioinformatics.We present a comprehensive work-flow tailored for analyzing complex data including data from multi-factorial studies. The developed approach aims at revealing effects caused by a distinct combination of experimental factors, in our case genotype and diet. Applying the developed work-flow to the analysis of an established polygenic mouse model for diet-induced type 2 diabetes, we found peptides with significant fold changes exclusively for the combination of a particular strain and diet. Exploitation of redundancy enables the visualization of peptide correlation and provides a natural way of feature selection for classification and prediction. Classification based on the features selected using our approach performs similar to classifications based on more complex feature selection methods.The combination of ANOVA and redundancy exploitation allows for identification of biomarker candidates in multi-dimensional MALDI-TOF MS profiling studies with complex experimental design. With respect to feature selection our method provides a fast and intuitive alternative to global optimization strategies with comparable performance. The method is implemented in R and the scripts are available by contacting the corresponding author.
Year
DOI
Venue
2011
10.1186/1471-2105-12-140
BMC Bioinformatics
Keywords
Field
DocType
algorithms,feature selection,analysis of variance,data analysis,complex data,microarrays,proteomics,bioinformatics,experimental design,biological process,maldi tof ms,global optimization
Matrix-assisted laser desorption/ionization,Feature selection,Proteomics,Biology,Factorial,Redundancy (engineering),Bioinformatics,Biomarker discovery,DNA microarray
Journal
Volume
Issue
ISSN
12
1
1471-2105
Citations 
PageRank 
References 
8
0.34
18
Authors
12
Name
Order
Citations
PageRank
Chris Bauer1130.73
Frank Kleinjung2130.73
Celia J. Smith3221.44
Mark W Towers480.34
Ali Tiss5281.98
Alexandra Chadt6130.73
Tanja Dreja7130.73
Dieter Beule8396.46
Hadi Al-Hasani9130.73
Knut Reinert101020105.87
Johannes Schuchhardt11528.38
Rainer Cramer12413.12