Title
A Human DNA Methylation Site Predictor Based on SVM
Abstract
During gene expression, transcription factors are unable to bind to a transcription binding site (TFBS) involved in regulation if DNA methylation has occurred at the TFBS. Methyl-CpG-binding proteins may also occupy the TFBS and prevent the functioning of a transcription factor. Thus, the methylation status of CpG sites is an important issue when trying to understand gene regulation and shows strong correlation with the TFBS involved. In addition, CpG islands would seem to undergo cell-specific and tissue-specific methylation. Such differential methylation is presented at numerous genetic loci that are essential for development. Current DNA methylation site prediction tools need to be improved so that they include TFBS features and have greater accuracy in terms of the DNA region that is involved in methylation. We developed models that compare the differences across these regions and tissues. The TFBSs, DNA properties and DNA distribution were used as features for this classification. From the results, we found some TFBSs that were able to discriminate whether a sequence was methylated or not. The sensitivity, specificity and accuracy estimated using 10-fold cross validation were 90.8%, 80.54%, and 86.07%, respectively. Thus, for these four regions and twelve tissues, the performance levels (ACC) were all greater than 80%. We propose that the differential features or methylations vary between the different regions because the features common to each DNA region made up only 50% of the top 70 features. An online predictor based on EpiMeP is available at http://140.115.51.41/EpiMeP/. Supplementary file is available at http://140.115.51.41/EpiMeP/supplementary.doc.
Year
DOI
Venue
2009
10.1109/BIBE.2009.22
BIBE
Keywords
Field
DocType
cell-specific methylation,dna methylation,cellular biophysics,genetics,current dna methylation site,gene expression,proteins,human dna methylation site,differential methylation,human dna methylation site predictor,methyl-cpg-binding protein,svm,dna distribution,dna region,biochemistry,biology computing,molecular biophysics,methylation status,tfbs feature,transcription factor,biological tissues,dna,dna property,tissue-specific methylation,support vector machines,transcription binding site,computer science,data models,systems biology,cpg island,binding site,bioinformatics,predictive models,binding protein,accuracy,tin,cross validation,gene regulation
Differentially methylated regions,Transcription (biology),Biology,CpG site,Methylation,DNA methylation,DNA,Regulation of gene expression,Bioinformatics,Genetics,Locus (genetics)
Conference
ISBN
Citations 
PageRank 
978-0-7695-3656-9
0
0.34
References 
Authors
6
7
Name
Order
Citations
PageRank
Yi-Ming Sun110.70
Wei-Li Liao200.34
Hsien-Da Huang383563.83
Baw-Jhiune Liu419338.12
Cheng-Wei Chang501.69
Jorng-Tzong Horng654167.78
Li-Ching Wu7155.03