Abstract | ||
---|---|---|
The size of data gathered from various ongoing biological and clinically studies is increasing at an exponential rate. The bio-inspired data mainly comprises of genes of DNA, protein and variety of proteomics and genetic diseases. Additionally, DNA microarray data is also available for early diagnosis and prediction of various types of cancer diseases. Interestingly, this data may store very vital information about genes, their structure and important biological function. The huge volume and constant increase in the extracted bio data has opened several challenges. Many bioinformatics and machine learning models have been developed but those fail to address key challenges presents in the efficient and accurate analysis of variety of complex biologically inspired data such as genetic diseases etc. The reliable and robust process of classifying the extracted data into different classes based on the information hidden in the sample data is also a very interesting and open problem. This research work mainly focuses to overcome major challenges in the accurate protein classification keeping in view of the success of deep learning models in natural language processing since it assumes the proteins sequences as a language. The learning ability and overall classification performance of the proposed system can be validated with deep learning classification models. The proposed system can have the superior ability to accurately classify the mentioned datasets than previous approaches and shows better results. The in-depth analysis of multifaceted biological data may also help in the early diagnosis of diseases that causes due to mutation of genes and to overcome arising challenges in the development of large-scale healthcare systems. |
Year | DOI | Venue |
---|---|---|
2020 | 10.1166/jmihi.2020.3179 | JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS |
Keywords | DocType | Volume |
Bioinformatics,Protein Sequence Classification,Deep Learning,Convolutional Neural Network,Sequence Encoding,Genes Healthcare Informatics | Journal | 10 |
Issue | ISSN | Citations |
10 | 2156-7018 | 1 |
PageRank | References | Authors |
0.35 | 0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Iftikhar Ahmad | 1 | 156 | 27.06 |
Muhammad Javed Iqbal | 2 | 5 | 2.16 |
Mohammad Basheri | 3 | 1 | 0.35 |