Title
Tlgp: A Flexible Transfer Learning Algorithm For Gene Prioritization Based On Heterogeneous Source Domain
Abstract
Background Gene prioritization (gene ranking) aims to obtain the centrality of genes, which is critical for cancer diagnosis and therapy since keys genes correspond to the biomarkers or targets of drugs. Great efforts have been devoted to the gene ranking problem by exploring the similarity between candidate and known disease-causing genes. However, when the number of disease-causing genes is limited, they are not applicable largely due to the low accuracy. Actually, the number of disease-causing genes for cancers, particularly for these rare cancers, are really limited. Therefore, there is a critical needed to design effective and efficient algorithms for gene ranking with limited prior disease-causing genes. Results In this study, we propose a transfer learning based algorithm for gene prioritization (called TLGP) in the cancer (target domain) without disease-causing genes by transferring knowledge from other cancers (source domain). The underlying assumption is that knowledge shared by similar cancers improves the accuracy of gene prioritization. Specifically, TLGP first quantifies the similarity between the target and source domain by calculating the affinity matrix for genes. Then, TLGP automatically learns a fusion network for the target cancer by fusing affinity matrix, pathogenic genes and genomic data of source cancers. Finally, genes in the target cancer are prioritized. The experimental results indicate that the learnt fusion network is more reliable than gene co-expression network, implying that transferring knowledge from other cancers improves the accuracy of network construction. Moreover, TLGP outperforms state-of-the-art approaches in terms of accuracy, improving at least 5%. Conclusion The proposed model and method provide an effective and efficient strategy for gene ranking by integrating genomic data from various cancers.
Year
DOI
Venue
2021
10.1186/s12859-021-04190-9
BMC BIOINFORMATICS
Keywords
DocType
Volume
Gene prioritizatio, Transfer learning, Gene co-expression network, Integrative analysis
Journal
22
Issue
ISSN
Citations 
SUPPL 9
1471-2105
0
PageRank 
References 
Authors
0.34
0
6
Name
Order
Citations
PageRank
Yan Wang100.34
Zuheng Xia200.34
Jingjing Deng3165.83
Xianghua Xie438337.13
Maoguo Gong52676172.02
Xiaoke Ma67611.69