Please wait a minute...

中国生物工程杂志

China Biotechnology
China Biotechnology  2021, Vol. 41 Issue (11): 40-47    DOI: 10.13523/j.cb.2106027
    
Drug-target Affinity Prediction Based on Deep Learning and Multi-layered Information Fusion
TANG Yue-wei1,LIU Zhi-ping1,2,**()
1 School of Control Science and Engineering, Shandong University, Jinan 250061, China
2 Center for Intelligent Medicine, Shandong University, Jinan 250061, China
Download: HTML   PDF(748KB) HTML
Export: BibTeX | EndNote (RIS)      

Abstract  

Drug discovery is a very important and costly process. Computer-assisted methods for predicting drug-protein affinity can greatly speed up the process of drug discovery. The key to the prediction of drug target affinity lies in the accurate and detailed characterization of drug and protein information. In this paper, a prediction model for drug target affinity based on deep learning and multi-layered information fusion is proposed, in an attempt to obtain better prediction performance by integrating multi-layered information of drugs and proteins. Firstly, the drug is expressed as molecular graph and ECFP, GCN module and fully connected(FC) layer are used for learning, respectively. Secondly, protein sequence and K-mer feature of protein are input into CNN module and FC layer, respectively to learn potential protein features. Finally, the features learned from the four channels are concatenated and the FC layer is used for prediction. In this study, the availability of the proposed method is verified on the two benchmark datasets of drug-targets affinity and compared with other existing models. The results show that the proposed model can obtain better prediction performance than the baseline model, which indicates that the proposed strategy for predicting drug target affinity based on multi-layered information fusion of drug and protein is effective.



Key wordsDrug target affinity      Drug      Protein      Deep learning      Multi-layered information fusion     
Received: 16 June 2021      Published: 01 December 2021
ZTFLH:  Q819  
Corresponding Authors: Zhi-ping LIU     E-mail: zpliu@sdu.edu.cn
Cite this article:

TANG Yue-wei,LIU Zhi-ping. Drug-target Affinity Prediction Based on Deep Learning and Multi-layered Information Fusion. China Biotechnology, 2021, 41(11): 40-47.

URL:

https://manu60.magtech.com.cn/biotech/10.13523/j.cb.2106027     OR     https://manu60.magtech.com.cn/biotech/Y2021/V41/I11/40

Datasets Proteins Drugs Interactions Density/%
Davis 442 68 30 056 100
KIBA 229 2 111 118 254 24.4
Table 1 Davis and KIBA dataset
Fig.1 Model framework Drug and protein were numerically represented, and feature extraction was performed on their respective channels. After that, full connection layers are to predict the affinity
Method Drug Protein Evaluation
Graph ECFP PS K-mer CI MSE
Proposed model M1 0 1 0 1 0.886 0.240
M2 0 1 1 0 0.887 0.250
M3 1 0 0 1 0.868 0.287
M4 1 0 1 0 0.876 0.282
M5 0 1 1 1 0.889 0.241
M6 1 0 1 1 0.873 0.281
M7 1 1 0 1 0.884 0.245
M8 1 1 1 0 0.882 0.254
M9 1 1 1 1 0.885 0.246
Table 2 The mean CI/MSE scores on independent test sets of Davis dataset using the proposed model
Method Drug Protein Evaluation
Graph ECFP PS K-mer CI MSE
Proposed model M1 0 1 0 1 0.879 0.152
M2 0 1 1 0 0.880 0.154
M3 1 0 0 1 0.866 0.167
M4 1 0 1 0 0.869 0.165
M5 0 1 1 1 0.882 0.147
M6 1 0 1 1 0.873 0.163
M7 1 1 0 1 0.877 0.157
M8 1 1 1 0 0.880 0.152
M9 1 1 1 1 0.882 0.149
Table 3 The mean CI/MSE scores on independent test sets of KIBA dataset using the proposed model
Method Drug Protein CI MSE
KronRLS PubChem Sim S-W 0.871 0.379
SimBoost PubChem Sim S-W 0.872 0.282
DeepDTA LS PS 0.878 0.261
WideDTA LS+LMCS PS + PDM 0.886 0.262
Proposed model ECFP PS + K-mer 0.889 0.241
Table 4 Comparison of CI/MSE scores on the Davis dataset between the proposed model and the baseline
Method Drug Protein CI MSE
KronRLS PubChem Sim S-W 0.782 0.411
SimBoost PubChem Sim S-W 0.836 0.222
DeepDTA LS PS 0.863 0.194
WideDTA LS+LMCS PS+PDM 0.875 0.194
Proposed model ECFP PS+K-mer 0.882 0.147
Table 5 Comparison of CI/MSE scores on the KIBA dataset between the proposed model and the baseline
[1]   Mullard A. New drugs cost US$2.6 billion to develop. Nature Reviews Drug Discovery, 2014, 13(12):877.
[2]   Ashburn T T, Thor K B. Drug repositioning: identifying and developing new uses for existing drugs. Nature Reviews Drug Discovery, 2004, 3(8):673-683.
pmid: 15286734
[3]   Öztürk H, Ozkirimli E, Özgür A. WideDTA: prediction of drug-target binding affinity.[2019-02-04].https://arxiv.org/abs/1902.04166 .
[4]   Yamanishi Y, Kotera M, Kanehisa M, et al. Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework. Bioinformatics, 2010, 26(12):i246-i254.
doi: 10.1093/bioinformatics/btq176
[5]   Nascimento A C A, Prudêncio R B C, Costa I G. A multiple kernel learning algorithm for drug-target interaction prediction. BMC Bioinformatics, 2016, 17:46.
doi: 10.1186/s12859-016-0890-3 pmid: 26801218
[6]   Cheng Z Z, Zhou S G, Wang Y, et al. Effectively identifying compound-protein interactions by learning from positive and unlabeled examples. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2018, 15(6):1832-1843.
doi: 10.1109/TCBB.8857
[7]   Pahikkala T, Airola A, Pietilä S, et al. Toward more realistic drug-target interaction predictions. Briefings in Bioinformatics, 2015, 16(2):325-337.
doi: 10.1093/bib/bbu010 pmid: 24723570
[8]   He T, Heidemeyer M, Ban F Q, et al. SimBoost: a read-across approach for predicting drug-target binding affinities using gradient boosting machines. Journal of Cheminformatics, 2017, 9(1):24.
doi: 10.1186/s13321-017-0209-z
[9]   Wu Y F, Gao M, Zeng M, et al. BridgeDPI: a novel graph neural network for predicting drug-protein interactions.[2021-01-29].https://arxiv.org/abs/2101.12547 .
[10]   Hinton G, Deng L, Yu D, et al. Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups. IEEE Signal Processing Magazine, 2012, 29(6):82-97.
[11]   Ma J S, Sheridan R P, Liaw A, et al. Deep neural nets as a method for quantitative structure-activity relationships. Journal of Chemical Information and Modeling, 2015, 55(2):263-274.
doi: 10.1021/ci500747n
[12]   Öztürk H, Özgür A, Ozkirimli E. DeepDTA: deep drug-target binding affinity prediction. Bioinformatics, 2018, 34(17):i821-i829.
doi: 10.1093/bioinformatics/bty593
[13]   Abbasi K, Razzaghi P, Poso A, et al. DeepCDA: deep cross-domain compound-protein affinity prediction through LSTM and convolutional neural networks. Bioinformatics, 2020, 36(17):4633-4642.
doi: 10.1093/bioinformatics/btaa544
[14]   Hirohara M, Saito Y, Koda Y, et al. Convolutional neural network based on SMILES representation of compounds for detecting chemical motif. BMC Bioinformatics, 2018, 19(Suppl 19):526.
doi: 10.1186/s12859-018-2523-5 pmid: 30598075
[15]   Karimi M, Wu D, Wang Z Y, et al. DeepAffinity: interpretable deep learning of compound-protein affinity through unified recurrent and convolutional neural networks. Bioinformatics, 2019, 35(18):3329-3338.
doi: 10.1093/bioinformatics/btz111
[16]   Lim J, Ryu S, Park K, et al. Predicting drug-target interaction using a novel graph neural network with 3D structure-embedded graph representation. Journal of Chemical Information and Modeling, 2019, 59(9):3981-3988.
doi: 10.1021/acs.jcim.9b00387
[17]   Gomes J, Ramsundar B, Feinberg E N, et al. Atomic convolutional networks for predicting protein-ligand binding affinity.[2017-03-30].https://arxiv.org/abs/1703.10603 .
[18]   Rose P W, Prlić A, Altunkaya A, et al. The RCSB protein data bank: integrative view of protein, gene and 3D structural information. Nucleic Acids Research, 2017, 45(D1):D271-D281.
[19]   Davis M I, Hunt J P, Herrgard S, et al. Comprehensive analysis of kinase inhibitor selectivity. Nature Biotechnology, 2011, 29(11):1046-1051.
doi: 10.1038/nbt.1990
[20]   Tang J, Szwajda A, Shakyawar S, et al. Making sense of large-scale kinase inhibitor bioactivity data sets: a comparative and integrative analysis. Journal of Chemical Information and Modeling, 2014, 54(3):735-743.
doi: 10.1021/ci400709d
[21]   Rogers D, Hahn M. Extended-connectivity fingerprints. Journal of Chemical Information and Modeling, 2010, 50(5):742-754.
doi: 10.1021/ci100050t pmid: 20426451
[22]   Wu Z H, Pan S R, Chen F W, et al. A comprehensive survey on graph neural networks. IEEE Transactions on Neural Networks and Learning Systems, 2021, 32(1):4-24.
doi: 10.1109/TNNLS.5962385
[23]   Ramsundar B, Eastman P, Walters P, et al. Deep learning for the life sciences: applying deep learning to genomics, microscopy, drug discovery, and more. California: O’Reilly Media, 2019: 55-56.
[24]   Kipf T N, Welling M. Semi-supervised classification with graph convolutional networks.[2017-02-22].https://arxiv.org/abs/1609.02907v4 .
[25]   Gönen M, Heller G. Concordance probability and discriminatory power in proportional hazards regression. Biometrika, 2005, 92(4):965-970.
doi: 10.1093/biomet/92.4.965
[26]   Sigrist C J A, Cerutti L, de Castro E, et al. PROSITE, a protein domain database for functional characterization and annotation. Nucleic Acids Research, 2010, 38(suppl_1):D161-D166.
doi: 10.1093/nar/gkp885
[1] MA Ning,WANG Han-jie. Advances of Optogenetics in the Regulation of Bacterial Production[J]. China Biotechnology, 2021, 41(9): 101-109.
[2] SUN Li-ping,XU Wan,LI Meng-wei,ZENG Ru,WENG Jian. Advances of the Physiochemical Properties of Sporopollenin and Its Biomedical Applications[J]. China Biotechnology, 2021, 41(9): 92-100.
[3] ZHAO Xiao-yu,XU Qi-ling,ZHAO Xiao-dong,AN Yun-fei. Enhancing Lentiviral Vector Transduction Efficiency for Facilitating Gene Therapy[J]. China Biotechnology, 2021, 41(8): 52-58.
[4] LI Jia-xin,ZHANG Zheng,LIU He,YANG Qing,LV Cheng-zhi,YANG Jun. Preparation and Drug Release Properties of Keratin-loaded Nanoparticles[J]. China Biotechnology, 2021, 41(8): 8-16.
[5] LIU Shao-jin,FENG Xue-jiao,WANG Jun-shu,XIAO Zheng-qiang,CHENG Ping-sheng. Market Analysis and Countermeasures of Nucleic Acid Drugs in China[J]. China Biotechnology, 2021, 41(7): 99-109.
[6] FENG Zhao,LI Jiang-hao,WANG Jia-hua. Functional Analysis of RpRPL22, a Ribosomal Protein Homologous Gene, in the Symbiotic Nodulation Process of Robinia Pseudoacacia[J]. China Biotechnology, 2021, 41(7): 10-21.
[7] MIAO Yi-nan,LI Jing-zhi,WANG Shuai,LI Chun,WANG Ying. Research Progress of Key Enzymes in Terpene Biosynthesis[J]. China Biotechnology, 2021, 41(6): 60-70.
[8] CHEN Wen-jie,MIAO Xian-feng. Domestic Research and Development Status of Antibody-drug Conjugates and Strategic Layout of Key Enterprises[J]. China Biotechnology, 2021, 41(6): 105-110.
[9] XU Ye-chun,LIU Hong,LI Jian-feng,SHEN Jing-shan,JIANG Hua-liang. Recent Progress in Drug Development against COVID-19[J]. China Biotechnology, 2021, 41(6): 111-118.
[10] HU Xuan,WANG Song,YU Xue-ling,ZHANG Xiao-peng. Construction of a Destabilized EGFP Cell Model for Gene Editing Evaluation[J]. China Biotechnology, 2021, 41(5): 17-26.
[11] LV Hui-zhong,ZHAO Chen-chen,ZHU Lian,XU Na. Progress of Using Exosome for Drug Targeted Delivery in Tumor Therapy[J]. China Biotechnology, 2021, 41(5): 79-86.
[12] HU Sheng-tao,ZHANG Er-bing,LIN Ye,ZHANG Feng,HUANG Dan,SONG Hou-pan,LIU Bin,CAI Xiong. Research Advances on the Therapy of Rheumatoid Arthritis with the Nanotechnology Based on Transdermal Drug Delivery System[J]. China Biotechnology, 2021, 41(2/3): 98-106.
[13] WU You,XIN Lin. New Drug Delivery System: Delivery of Exosomes as Drug Carriers[J]. China Biotechnology, 2020, 40(9): 28-35.
[14] GUO Guang-chao,ZHOU Yu-yong,CAO San-jie,WU Yao-min,WU Rui,ZHAO Qin,WEN Xin-tian,HUANG Xiao-bo,WEN Yi-ping. The Study on the Effect of NS2A-C60A Site Mutation of Japanese Encephalitis Virus on Its Biological Characteristics[J]. China Biotechnology, 2020, 40(9): 1-10.
[15] YANG Wei,SONG Fang-xiang,WANG Shuai,ZHANG Li,WANG Hong-xia,LI Yan. Preparation and Application of Janus Nanoparticles in Drug Delivery System[J]. China Biotechnology, 2020, 40(7): 70-81.