首页 | 本学科首页   官方微博 | 高级检索  
     


DCAMCP: A deep learning model based on capsule network and attention mechanism for molecular carcinogenicity prediction
Authors:Zhe Chen  Li Zhang  Jianqiang Sun  Rui Meng  Shuaidong Yin  Qi Zhao
Affiliation:1. School of Mathematics and Statistics, Liaoning University, Shenyang, China

Contribution: Data curation (equal), ​Investigation (equal), Methodology (equal), Writing - original draft (equal);2. School of Life Science, Liaoning University, Shenyang, China

Contribution: Conceptualization (equal), Data curation (equal), ​Investigation (equal), Methodology (equal), Writing - original draft (equal);3. School of Information Science and Engineering, Linyi University, Linyi, China

Contribution: Conceptualization (equal), Formal analysis (equal), Methodology (equal), Supervision (equal);4. School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, China

Contribution: Software (equal), Validation (equal), Writing - review & editing (equal);5. School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, China

Abstract:The carcinogenicity of drugs can have a serious impact on human health, so carcinogenicity testing of new compounds is very necessary before being put on the market. Currently, many methods have been used to predict the carcinogenicity of compounds. However, most methods have limited predictive power and there is still much room for improvement. In this study, we construct a deep learning model based on capsule network and attention mechanism named DCAMCP to discriminate between carcinogenic and non-carcinogenic compounds. We train the DCAMCP on a dataset containing 1564 different compounds through their molecular fingerprints and molecular graph features. The trained model is validated by fivefold cross-validation and external validation. DCAMCP achieves an average accuracy (ACC) of 0.718 ± 0.009, sensitivity (SE) of 0.721 ± 0.006, specificity (SP) of 0.715 ± 0.014 and area under the receiver-operating characteristic curve (AUC) of 0.793 ± 0.012. Meanwhile, comparable results can be achieved on an external validation dataset containing 100 compounds, with an ACC of 0.750, SE of 0.778, SP of 0.727 and AUC of 0.811, which demonstrate the reliability of DCAMCP. The results indicate that our model has made progress in cancer risk assessment and could be used as an efficient tool in drug design.
Keywords:capsule network  carcinogenicity  computational toxicology  deep learning  graph attention network
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号