首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Integration of A Deep Learning Classifier with A Random Forest Approach for Predicting Malonylation Sites
Authors:Zhen Chen  Ningning He  Yu Huang  Wen Tao Qin  Xuhan Liu  Lei Li
Institution:1. School of Basic Medicine, Qingdao University, Qingdao 266021, China;2. School of Data Science and Software Engineering, Qingdao University, Qingdao 266021, China;3. Department of Biochemistry, Schulich School of Medicine and Dentistry, University of Western Ontario, London, Ontario N6A 5C1, Canada;4. Department of Information Technology, Beijing Oriental Yamei Gene Technology Institute Co. Ltd., Beijing 100078, China;5. Qingdao Cancer Institute, Qingdao University, Qingdao 266021, China
Abstract:As a newly-identified protein post-translational modification, malonylation is involved in a variety of biological functions. Recognizing malonylation sites in substrates represents an initial but crucial step in elucidating the molecular mechanisms underlying protein malonylation. In this study, we constructed a deep learning (DL) network classifier based on long short-term memory (LSTM) with word embedding (LSTMWE) for the prediction of mammalian malonylation sites. LSTMWE performs better than traditional classifiers developed with common pre-defined feature encodings or a DL classifier based on LSTM with a one-hot vector. The performance of LSTMWE is sensitive to the size of the training set, but this limitation can be overcome by integration with a traditional machine learning (ML) classifier. Accordingly, an integrated approach called LEMP was developed, which includes LSTMWE and the random forest classifier with a novel encoding of enhanced amino acid content. LEMP performs not only better than the individual classifiers but also superior to the currently-available malonylation predictors. Additionally, it demonstrates a promising performance with a low false positive rate, which is highly useful in the prediction application. Overall, LEMP is a useful tool for easily identifying malonylation sites with high confidence. LEMP is available at http://www.bioinfogo.org/lemp.
Keywords:Deep learning  Recurrent neural network  LSTM  Malonylation  Random forest
本文献已被 CNKI ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号