期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Predicting Protein Subcellular Localization： Past, Present, and Future 总被引：10，自引：0，他引：10

Dönnes P Höglund A 《基因组蛋白质组与生物信息学报(英文版)》2004,2(4):209-215

相似文献

2.

Prediction of Subcellular Localization of Apoptosis Protein Using Chou’s Pseudo Amino Acid Composition

Hao Lin Hao Wang Hui Ding Ying-Li Chen Qian-Zhong Li 《Acta biotheoretica》2009,57(3):321-330

Apoptosis proteins play an essential role in regulating a balance between cell proliferation and death. The successful prediction of subcellular localization of apoptosis proteins directly from primary sequence is much benefited to understand programmed cell death and drug discovery. In this paper, by use of Chou’s pseudo amino acid composition (PseAAC), a total of 317 apoptosis proteins are predicted by support vector machine (SVM). The jackknife cross-validation is applied to test predictive capability of proposed method. The predictive results show that overall prediction accuracy is 91.1% which is higher than previous methods. Furthermore, another dataset containing 98 apoptosis proteins is examined by proposed method. The overall predicted successful rate is 92.9%. 相似文献

3.

Prediction of Apoptosis Protein’s Subcellular Localization by Fusing Two Different Descriptors Based on Evolutionary Information

Yunyun Liang Shengli Zhang 《Acta biotheoretica》2018,66(1):61-78

相似文献

4.

Automatically Improve the Quality of MS／MS Spectrum from Sequest

WeiSUN Fu-XinLI KaiZHAO JueWANG De-XianZHENG You-HeGAO 《Acta biochimica et biophysica Sinica》2004,36(1):86-86

相似文献

5.

Proteotyping：A New Approach Studying Influenza Virus Evolution at the Protein Level

Wei-feng SHI Zhong ZHANG Lei PENG Yan-zhou ZHANG Bin LIU Chao-dong ZHU 《Virologica Sinica》2007,22(5):405-411

Phylogenetic methods have been widely used to detect the evolution of influenza viruses.However,previous phylogenetic studies of influenza viruses do not make full use of the genetic information at the protein level and therefore cannot distinguish the subtle differences among viral genes.Proteotyping is a new approach to study influenza virus evolution.It aimed at mining the potential genetic information of the viral gene at the protein level by visualizing unique amino acid signatures(proteotypes).Neuraminidase gene fragments of some H5N1 avian influenza viruses were used as an example to illustrate how the proteotyping method worked.Bayesian analysis confirmed that the NA gene tree was mainly divided into three lineages.The NA proteotype analysis further suggested there might be multiple proteotypes within these three lineages and even within single genotypes.At the same time,some proteotypes might even involve more than one genotype.In particular,it also discovered some amino acids of viruses of some genotypes might co-reassort.All these results proved this approach could provide additional information in contrast to results from standard phylogenetic tree analysis. 相似文献

6.

Establishment of the Platform for Therapeutic Protein Pegylation and Studies on Biological Characteristics of the Pegylated Conjugates

XIEFang HUANGJing LiuChen WUZi-Rong 《Acta biochimica et biophysica Sinica》2003,35(11):1045-1045

相似文献

7.

A New Amylose Derivative for the Preparation of Protein–Carbohydrate Conjugates

《Bioscience, biotechnology, and biochemistry》2013,77(12):2091-2092

N,N-Dimethylformamidase (DMFase) from Alcaligenes sp. strain KUFA-1, a bacterium that can grow on N,N-dimethylformamide (DMF) as the sole carbon and nitrogen source, catalyzes the first step of the DMF degradation. The DMFase gene dmfA1A2 was cloned in Escherichia coli, and its nucleotides were sequenced. The deduced amino acid sequence of the enzyme consisted of two α- and two β-subunits with 132 and 762 amino acids, respectively, and had little similarity to sequences in protein databases, including various amidases. The protein may be a new kind of amidase. DMFase activity was detected in E. coli cells transformed with an expression plasmid of the cloned DMFase gene. The properties of recombinant DMFase purified from E. coli were identical to those of Alcaligenes DMFase. 相似文献

8.

A Multiple Functional Protein： the Herpes Simplex Virus Type 1 Tegument Protein VP22

Mei-li Li Hong Guo Qiong Ding Chun-fu Zheng 《中国病毒学》2009,24(3):153-161

相似文献

9.

A Dual Regulatory Role for the Disordered C-Terminus of Protein Kinase Cα

Rebecca B. Berlow 《Biophysical journal》2018,114(7):1513-1514

相似文献

10.

CRISPR-P： A Web Tool for Synthetic Single-Guide RNA Design of CRISPR-System in Plants 总被引：1，自引：0，他引：1

《植物生理与分子生物学学报》2014,(9):1494-1496

相似文献

11.

蛋白质亚细胞定位预测中的序列编码技术

王正华张振慧王勇献《生物信息学》2007,5(2):82-85,89

蛋白质序列的编码是亚细胞定位预测问题中的关键技术之一。该文较为详细地介绍了目前已有的蛋白质序列编码算法;并指出了序列编码中存在的一些问题及可能的发展方向。相似文献

12.

Prediction of protein subcellular localization 总被引：6，自引：0，他引：6

Yu CS Chen YC Lu CH Hwang JK 《Proteins》2006,64(3):643-651

Because the protein's function is usually related to its subcellular localization, the ability to predict subcellular localization directly from protein sequences will be useful for inferring protein functions. Recent years have seen a surging interest in the development of novel computational tools to predict subcellular localization. At present, these approaches, based on a wide range of algorithms, have achieved varying degrees of success for specific organisms and for certain localization categories. A number of authors have noticed that sequence similarity is useful in predicting subcellular localization. For example, Nair and Rost (Protein Sci 2002;11:2836-2847) have carried out extensive analysis of the relation between sequence similarity and identity in subcellular localization, and have found a close relationship between them above a certain similarity threshold. However, many existing benchmark data sets used for the prediction accuracy assessment contain highly homologous sequences-some data sets comprising sequences up to 80-90% sequence identity. Using these benchmark test data will surely lead to overestimation of the performance of the methods considered. Here, we develop an approach based on a two-level support vector machine (SVM) system: the first level comprises a number of SVM classifiers, each based on a specific type of feature vectors derived from sequences; the second level SVM classifier functions as the jury machine to generate the probability distribution of decisions for possible localizations. We compare our approach with a global sequence alignment approach and other existing approaches for two benchmark data sets-one comprising prokaryotic sequences and the other eukaryotic sequences. Furthermore, we carried out all-against-all sequence alignment for several data sets to investigate the relationship between sequence homology and subcellular localization. Our results, which are consistent with previous studies, indicate that the homology search approach performs well down to 30% sequence identity, although its performance deteriorates considerably for sequences sharing lower sequence identity. A data set of high homology levels will undoubtedly lead to biased assessment of the performances of the predictive approaches-especially those relying on homology search or sequence annotations. Our two-level classification system based on SVM does not rely on homology search; therefore, its performance remains relatively unaffected by sequence homology. When compared with other approaches, our approach performed significantly better. Furthermore, we also develop a practical hybrid method, which combines the two-level SVM classifier and the homology search method, as a general tool for the sequence annotation of subcellular localization. 相似文献

13.

Mimicking cellular sorting improves prediction of subcellular localization 总被引：27，自引：0，他引：27

Nair R Rost B 《Journal of molecular biology》2005,348(1):85-100

Predicting the native subcellular compartment of a protein is an important step toward elucidating its function. Here we introduce LOCtree, a hierarchical system combining support vector machines (SVMs) and other prediction methods. LOCtree predicts the subcellular compartment of a protein by mimicking the mechanism of cellular sorting and exploiting a variety of sequence and predicted structural features in its input. Currently LOCtree does not predict localization for membrane proteins, since the compositional properties of membrane proteins significantly differ from those of non-membrane proteins. While any information about function can be used by the system, we present estimates of performance that are valid when only the amino acid sequence of a protein is known. When evaluated on a non-redundant test set, LOCtree achieved sustained levels of 74% accuracy for non-plant eukaryotes, 70% for plants, and 84% for prokaryotes. We rigorously benchmarked LOCtree in comparison to the best alternative methods for localization prediction. LOCtree outperformed all other methods in nearly all benchmarks. Localization assignments using LOCtree agreed quite well with data from recent large-scale experiments. Our preliminary analysis of a few entirely sequenced organisms, namely human (Homo sapiens), yeast (Saccharomyces cerevisiae), and weed (Arabidopsis thaliana) suggested that over 35% of all non-membrane proteins are nuclear, about 20% are retained in the cytosol, and that every fifth protein in the weed resides in the chloroplast. 相似文献

14.

The prediction of protein subcellular localization from sequence: a shortcut to functional genome annotation. 总被引：2，自引：0，他引：2

Rita Casadio Pier Luigi Martelli Andrea Pierleoni 《Briefings in Functional Genomics and Prot》2008,7(1):63-73

Automated sequence annotation is a major goal of post-genomic era with hundreds of genomes in the databases, from both prokaryotes and eukaryotes. While the number of fully sequenced chromosomes from microbial organisms exponentially increased in the last decade above 600, presently we know the whole DNA content of only 25 eukaryotic organisms, including Homo sapiens. However, the process of genome annotation is far from being completed. This is particularly relevant in eukaryotes, whose cells contain several subcellular compartments, or organelles, enclosed by membranes, where different relevant functions are performed. Translocation across the membrane into the organelles is a highly regulated and complex cellular process. Indeed different proteins and/or protein isoforms, originated from genes by alternative splicing, may be conveyed to different cell compartments, depending on their specific role in the cell. During recent years the prediction of subcellular localization (SL) by computational means has been an active research area. Several methods are presently available based on different notions and addressing different aspects of SL. This review provides a short overview of the most well performing methods described in the literature, highlighting their predictive capabilities and different applications. 相似文献

15.

Fast Fourier transform-based support vector machine for subcellular localization prediction using different substitution models 总被引：2，自引：0，他引：2

Wang Z Jiang L Li M Sun L Lin R 《Acta biochimica et biophysica Sinica》2007,39(9):715-721

There are approximately 109 proteins in a cell. A hotspot in bioinformatics is how to identify a protein＇s subcellular localization, if its sequence is known. In this paper, a method using fast Fourier transform-based support vector machine is developed to predict the subcellular localization of proteins from their physicochemical properties and structural parameters. The prediction accuracies reached 83% in prokaryotic organisms and 84% in eukaryotic organisms with the substitution model of the c-p-v matrix （c, composition; p, polarity; and v, molecular volume）. The overall prediction accuracy was also evaluated using the ＂leave-one-out＂ jackknife procedure. The influence of the substitution model on prediction accuracy has also been discussed in the work. The source code of the new program is available on request from the authors. 相似文献

16.

Sequence conserved for subcellular localization 总被引：6，自引：0，他引：6

下载免费PDF全文

Nair R Rost B 《Protein science : a publication of the Protein Society》2002,11(12):2836-2847

The more proteins diverged in sequence, the more difficult it becomes for bioinformatics to infer similarities of protein function and structure from sequence. The precise thresholds used in automated genome annotations depend on the particular aspect of protein function transferred by homology. Here, we presented the first large-scale analysis of the relation between sequence similarity and identity in subcellular localization. Three results stood out: (1) The subcellular compartment is generally more conserved than what might have been expected given that short sequence motifs like nuclear localization signals can alter the native compartment; (2) the sequence conservation of localization is similar between different compartments; and (3) it is similar to the conservation of structure and enzymatic activity. In particular, we found the transition between the regions of conserved and nonconserved localization to be very sharp, although the thresholds for conservation were less well defined than for structure and enzymatic activity. We found that a simple measure for sequence similarity accounting for pairwise sequence identity and alignment length, the HSSP distance, distinguished accurately between protein pairs of identical and different localizations. In fact, BLAST expectation values outperformed the HSSP distance only for alignments in the subtwilight zone. We succeeded in slightly improving the accuracy of inferring localization through homology by fine tuning the thresholds. Finally, we applied our results to the entire SWISS-PROT database and five entirely sequenced eukaryotes. 相似文献

17.

mPLR-Loc: An adaptive decision multi-label classifier based on penalized logistic regression for protein subcellular localization prediction

《Analytical biochemistry》2015

Proteins located in appropriate cellular compartments are of paramount importance to exert their biological functions. Prediction of protein subcellular localization by computational methods is required in the post-genomic era. Recent studies have been focusing on predicting not only single-location proteins but also multi-location proteins. However, most of the existing predictors are far from effective for tackling the challenges of multi-label proteins. This article proposes an efficient multi-label predictor, namely mPLR-Loc, based on penalized logistic regression and adaptive decisions for predicting both single- and multi-location proteins. Specifically, for each query protein, mPLR-Loc exploits the information from the Gene Ontology (GO) database by using its accession number (AC) or the ACs of its homologs obtained via BLAST. The frequencies of GO occurrences are used to construct feature vectors, which are then classified by an adaptive decision-based multi-label penalized logistic regression classifier. Experimental results based on two recent stringent benchmark datasets (virus and plant) show that mPLR-Loc remarkably outperforms existing state-of-the-art multi-label predictors. In addition to being able to rapidly and accurately predict subcellular localization of single- and multi-label proteins, mPLR-Loc can also provide probabilistic confidence scores for the prediction decisions. For readers’ convenience, the mPLR-Loc server is available online (http://bioinfo.eie.polyu.edu.hk/mPLRLocServer). 相似文献

18.

基于多层次稀疏编码预测蛋白质亚细胞定位

陈行健胡雪娇薛卫《生物工程学报》2019,35(4):687-696

文中提出了一种简单有效的蛋白质亚细胞区间定位预测方法,为进一步了解蛋白质的功能和性质提供理论基础。运用稀疏编码,结合氨基酸组成信息提取蛋白质序列特征,基于不同字典大小对得到的特征进行多层次池化整合,并送入支持向量机进行分类。经Jackknife检验,在数据集ZD98、CH317和Gram1253上的预测成功率分别达到95.9%、93.4%和94.7%。实验证明基于多层次稀疏编码的分类预测算法能显著提高蛋白质亚细胞区间定位的预测精度。相似文献

19.

Prediction of subcellular protein localization based on functional domain composition

Jia P Qian Z Zeng Z Cai Y Li Y 《Biochemical and biophysical research communications》2007,357(2):366-370

Assigning subcellular localization (SL) to proteins is one of the major tasks of functional proteomics. Despite the impressive technical advances of the past decades, it is still time-consuming and laborious to experimentally determine SL on a high throughput scale. Thus, computational predictions are the preferred method for large-scale assignment of protein SL, and if appropriate, followed up by experimental studies. In this report, using a machine learning approach, the Nearest Neighbor Algorithm (NNA), we developed a prediction system for protein SL in which we incorporated a protein functional domain profile. The overall accuracy achieved by this system is 93.96%. Furthermore, comparisons with other methods have been conducted to demonstrate the validity and efficiency of our prediction system. We also provide an implementation of our Subcellular Location Prediction System (SLPS), which is available at http://pcal.biosino.org. 相似文献