首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 78 毫秒
1.
基于氨基酸特征序列的蛋白质结构分析   总被引:2,自引:1,他引:2  
针对蛋白质序列中氨基酸的核苷酸组成部分及其相关特征信息,提出另外的σ-等序列的概念,并讨论了其主要特征与次要特征,可作为对蛋白质进行定性和定量比较的一种方法,用来判断这些物种的同源性和相似性程度。然后,对所取的全α螺旋,全β折叠和αβ类序列,利用σ-,τ-,στ序列的概念,给出蛋白质序列的相关氨基酸特征序列。同时对三类共18个蛋白质序列进行数值刻划,给出数值刻划图并进行分析。  相似文献   

2.
蛋白质的序列、结构和功能多种多样.大量研究表明蛋白质的结构与其氨基酸序列的排序有关,并且局部的氨基酸序列环境对蛋白质的结构具有一定的影响.本文提出一种新的基于5-mer氨基酸扭转角统计偏好的蛋白质结构类型预测方法,在该方法通过PDB数据库中5-mer中间氨基酸的扭转角统计偏好来进行结构类型的预测.新方法可以通过计算机仿...  相似文献   

3.
为了研究蛋白质序列的特性,首先采用非线性预测方法,得到蛋白质序列的总体误差平均值图。并通过与随机序列,混沌序列的总体误差平均值图相比较,发现蛋白质序列有别于随机序列,和混沌序列很相似,则猜测蛋白质序列具有混沌特性。为了验证该猜测,将蛋白质序列通过混沌随机游走描述方法转化为时间序列,并计算其最大Lyapunov指数。在选取的时间延迟和嵌入维数下,每类蛋白质序列的最大Lyapunov指数都大于零,从而得出蛋白质序列具有混沌特性的结论。  相似文献   

4.
基于支持向量机和贝叶斯方法的蛋白质四级结构分类研究   总被引:4,自引:2,他引:4  
用支持向量机和贝叶斯两种方法对蛋白质四级结构进行分类研究。结果表明,基于支持向量机的分类结果最好,其l0CV检验的总分类精度、正样本正确预测率、Matthes相关系数和假阳性率分别为74.2%、84.6%、0.474、38.9%;基于贝叶斯的分类结果没有支持向量机的分类结果好,但其l0CV检验的假阳性率最低(15.9%).这些结果说明同源寡聚蛋白质一级序列包含四级结构信息,同时特征向量的确表示了埋藏在缔合亚基作用部位接触表面的基本信息。  相似文献   

5.
mRNA所包含的核苷酸序列通过三联体密码子决定了蛋白质的氨基酸序列,但是,由于对氨基酸同义密码使用频率上的差异,密码子与反密码子相互作用效率上的不同,以及密码子上下文关系和mRNA不同区域二级结构上的差异,造成了核糖体对mRNA不同区域翻译速度上的差异,加之共翻译折叠的作用,使得mRNA的序列和结构影响着蛋白质空间结构的形成。  相似文献   

6.
本文介绍了计算机在核酸和蛋白质一级结构序列分析上的一些应用,包括序列的收集和贮存,两个或多个序列之间同源性的比较,用限制性内切酶找出酶切位点,找出DNA序列的开放式密码解读链和蛋白质序列倒翻成DMA序列的可能结果以及DNA和蛋白质序列的建立和应用等。  相似文献   

7.
从非同源蛋白质的一级序列预测其结构类   总被引:7,自引:1,他引:7  
对基于氨基酸组成、自相关函数和自协方差函数提取特征的蛋白质结构类预测算法进行分析比较,对氨基酸组成和自相关函数相结合的方法,以及氨基酸组成和自协放差函数相结合的方法的预测算法进行了研究。结果表明:对非同源蛋白质,因氨基酸和自相关函数相结合的方法中,采用Miyazawa和Jernigan的疏水值时,训练的自检验的总精度为95.34%,其Jackknife检验的总精度为81.92%,检验加的他检验的总精工为86.61%。在氨基酸组成和自协方差函数相结合的方法中,采用Wold等的疏水值时,训练库的自检验的总精度为96.71%,其Jackknife检验的总精度为82.18%,检验加的他检验的总精工为86.88%。这说明氨基酸组成和自相关函数相结合的方法,以及氨基酸组成和自协方差函数相结合的方法可有效提高结构类预测精度,表明提取更多有效的序列信息是提高分类精度的关键。  相似文献   

8.
吴琳琳  徐硕 《生物信息学》2010,8(3):187-190
蛋白质结构预测是现代计算生物领域最重要的问题之一,而蛋白质二级结构预测是蛋白质高级结构预测的基础。目前蛋白质二级结构的预测方法较多,其中SVM方法取得了较高的预测精度。重在阐述使用SVM用于蛋白质二级结构预测的步骤,以及与其他方法进行比较时应该注意的事项,为下一步的研究提供参考及启发。  相似文献   

9.
梁启浩  李阳  唐旭清 《病毒学报》2017,33(3):313-319
基于经典HP模型,本文采用离散傅里叶变换获取蛋白质特征,利用分层聚类方法进行蛋白质序列的结构分析。其目的是将自动信号频谱分析技术与层次聚类方法相结合,并应用到蛋白质序列结构分析中。通过流感病毒HA和NA蛋白质序列的实验结果表明:应用该方法可得到非常好的分类结果。这些研究为基于大数据的蛋白质序列的自动信息提取和结构分析提供基础。  相似文献   

10.
基于知识的蛋白质结构预测   总被引:5,自引:0,他引:5  
介绍了近几年基于知识的蛋白质三维结构预测方法及其进展.目前,基于知识的结构预测方法主要有两类,一类是同源蛋白模建,这种技术比较成熟,模建的结果可靠性比较高,但只适用于同源性比较高的目标序列的模建;另一类方法即蛋白质逆折叠技术,主要包括3D profile方法和基于势函数的方法,给出的是目标蛋白质的空间走向,它主要可用于序列同源性比较低的蛋白质的结构预测.  相似文献   

11.
A novel approach for evaluation of sequence relatedness via a network over the sequence space is presented. This relatedness is quantified by graph theoretical techniques. The graph is perceived as a flow network, and flow algorithms are applied. The number of independent pathways between nodes in the network is shown to reflect structural similarity of corresponding protein fragments. These results provide an appropriate parameter for quantitative estimation of such relatedness, as well as reliability of the prediction. They also demonstrate a new potential for sequence analysis and comparison by means of the flow network in the sequence space.  相似文献   

12.
闫化军  章毅 《生物信息学》2004,2(4):19-24,41
运用加入竞争层的BP网络,研究了基于蛋白质二级结构内容的域结构类预测问题.在BP网络中嵌入一竞争,层显著提高了网络预测性能.仅使用了一个小的训练集和简单的网络结构,获得了很高的预测精度自支持精度97.62%,jack-knife测试精度97.62%,及平均外推精度90.74%.在建立更完备的域结构类特征向量和更有代表性的训练集的基础上,所述方法将为蛋白质域结构分类领域提供新的分类基准.  相似文献   

13.
Campagna A  Serrano L  Kiel C 《FEBS letters》2008,582(8):1231-1236
Determining protein interaction networks and generating models to simulate network changes in time and space are crucial for understanding a biological system and for predicting the effect of mutants found in diseases. In this review we discuss the great potential of using structural information together with computational tools towards reaching this goal: the prediction of new protein interactions, the estimation of affinities and kinetic rate constants between protein complexes, and finally the determination of which interactions are compatible with each other and which interactions are exclusive. The latter one will be important to reorganize large scale networks into functional modular networks.  相似文献   

14.
艾亮  冯杰 《生物信息学》2023,21(3):179-186
本文提出了一种新的快速非比对的蛋白质序列相似性与进化分析方法。在刻画蛋白质序列特征时,首先将氨基酸的10种理化性质通过主成分分析浓缩为6个主成分,并且将每条蛋白质序列里的氨基酸数目作为权重对主成分得分值进行加权平均,然后再融合氨基酸的位置信息构成一个26维的蛋白质序列特征向量,最后利用欧式距离度量蛋白质序列间的相似性及进化关系。通过对3个蛋白质序列数据集的测试表明,本文提出的方法能将每条蛋白质序列准确聚类,并且简便快捷,说明了该方法的有效性。  相似文献   

15.
Wolff K  Vendruscolo M  Porto M 《Gene》2008,422(1-2):47-51
We discuss a computational approach for reconstructing the native structures of proteins from the knowledge of a structural profile - the first eigenvector of the contact map of the native structure itself. The procedure consists in carrying out Monte Carlo simulations of a tube model of the protein structure with an energy bias towards the target structural profile. We present the reconstruction of two small proteins and address problems arising in the reconstruction of larger proteins. Our results indicate that an accurate physico-chemical energy function should be used in conjunction with the structural profile bias in order to achieve accurate reconstructions.  相似文献   

16.
Gao QB  Wang ZZ  Yan C  Du YH 《FEBS letters》2005,579(16):3444-3448
To understand the structure and function of a protein, an important task is to know where it occurs in the cell. Thus, a computational method for properly predicting the subcellular location of proteins would be significant in interpreting the original data produced by the large-scale genome sequencing projects. The present work tries to explore an effective method for extracting features from protein primary sequence and find a novel measurement of similarity among proteins for classifying a protein to its proper subcellular location. We considered four locations in eukaryotic cells and three locations in prokaryotic cells, which have been investigated by several groups in the past. A combined feature of primary sequence defined as a 430D (dimensional) vector was utilized to represent a protein, including 20 amino acid compositions, 400 dipeptide compositions and 10 physicochemical properties. To evaluate the prediction performance of this encoding scheme, a jackknife test based on nearest neighbor algorithm was employed. The prediction accuracies for cytoplasmic, extracellular, mitochondrial, and nuclear proteins in the former dataset were 86.3%, 89.2%, 73.5% and 89.4%, respectively, and the total prediction accuracy reached 86.3%. As for the prediction accuracies of cytoplasmic, extracellular, and periplasmic proteins in the latter dataset, the prediction accuracies were 97.4%, 86.0%, and 79.7, respectively, and the total prediction accuracy of 92.5% was achieved. The results indicate that this method outperforms some existing approaches based on amino acid composition or amino acid composition and dipeptide composition.  相似文献   

17.
This paper presents an essentially new method used to construct phylogenetic trees from related amino acid sequences. The method is based on a new distance measure which describes sequence relationships by means of typical steric and physicochemical properties of the amino acids and is advantageous in some essential points. The method was applied to different sets of protein sequences and the results were compared with other well-established methods.  相似文献   

18.
《Biochimie》2013,95(9):1741-1744
In this study, a 12-dimensional feature vector is constructed to reflect the general contents and spatial arrangements of the secondary structural elements of a given protein sequence. Among the 12 features, 6 novel features are specially designed to improve the prediction accuracies for α/β and α + β classes based on the distributions of α-helices and β-strands and the characteristics of parallel β-sheets and anti-parallel β-sheets. To evaluate our method, the jackknife cross-validating test is employed on two widely-used datasets, 25PDB and 1189 datasets with sequence similarity lower than 40% and 25%, respectively. The performance of our method outperforms the recently reported methods in most cases, and the 6 newly-designed features have significant positive effect to the prediction accuracies, especially for α/β and α + β classes.  相似文献   

19.
To find out the evolutionary relationships among different tRNA sequences of 21 amino acids, 22 networks are constructed. One is constructed from whole tRNAs, and the other 21 networks are constructed from the tRNAs which carry the same amino acids. A new method is proposed such that the alignment scores of any two amino acids groups are determined by the average degree and the average clustering coefficient of their networks. The anticodon feature of isolated tRNA and the phylogenetic trees of 21 group networks are discussed. We find that some isolated tRNA sequences in 21 networks still connect with other tRNAs outside their group, which reflects the fact that those tRNAs might evolve by intercrossing among these 21 groups. We also find that most anticodons among the same cluster are only one base different in the same sites when S ≥ 70, and they stay in the same rank in the ladder of evolutionary relationships. Those observations seem to agree on that some tRNAs might mutate from the same ancestor sequences based on point mutation mechanisms.  相似文献   

20.
Ding S  Zhang S  Li Y  Wang T 《Biochimie》2012,94(5):1166-1171
Knowledge of structural classes plays an important role in understanding protein folding patterns. In this paper, features based on the predicted secondary structure sequence and the corresponding E–H sequence are extracted. Then, an 11-dimensional feature vector is selected based on a wrapper feature selection algorithm and a support vector machine (SVM). Among the 11 selected features, 4 novel features are newly designed to model the differences between α/β class and α + β class, and other 7 rational features are proposed by previous researchers. To examine the performance of our method, a total of 5 datasets are used to design and test the proposed method. The results show that competitive prediction accuracies can be achieved by the proposed method compared to existing methods (SCPRED, RKS-PPSC and MODAS), and 4 new features are demonstrated essential to differentiate α/β and α + β classes. Standalone version of the proposed method is written in JAVA language and it can be downloaded from http://web.xidian.edu.cn/slzhang/paper.html.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号