首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 343 毫秒
1.
通过生物信息学的方法对双峰驼凝乳酶原基因及相应的氨基酸序列的同源性、理化性质、保守结构域、亚细胞定位、信号肽、跨膜结构域、亲水性/疏水性、二级结构进行预测分析.结果表明,双峰驼凝乳酶原基因开放阅读框全长1 146 bp,编码381个氨基酸,属于胃蛋白酶A超家族,预测定位于内质网(膜)的稳定亲水性蛋白,具有一个16个氨基酸的信号肽,其不含跨膜结构域.无规卷曲是其二级结构中最大量的结构元件,α螺旋和延抻链分散于整个蛋白质中,活性位点的分析表明,编码蛋白有6类活性位点.分析双峰驼凝乳酶原基因及其编码蛋白质的特征,能够为深入开展双峰驼凝乳酶的表达和凝乳特性研究提供理论依据.  相似文献   

2.
利用Gen Bank中已登录的完整的麻风树、乳浆大戟、蓖麻和乌桕中的13个蓖麻烯合酶(Casbene synthase,CS;EC 4.6.1.7)基因序列,通过生物信息学方法对其核酸及氨基酸序列、组成成分、导肽、信号肽、跨膜结构域、疏水性/亲水性、蛋白质的二级结构、三级结构及功能域等进行了分析预测。结果表明,13个CS基因的ORF长度均在1 647~1 845 bp,蛋白分子量均在63.0~70.8 k D,终止密码子为TGA或TAA,理论等电点均小于7.0,表明CS蛋白呈酸性。氨基酸含量最高的均为亮氨酸。核苷酸同源性比较分析表明,CS基因主要分为两类。导肽预测发现其中6个CS具有导肽,均为叶绿体导肽。信号肽和扩模结构域预测发现这些CS不存在信号肽和跨膜结构域,肽链整体呈现为亲水性。这些CS的主要二级结构元件为α-螺旋,并且都包含两个萜类合酶功能域。以上研究为进一步探索CS基因的功能提供一定理论依据。  相似文献   

3.
葡萄乙醇脱氢酶基因Ⅲ的电子克隆及生物信息学分析   总被引:1,自引:0,他引:1  
利用电子克隆方法获得葡萄乙醇脱氢酶基因Ⅲ(ADHⅢ),并采用生物信息学方法对该基因编码蛋白从氨基酸组成、理化性质、跨膜结构域、疏水性/亲水性、亚细胞定位、高级结构以及功能域等方面进行了预测和分析.结果表明,葡萄ADHⅢ基因全长1 602 bp,包含1 140 bp的ORF,编码379个氨基酸,该蛋白不具有明显的疏水区域,也无跨膜结构域,α-螺旋和不规则卷曲是其二级结构的主要构件.葡萄ADHⅢ包含有ADH功能域,和其他植物的ADHⅢ在序列组成、高级结构及活性位点等方面均具有高度的相似性.  相似文献   

4.
植物阿魏酸-5-羟化酶生物信息学分析   总被引:1,自引:0,他引:1  
阿魏酸-5-羟化酶(F5H)是木质素生物合成的关键酶之一,它依赖于细胞色素P450催化阿魏酸在5位上发生羟基化反应。采用生物信息学的方法和工具对在GenBank上注册的拟南芥(Arabidopsis thaliana)、油菜(Brassica napus)、杨树(Populus trichocarpa)、番茄(Lycopersicon esculentum)、紫苜蓿(Medicago sativa)、喜树(Camptotheca acuminate)等植物的阿魏酸-5-羟化酶基因的核苷酸序列及推导的氨基酸序列进行分析,包括组成成分、氨基酸翻译后修饰、跨膜拓扑结构域、疏水性/亲水性、蛋白质二级功能结构域等进行分析预测和推断。结果表明,植物F5H是一个具有跨膜结构域的亲水性蛋白,存在于内质网等分泌途径中,α-螺旋和不规则卷曲是其二级结构的主要结构元件,具有细胞色素P450家族特征性结构域及保守功能域。  相似文献   

5.
油菜蔗糖转化酶基因的电子克隆和生物信息学分析   总被引:2,自引:1,他引:1       下载免费PDF全文
苏宁  杨万年 《生物信息学》2013,11(3):224-232
运用电子克隆技术获得油菜中一个蔗糖转化酶基因eDNA序列,同时根据此段序列设计引物以油菜eDNA为模板进行扩增。经测序得到证实。采用生物信息学方法,对该基因编码蛋白从氨基酸组成、基本理化性质、跨膜结构域、信号肽导肽、疏水性/亲水性、二级结构、亚细胞定位等方面进行了预测和分析。结果表明:该基因eDNA序列长度为2150bp,包含一个1779bp开放阅读框,编码592个氨基酸;该编码蛋白含有蔗糖转化酶的多个典型的保守结构域。同源比对分析显示,该基因编码的氨基酸序列与拟南芥等植物的蔗糖转化酶基因具有高度的相似性,进一步确定该蛋白为蔗糖蛋白酶。研究结果为该基因进一步的实验克隆,表达分析,功能鉴定奠定基础。  相似文献   

6.
本研究采用生物信息学的方法对甜瓜数据库中的甜瓜乙烯应答因子基因ERFⅠ-14(登录号:MEL O3C014441)的启动子特性、RNA结构、理化性质、导肽、信号肽、跨膜结构域、蛋白质二级结构、三级结构及功能域等进行预测和推断。结果表明,CmERFⅠ-14基因的开放阅读框(ORF)长645 bp,编码214个氨基酸,蛋白质的分子量约为23 kD,理论等电点为6.59,不存在导肽和信号肽,无跨膜结构域,二级结构中最主要的结构元件是无规则卷曲,包含一个AP2/ERF功能结构域,预测到8个可能的互作蛋白和多个启动子元件,没有预测到CpG岛。系统发育分析发现,其与黄瓜ERF3-like蛋白(XP_004140127.1)亲缘关系最近。  相似文献   

7.
用生物信息方法对果胶裂解酶(PNL)基因的核酸序列及其推导氨基酸序列的组成、亚细胞定位、疏水性/亲水性以及二、三级结构等进行分析.结果表明,黑曲霉的PNL为具有一定亲水性的稳定酸性分泌蛋白,具有明显的信号肤,无跨膜结构区,保守功能结构域为Pee_lyase_C.二级结构主要构成是不规则卷曲,具有以β片层结构为基础的相似三维空间结构.  相似文献   

8.
不同于人、鼠等物种ELOVL7基因的高相似度,不同品种的猪ELOVL7基因相似度较低。为了探究该基因的特性,本研究运用生物信息学的方法对苏太猪ELOVL7基因及其氨基酸序列的同源性、理化性质、保守结构域、亚细胞定位、信号肽、跨膜结构域、亲水性/疏水性、二级结构、功能预测以及磷酸化位点等进行预测分析。结果表明:在苏太猪中,ELOVL7全长2 324 bp,编码区为846 bp,共编码281个氨基酸。其结构稳定,分子量为33 387.4 Da,带正电荷,偏碱性。该基因所编码的蛋白质最可能位于细胞膜上,主要的功能是运输和结合,为跨膜、非分泌型疏水蛋白质,含有1个GNS1/SUR4家族的保守结构域,并有15个丝氨酸激酶、15个苏氨酸激酶和20个酪氨酸激酶潜在磷酸化位点。α螺旋是ELOVL7二级结构和三级结构中最主要的结构元件。另外ELOVL7与大部分物种的氨基酸序列相似性达90%以上,且亲缘关系较近。分析ELOVL7基因及其氨基酸序列的特征,能够为进一步挖掘该基因内的突变对长链脂肪酸表型的影响以及合成、代谢机理提供分子依据。  相似文献   

9.
目的:通过生物信息学方法对八氢番茄红素合成酶基因(PSY)及氨基酸序列分析,并构建三维结构。方法:运用生物信息学方法对八氢番茄红素合成酶基因及其蛋白质序列的理化性质、亲/疏水性、信号肽、跨膜结构域、糖基化位点,磷酸化位点,二级结构,功能结构域和三级结构进行预测分析。结果:PSY基因含1239bp的开放阅读框,编码氨基酸数为412,为碱性不稳定蛋白;八氢番茄红素合成酶富含Arg、Leu、Ala、Ser、Val等氨基酸,为亲水性蛋白质;PSY为非跨膜蛋白,不含信号肽,具有多个磷酸化位点,α螺旋和无规卷曲是其主要结构元件。结论:用同源建模的方法构建其三维结构,得到合理模型,为采用生物工程提高番茄红素产量提供理论依据。  相似文献   

10.
以高粱β-1,3-葡聚糖酶基因(β-1,3-glucanase gene)cDNA序列为探针,搜索甘蔗EST数据库,而后通过电子克隆技术,拼接获得甘蔗β-1,3-葡聚糖酶基因ScBG。采用生物信息学方法,对该基因编码蛋白从氨基酸组成、理化性质、跨膜结构域、卷曲螺旋、亚细胞定位、信号肽、功能域及高级结构等方面进行了预测和分析。结果表明:ScBG基因全长1270bp,包含一个长达1011bp的完整开放读码框(open reading frame,ORF),编码336个氨基酸,分子量为34.8KD,理论等电点为4.98。该蛋白质很可能是胞外定位的诱导物释放型酸性葡聚糖酶,是一种稳定的分泌蛋白,且可信度达最高等级1。该蛋白属于糖苷水解酶第17家族,含有N端信号肽,在第7~29位氨基酸处含有跨膜信号区,在第31~321位氨基酸处含有糖苷水解酶17家族结构域,含2个主要的功能结构域。10个物种ScBG蛋白氨基酸序列的同源性分析表明,甘蔗ScBG基因编码蛋白与高粱β-1,3-葡聚糖酶基因的编码蛋白的同源性最高,达79.82%。以上研究结果为ScBG基因下一步的分子克隆、功能鉴定和应用提供基础。  相似文献   

11.
序列同源性分析软件Blast的WEB界面构建及其应用   总被引:5,自引:1,他引:4  
基于局域网(Intranet)内的PC/Linux服务器, 构建了序列同源性分析软件Blast的WEB界面. 局域网内的所有计算机均可通过WEB方式访问该服务器进行公共数据库和自建数据库的查询,具有保密、高效、免费的优点,能够满足实验室和研究院所的大规模、快速数据分析任务.  相似文献   

12.
We have used synthetic peptides to study a conserved RNA binding motif in yeast poly(A)-binding protein. Two peptides, 45 and 44 amino acids in length, corresponding to amino and carboxyl halves of a 90-amino acid RNA-binding domain in the protein were synthesized. While the amino-terminal peptide had no significant affinity for nucleic acids, the carboxyl-terminal peptide-bound nucleic acids with similar characteristics to that for the entire 577 residue yeast poly(A)-binding protein. In 100 mM NaCl, the latter peptide retained over 50% of the intrinsic binding free energy of the protein, as well as, similar RNA versus DNA binding specificity. However, shuffling of the sequence of this 44 residue peptide had surprisingly little effect on its nucleic acid binding properties suggesting the overriding importance of amino acid composition as opposed to primary sequence. Deletion studies on the 44 residue peptide with the "correct" sequence succeeded in identifying amino acids important for conferring RNA specificity and for increasing our understanding of the molecular basis for nucleic acid binding by synthetic peptides. The shuffled peptide study, however, clearly indicates that considerable caution must be exercised before extrapolating results of structure/function studies on synthetic peptide analogues to the parent protein.  相似文献   

13.
Sequence and evolution of guinea pig preproinsulin DNA   总被引:1,自引:0,他引:1  
Guinea pig insulin exhibits an unusually high degree of divergence from the conserved insulins of other mammals. cDNA clones encoding guinea pig preproinsulin were isolated, and their nucleic acid sequences were determined. Comparisons of the nucleic acid sequence and its predicted amino acid sequence with sequences encoding insulins of other species revealed that the gene encoding guinea pig preproinsulin evolved from the same ancestral mammalian gene as other known mammalian insulin genes.  相似文献   

14.
The phylogenetic distribution and structural diversity of the nitric oxide synthases (NOS) remain important and issues that are little understood. We present sequence information, as well as phylogenetic analysis, for three NOS cDNAs identified in two non-mammalian species: the vertebrate marine teleost fish Stenotomus chrysops (scup) and the invertebrate echinoderm Arbacia punctulata (sea urchin). Partial gene sequences containing the well-conserved calmodulin (CaM)-binding domain were amplified by RT-PCR. Identical 375-bp cDNAs were amplified from scup brain, heart, liver and spleen; this sequence shares 82% nucleic acid and 91% predicted amino acid identity with the corresponding region of human neuronal NOS. A 387-bp cDNA was amplified from sea urchin ovary and testes; this sequence shares 72% nucleic acid identity and 65% deduced amino acid identity with human neuronal NOS. A second cDNA of 381 bp was amplified from sea urchin ovary and it shares 66% nucleic acid and 57% deduced amino acid identity with the first sea urchin sequence. Together with earlier reports of neuronal and inducible NOS sequences in fish, these data indicate that multiple NOS isoforms exist in non-mammalian species. Phylogenetic analysis of these sequences confirms the conserved nature of NOS, particularly of the calmodulin-binding domains.  相似文献   

15.
Using PC/GENE for protein and nucleic acid analysis   总被引:4,自引:0,他引:4  
This paper describes a series of protein analyses using the molecular biology software package PC/GENE, which runs on an IBM or compatible microcomputer. A nucleic acid sequence was first edited and then translated into an amino acid sequence. The amino acid composition, isoelectric point, molecular weight, and other properties of the sequence were determined. Programs to predict secondary structure, alpha helix membrane associations, hydrophobic and hydrophilic regions, and surface and antigenic sites from the amino acid sequence were also used. A search was made in a data base for sequences containing a region similar to a region in the protein sequence. Sequence alignments and queries of data bases can also be performed.  相似文献   

16.
The mRNA sequence for bovine lactoferrin expressed in the mammary gland was determined by sequencing three over lapping cDNA clones and by direct sequencing of the mRNA. The mRNA (2351 bases) codes for a 708 amino acid protein with a 19 amino acid signal peptide immediately preceding a sequence identical to the N-terminal 40 amino acids reported for bovine lactoferrin. A putative destabilizing sequence (AUUUA) was identified in the 3'-untranslated region. The nucleic acid sequence and deduced amino acid sequence are highly homologous with other transferrin family members. Lactoferrin mRNA concentrations in bovine mammary tissue were quite low two days before parturition and during lactation but were high three days after the cessation of milking, a sharp contrast from the pattern of regulation of the other milk proteins.  相似文献   

17.
The amino acid sequence of the small subunit of ribulose-1, 5-bisphosphate carboxylase from pea consists of a single polypeptide chain of 123 residues with a calculated MW of ca 14 480. The N-terminus was ‘ragged’ and both methionine and glutamine were determined in residue position 1. No heterogeneity was found even though two isofocussing variants were observed. The amino acid sequence confirms the nucleic acid sequence of cDNA of mRNA determined independently.  相似文献   

18.
本文报道了在AppleⅡ型微机上实现核酸数据处理的一系列工作程序。应用这些程序,可进行核酸数据的贮存、对指定的核酸数据结构的改造、限制性内切酶识别位点的检索、核酸序列至蛋白序列的翻译、相关核酸序列及蛋白序列的同源性比较、氨基酸密码使用频率的统计和基因的启动子结构的初步探索等方面的工作。  相似文献   

19.
Murine leukemia viruses contain a low molecular weight basic protein, designated p10, which binds to single-stranded nucleic acids. The complete amino acid sequence of p10 from the Rauscher strain of virus has been determined. The partial amino acid sequences of p10s from Moloney, Friend, AKR, Gross, radiation leukemia, and BALB/2 viral strains have also been determined using microsequencing techniques. Rauscher p10 is composed of 56 amino acid residues; the other p10s are similar in size but differ from Rauscher by a few conservative amino acid substitutions. The structure of Rauscher p10 was compared to the structure of a functionally homologous protein from Rous avian sarcoma virus. The comparison revealed regions of amino acid sequence homologies which indicate a phylogenetic relationship between the murine and avian viral strains. The analyses revealed a periodic placement of three Cys residues and a Gly-His sequence. A structure involving these residues is found once in the murine protein and twice in the avian protein. A similar structure is seen in the single stranded nucleic acid binding protein of bacteriophage T4. However, in the latter case, the order of amino acid residues is inverted.  相似文献   

20.
The complete cDNA nucleic acid sequence of preproapolipoprotein (apo) A-II, a major protein constituent of high density lipoproteins, has been determined on clones from a human liver ds-cDNA library. Clones containing ds-cDNA for apoA-II were identified in the human liver ds-cDNA library using synthetic oligonucleotides as probes. Of 3200 clones screened, 4 reacted with the oligonucleotide probes. The DNA sequence coding for amino acids ?17 to +17 of apoA-II were determined by Maxam-Gilbert sequence analysis of restriction fragments isolated from one of these clones, pMDB2049. The remainder of the cDNA sequence was established by sequence analysis of a primer extension product synthesized utilizing a restriction fragment near the 5'-end of clone pMDB2049 as primer with total liver mRNA. The apoA-II mRNA encodes for a 100 amino acid protein, preproapoA-II that has an 18 amino acid prepeptide and a 5 amino acid propeptide terminating with a basic dipeptide (Arg-Arg) at the cleavage site to mature apoA-II.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号