首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Analysis of sequence requirements for protein tyrosine sulfation.   总被引:5,自引:0,他引:5       下载免费PDF全文
We analyzed sequences surrounding known tyrosine sulfation sites to determine the characteristics that distinguish these sites from those that do not undergo sulfation. Tests evaluated the number and position of acidic, basic, hydrophobic, and small amino acids, as well as disulfide and N-glycosylation (sugar) sites. We determined that composition-based tests that select close to 100% of known tyrosine sulfation sites reject 97% of the non-sulfated tyrosines. The acidic test, by far the most selective, eliminated 95% of the non-sulfated tyrosine residues and none of the sulfated tyrosines. Including the basic, hydrophobic, and disulfide tests increased the elimination rate to 97%. Whereas no position flanking the tyrosine residues had the same amino acid always present, imperfectly conserved amino acids found in some positions will improve the specificity of the tests.  相似文献   

2.
The lipocalins and fatty acid-binding proteins (FABPs) are two recently identified protein families that both function by binding small hydrophobic molecules. We have sought to clarify relationships within and between these two groups through an analysis of both structure and sequence. Within a similar overall folding pattern, we find large parts of the lipocalin and FABP structures to be quantitatively equivalent. The three largest structurally conserved regions within the lipocalin common core correspond to characteristic sequence motifs that we have used to determine the constitution of this family using an iterative sequence analysis procedure. This afforded a new interpretation of the family, which highlighted the difficulties of determining a comprehensive and coherent classification of the lipocalins. The first of the three conserved sequence motifs is also common to the FABPs and corresponds to a conserved structural element characteristic of both families. Similarities of structure and sequence within the two families suggests that they form part of a larger "structural superfamily"; we have christened this overall group the calycins to reflect the cup-shaped structure of its members.  相似文献   

3.
Does the amino acid use at the terminal positions of an α‐helix become altered depending on the context—more specifically, when there is an adjoining 310‐helix, and can a single helical cylinder encompass the resultant composite helix? An analysis of 138 and 107 cases of 310–α and α–310 composite helices, respectively, found in known protein structures indicate that the secondary structural element occurring first imposes its characteristics on the sequence of the structural element coming next. Thus, when preceded by a 310‐helix, the preference of proline to occur at the N1 position of an α‐helix is shifted to the N2 position, a typical characteristic of the C‐terminal capping of the 310‐helix. When an α‐ or a 310‐helix leads into a helix of the other type, there is a bend at the junction, especially for the 310–α composite, with the two junction residues facing inward and buried within the structure. Thus a single helical cylinder may not properly represent a composite helix, the bend providing a means for the tertiary structure to assume a globular shape, very much akin to what a proline‐induced kink does to an α‐helix. The tertiary structural context in which β–310 and 310–β composites occurs can be different, causing the angle between the secondary structural elements in the two cases to be different. Composites of 310‐helices and β‐strands are much more conserved among members in families of homologous structures than those between two types of helices; in many of the former instances, the 310‐helix constitutes the loops in β‐hairpin or β–β‐corner motifs. The overall fold of the chain may be more conserved than the actual identify of the secondary structure elements in a composite. © 2005 Wiley Periodicals, Inc. Biopolymers 78: 147–162, 2005 This article was originally published online as an accepted preprint. The “Published Online” date corresponds to the preprint version. You can request a copy of the preprint by emailing the Biopolymers editorial office at biopolymers@wiley.com  相似文献   

4.
5.
6.
We have recently developed a fast approach to comparisons of 3-dimensional structures. Our method is unique, treating protein structures as collections of unconnected points (atoms) in space. It is completely independent of the amino acid sequence order. It is unconstrained by insertions, deletions, and chain directionality. It matches single, isolated amino acids between 2 different structures strictly by their spatial positioning regardless of their relative sequential position in the amino acid chain. It automatically detects a recurring 3D motif in protein molecules. No predefinition of the motif is required. The motif can be either in the interior of the proteins or on their surfaces. In this work, we describe an enhancement over our previously developed technique, which considerably reduces the complexity of the algorithm. This results in an extremely fast technique. A typical pairwise comparison of 2 protein molecules requires less than 3 s on a workstation. We have scanned the structural database with dozens of probes, successfully detecting structures that are similar to the probe. To illustrate the power of this method, we compare the structure of a trypsin-like serine protease against the structural database. Besides detecting homologous trypsin-like proteases, we automatically obtain 3D, sequence order-independent, active-site similarities with subtilisin-like and sulfhydryl proteases. These similarities equivalence isolated residues, not conserving the linear order of the amino acids in the chains. The active-site similarities are well known and have been detected by manually inspecting the structures in a time-consuming, laborious procedure. This is the first time such equivalences are obtained automatically from the comparison of full structures. The far-reaching advantages and the implications of our novel algorithm to studies of protein folding, to evolution, and to searches for pharmacophoric patterns are discussed.  相似文献   

7.
Ribosomal protein S 13 gene has been cloned and analyzed in many organisms,but there are few documents relating to insects. In this communication, the full-length cDNA sequence of ribosomal protein S 13 gene in the diamondback moth, Plutella xylostella(Lepidoptera: Plutellidae), was determined by using PCR amplification technique. The features of the ribosomal protein S 13 gene sequence were analyzed and the deduced amino acids sequence was compared with those from other insects. The results of multi-alignment of the amino acid sequences between the diamondback moth and other insect species revealed that this gene sequence is highly conserved in insects. Based on maximum likelihood method, a phylogenetic tree was constructed from 10 different species using PHYLIP software. It showed that nematode is one separate lineage and the five insect speciesbe long to another lineage, whereas those species higher than insects form the third one. The pattern of this phylogenetic tree evidently represented the evolution of different species.  相似文献   

8.
In this paper, we present an updated classification of the ubiquitous MIP (Major Intrinsic Protein) family proteins, including 153 fully or partially sequenced members available in public databases. Presently, about 30 of these proteins have been functionally characterized, exhibiting essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol facilitators. Sequence alignments were used to predict amino acids and motifs discriminant in channel specificity. The protein sequences were also analyzed using statistical tools (comparisons of means and correspondence analysis). Five key positions were clearly identified where the residues are specific for each functional subgroup and exhibit high dissimilar physico-chemical properties. Moreover, we have found that the putative channels for small neutral solutes clearly differ from the aquaporins by the amino acid content and the length of predicted loop regions, suggesting a substrate filter function for these loops. From these results, we propose a signature pattern for water transport.  相似文献   

9.
Structural analysis of oligomycin sensitivity-conferring protein (OSCP) revealed repeating sequences (residues 1-89, 105-190) suggesting an evolution of the protein by gene duplication. In addition to the reported homology with the delta-subunit of Escherichia coli F1ATPase, OSCP also shows a certain homology with the b-subunit of E. coli F0 and the ADP/ATP carrier of mitochondria.  相似文献   

10.
The aim of this study was to develop simple sequence repeat (SSR) markers for Dendrobium varieties/species, many of which have medicinal and horticultural values. Two genomic DNA libraries of Dendrobium Sonia enriched with GA repeats and CA repeats were constructed. Fourteen polymorphic SSR markers were identified when screened against 42 popular commercial Dendrobium hybrids. The average allele number was 12.0 ± 1.9 and the observed heterozyosity was averaged at 0.70. All 42 hybrids tested, except for two tissue culture mutants, were uniquely identified with the markers used. Sibling hybrids were closely clustered. Hybrids were also closer to parents. These SSR markers can be used for molecular ecology research, genetic mapping and marker‐assisted breeding. They can also help protection for new Dendrobium varieties.  相似文献   

11.
烟实夜蛾脂肪酸结合蛋白基因的克隆、序列分析与表达   总被引:1,自引:0,他引:1  
张涛  安世恒  尹新明 《昆虫学报》2007,50(5):528-533
应用RT-PCR技术,从烟实夜蛾Helicoverpa assulta幼虫脂肪体组织和血细胞总RNA中反转录扩增脂肪酸结合蛋白(fatty-acid binding protein,FABP)基因的cDNA片段,克隆到原核表达载体pGEX-4T-2上,转化大肠杆菌BL21(DE3),用IPTG进行诱导表达并进行检测。结果表明:扩增得到的片段全长399 bp(GenBank登录号为DQ299942),编码132个氨基酸残基,预测分子量15.0 kD,等点电5.83。FABP融合了GST。原核表达后经电泳检测到约41 kD大小的外源蛋白,Western blot检测表明是目的蛋白。  相似文献   

12.
王玲  江幸福  罗礼智  胡毅  张蕾 《昆虫知识》2012,49(6):1439-1447
昆虫中肠膜上的类钙粘蛋白(cadherin-like protein)是Bt毒素的一类重要受体,它与Bt毒素对昆虫的杀虫作用机制以及昆虫对Bt毒素的抗性等密切相关。本研究根据已报道的其它昆虫的类钙粘蛋白基因的保守区设计简并引物,应用RT-PCR和RACE技术克隆了迁飞性重要害虫甜菜夜蛾Spodoptera exigua(Hübner)类钙粘蛋白基因全长cDNA序列(命名为SeCAD,GenBank登录号为HQ647122),全长5582bp,编码1739个氨基酸,推导的氨基酸序列包括1个信号肽、1个前蛋白区、11个钙粘蛋白重复、1个近膜区、1个跨膜区和1个胞质区。预测的分子量和等电点分别为196.447ku和4.47。该蛋白与同科的大螟Sesamia inferens、蛀茎夜蛾S.nonagrioides、烟芽夜蛾Heliothi svirescens、烟夜蛾Helicover paassulta亲缘关系更近,氨基酸序列一致性分别为61.28%、60.34%、60.14%、60.08%。这些结果对于揭示转Bt基因作物对非靶标、迁飞性甜菜夜蛾的杀虫作用机制以及评价其潜在的对Bt毒素抗性机制等奠定了基础。  相似文献   

13.
Pánek J  Eidhammer I  Aasland R 《Proteins》2005,58(4):923-934
Structural similarity among proteins is reflected in the distribution of hydropathicity along the amino acids in the protein sequence. Similarities in the hydropathy distributions are obvious for homologous proteins within a protein family. They also were observed for proteins with related structures, even when sequence similarities were undetectable. Here we present a novel method that employs the hydropathy distribution in proteins for identification of (sub)families in a set of (homologous) proteins. We represent proteins as points in a generalized hydropathy space, represented by vectors of specifically defined features. The features are derived from hydropathy of the individual amino acids. Projection of this space onto principal axes reveals groups of proteins with related hydropathy distributions. The groups identified correspond well to families of structurally and functionally related proteins. We found that this method accurately identifies protein families in a set of proteins, or subfamilies in a set of homologous proteins. Our results show that protein families can be identified by the analysis of hydropathy distribution, without the need for sequence alignment.  相似文献   

14.
Apidermin (APD)蛋白家族是一个新的昆虫结构性表皮蛋白家族。本研究结合生物信息学和RT-PCR扩增, 对意大利蜜蜂Apis mellifera ligustica(简称“意蜂”)的apd-1-like, apd-3-like和中华蜜蜂Apis cerena cerena(简称“中蜂”)的apd-2 等3个新的apd基因的结构特征和表达进行了分析, 并分析了昆虫APD蛋白家族的序列特征。结果显示, 在西方蜜蜂Apis mellifera(简称“西蜂”)中, apd基因家族的6个成员串联排列在基因组序列第4号连锁群上, 它们在A. m. ligustica雄蜂头部中的转录水平差异明显, 且其启动子序列所含顺式元件也不同。中蜂apd-2和意蜂apd-1-like都含有3个外显子和2个内含子, 而意蜂apd-3-like则由4个外显子和3个内含子组成。蛋白序列分析结果显示, 目前已知的10条APD蛋白序列N末端均具有相似的信号肽序列, 其成熟蛋白分子量为6.0~37.0 kD, pI为6.2~10.8。其中西蜂的APD1-3、APD-like和东方蜜蜂Apis cerena的APD-2等5条较短的多肽中疏水氨基酸残基达52%~67%, 且Ala含量最为丰富(占25%~34%); 而丽蝇蛹集金小蜂Nasonia vitripennis的APD 1-3和西蜂APD-1-like, APD-3-like等另外5条APD多肽富含Gly(21%~30%), 其序列中疏水氨基酸残基含量为35%~41%。多肽序列多重比对和系统进化分析结果显示, APD家族可划分为2个亚家族。亚家族Ⅰ含有西蜂APD 1-3和东方蜜蜂APD-2等4条较短的多肽序列, 其N末端为一个长33 aa的保守基序; 亚家族Ⅱ由另外6条相对较长的多肽序列组成, 其N末端保守基序长50 aa, C末端保守基序长16 aa。本文所描述的APD蛋白家族序列特征有助于以后从其他昆虫中鉴定新的apd基因。  相似文献   

15.
王玲  江幸福  罗礼智  胡毅  张蕾 《昆虫学报》2011,54(10):1094-1103
昆虫中肠膜类钙粘蛋白(cadherin-like protein, CLP)是苏云金芽孢杆菌Bacillus thuringiensis (Bt)毒素的重要受体之一, 与Bt毒素的杀虫作用机制以及昆虫对Bt毒素的抗性等密切相关。本研究应用RT-PCR和RACE技术, 克隆了迁飞性重要害虫粘虫Mythimna separata类钙粘蛋白基因全长cDNA序列(命名为Msclp, GenBank登录号为JF951432), 全长5 642 bp, 编码1 757个氨基酸, 推导的氨基酸序列具有昆虫类钙粘蛋白的特征结构, 包括1个信号肽、 1个前蛋白区、 12个钙粘蛋白重复、 1个近膜区、 1个跨膜区和1个胞质区。预测的分子量和等电点分别为196.786 kD和4.5。该蛋白与同科的烟夜蛾Helicoverpa assulta、 烟芽夜蛾Heliothis virescens、 棉铃虫Helicoverpa armigera及甜菜夜蛾Spodoptera exigua的类钙粘蛋白亲缘关系最近, 氨基酸序列一致性分别为61.77%, 61.66%, 61.26%和58.14%。荧光定量RT-PCR分析表明, 该类钙粘蛋白基因在不同龄期的粘虫幼虫中表达量差异极显著(P<0.01), 其中4龄幼虫相对表达量最高, 但与3龄、 6龄幼虫并没有显著性差异; 1和2龄幼虫表达量最低; 表达部位主要在粘虫中肠, 在中肠以外的虫体其他部位表达量极低。这些结果对于揭示转Bt作物对非靶标、 迁飞性粘虫的杀虫作用机制以及评价其潜在的对Bt毒素抗性机制等奠定了基础。  相似文献   

16.
17.
Restriction fragment length polymorphisms of 6 base pair recognising endonucleases are used to reconstruct the phylogeny of the endemic Canary Island lacertid, Gallotia. The division into conventional species is upheld by this molecular analysis and the western Canary Island lizard (G. galloti) and eastern Canary Island lizard (G. atlantica) are hypothesized to be sister species. A more comprehensive study of the intraspecific relationships of G. galloti, based on nineteen restriction enzymes, indicates that there are distinct southern and northern lineages within this species. The phylogenetic analysis does not uphold the conventional subspecies, but suggests an alternative arrangement with one northern (La Palma, Tenerife) and one southern (Gomera, Hierro) subspecies. The inferred timing of molecular divergence of populations of G. galloti, based on RFLP analysis, is compatible with the geological timing for island origin and fossil data. Mantel tests show that mitochondrial RFLP divergence is correlated with mitrochondrial 12s rRNA and cytochrome oxidase I sequence divergence and highly correlated with mitochondrial cytochrome b sequence divergence.  相似文献   

18.
The nucleotide sequence of the gene encoding the cellulose-binding protein A (CBPA) of Eubacterium cellulosolvens 5 was determined. The gene consists of an open reading frame of 3453 nucleotides and encodes a protein of 1151 amino acids with a molecular mass of 126408 Da. The deduced amino acid sequence of CBPA contained one domain highly similar to a catalytic domain of glycosyl hydrolases belonging to family 9, two linker-like domains and four domains of unknown function. Among the four domains of unknown function, the domains 1 and 2 region had significant homology in amino acid sequence with the cellulose-binding domains in the family 9 glycosyl hydrolases. The cloned gene was inserted into an expression vector, pBAD-TOPO, and expressed in Escherichia coli as a fused protein. The fused protein was detected by immunoblotting using antiserum against CBPA.  相似文献   

19.
 Inter-simple sequence repeat (ISSR) amplification was used to analyze microsatellite motif frequency in the rice genome and to evaluate genetic diversity among rice cultivars. A total of 32 primers, containing different simple sequence repeat (SSR) motifs, were tested for amplification on a panel of 59 varieties, representative of the diversity of cultivated rice (Oryza sativa L.). The ISSR analysis provided insights into the organization, frequency and levels of polymorphism of different simple sequence repeats in rice. The more common dinucleotide motifs were more amenable to ISSR analysis than the more infrequent tri-, tetra- and penta-nucleotide motifs. The ISSR results suggested that within the dinucleotide class, the poly(GA) motif was more common than the poly(GT) motif and that the frequency and clustering of specific tri- and tetra-nucleotide simple sequence repeats was variable and motif-specific. Furthermore, trinucleotide ISSR markers were found to be less polymorphic than either dinucleotide or certain tetranucleotide ISSR markers, suggesting which motifs would be better targets for microsatellite marker development. The ISSR amplification pattern was used to group the rice genotypes by cluster analysis. These results were compared to surveys of the same varieties for amplified fragment length polymorphism (AFLP), restriction fragment length polymorphism (RFLP) and isozyme markers. The ISSR fingerprint could be used to differentiate the genotypes belonging to either Japonica or Indica sub species of cultivated rice and to dissect finer levels of diversity within each subspecies. A higher percentage of polymorphic bands was produced with the ISSR technique than the AFLP method, based on a similar PCR reaction. Therefore, ISSR amplification proved to be a valuable method for determining genetic variability among rice varieties and for rapidly identifying cultivars. This efficient genetic fingerprinting technique would be useful for characterizing the large numbers of rice accessions held in national and international germplasm centers. Received: 25 May 1998 / Accepted: 17 September 1998  相似文献   

20.
In prokaryotic cells, 3′–5′ exonucleases can attenuate messenger RNA (mRNA) directionally from the direction of the 3′–5′ untranslated region (UTR), and thus improving the stability of mRNAs without influencing normal cell growth and metabolism is a key challenge for protein production and metabolic engineering. Herein, we significantly improved mRNA stability by using synthetic repetitive extragenic palindromic (REP) sequences as an effective mRNA stabilizer in two typical prokaryotic microbes, namely, Escherichia coli for the production of cyclodextrin glucosyltransferase (CGTase) and Corynebacterium glutamicum for the production of N-acetylglucosamine (GlcNAc). First, we performed a high-throughput screen to select 4 out of 380 REP sequences generated by randomizing 6 nonconservative bases in the REP sequence designed as the degenerate base “N.” Secondly, the REP sequence was inserted at several different positions after the stop codon of the CGTase-encoding gene. We found that mRNA stability was improved only when the space between the REP sequence and stop codon was longer than 12 base pairs (bp). Then, by reconstructing the spacer sequence and secondary structure of the REP sequence, a REP sequence with 8 bp in a stem-loop was obtained, and the CGTase activity increased from 210.6 to 291.5 U/ml. Furthermore, when this REP sequence was added to the 3′-UTR of glucosamine-6-phosphate N-acetyltransferase 1 ( GNA1), which is a gene encoding a key enzyme GNA1 in the GlcNAc synthesis pathway, the GNA1 activity was increased from 524.8 to 890.7 U/mg, and the GlcNAc titer was increased from 4.1 to 6.0 g/L in C. glutamicum. These findings suggest that the REP sequence plays an important function as an mRNA stabilizer in prokaryotic cells to stabilize its 3′-terminus of the mRNA by blocking the processing action of the 3′–5′ exonuclease. Overall, this study provides new insight for the high-efficiency overexpression of target genes and pathway fine-tuning in bacteria.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号