首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The entire phosphoprotein (P) and nucleocapsid (N) protein gene sequences and deduced amino acid sequences for 18 selected vesicular stomatitis virus isolates representative of the natural genetic diversity within the New Jersey serotype are reported. Phylogenetic analysis of the data using maximum parsimony allowed construction of evolutionary trees for the individual genes and the combined N, P, and glycoprotein (G) genes of these viruses. Virtually identical rates of nucleotide substitutions were found for each gene, indicating that evolution of these genes occurs at essentially the same rate. Although up to 19 and 17% sequence differences were evident in the P and N genes, respectively, no variation in gene length or evidence of recombinational rearrangements was found. However, striking evolutionary differences were observed among the amino acid sequences of vesicular stomatitis virus New Jersey N, P, and G proteins. The N protein amino acid sequence was the most highly conserved among the different isolates, indicating strong functional and structural constraints. Conversely, the P protein amino acid sequences were highly variable, indicating considerably fewer constraints or greater evolutionary pressure on the P protein. Much of the remarkable amino acid variability of the P protein resided in a hypervariable domain located between amino acids 153 and 205. The variability within this region would be consistent with it playing a structural role as a spacer to maintain correct conformational presentation of the separate active domains of this multifunctional protein. In marked contrast, the adjacent domain I of the P protein (previously thought to be under little evolutionary constraint) contained a highly conserved region. The colocalization of a short, potentially functional overlapping open reading frame to this region may explain this apparent anomaly.  相似文献   

2.
中国人白细胞介素-12 cDNA基因的克隆及序列分析与比较   总被引:3,自引:0,他引:3  
焦宏远  詹美云 《病毒学报》2000,16(4):336-340
为研究中国人IL-12的基因特征,采用逆转录巢式聚合酶链反应(RT-nPCR)从中国人脐带血单核细胞中分别克隆了P35、P40两亚基cDNA基因,包括完整的前体蛋白编码序列,其中P35 cDNA编码219个氨基酸的多肽,P40 cDNA编码328个氨基酸的多肽,与国外序列(NKSF、CLMF)比较结果发现:所克隆序列P35同NKSF相比,第44aa密友子由GTC(Val)→GTG(Val),但未改  相似文献   

3.
Summary The availability of the amino acid sequence for nine different mammalian P1 family protamines and the revised amino acid sequence of the chicken protamine galline (Oliva and Dixon 1989) reveals a much close relationship between mammalian and avian protamines than was previously thought (Nakano et al. 1976). Dot matrix analysis of all protamine genes for which genomic DNA or cDNA sequence is available reveals both marked sequence similarities in the mammalian protamine gene family and internal repeated sequences in the chicken protamine gene. The detailed alignments of the cis-acting regulatory DNA sequences shows several consensus sequence patterns, particularly the conservation of a cAMP response element (CRE) in all the protamine genes and of the regions flanking the TATA box, CAP site, N-terminal coding region, and polyadenylation signal. In addition we have found a high frequency of the CA dinucleotide immediately adjacent to the CRE element of both the protamine genes and the testis transition proteins, a feature not present in other genes, which suggests the existence of an extended CRE motif involved in the coordinate expression of protamine and transition protein genes during spermatogenesis. Overall these findings suggest the existence of an avian-mammalian P1 protamine gene line and are discussed in the context of different hypotheses for protamine gene evolution and regulation.  相似文献   

4.
The major histocompatibility complex (MHC) genes are the most polymorphic genes found in the vertebrate genome, and they encode proteins that play an essential role in the adaptive immune response. Many songbirds (passerines) have been shown to have a large number of transcribed MHC class I genes compared to most mammals. To elucidate the reason for this large number of genes, we compared 14 MHC class I alleles (α1–α3 domains), from great reed warbler, house sparrow and tree sparrow, via phylogenetic analysis, homology modelling and in silico peptide-binding predictions to investigate their functional and genetic relationships. We found more pronounced clustering of the MHC class I allomorphs (allele specific proteins) in regards to their function (peptide-binding specificities) compared to their genetic relationships (amino acid sequences), indicating that the high number of alleles is of functional significance. The MHC class I allomorphs from house sparrow and tree sparrow, species that diverged 10 million years ago (MYA), had overlapping peptide-binding specificities, and these similarities across species were also confirmed in phylogenetic analyses based on amino acid sequences. Notably, there were also overlapping peptide-binding specificities in the allomorphs from house sparrow and great reed warbler, although these species diverged 30 MYA. This overlap was not found in a tree based on amino acid sequences. Our interpretation is that convergent evolution on the level of the protein function, possibly driven by selection from shared pathogens, has resulted in allomorphs with similar peptide-binding repertoires, although trans-species evolution in combination with gene conversion cannot be ruled out.  相似文献   

5.
Cruciferin (12S globulin) is the major seed protein in Brassica napus (oil seed rape). It is synthesized during seed development and consists of six subunit pairs. Each of these pairs is synthesized as a precursor containing one alpha and one beta chain. At least three different precursors exist (P1-3), giving rise to four different mature subunits (cru1-4). Several cruciferin clones were isolated from a seed mRNA cDNA library. Comparison of the deduced amino acid sequences of these clones to amino acid sequences of purified cruciferin chains and peptides identified them as coding for cru2/3 and cru4 subunits. From the amino acid sequences deduced from two overlapping cDNA clones, the precursor of the cru4 subunit was shown to consist of 465 amino acid residues. Comparison of cruciferin and cruciferin-related sequences from B. napus and Arabidopsis thaliana, respectively, suggested that early during evolution the Brassicaceae family only possessed two types of 11-12S globulin genes, like the present-day Fabaceae.  相似文献   

6.
Summary cDNA clones encoding three photosystem I subunits of Chlamydomonas reinhardtii with apparent molecular masses 13, 5 and 3 kDa (thylakoid polypeptides 28, 35 and 37; P28, P35 and P37, respectively) were isolated using gene specific oligonucleotides as probes. The sequences of these oligonucleotides were deduced from the N-terminal amino acid sequences of the proteins. The cDNAs were sequenced and used to probe Southern and Northern blots. The Southern blot analysis indicates that the proteins are encoded by single-copy genes. The mRNA sizes of the three components are 960 (P28), 1120 (P35) and 790 (P37) nucleotides. Comparison between the open reading frames of the cDNAs and the N-terminal amino acid sequences of the proteins indicates that the nascent polypeptides possess N-terminal transit sequences that are removed to give mature proteins of 11.0 (P28), 10.0 (P35) and 8.4 (P37) kDa. Analysis of the deduced protein sequences suggests that P28 and P35 are extrinsic membrane proteins and that P37 spans the thylakoid membrane. All three proteins have short transit peptides that probably route them to the stromal side of the thylakoid membrane.Abbreviations OEE1, 2 and 3 oxygen evolution enhancer proteins 1, 2 and 3 - RuBisCO ribulose bisphosphate carboxylase/oxygenase - PS photosystem - P28, P35 and P37 Chlamydomonas reinhardtii thylakoid polypeptides 28, 35 and 37 The nucleotide sequences presented here will appear in the EMBL/Genbank/DDBJ Nucleotide Sequence Databases under the accession numbers X15164 (11.0 kDa subunit; P28), X15165 (10.0 kDa subunit; P35) and X15166 (8.4 kDa subunit; P37)  相似文献   

7.
This study assesses whether the phylogenetic relationships between SODs from different organisms could assist in elucidating the functional relationships among these enzymes from evolutionarily distinct species. Phylogenetic trees and intron positions were compared to determine the relationships among these enzymes. Alignment of Cu/ZnSOD amino acid sequences indicates high homology among plant sequences, with some features that distinguish chloroplastic from cytosolic Cu/ZnSODs. Among eukaryotes, the plant SODs group together. Alignment of the Mn and FeSOD amino acid sequences indicates a higher degree of homology within the group of MnSODs (>70%) than within FeSODs (approximately 60%). Tree topologies are similar and reflect the taxonomic classification of the corresponding species. Intron number and position in the Cu/Zn Sod genes are highly conserved in plants. Genes encoding cytosolic SODs have seven introns and genes encoding chloroplastic SODs have eight introns, except the chloroplastic maize Sod1, which has seven. In Mn Sod genes the number and position of introns are highly conserved among plant species, but not among nonplant species. The link between the phylogenetic relationships and SOD functions remains unclear. Our findings suggest that the 5' region of these genes played a pivotal role in the evolution of function of these enzymes. Nevertheless, the system of SODs is highly structured and it is critical to understand the physiological differences between the SODs in response to different stresses in order to compare their functions and evolutionary history.  相似文献   

8.
Protamine P1 genes have been sequenced by PCR amplification and direct DNA sequencing from 9 primates representing 5 major families, Cebidae (new world monkeys), Cercopithecidae (old world monkeys), Hylobatidae (gibbons), Pongidae (gorilla, orangutan, and chimpanzee), and Hominidae (human). In this recently diverged group of primates these genes are clearly orthologous but very variable, both at the DNA level and in their expressed amino acid sequences. The rate of variation amongst the protamine Pls indicates that they are amongst the most rapidly diverging polypeptides studied. However, some regions are conserved both in primates and generally in other placental mammals. These are the 13 N-terminal residues (including a region of alternating serine and arginine residues (the motif SRSR, res. 10–13) susceptible to Ser phosphorylation), a tract of six Arg residues (res. 24–29) in the center of the molecule, and a six-residue region (RCCRRR, res. 39–44), consisting of a pair of cysteines flanked by arginines. Detailed consideration of nearest neighbor matrices and trees based on maximum parsimony indicates that PI genes from humans, gorillas, and chimpanzees are very similar. The amino acid and nucleotide differences between humans and gorillas. are fewer than those between humans and chimpanzees. This finding is at variance with data from DNA-DNA hybridization and extensive globin and mitochondrial DNA sequences which place human and chimpanzee as closest relatives in the super family, Hominoidea. This may be related to the fact that protamine Pls are expressed in germ line rather than somatic cells. In contrast to the variability of the exon regions of the protamine P1 genes, the sequence of the single intron is highly conserved.  相似文献   

9.
10.
Synaptic proteins synucleins are found in pathologic aggregates in human brain during neurodegenerative diseases and in some tumors. Normal functions of these proteins in synapses are still unclear. In the present study, we used cDNA cloning to determine amino acid sequences of synucleins in the central nervous system of river lamprey (Lampetra fluviatilis), which is used as a model organism to study molecular mechanisms of synaptic transmission. Three genes are identified. High similarity in amino acid sequences as compared to other vertebrate species is revealed. The bioinformatic analysis predicts that the river lamprey synucleins relate to the group of gamma-synucleins. High homology with human alpha-synuclein is reported. The hydrophobic region required for the formation of alpha-synuclein amyloid fibers is also present in the river lamprey synucleins. The latter suggests that this region appeared at early stages of evolution. The obtained amino acid sequences of synucleins in the river lamprey brain will allow generating novel molecular tools for dissecting physiological functions of these proteins.  相似文献   

11.
Convicilins, a set of seed storage proteins, differ from vicilins, a related group of seed storage proteins, mainly because of the presence of the N-terminal extension, an additional sequence of amino acids in the sequence corresponding to the first exon. Convicilins have been described only in species of the legume tribe Vicieae. One or two genes for convicilins have been identified in most species of this tribe. The genus Pisum is the main exception, since two genes have been identified in most of its species. Thirty-four new convicilin gene sequences from 29 different species (Lathyrus, Lens, Pisum, and Vicia spp.) have been analyzed here. Convicilin gene sequences are generally organized in 6 exons, but in some instances one of the internal introns (2nd or 4th) is lost. In these 29 species, the N-terminal extension is formed by a stretch of 99 to 196 amino acids particularly rich in polar and charged amino acids (on average, it contains 29.43% glutamic acid and 15.38% arginine residues). This N-terminal extension has the characteristics of an intrinsically unstructured region (IUR), one of the categories of protein "degenerate sequences". A comparative analysis indicates that the N-terminal extension sequence has evolved faster than the surrounding sequence, which is common to all vicilins, and it evolved mainly through a series of duplications of short internal sequences and triplet expansions, the predominant one being GAA. This agrees with the evolution of IURs, which is faster than the evolution of surrounding sequences and is mainly due to replication slippage and unequal crossover recombination. Alternative maximum-likelihood trees of phylogenetic relationships among the 29 Vicieae species based on the convicilin exon sequences are presented and discussed.  相似文献   

12.
Thirty-nine human parainfluenza type 1 (HPIV-1) genomes were sequenced from samples collected in Milwaukee, Wisconsin from 1997–2010. Following sequencing, phylogenetic analyses of these sequences plus any publicly available HPIV-1 sequences (from GenBank) were performed. Phylogenetic analysis of the whole genomes, as well as individual genes, revealed that the current HPIV-1 viruses group into three different clades. Previous evolutionary studies of HPIV-1 in Milwaukee revealed that there were two genotypes of HPIV-1 co-circulating in 1991 (previously described as HPIV-1 genotypes C and D). The current study reveals that there are still two different HPIV-1 viruses co-circulating in Milwaukee; however, both groups of HPIV-1 viruses are derived from genotype C indicating that genotype D may no longer be in circulation in Milwaukee. Analyses of genetic diversity indicate that while most of the genome is under purifying selection some regions of the genome are more tolerant of mutation. In the 40 HPIV-1 genomes sequenced in this study, the nucleotide sequence of the L gene is the most conserved while the sequence of the P gene is the most variable. Over the entire protein coding region of the genome, 81 variable amino acid residues were observed and as with nucleotide diversity, the P protein seemed to be the most tolerant of mutation (and contains the greatest proportion of non-synonymous to synonymous substitutions) while the M protein appears to be the least tolerant of amino acid substitution.  相似文献   

13.
Summary Two high-molecular-weight subunit (HMWS) glutenin genes from the A and B genomes of the hexaploid bread wheat Triticum aestivum L. cv Cheyenne have been isolated and sequenced. Both of these genes are of the high Mr class (x-type) of HMW glutenins, and have not been previously reported. The entire set of six HMW genes from cultivar Cheyenne have now been isolated and characterized. An analysis of the Ax and Bx sequences shows that the Ax sequence is similar to the homoeologous gene from the D genome, while the Bx repeat structure is significantly different. The repetitive region of these proteins can be modelled as a series of interspersed copies of repeat modifs of 6, 9, and 15 amino acid residues. The evolution of these genes includes single-base substitutions over the entire coding region, plus insertion/deletions of single or blocks of repeats in the central repetitive domain.  相似文献   

14.
The amino acid sequences of the amidinotransferases and the nucleotide sequences of their genes or cDNA from four Streptomyces species (seven genes) and from the kidneys of rat, pig, human and human pancreas were compared. The overall amino acid and nucleotide sequences of the prokaryotes and eukaryotes were very similar and further, three regions were identified that were highly identical. Evidence is presented that there is virtually zero chance that the overall and high identity regions of the amino acid sequence similarities and the overall nucleotide sequence similarities between Streptomyces and mammals represent random match. Both rat and lamprey amidinotransferases were able to use inosamine phosphate, the amidine group acceptor of Streptomyces. We have concluded that the structure and function of the amidinotransferases and their genes has been highly conserved through evolution from prokaryotes to eukaryotes. The evolution has occurred with: (1) a high degree of retention of nucleotide and amino acid sequences; (2) a high degree of retention of the primitive Streptomyces guanine+cytosine (G+C) third codon position composition in certain high identity regions of the eukaryote cDNA; (3) a decrease in the specificities for the amidine group acceptors; and (4) most of the mutations silent in the regions suggested to code for active sites in the enzymes.  相似文献   

15.
We have developed a statistical method named MAP (mutagenesis assistant program) to equip protein engineers with a tool to develop promising directed evolution strategies by comparing 19 mutagenesis methods. Instead of conventional transition/transversion bias indicators as benchmarks for comparison, we propose to use three indicators based on the subset of amino acid substitutions generated on the protein level: (1) protein structure indicator; (2) amino acid diversity indicator with a codon diversity coefficient; and (3) chemical diversity indicator. A MAP analysis for a single nucleotide substitution was performed for four genes: (1) heme domain of cytochrome P450 BM-3 from Bacillus megaterium (EC 1.14.14.1); (2) glucose oxidase from Aspergillus niger (EC 1.1.3.4); (3) arylesterase from Pseudomonas fluorescens (EC 3.1.1.2); and (4) alcohol dehydrogenase from Saccharomyces cerevisiae (EC 1.1.1.1). Based on the MAP analysis of these four genes, 19 mutagenesis methods have been evaluated and criteria for an ideal mutagenesis method have been proposed. The statistical analysis showed that existing gene mutagenesis methods are limited and highly biased. An average amino acid substitution per residue of only 3.15-7.4 can be achieved with current random mutagenesis methods. For the four investigated gene sequences, an average fraction of amino acid substitutions of 0.5-7% results in stop codons and 4.5-23.9% in glycine or proline residues. An average fraction of 16.2-44.2% of the amino acid substitutions are preserved, and 45.6% (epPCR method) are chemically different. The diversity remains low even when applying a non-biased method: an average of seven amino acid substitutions per residue, 2.9-4.7% stop codons, 11.1-16% glycine/proline residues, 21-25.8% preserved amino acids, and 55.5% are amino acids with chemically different side-chains. Statistical information for each mutagenesis method can further be used to investigate the mutational spectra in protein regions regarded as important for the property of interest.  相似文献   

16.
为探讨新城疫病毒(Newcastle disease virus,NDV)血凝素-神经氨酸酶(HN)和磷蛋白(P)基因遗传特性以及相互关系,将1997~2005年间国内分离到12株NDV毒株,分别进行HN和P基因克隆测序,结合15个已发表的国内外不同时期的NDV毒株HN和P基因,计算所有毒株HN和P基因的不同核苷酸和氨基酸片段进化距离,利用统计软件进行了不同片段间进化距离的方差分析,HN或P基因核苷酸进化距离与毒株分离时间、HN或P基因片段与其全长间以及HN和P基因全长间的相关分析.统计分析显示:NDV HN或P基因不同核苷酸和氨基酸序列片段变异程度不一样;不同毒株间HN或P基因片段与其全长间以及HN和P基因全长间无论是核苷酸还是氨基酸遗传变异高度相关.以上说明,NDV HN和P基因虽以不同的方式进化,但是HN和P基因遗传变异的趋势是相同的.HN和P基因的变异与分离时间有一定的联系.  相似文献   

17.
A cDNA clone (pDH7) has been isolated which encodes the entire leader peptide and variable (V) region and most of the constant (C) region of a rabbit lambda-light chain. Although similar to amino acid sequences derived from fragments of isolated lambda-chains from several Basilea rabbits, differences in the first framework region (FR1) suggest that at least two germ-line V lambda genes are expressed. There are major differences between rabbit V lambda sequences and light chains of other species: in particular, rabbit lambda-chains have an additional four amino acids in the vicinity of the FR2-CDR2 junction. The same region also has significant homology with the human D2 germ-line mini-gene sequence, especially with a 14-nucleotide sequence previously shown to be homologous to human and rabbit heavy chain CDR2 sequences. Similar homologies in other heavy and light chain sequences suggest that D-gene segments may be derived from VH genes, perhaps by transposition. The framework regions of the rabbit lambda-chain encoded by clone pDH7 show the greatest homologies with those of human kappa- and lambda-sequences (46 to 54% homology), with that of chicken sequence (55%), and least with murine V lambda sequences (40%).  相似文献   

18.
19.
Structure and evolution of the apolipoprotein multigene family   总被引:8,自引:0,他引:8  
We present the complementary DNA and deduced amino acid sequence of rat apolipoprotein A-II (apoA-II), and the results of a detailed statistical analysis of the nucleotide and amino acid sequences of all the apolipoprotein gene sequences published to date: namely, those of human and rat apoA-I, apoA-II and apoE, rat apoA-IV, and human apoC-I, C-II and C-III. Our results indicate that the apolipoprotein genes have very similar genomic structures, each having a total of three introns at the same locations. Using the exon/intron junctions as reference points, we have obtained an alignment of the coding regions of all the genes studied. It appears that the mature peptide regions of these genes are almost completely made up of tandem repeats of 11 codons. The part of mature peptide region encoded by exon 3 contains a common block of 33 codons, whereas the part encoded by exon 4 contains a much more variable number of internal repeats of 11 codons. These genes have apparently evolved from a primordial gene through multiple partial (internal) and complete gene duplications. On the basis of the degree of homology of the various sequences, and the pattern of the internal repeats in these genes, we propose an evolutionary tree for the apolipoprotein genes and give rough estimates of the divergence times between these genes. Our results show that apoA-II has evolved extremely rapidly and that apoA-I and apoE also have evolved at high rates but some regions are better conserved than the others. The rate of evolution of individual regions seems to be related to the stringency of their functional requirements.  相似文献   

20.
The relative efficiencies of different protein-coding genes of the mitochondrial genome and different tree-building methods in recovering a known vertebrate phylogeny (two whale species, cow, rat, mouse, opossum, chicken, frog, and three bony fish species) was evaluated. The tree-building methods examined were the neighbor joining (NJ), minimum evolution (ME), maximum parsimony (MP), and maximum likelihood (ML), and both nucleotide sequences and deduced amino acid sequences were analyzed. Generally speaking, amino acid sequences were better than nucleotide sequences in obtaining the true tree (topology) or trees close to the true tree. However, when only first and second codon positions data were used, nucleotide sequences produced reasonably good trees. Among the 13 genes examined, Nd5 produced the true tree in all tree-building methods or algorithms for both amino acid and nucleotide sequence data. Genes Cytb and Nd4 also produced the correct tree in most tree-building algorithms when amino acid sequence data were used. By contrast, Co2, Nd1, and Nd41 showed a poor performance. In general, large genes produced better results, and when the entire set of genes was used, all tree-building methods generated the true tree. In each tree-building method, several distance measures or algorithms were used, but all these distance measures or algorithms produced essentially the same results. The ME method, in which many different topologies are examined, was no better than the NJ method, which generates a single final tree. Similarly, an ML method, in which many topologies are examined, was no better than the ML star decomposition algorithm that generates a single final tree. In ML the best substitution model chosen by using the Akaike information criterion produced no better results than simpler substitution models. These results question the utility of the currently used optimization principles in phylogenetic construction. Relatively simple methods such as the NJ and ML star decomposition algorithms seem to produce as good results as those obtained by more sophisticated methods. The efficiencies of the NJ, ME, MP, and ML methods in obtaining the correct tree were nearly the same when amino acid sequence data were used. The most important factor in constructing reliable phylogenetic trees seems to be the number of amino acids or nucleotides used.   相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号