首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
To learn more about the evolutionary origins of Escherichia coli genes, we surveyed systematically for extended sequence similarities among the 1,264 amino acid sequences encoded by chromosomal genes of E. coli K-12 in SwissProt release 26 by using the FASTA program and imposing the following criteria: (i) alignment of segments at least 100 amino acids long and (ii) at least 20% amino acid identity. Altogether, 624 extended alignments meeting the two criteria were identified, corresponding to 577 protein sequences (45.6% of the 1,264 E. coli protein sequences) that had an extended alignment with at least one other E. coli protein sequence. To exclude alignments of questionable biological significance, we imposed a high threshold on the number of gaps allowed in each of the 624 extended alignments, giving us a subset of 464 proteins. The population of 464 alignments has the following characteristics expressed as median values of the group: 254 amino acids in the alignment, representing 86% of the length of the protein, 33% of the amino acids in the alignment being identical, and 1.1 gaps introduced per 100 amino acids of alignment. Where functions are known, nearly all pairs consist of functionally related proteins. This implies that the sequence similarity we detected has biological meaning and did not arise by chance. That a major fraction of E. coli proteins form extended alignments strongly suggests the predominance of duplication and divergence of ancestral genes in the evolution of E. coli genes. The range of degrees of similarity shows that some genes originated more recently than others. There is no evidence of genome doubling in the past, since map distances between genes of sequence-related proteins show no coherent pattern of favored separations.  相似文献   

2.
Nucleotide sequence of Klebsiella pneumoniae lac genes.   总被引:10,自引:8,他引:2       下载免费PDF全文
The nucleotide sequences of the Klebsiella pneumoniae lacI and lacZ genes and part of the lacY gene were determined, and these genes were located and oriented relative to one another. The K. pneumoniae lac operon is divergent in that the lacI and lacZ genes are oriented head to head, and complementary strands are transcribed. Besides base substitutions, the lacZ genes of K. pneumoniae and Escherichia coli have suffered short distance shifts of reading frame caused by additions or deletions or both during evolutionary divergence from a common ancestral gene. Relative to corresponding E. coli sequences, the nucleotide sequences of the lacZ and lacY genes are 61 and 67% conserved, and the lacI genes are 49% conserved. A comparison of both nucleotide and amino acid sequences revealed that the K. pneumoniae and E. coli lacI genes and lac repressor proteins each are related to the galR gene and gal repressor of E. coli to about the same extent. In terms of evolutionary relationships, the divergence of the forerunner of the galR gene from an ancestral lac repressor gene preceded separation and differentiation of the K. pneumoniae and E. coli lac repressor genes.  相似文献   

3.
M. Rina  C. Savakis 《Genetics》1991,127(4):769-780
Four genes encoding the major egg yolk polypeptides of the Mediterranean fruit fly Ceratitis capitata, vitellogenins 1 and 2 (VG1 and VG2), were cloned, characterized and partially sequenced. The genes are located on the same region of chromosome 5 and are organized in pairs, each encoding the two polypeptides on opposite DNA strands. Restriction and nucleotide sequence analysis indicate that the gene pairs have arisen from an ancestral pair by a relatively recent duplication event. The transcribed part is very similar to that of the Drosophila melanogaster yolk protein genes Yp1, Yp2 and Yp3. The Vg1 genes have two introns at the same positions as those in D. melanogaster Yp3; the Vg2 genes have only one of the introns, as do D. melanogaster Yp1 and Yp2. Comparison of the five polypeptide sequences shows extensive homology, with 27% of the residues being invariable. The sequence similarity of the processed proteins extends in two regions separated by a nonconserved region of varying size. Secondary structure predictions suggest a highly conserved secondary structure pattern in the two regions, which probably correspond to structural and functional domains. The carboxy-end domain of the C. capitata proteins shows the same sequence similarities with triacyglycerol lipases that have been reported previously for the D. melanogaster yolk proteins. Analysis of codon usage shows significant differences between D. melanogaster and C. capitata vitellogenins with the latter exhibiting a less biased representation of synonymous codons.  相似文献   

4.
Ethanolamine ammonia-lyase is a bacterial enzyme that catalyzes the adenosylcobalamin-dependent conversion of certain vicinal amino alcohols to oxo compounds and ammonia. Studies of ethanolamine ammonia-lyase from Clostridium sp. and Escherichia coli have suggested that the enzyme is a heterodimer composed of subunits of Mr approximately 55,000 and 35,000. Using a partial Sau3A Salmonella typhimurium library ligated into pBR328 and selecting by complementation of a mutant lacking ethanolamine ammonia-lyase activity, we have cloned the genes for the 2 subunits of the S. typhimurium enzyme. The genes were localized to a 6.5-kilobase fragment of S. typhimurium DNA, from which they could be expressed in E. coli under noninducing conditions. Sequencing of a 2526-base pair portion of this 6.5-kilobase DNA fragment revealed two open reading frames separated by 21 base pairs. The open reading frames encoded proteins of 452 and 286 residues whose derived N-terminal sequences were identical to the N-terminal sequences of the 2 subunits of the E. coli ethanolamine ammonia-lyase, except that residue 16 of the large subunit was asparagine in the E. coli sequence and aspartic acid in the S. typhimurium sequence.  相似文献   

5.
We have tried to approach the nature of the last common ancestor to Haemophilus influenzae and Escherichia coli and to determine how each bacterium could have diverged from this putative organism. The approach used was exhaustive analysis of the homologous proteins coded by genes present in these bacteria, using as criteria for sequence relatedness an alignment of at least 80 amino acid residues and a PAM distance (number of accepted point mutations per 100 residues separating two sequences) below 250. Evolutionarily significant similarities were found between 1,345 H. influenzae proteins (85% of the total genome) and 3,058 E. coli. proteins (75% of the total genome), many of them belonging to families of various sizes (from 666 doublets to 35 large groups of more than 10 members). Nearly all the genes found by this approach to be duplicated in both bacteria were already duplicated in their last common ancestor. This was deduced from (1) the comparison of the respective distributions of evolutionary distances between orthologs (genes separated only by speciation events) and paralogs (genes duplicated in the same genome) and (2) the analysis of the phylogenetic trees reconstructed for each family of paralogs containing at least two members belonging to each bacterium. The distributions of the different categories of homologs show a significant loss of paralogous genes in H. influenzae (reduction proportional to the genome size), of many sequences which are still present in one copy in E. coli, and of some entire gene families. Phylogenetic trees also confirmed this recent loss of paralogous genes in H. influenzae. Thus, the genome size of the last common ancestor of these two bacteria would have been close to that of present-day E. coli, and the evolution of H. influenzae toward a parasitic life led to an important decrease in its genome size by some mechanism of streamlining. During this recent evolution, the memory of the gene order present in the last common ancestor has been blurred, but a few short conserved chromosomal fragments can still be detected in present-day E. coli and H. influenzae.   相似文献   

6.
We have determined the nucleotide sequence of the rat apolipoprotein (apo-) A-IV gene and analyzed its structural and evolutionary relationships to the human apolipoprotein A-I, E, and C-III genes. The rat A-IV gene is 2.4 kilobases in size and consists of three exons (142, 126, and 1157 base pairs) interrupted by two introns (277 and 673 base pairs). The 5'-nontranslated region and most of the signal peptide are encoded by the first exon. Thus, the apo-A-IV gene lacks an intron in the 5'-nontranslated region of its mRNA in contrast to all other known apolipoprotein genes. Sequences coding for amphipathic docosapeptides span both the second and third exons of the rat A-IV gene. We demonstrate that this is also true for the human apolipoprotein genes. This gene family seems to have evolved by the duplication of an ancestral minigene that resulted in the formation of two exons. Thereafter, evolution of these sequences was dominated by intraexonic amplification of repeating units coding for amphipathic peptides. Sequence divergence of these repeats resulted in the functional differentiation of the apolipoproteins. However, conservation of the fundamental amphipathic pattern allowed members of this protein family to retain their lipid-binding properties.  相似文献   

7.
Abstract APETALA1 (AP1) and CAULIFLOWER (CAL) are a pair of paralogous genes that were generated through the pre‐Brassicaceae whole‐genome duplication event. AP1 and CAL have both partially redundant and unique functions. Previous studies have shown that the K and C regions of their proteins are essential for the functional divergence. However, which differences in these regions are the major contributors and how the differences were accumulated remain unknown. In the present study, we compared the sequences of the two proteins and identified five gaps and 55 amino acid replacements between them. Investigation of genomic sequences further indicated that the differences in the proteins were caused by non‐synonymous substitutions and changes in exon–intron structures. Reconstruction of three‐dimensional structures revealed that the sequence divergence of AP1 and CAL has resulted in differences between the two in terms of the number, length, position and orientation of α‐helices, especially in the K and C regions. Comparisons of sequences and three‐dimensional structures of ancestral proteins with AP1 and CAL suggest that the ancestral AP1 protein experienced fewer changes, whereas the ancestral CAL protein accumulated more changes shortly after gene duplication, relative to their common ancestor. Thereafter, AP1‐like proteins experienced few mutations, whereas CAL‐like proteins were not conserved until the diversification of the Brassicaceae lineage I. This indicates that AP1‐ and CAL‐like proteins evolved asymmetrically after gene duplication. These findings provide new insights into the functional divergence of AP1 and CAL genes.  相似文献   

8.
9.
Duplicated genes frequently evolve at different rates. This asymmetry is evidence of natural selection's ability to discriminate between the 2 copies, subjecting them to different levels of purifying selection or even permitting adaptive evolution of one or both copies. However, if gene duplication creates pairs of protein-coding sequences that are initially identical, this raises the question of how selection tells the 2 copies apart. Here, we investigated asymmetric sequence divergence of recently duplicated genes in rodents and related this to 2 possible sources of such asymmetry: gene relocation as a consequence of duplication and retrotransposition as a mechanism of gene duplication. We found that most young rodent duplicates that have been relocated were created by retrotransposition. The degree of rate asymmetry in gene pairs where one copy has been relocated (either by retrotransposition or DNA-based duplication) is greater than in pairs formed by local DNA-based duplication events. Furthermore, by considering the direction of transposition for distant duplicates, we found a consistent tendency for retrogenes to undergo accelerated protein evolution relative to their static paralogs, whereas DNA-based transpositions showed no such tendency. Finally, we demonstrate that the faster sequence evolution of retrogenes correlates with the profound alteration of their expression pattern that is precipitated by retrotransposition.  相似文献   

10.
Gene duplication and divergence is widely considered to be a fundamental mechanism for generating evolutionary novelties. The Bone Morphogenetic Proteins (BMPs) are a diverse family of signalling molecules found in all metazoan genomes that have evolved by duplication and divergence from a small number of ancestral types. In the fruit fly Drosophila, there are three BMPs: Decapentaplegic (Dpp) and Glass bottom boat (Gbb), which are the orthologues of vertebrate BMP2/4 and BMP5/6/7/8, respectively, and Screw (Scw), which, at the sequence level, is equally divergent from Dpp and Gbb. It has recently been shown that Scw has arisen from a duplication of Gbb in the lineage leading to higher Diptera. We show that since this duplication event, Gbb has maintained the ancestral BMP5/6/7/8 functionality while Scw has rapidly diverged. The evolution of Scw was accompanied by duplication and divergence of a suite of extracellular regulators that continue to diverge together in the higher Diptera. In addition, Scw has become restricted in its receptor specificity: Gbb proteins can signal through the Type I receptors Thick veins (Tkv) and Saxophone (Sax), while Scw signals through Sax. Thus, in a relatively short span of evolutionary time, the duplication event that gave rise to Scw produced not only a novel ligand but also a novel signalling mode that is functionally distinct from the ancestral Gbb mode. Our results demonstrate the plasticity of the BMP pathway not only in evolving new family members and new functions but also new signalling modes by redeploying key regulators in the pathway.  相似文献   

11.
Evolution of sarcomeric myosin heavy chain genes: evidence from fish   总被引:1,自引:0,他引:1  
Myosin heavy chain (MYH) is a major structural protein, integral to the function of sarcomeric muscles. We investigated both exon-intron organization and amino acid sequence of sarcomeric MYH genes to infer their evolutionary history in vertebrates. Our results were consistent with the hypothesis that a multigene family encoded MYH proteins in the ancestral chordate, one gene ancestral to human MYH16 and its homologues and another ancestral to all other vertebrate sarcomeric MYH genes. We identified teleost homologues of mammalian skeletal and cardiac MYH genes, indicating that the ancestors of those genes were present before the divergence of actinopterygians and sarcopterygians. Indeed, the ancestral skeletal genes probably duplicated at least once before the divergence of teleosts and tetrapods. Fish homologues of mammalian skeletal MYH are expressed in skeletal tissue and homologues of mammalian cardiac genes are expressed in the heart but, unlike mammals, there is overlap between these expression domains. Our analyses inferred two other ancestral vertebrate MYH genes, giving rise to human MYH14 and MYH15 and their homologues. Relative to the skeletal and cardiac genes, MYH14 and MYH15 homologues are characterized by evolution of intron position, differences in evolutionary rate between the functionally differentiated head and rod of the myosin protein, and possible evolution of function among vertebrate classes. Tandem duplication and gene conversion appear to have played major roles in the evolution of at least cardiac and skeletal MYH genes in fish. One outcome of this high level of concerted evolution is that different fish taxa have different suites of MYH genes, i.e., true orthologs do not exist.  相似文献   

12.
Bioluminescence in beetles is dependent upon the enzyme luciferase. It has been hypothesised luciferase evolved from a fatty acyl-CoA synthetase gene deriving a novel bioluminescent function (neofunctionalization) after a gene duplication event. We evaluated this hypothesis within a phylogenetic framework using independent evidence obtained from the genome of Tribolium castaneum, published luciferase genes and novel luciferase and luciferase-like sequences. This phylogenetic study provides evidence for a large gene family of luciferase and luciferase-like paralogues in bioluminescent and non-bioluminescent beetles. All luciferase sequences formed a clade supporting a protoluciferase existing prior to the divergence of the Lampyridae, Elateridae and Phengodidae (Elateroidea). Multiple luciferase genes were identified from members of the Photurinae and the Luciolinae indicating complex gene duplication events within lampyrid genomes. The majority of luciferase residues were identified to be under purifying selection as opposed to positive selection. We conclude that beetle luciferase may have arisen from a process of subfunctionalization as opposed to neofunctionalization early on in the evolution of the Elateroidea.  相似文献   

13.
Summary The core histone genes ofSaccharomyces cerevisiae are arranged as duplicate nonallelic sets of specifically paired genes. The identity of structural organization between the duplicated gene pairs would have its simplest evolutionary origin in the duplication of a complete locus in a single event. In such a case, the time since the duplication of one of the genes should be identical to that since duplication of the gene adjacent to it on the chromosome. A calculation of the evolutionary distances between the coding DNA sequences of the histone genes leads to a duplication paradox: The extents of sequence divergence in the silent component of third-base positions for adjacent pairs of genes are not identical. Estimates of the evolutionary distance between the two H3-H4 noncoding intergene DNA sequences are large; the divergence between the two separate sequences is indistinguishable from the divergence between either of the regions and a randomly generated permutation of itself. These results suggest that the duplication event may have occurred much earlier than previously estimated. The potential age of the duplication, and the attractive simplicity of the duplication of both the H3-H4 and the H2A-H2B gene pairs having taken place in a single event, leads to the hypothesis that modern haploidS. cerevisiae may have evolved by diploidization or fusion of two ancient fungi.  相似文献   

14.
A 16-kb BamHI fragment of the lactose plasmid pNZ63 from Leuconostoc lactis NZ6009 was cloned in Escherichia coli MC1061 by using pACYC184 and was found to express a functional beta-galactosidase. Deletion and complementation analysis showed that the coding region for beta-galactosidase was located on a 5.8-kb SalI-BamHI fragment. Nucleotide sequence analysis demonstrated that this fragment contained two partially overlapping genes, lacL (1,878 bp) and lacM (963 bp), that could encode proteins with calculated sizes of 72,113 and 35,389 Da, respectively. The L. lactis beta-galactosidase was overproduced in E. coli by using a lambda pL expression system. Two new proteins with M(r)s of 75,000 and 36,000 appeared upon induction of PL. The N-terminal sequences of these proteins corresponded to those deduced from the lacL and lacM gene sequences. Mutation and deletion analysis showed that lacL expression is essential for LacM production and that both the lacL and lacM genes are required for the production of a functional beta-galactosidase in E. coli. The deduced amino acid sequences of the LacL and LacM proteins showed considerable identity with the sequences of the N- and C-terminal parts, respectively, of beta-galactosidases from other lactic acid bacteria or E. coli. DNA and protein sequence alignments suggest that the L. lactis lacL and lacM genes have been generated by an internal deletion in an ancestral beta-galactosidase gene.  相似文献   

15.
Overlapping recombinant clones that appear to encompass the entire renin gene, named Ren 1, have been isolated from a library of BALB/c mouse genomic DNA fragments. Based on restriction endonuclease mapping and DNA sequence analysis, Ren 1 spans 9.6 kb and contains nine exons interrupted by eight intervening sequences of highly variable size. The first exon, encoding the signal peptide of preprorenin, is separated from the eight following exons by a 3-kb intron. These eight exons are organized into two clusters of four separated by a 2-kb intron. DNA stretches encoding the aspartyl residues, which are part of the active site of renin, are located at homologous positions in both clusters. Our results show that aspartyl protease genes have arisen by duplication and fusion of an ancestral gene containing five exons. The estimated date of the duplication event of the mouse renin genes Ren 1 and Ren 2 is discussed.  相似文献   

16.
We report the complete sequence of the gene encoding mouse glial fibrillary acidic protein (GFAP), the intermediate filament (IF) protein specific to astrocytes. The 9.8 kb gene includes nine exons separated by introns ranging in size from 0.2 to 2.5 kb. A comparison of the organization of the GFAP gene with that of genes encoding other IF proteins reveals that the structure of IF genes is highly conserved in spite of considerable divergence at the amino acid level. Thus, most of the evolutionary events leading to the placement of introns in IF genes must have occurred prior to the duplication and subsequent divergence of IF genes from a presumptive common ancestral sequence. The conserved gene organization is unrelated to structural features of IF proteins. A curious feature of the GFAP gene is the large number of repeated sequences found in the introns. Six tracts of reiterated di- or trinucleotides are present, plus tandem repeats of two different novel sequences. One repeat is unique to the GFAP gene; the other occurs elsewhere in the mouse genome, although at relatively low frequency.  相似文献   

17.
FtsZ蛋白在原核细胞以及植物细胞叶绿体的分裂过程中发挥着重要作用。为了研究叶绿体分裂装置的进化 ,运用RT PCR方法从莱茵衣藻中克隆了叶绿体分裂相关基因CrFtsZ3。由于已经从衣藻细胞中克隆了一个ftsZ基因 ,所以CrFtsZ3的克隆表明衣藻中已经存在两类不同的 ftsZ基因 ,这说明 ftsZ基因的复制与分歧发生于绿藻的分化之前。序列分析结果显示 ,CrFtsZ3所编码的蛋白质具有FtsZ蛋白的典型模体。进一步的原核表达与定位分析表明CrFtsZ3 GFP融合蛋白沿着宿主菌体的纵轴方向有规律地聚集成荧光点或荧光带 ,并且CrFtsZ3蛋白过量表达明显干挠了宿主菌正常的细胞分裂过程 ,说明衣藻CrFtsZ3蛋白能够识别宿主细胞内的分裂位点并影响细胞分裂过程 ,从而初步验证了它的生物学功能  相似文献   

18.
The mitochondrial factor F6 has been purified by reverse-phase HPLC and the molecular weight (8500), amino acid composition and about 25% of the amino acid sequence determined. In the NH2-terminal sequence of the first 18 amino acids (NKELDPVQKLFVDKIREY), six identities with the NH2-terminal sequence of the oligomycin-sensitivity conferring protein (OSCP) are apparent, as well as less striking similarities with the OSCP related subunit delta of E. coli F1. The possibility that F6, OSCP and subunit delta of E. coli F1 could have evolved from a common ancestral gene is supported by apparent gene duplication within the OSCP and subunit delta sequences.  相似文献   

19.
The pairs of nitrogen fixation genes nifDK and nifEN encode for the α and β subunits of nitrogenase and for the two subunits of the NifNE protein complex, involved in the biosynthesis of the FeMo cofactor, respectively. Comparative analysis of the amino acid sequences of the four NifD, NifK, NifE, and NifN in several archaeal and bacterial diazotrophs showed extensive sequence similarity between them, suggesting that their encoding genes constitute a novel paralogous gene family. We propose a two-step model to reconstruct the possible evolutionary history of the four genes. Accordingly, an ancestor gene gave rise, by an in-tandem paralogous duplication event followed by divergence, to an ancestral bicistronic operon; the latter, in turn, underwent a paralogous operon duplication event followed by evolutionary divergence leading to the ancestors of the present-day nifDK and nifEN operons. Both these paralogous duplication events very likely predated the appearance of the last universal common ancestor. The possible role of the ancestral gene and operon in nitrogen fixation is also discussed. Received: 21 June 1999 / Accepted: 1 March 2000  相似文献   

20.
L F Wu  M H Saier  Jr 《Journal of bacteriology》1990,172(12):7167-7178
The nucleotide sequence of the fruA gene, the terminal gene in the fructose operon of Rhodobacter capsulatus, is reported. This gene codes for the fructose permease (molecular weight, 58,575; 578 aminoacyl residues), the fructose enzyme II (IIFru) of the phosphoenolpyruvate-dependent phosphotransferase system. The deduced aminoacyl sequence of the encoded gene product was found to be 55% identical throughout most of its length with the fructose enzyme II of Escherichia coli, with some regions strongly conserved and others weakly conserved. Sequence comparisons revealed that the first 100 aminoacyl residues of both enzymes II were homologous to the second 100 residues, suggesting that an intragenic duplication of about 300 nucleotides had occurred during the evolution of IIFru prior to divergence of the E. coli and R. capsulatus genes. The protein contains only two cysteyl residues, and only one of these residues is conserved between the two proteins. This residue is therefore presumed to provide the active-site thiol group which may serve as the phosphorylation site. IIFru was found to exhibit regions of homology with sequenced enzymes II from other bacteria, including those specific for sucrose, beta-glucosides, mannitol, glucose, N-acetylglucosamine, and lactose. The degree of evolutionary divergence differed for different parts of the proteins, with certain transmembrane segments exhibiting high degrees of conservation. The hydrophobic domain of IIFru was also found to be similar to several uniport and antiport transporters of animals, including the human and mouse insulin-responsive glucose facilitators. These observations suggest that the mechanism of transmembrane transport may be similar for permeases catalyzing group translocation and facilitated diffusion.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号