首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 201 毫秒
1.
Li Z  Liu Q  Song M  Zheng Y  Nan P  Cao Y  Chen G  Li Y  Zhong Y 《Bio Systems》2005,82(3):226-234
Physiological functions and characteristic structures of the serpin gene superfamily have been studied extensively, yet the evolution of the serpin genes remains unclear. Gene duplication in this superfamily may shed light on this issue. Two models are used to predict the preservation of duplicated genes: the classical model and the duplication-degeneration-complementation (DDC) model. In this study, we analyzed the phylogenetic relationships of 33 human serpin genes and the expression data of some members of the serpin superfamily from a DNA microarray of human leukemia U937 cells with stably inducible expression of the leukemia-related AML1-ETO gene. We then determined the utility of the DDC model by mapping serpin superfamily expression data to the phylogenetic tree. The correlation between sequence and expression divergences as measured by the Pearson correlation coefficient indicated that human serpin genes evolved under the DDC model. Our study provides a new strategy for comparative analysis of gene sequences and microarray data.  相似文献   

2.
The human CD1 proteins belong to a lipid-glycolipid antigen-presenting gene family and are related in structure and function to the MHC class I molecules. Previous mapping and DNA hybridization studies have shown that five linked genes located within a cluster on human chromosome 1q22-23 encode the CD1 protein family. We have analyzed the complete genomic sequence of the human CD1 gene cluster and found that the five active genes are distributed over 175,600 nucleotides and separated by four expanded intervening genomic regions (IGRs) ranging in length between 20 and 68 kb. The IGRs are composed mostly of retroelements including five full-length L1 PA sequences and various pseudogenes. Some L1 sequences have acted as receptors for other subtypes or families of retroelements. Alu molecular clocks that have evolved during primate history are found distributed within the HLA class I duplicated segments (duplicons) but not within the duplicons of CD1. Phylogeny of the alpha3 domain of the class I-like superfamily of proteins shows that the CD1 cluster is well separated from HLA class I by a number of superfamily members including MIC (PERB11), HFE, Zn-alpha2-GP, FcRn, and MR1. Phylogenetically, the human CD1 sequences are interspersed by CD1 sequences from other mammalian species, whereas the human HLA class I sequences cluster together and are separated from the other mammalian sequences. Genomic and phylogenetic analyses support the view that the human CD1 gene copies were duplicated prior to the evolution of primates and the bulk of the HLA class I genes found in humans. In contrast to the HLA class I genomic structure, the human CD1 duplicons are smaller in size, they lack Alu clocks, and they are interrupted by IGRs at least 4 to 14 times longer than the CD1 genes themselves. The IGRs seem to have been created as "buffer zones" to protect the CD1 genes from disruption by transposable elements.  相似文献   

3.
《Bio Systems》2006,83(3):226-234
Physiological functions and characteristic structures of the serpin gene superfamily have been studied extensively, yet the evolution of the serpin genes remains unclear. Gene duplication in this superfamily may shed light on this issue. Two models are used to predict the preservation of duplicated genes: the classical model and the duplication–degeneration–complementation (DDC) model. In this study, we analyzed the phylogenetic relationships of 33 human serpin genes and the expression data of some members of the serpin superfamily from a DNA microarray of human leukemia U937 cells with stably inducible expression of the leukemia-related AML1-ETO gene. We then determined the utility of the DDC model by mapping serpin superfamily expression data to the phylogenetic tree. The correlation between sequence and expression divergences as measured by the Pearson correlation coefficient indicated that human serpin genes evolved under the DDC model. Our study provides a new strategy for comparative analysis of gene sequences and microarray data.  相似文献   

4.
A phylogenetic analysis of the basic helix-loop-helix (bHLH) gene superfamily was performed using seven different species (human, mouse, rat, worm, fly, yeast, and plant Arabidopsis) and involving over 600 bHLH genes ( Stevens et al., 2008). All bHLH genes were identified in the genomes of the various species, including expressed sequence tags, and the entire coding sequence was used in the analysis. Nearly 15% of the gene family has been updated or added since the original publication. A super-tree involving six clades and all structural relationships was established and is now presented for four of the species. The wealth of functional data available for members of the bHLH gene superfamily provides us with the opportunity to use this exhaustive phylogenetic tree to predict potential functions of uncharacterized members of the family. This phylogenetic and genomic analysis of the bHLH gene family has revealed unique elements of the evolution and functional relationships of the different genes in the bHLH gene family.  相似文献   

5.
Structure and evolution of the lipase superfamily.   总被引:11,自引:0,他引:11  
The lipase superfamily includes three vertebrate and three invertebrate (dipteran) proteins that show significant amino acid sequence similarity to one another. The vertebrate proteins are lipoprotein lipase (LPL), hepatic lipase (HL), and pancreatic lipase (PL). The dipteran proteins are Drosophila yolk proteins 1, 2, and 3. We review the relationships among these proteins that have been established according to gene structural relatedness and introduce our findings on the phylogenetic relationships, distance relationships, and evolutionary history of the lipase gene superfamily. Drosophila yolk proteins contain a 104 amino acid residue segment that is conserved with respect to the lipases. We have used the yolk proteins as an outgroup to root a phylogeny of the lipase family. Our phylogenetic reconstruction suggests that ancestral PL diverged earlier than HL and LPL, which share a more recent root. Human and bovine LPL are shown to be more closely related to murine LPL than to guinea pig LPL. A comparison of the distance (a measure of the number of substitutions between sequences) between mammalian and avian LPL reveals that guinea pig LPL has the largest distance from the other mammals. Human, rodent, and rabbit HL show marked divergence from one another, although they have similar relative rates of amino acid substitution when compared to human LPL as an outgroup. Human and porcine PL are not as divergent as human and rat HL, suggesting that PL is more conserved than HL. However, canine PL demonstrates an unusually rapid rate of substitution with respect to the other pancreatic lipases. The lipases share several structurally conserved features. One highly conserved sequence (Gly-Xaa-Ser-Xaa-Gly) contains the active site serine. This feature, which agrees with that found in serine esterases and proteases, is found within the entire spectrum of lipases, including the evolutionarily unrelated prokaryotic lipases. We review the location and possible activity of putative lipid binding domains. We have constructed a conservation index (CI) to display conserved structural features within the lipase gene family, a CI of 1.0 signifying perfect conservation. We have found a correlation between a high CI and the position of conserved functional structures. The putative lipid-binding domains of LPL and HL, the disulfide-bridging cysteine residues, catalytic residues, and N-linked glycosylation sites of LPL, HL, and PL all lie within regions having a CI of 0.8 or higher. A number of amino acid substitutions have been identified in familial hyperchylomicronemia which result in loss of LPL function.(ABSTRACT TRUNCATED AT 400 WORDS)  相似文献   

6.
7.
8.
Isolation of a cDNA clone for human antithrombin III   总被引:20,自引:0,他引:20  
Antithrombin III (ATIII) is an important plasma protease inhibitor with a central role in the coagulation system. On the basis of its protein sequence, ATIII is one member of a "super family" of protease inhibitors that includes alpha 1-antitrypsin and chicken ovalbumin. An increased risk of thromboembolism is associated with inherited ATIII deficiency. To study the structure and expression of the human ATIII gene, we have isolated complementary (cDNA) clones for ATIII from human liver mRNA. ATIII cDNA clones were identified by hybridization to a mixture of synthetic oligodeoxynucleotides encoding amino acids 251-256 of the ATIII protein sequence. The largest cDNA clone (1.4 kilobases) included the coding region of ATIII mRNA from codon 10 through a 3'-untranslated region. Comparison of ATIII cDNA clones from two different sources revealed a sequence polymorphism at an internal PstI restriction site. Analysis of both total genomic DNAs and an ATIII gene cloned in a bacteriophage Charon 4A showed that the ATIII gene is present once per haploid genome and is distributed over 10-16 kilobases of DNA. Computer-assisted comparison of the cDNA sequence with those for baboon alpha 1-antitrypsin and chicken ovalbumin revealed homologies consistent with their inclusion in the protease inhibitor superfamily.  相似文献   

9.
The diversity of T-cell receptors is generated primarily by the variable-region gene families, each of which is composed of a large number of member genes. The entire genomic sequence of the variable region (VB) of the T- cell receptor beta chain from humans and mice has become available. To understand the evolutionary dynamics of the VB gene family, we conducted a phylogenetic analysis of all VB genes from humans and mice, as well as a detailed analysis of internal DNA duplications in the human genomic VB region. The phylogenetic tree obtained shows that human and mouse VB genes intermingle extensively rather than forming two separate clusters and that many gene duplications occurred both before and after the divergence between primates and rodents. Analyzing the genomic maps of transposable elements (e.g., LINEs and SINEs) and relic VB genes in the VB gene region, we present evidence that a 20-kb VB region duplicated tandemly four times in the human lineage during the last 32 Myr, and 6 out of the 15 VB genes in this region have become nonfunctional during this period. Our results show that the VB gene family is subject to evolution by a birth-and-death process rather than to concerted evolution.  相似文献   

10.
Enzymes of the chalcone synthase (CHS) superfamily catalyze the production of a variety of secondary metabolites in bacteria, fungi and plants. Some of these metabolites have played important roles during the early evolution of land plants by providing protection from various environmental assaults including UV irradiation. The genome of the moss, Physcomitrella patens, contains at least 17 putative CHS superfamily genes. Three of these genes (PpCHS2b, PpCHS3 and PpCHS5) exist in multiple copies and all have corresponding ESTs. PpCHS11 and probably also PpCHS9 encode non-CHS enzymes, while PpCHS10 appears to be an ortholog of plant genes encoding anther-specific CHS-like enzymes. It was inferred from the genomic locations of genes comprising it that the moss CHS superfamily expanded through tandem and segmental duplication events. Inferred exon–intron architectures and results from phylogenetic analysis of representative CHS superfamily genes of P. patens and other plants showed that intron gain and loss occurred several times during evolution of this gene superfamily. A high proportion of P. patens CHS genes (7 of 14 genes for which the full sequence is known and probably 3 additional genes) are intronless, prompting speculation that CHS gene duplication via retrotransposition has occurred at least twice in the moss lineage. Analyses of sequence similarities, catalytic motifs and EST data indicated that a surprisingly large number (as many as 13) of the moss CHS superfamily genes probably encode active CHS. EST distribution data and different light responsiveness observed with selected genes provide evidence for their differential regulation. Observed diversity within the moss CHS superfamily and amenability to gene manipulation make Physcomitrella a highly suitable model system for studying expansion and functional diversification of the plant CHS superfamily of genes.  相似文献   

11.

Background

The metzincins are a large gene superfamily of proteases characterized by the presence of a zinc protease domain, and include the ADAM, ADAMTS, BMP1/TLL, meprin and MMP genes. Metzincins are involved in the proteolysis of a wide variety of proteins, including those of the extracellular matrix. The metzincin gene superfamily comprises eighty proteins in the human genome and ninety-three in the mouse. When and how the level of complexity apparent in the vertebrate metzincin gene superfamily arose has not been determined in detail. Here we present a comprehensive analysis of vertebrate metzincins using genes from both Ciona intestinalis and Danio rerio to provide new insights into the complex evolution of this gene superfamily.

Results

We have identified 19 metzincin genes in the ciona genome and 83 in the zebrafish genome. Phylogenetic analyses reveal that the expansion of the metzincin gene superfamily in vertebrates has occurred predominantly by the simple duplication of pre-existing genes rather than by the appearance and subsequent expansion of new metzincin subtypes (the only example of which is the meprin gene family). Despite the number of zebrafish metzincin genes being relatively similar to that of tetrapods (e.g. man and mouse), the pattern of gene retention and loss within these lineages is markedly different. In addition, we have studied the evolution of the related TIMP gene family and identify a single ciona and four zebrafish TIMP genes.

Conclusion

The complexity seen in the vertebrate metzincin gene families was mainly acquired during vertebrate evolution. The metzincin gene repertoire in protostomes and invertebrate deuterostomes has remained relatively stable. The expanded metzincin gene repertoire of extant tetrapods, such as man, has resulted largely from duplication events associated with early vertebrate evolution, prior to the sarcopterygian-actinopterygian split. The teleost repertoire of metzincin genes in part parallels that of tetrapods but has been significantly modified, perhaps as a consequence of a teleost-specific duplication event.  相似文献   

12.
J J Bao  R N Sifers  V J Kidd  F D Ledley  S L Woo 《Biochemistry》1987,26(24):7755-7759
alpha 1-Antichymotrypsin belongs to a supergene family that includes alpha 1-antitrypsin, antithrombin III, ovalbumin, and angiotensinogen. The human chromosomal alpha 1-antichymotrypsin gene has been cloned and its molecular structure established. The gene is approximately 12 kb in length and contains five exons and four introns. The locations of the introns within the alpha 1-antichymotrypsin gene are identical with those of the human alpha 1-antitrypsin and angiotensinogen genes. Other members of this supergene family contain introns located at nonhomologous positions of the genes. The homologous organization of the alpha 1-antichymotrypsin and alpha 1-antitrypsin genes corresponds with the high degree of homology between their protein sequences and suggests that these loci arose by recent gene duplication. A model is presented for the evolution of both the genomic structure and the protein sequences of the serine protease inhibitor superfamily.  相似文献   

13.
Concerted evolution of the mouse immunoglobulin gamma chain genes   总被引:4,自引:1,他引:3       下载免费PDF全文
The nucleotide sequences of the immunoglobulin heavy-chain constant region genes of mouse, C gamma 3, C gamma 1, C gamma 2b and C gamma 2a, together with that of a human equivalent C gamma 4 were compared. All the six pairs of genes within the mouse C gamma gene family contain DNA segments that exhibit marked homology, whereas no such segmental homology was found in interspecies comparisons. This result indicates that the four C gamma genes of the mouse evolved concertedly by exchanging parts of their genetic information with each other either by gene conversion or by double unequal crossing-over. Another example of such concerted evolution was found in gene regions encoding membrane domains of the mouse C gamma chains. We also searched for such segmental homologies in other mammalian C gamma gene families and found at least two more examples in man and guinea-pig. In the mouse C gamma gene family, the silent positions of an exon encoding the third domain of C gamma chains show much greater divergence in sequence than other regions, indicating that the genetic information encoded by this gene region was least scrambled during recent evolution. A phylogenetic tree constructed from the nucleotide differences of this exon demonstrates that at least two C gamma genes had already existed before mammalian radiation. Based on these results, evolution of mammalian C gamma gene families is discussed.  相似文献   

14.
The nucleotide sequence of the beta globin gene cluster of the prosimian Galago crassicaudatus has been determined. A total sequence spanning 41,101 bp contains and links together previously published sequences of the five galago beta-like globin genes (5'-epsilon-gamma-psi eta-delta-beta-3'). A computer-aided search for middle interspersed repetitive sequences identified 10 LINE (L1) elements, including a 5' truncated repeat that is orthologous to the full-length L1 element found in the human epsilon-gamma intergenic region. SINE elements that were identified included one Alu type I repeat, four Alu type II repeats, and two methionine tRNA-derived Monomer (type III) elements. Alu type II and Monomer sequences are unique to the galago genome. Structural analyses of the cluster sequence reveals that it is relatively A+T rich (about 62%) and regions with high G+C content are associated primarily with globin coding regions. Comparative analyses with the beta globin cluster sequences of human, rabbit, and mouse reveal extensive sequence homologies in their genic regions, but only human, galago, and rabbit sequences share extensive intergenic sequence homologies. Divergence analyses of aligned intergenic and flanking sequences from orthologous human, galago, and rabbit sequences show a gradation in the rate of nucleotide sequence evolution along the cluster where sequences 5' of the epsilon globin gene region show the least sequence divergence and sequences just 5' of the beta globin gene region show the greatest sequence divergence.  相似文献   

15.
16.
In the Metazoa, globin proteins display an underlying unity in tertiary structure that belies an extraordinary diversity in primary structures, biochemical properties, and physiological functions. Phylogenetic reconstructions can reveal which of these functions represent novel, lineage-specific innovations, and which represent ancestral functions that are shared with homologous globin proteins in other eukaryotes and even prokaryotes. To date, our understanding of globin diversity in deuterostomes has been hindered by a dearth of genomic sequence data from the Ambulacraria (echinoderms + hemichordates), the sister group of chordates, and the phylum Xenacoelomorpha, which includes xenoturbellids, acoelomorphs, and nemertodermatids. Here, we report the results of a phylogenetic and comparative genomic analysis of the globin gene repertoire of deuterostomes. We first characterized the globin genes of the acorn worm, Saccoglossus kowalevskii, a representative of the phylum Hemichordata. We then integrated genomic sequence data from the acorn worm into a comprehensive analysis of conserved synteny and phylogenetic relationships among globin genes from representatives of the eight lineages that comprise the superphylum Deuterostomia. The primary aims were 1) to unravel the evolutionary history of the globin gene superfamily in deuterostomes and 2) to use the estimated phylogeny to gain insights into the functional evolution of deuterostome globins. Results of our analyses indicate that the deuterostome common ancestor possessed a repertoire of at least four distinct globin paralogs and that different subsets of these ancestral genes have been retained in each of the descendant organismal lineages. In each major deuterostome group, a different subset of ancestral precursor genes underwent lineage-specific expansions of functional diversity through repeated rounds of gene duplication and divergence. By integrating results of the phylogenetic analysis with available functional data, we discovered that circulating oxygen-transport hemoglobins evolved independently in several deuterostome lineages and that intracellular nerve globins evolved independently in chordates and acoelomorph worms.  相似文献   

17.
18.
We have identified the Hsp70 gene superfamily of the nematode Caenorhabditis briggsae and investigated the evolution of these genes in comparison with Hsp70 genes from C. elegans, Drosophila, and yeast. The Hsp70 genes are classified into three monophyletic groups according to their subcellular localization, namely, cytoplasm (CYT), endoplasmic reticulum (ER), and mitochondria (MT). The Hsp110 genes can be classified into the polyphyletic CYT group and the monophyletic ER group. The different Hsp70 and Hsp110 groups appeared to evolve following the model of divergent evolution. This model can also explain the evolution of the ER and MT genes. On the other hand, the CYT genes are divided into heat-inducible and constitutively expressed genes. The constitutively expressed genes have evolved more or less following the birth-and-death process, and the rates of gene birth and gene death are different between the two nematode species. By contrast, some heat-inducible genes show an intraspecies phylogenetic clustering. This suggests that they are subject to sequence homogenization resulting from gene conversion-like events. In addition, the heat-inducible genes show high levels of sequence conservation in both intra-species and inter-species comparisons, and in most cases, amino acid sequence similarity is higher than nucleotide sequence similarity. This indicates that purifying selection also plays an important role in maintaining high sequence similarity among paralogous Hsp70 genes. Therefore, we suggest that the CYT heat-inducible genes have been subjected to a combination of purifying selection, birth-and-death process, and gene conversion-like events.  相似文献   

19.
The human TCR beta-chain gene complex includes at least 57 variable (V) gene segments, a number estimated using a combination of Southern blots of conventional and pulsed field gels, sequence analysis of cDNA clones, and from the analysis of genomic cosmid and phage clones. This number includes six TCR beta-chain V genes in four new families identified here by sequence analysis of clones derived from a human TCR beta-chain specific cDNA library. Comparison of the sequences of the new V beta genes with previously reported V beta sequences reveals predicted similarities but less than 75% nucleic acid identity that establishes them as new V beta families. One of the new V beta gene families includes three genes and the other three are single member families. Identification of these six new V beta genes falling into four V beta families brings the total number of transcribed human V beta families to 24 and makes it possible to refine the estimate of the total number of human TCR V beta genes to 57.  相似文献   

20.
A family of 40 terpenoid synthase genes ( AtTPS) was discovered by genome sequence analysis in Arabidopsis thaliana. This is the largest and most diverse group of TPS genes currently known for any species. AtTPS genes cluster into five phylogenetic subfamilies of the plant TPS superfamily. Surprisingly, thirty AtTPS closely resemble, in all aspects of gene architecture, sequence relatedness and phylogenetic placement, the genes for plant monoterpene synthases, sesquiterpene synthases or diterpene synthases of secondary metabolism. Rapid evolution of these AtTPS resulted from repeated gene duplication and sequence divergence with minor changes in gene architecture. In contrast, only two AtTPS genes have known functions in basic (primary) metabolism, namely gibberellin biosynthesis. This striking difference in rates of gene diversification in primary and secondary metabolism is relevant for an understanding of the evolution of terpenoid natural product diversity. Eight AtTPS genes are interrupted and are likely to be inactive pseudogenes. The localization of AtTPS genes on all five chromosomes reflects the dynamics of the Arabidopsis genome; however, several AtTPS genes are clustered and organized in tandem repeats. Furthermore, some AtTPS genes are localized with prenyltransferase genes ( AtGGPPS, geranylgeranyl diphosphate synthase) in contiguous genomic clusters encoding consecutive steps in terpenoid biosynthesis. The clustered organization may have implications for TPS gene evolution and the evolution of pathway segments for the synthesis of terpenoid natural products. Phylogenetic analyses highlight events in the divergence of the TPS paralogs and suggest orthologous genes and a model for the evolution of the TPS gene family.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号