首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Several eukaryotic genomes have been completely sequenced and this provides an opportunity to investigate the extent and characteristics (e.g., single gene duplication, block duplication, etc.) of gene duplication in a genome. Detecting duplicate genes in a genome, however, is not a simple problem because of several complications such as domain shuffling, the existence of isoforms derived from alternative splicing, and annotational errors in the databases. We describe a method for overcoming these difficulties and the extents of gene duplication in the genomes of Drosophila melanogaster, Caenorhabditis elegans, and yeast inferred from this method. We also describe a method for detecting block duplications in a genome. Application of this method showed that block duplication is a common phenomenon in both yeast and nematode. The patterns of block duplication in the two species are, however, markedly different. Yeast shows much more extensive block duplication than nematode, with some chromosomes having more than 40% of the duplications derived from block duplications. Moreover, in yeast the majority of block duplications occurred between chromosomes, while in nematode most block duplications occurred within chromosomes.  相似文献   

2.
In this paper we present a new method for detecting block duplications in a genome. It is more stringent than previous ones in that it requires a more rigorous definition of paralogous genes and that it requires the paralogous proteins on the two blocks to be contiguous. In addition, it provides three criterion choices: (1) the same composition (i.e., having the same paralogues in the two windows), (2) the same composition and gene order, and (3) the same composition, gene order, and gene orientation. The method is completely automated, requiring no visual inspection as in previous methods. We applied it to analyze the complete genomes of S. cerevisiae and C. elegans. In yeast we detected fewer duplicated blocks than previously reported. In C. elegans, however, we detected more block duplications than previously reported, indicating that although our method has a more stringent definition of block duplication than previous ones, it may be more sensitive in detection because it considers every possible window rather than only fixed nonoverlapping windows. Our results show that block duplication is a common phenomenon in both organisms. The patterns of block duplication in the two species are, however, markedly different. The yeast shows much more extensive block duplication than the nematode, with some chromosomes having more than 40% of the duplications derived from block duplications. Moreover, in the yeast the majority of block duplications occurred between chromosomes, while in the nematode most block duplications occurred within chromosomes.  相似文献   

3.
Katju V  Lynch M 《Genetics》2003,165(4):1793-1803
The significance of gene duplication in provisioning raw materials for the evolution of genomic diversity is widely recognized, but the early evolutionary dynamics of duplicate genes remain obscure. To elucidate the structural characteristics of newly arisen gene duplicates at infancy and their subsequent evolutionary properties, we analyzed gene pairs with < or =10% divergence at synonymous sites within the genome of Caenorhabditis elegans. Structural heterogeneity between duplicate copies is present very early in their evolutionary history and is maintained over longer evolutionary timescales, suggesting that duplications across gene boundaries in conjunction with shuffling events have at least as much potential to contribute to long-term evolution as do fully redundant (complete) duplicates. The median duplication span of 1.4 kb falls short of the average gene length in C. elegans (2.5 kb), suggesting that partial gene duplications are frequent. Most gene duplicates reside close to the parent copy at inception, often as tandem inverted loci, and appear to disperse in the genome as they age, as a result of reduced survivorship of duplicates located in proximity to the ancestral copy. We propose that illegitimate recombination events leading to inverted duplications play a disproportionately large role in gene duplication within this genome in comparison with other mechanisms.  相似文献   

4.
The nucleotide sequences of the mitochondrial DNA (mtDNA) molecules of two nematodes, Caenorhabditis elegans [13,794 nucleotide pairs (ntp)], and Ascaris suum (14,284 ntp) are presented and compared. Each molecule contains the genes for two ribosomal RNAs (s-rRNA and l-rRNA), 22 transfer RNAs (tRNAs) and 12 proteins, all of which are transcribed in the same direction. The protein genes are the same as 12 of the 13 protein genes found in other metazoan mtDNAs: Cyt b, cytochrome b; COI-III, cytochrome c oxidase subunits I-III; ATPase6, Fo ATPase subunit 6; ND1-6 and 4L, NADH dehydrogenase subunits 1-6 and 4L: a gene for ATPase subunit 8, common to other metazoan mtDNAs, has not been identified in nematode mtDNAs. The C. elegans and A. suum mtDNA molecules both include an apparently noncoding sequence that contains runs of AT dinucleotides, and direct and inverted repeats (the AT region: 466 and 886 ntp, respectively). A second, apparently noncoding sequence in the C. elegans and A. suum mtDNA molecules (109 and 117 ntp, respectively) includes a single, hairpin-forming structure. There are only 38 and 89 other intergenic nucleotides in the C. elegans and A. suum mtDNAs, and no introns. Gene arrangements are identical in the C. elegans and A. suum mtDNA molecules except that the AT regions have different relative locations. However, the arrangement of genes in the two nematode mtDNAs differs extensively from gene arrangements in all other sequenced metazoan mtDNAs. Unusual features regarding nematode mitochondrial tRNA genes and mitochondrial protein gene initiation codons, previously described by us, are reviewed. In the C. elegans and A. suum mt-genetic codes, AGA and AGG specify serine, TGA specifies tryptophan and ATA specifies methionine. From considerations of amino acid and nucleotide sequence similarities it appears likely that the C. elegans and A. suum ancestral lines diverged close to the time of divergence of the cow and human ancestral lines, about 80 million years ago.  相似文献   

5.
Drosophila melanogaster is an arthropod with a much more complex anatomy and physiology than the nematode Caenorhabditis elegans. We investigated one of the protein superfamilies in the two organisms that plays a major role in development and function of cell-cell communication: the immunoglobulin superfamily (IgSF). Using hidden Markov models, we identified 142 IgSF proteins in Drosophila and 80 in C. elegans. Of these, 58 and 22, respectively, have been previously identified by experiments. On the basis of homology and the structural characterisation of the proteins, we can suggest probable types of function for most of the novel proteins. Though overall Drosophila has fewer genes than C. elegans, it has many more IgSF cell-surface and secreted proteins. Half the IgSF proteins in C. elegans and three quarters of those in Drosophila have evolved subsequent to the divergence of the two organisms. These results suggest that the expansion of this protein superfamily is one of the factors that have contributed to the formation of the more complex physiological features that are found in Drosophila.  相似文献   

6.
Xie T  Ding D 《Gene》2000,261(2):305-310
It is one of key problems for comparative genomics to accurately identify orthologous genes/proteins. Here 42 quartettes of human, yeast Saccharomyces cerevisiae, nematode Caenorhabditis elegans, and fruit fly Drosophila melanogaster candidate orthologs, defined by using similarity-based highest hit criteria (Mushegian et al., 1998 Genome Res. 8: 590-598), were reconsidered according to molecular evolutionary analysis. We found that only 14 of the 42 candidate orthologous groups can be identified to have truly one-to-one orthologous relationships, whereas other groups were characterized by one (many)-to-many orthologous relationships or even more complex scenarios involving gene duplications and/or gene losses. The result could imply that the classical one-to-one orthology might be not as common as typically accepted and automated similarity-based methods should be used with caution when accurate orthology/paralogy discrimination is required.  相似文献   

7.
Thomas JH 《Genetics》2006,172(1):127-143
An algorithm for detecting local clusters of homologous genes was applied to the genome of Caenorhabditis elegans. Clusters of two or more homologous genes are abundant, totaling 1391 clusters containing 4607 genes, over one-fifth of all genes in C. elegans. Cluster genes are distributed unevenly in the genome, with the large majority located on autosomal chromosome arms, regions characterized by higher genetic recombination and more repeat sequences than autosomal centers and the X chromosome. Cluster genes are transcribed at much lower levels than average and very few have gross phenotypes as assayed by RNAi-mediated reduction of function. The molecular identity of cluster genes is unusual, with a preponderance of nematode-specific gene families that encode putative secreted and transmembrane proteins, and enrichment for genes implicated in xenobiotic detoxification and innate immunity. Gene clustering in Drosophila melanogaster is also substantial and the molecular identity of clustered genes follows a similar pattern. I hypothesize that autosomal chromosome arms in C. elegans undergo frequent local gene duplication and that these duplications support gene diversification and rapid evolution in response to environmental challenges. Although specific gene clusters have been documented in C. elegans, their abundance, genomic distribution, and unusual molecular identities were previously unrecognized.  相似文献   

8.
The cut superclass of homeobox genes has been divided into three classes: CUX, ONECUT and SATB. Given the various completed genomes, we have now made a comprehensive survey. We find that there are only two cut domain containing genes in Drosophila, one CUX and one ONECUT type. Caenorhabditis elegans has undergone an expansion of the ONECUT subclass genes and has a gene cluster with three ONECUT class genes, one of which has lost the cut domain. Two of these genes contain a conserved sequence motif, termed OCAM, which also occurs in another gene in C. elegans this motif seems to be nematode specific. A recently uncovered C. elegans CUX gene has sequence conservation in its amino-terminus with vertebrate CUX proteins. Further, the 5' end of this gene containing the conserved region can undergo alternative splicing to give rise to a protein with a different carboxy-terminus lacking the cut- and homeodomain. This protein is conserved in its entirety with vertebrate genes termed CASP--which are also alternative splice products of the CUX genes--and with plant and fungal genes. The highly divergent SATB genes share a conserved amino terminal domain, COMPASS, with the Drosophila defective proventriculus gene and a C. elegans ORF. These two "COMPASS" family genes encode two highly divergent homeodomains, may be homologues of the SATB genes and thus probably belong to the cut superclass, too.  相似文献   

9.
10.
Gene duplication plays an important role in evolution because it is the primary source of new genes. Many recent studies showed that gene duplicability varies considerably among genes. Several considerations led us to hypothesize that less important genes have higher rates of successful duplications, where gene importance is measured by the fitness reduction caused by the deletion of the gene. Here, we test this hypothesis by comparing the importance of two groups of singleton genes in the yeast Saccharomyces cerevisiae (Sce). Group S genes did not duplicate in four other yeast species examined, whereas group D experienced duplication in these species. Consistent with our hypothesis, we found group D genes to be less important than group S genes. Specifically, 17% of group D genes are essential in Sce, compared to 28% for group S. Furthermore, deleting a group D gene in Sce reduces the fitness by 24% on average, compared to 38% for group S. Our subsequent analysis showed that less important genes have more cis-regulatory motifs, which could lead to a higher chance of subfunctionalization of duplicate genes and result in an enhanced rate of gene retention. Less important genes may also have weaker dosage imbalance effects and cause fewer genetic perturbations when duplicated. Regardless of the cause, our observation indicates that the previous finding of a less severe fitness consequence of deleting a duplicate gene than deleting a singleton gene is at least in part due to the fact that duplicate genes are intrinsically less important than singleton genes and suggests that the contribution of duplicate genes to genetic robustness has been overestimated.  相似文献   

11.
The enrichment of duplicate genes, and therefore paralogs (proteins coded by duplicate genes), in multicellular versus unicellular organisms enhances genomic functional innovation. This study quantitatively examined relationships among paralog enrichment, expression pattern diversification and multicellularity, aiming to better understand genomic basis of multicellularity. Paralog abundance in specific cells was compared with those in unicellular proteomes and the whole proteomes of multicellular organisms. The budding yeast, Saccharomyces cerevisiae and the nematode, Caenorhabditis elegans, for which the gene sets expressed in specific cells are available, were used as uni and multicellular models, respectively. Paralog count (K) distributions [P((k))] follow a power-law relationship [P((k)) ∝ k(-α)] in the whole proteomes of both species and in specific C. elegans cells. The value of the constant α can be used as a gauge of paralog abundance; the higher the value, the lower the paralog abundance. The α-value is indeed lower in the whole proteome of C. elegans (1.74) than in S. cerevisiae (2.34), quantifying the enrichment of paralogs in multicellular species. We also found that the power-law relationship applies to the proteomes of specific C. elegans cells. Strikingly, values of α in specific cells are higher and comparable to that in S. cerevisiae. Thus, paralog abundance in specific cells is lower and comparable to that in unicellular species. Furthermore, how much the expression level of a gene fluctuates across different C. elegans cells correlates positively with its paralog count, which is further confirmed by human gene-expression patterns across different tissues. Taken together, these results quantitatively and mechanistically establish enrichment of paralogs with diversifying expression patterns as genomic and evolutionary basis of multicellularity.  相似文献   

12.
A novel Drosophila melanogaster gene UBL3 was characterized and shown to be highly conserved in man and Caenorhabditis elegans (C. elegans). The human and mouse homologues were cloned and sequenced. UBL3 is a ubiquitin-like protein of unknown function with no conserved homologues in yeast. Mapping of the human and mouse UBL3 genes places them within a region of shared gene order between human and mouse chromosomes on human chromosome 13q12-13 and telomeric mouse chromosome 5 (MMU5).  相似文献   

13.
14.
15.
Collagens, modifying enzymes and their mutations in humans, flies and worms   总被引:20,自引:0,他引:20  
Collagens and proteins with collagen-like domains form large superfamilies in various species, and the numbers of known family members are increasing constantly. Vertebrates have at least 27 collagen types with 42 distinct polypeptide chains, >20 additional proteins with collagen-like domains and approximately 20 isoenzymes of various collagen-modifying enzymes. Caenorhabditis elegans has approximately 175 cuticle collagen polypeptides and two basement membrane collagens. Drosophila melanogaster has far fewer collagens than many other species but has approximately 20 polypeptides similar to the catalytic subunits of prolyl 4-hydroxylase, the key enzyme of collagen synthesis. More than 1300 mutations have so far been characterized in 23 of the 42 human collagen genes in various diseases, and many mouse models and C. elegans mutants are also available to analyse the collagen gene family and their modifying enzymes.  相似文献   

16.
17.
Exposure of the nematode Caenorhabditis elegans to a heat shock results in the induction of a number of genes not normally expressed in the animals under normal growth conditions. Among these are a family of genes encoding 16 kDa heat shock proteins (hsp16s). The major hsp16 genes have been cloned and characterized, and found to reside at two clusters in the C. elegans genome. One cluster contains two distinct genes, hsp16-1 and hsp16-48, arranged in divergent orientations separated by only 348 base pairs (bp). An identical pair, duplicated and inverted with respect to the first pair, is located 415 bp away. This cluster, located on chromosome V, therefore contains four genes as two identical pairs within less than 4 kilobases of DNA, and the pairs form the arms of a large inverted repeat. A second pair of genes, hsp16-2 and hsp16-41, constitutes a second hsp16 locus with an organization very similar to that of the hsp16-1/48 locus, except that it is not duplicated. Comparisons of the derived amino acid sequences show that hsp16-1 and hsp16-2 form a closely related pair, as do hsp16-41 and hsp16-48. These hsps show extensive sequence identity with the small hsps of Drosophila, as well as with mammalian alpha-crystallins. The coding region of each gene is interrupted by a single intron of approximately 50 bp, in a position homologous to that of the first intron in mouse alpha-crystallin gene.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

18.
Zhang P  Min W  Li WH 《Gene》2004,342(2):263-268
We studied the age distribution of duplicate genes in each of four eukaryotic genomes: human, Arabidopsis thaliana, Caenorhabditis elegans, and Drosophila melanogaster. The four distributions differ greatly from each other, contrary to the previous proposal of a universal L-shaped distribution in all eukaryotic genomes studied. Indeed, only the distribution in humans is L-shaped. The distribution in Arabidopsis is consistent with the hypothesis of an ancient genome duplication with no recent burst of duplication events, while the distribution in C. elegans is nearly uniform. We also applied a nonparametric method to the human distribution to show that the rate of loss of duplicate genes decreases over time, contrary to the proposal of an exponential decay. One possible explanation of the decreasing rate of loss of duplicate genes over time could be rapid functional divergence between duplicate genes, providing an advantage for the retention of both duplicates.  相似文献   

19.
BACKGROUND: Both animals and plants respond rapidly to pathogens by inducing the expression of defense-related genes. Whether such an inducible system of innate immunity is present in the model nematode Caenorhabditis elegans is currently an open question. Among conserved signaling pathways important for innate immunity, the Toll pathway is the best characterized. In Drosophila, this pathway also has an essential developmental role. C. elegans possesses structural homologs of components of this pathway, and this observation raises the possibility that a Toll pathway might also function in nematodes to trigger defense mechanisms or to control development. RESULTS: We have generated and characterized deletion mutants for four genes supposed to function in a nematode Toll signaling pathway. These genes are tol-1, trf-1, pik-1, and ikb-1 and are homologous to the Drosophila melanogaster Toll, dTraf, pelle, and cactus genes, respectively. Of these four genes, only tol-1 is required for nematode development. None of them are important for the resistance of C. elegans to a number of pathogens. On the other hand, C. elegans is capable of distinguishing different bacterial species and has a tendency to avoid certain pathogens, including Serratia marcescens. The tol-1 mutants are defective in their avoidance of pathogenic S. marcescens, although other chemosensory behaviors are wild type. CONCLUSIONS: In C. elegans, tol-1 is important for development and pathogen recognition, as is Toll in Drosophila, but remarkably for the latter r?le, it functions in the context of a behavioral mechanism that keeps worms away from potential danger.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号