首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
How Many Processed Pseudogenes Are Accumulated in a Gene Family?   总被引:2,自引:0,他引:2       下载免费PDF全文
James Bruce Walsh 《Genetics》1985,110(2):345-364
A simple kinetic model is developed that describes the accumulation of processed pseudogenes in a functional gene family. Insertion of new pseudogenes occurs at rate ν per gene and is countered by spontaneous deletion (at rate δ per DNA segment) of segments containing processed pseudogenes. If there are k functional genes in a gene family, the equilibrium number of processed pseudogenes is k(ν/δ), and the percentage of functional genes in the gene family at equilibrium is 1/[1 + (ν/δ)]. ν/δ values estimated for five gene families ranged from 1.7 to 15. This fairly narrow range suggests that the rates of formation and deletion of processed pseudogenes may be positively correlated for these families. If δ is sufficiently large relative to the per nucleotide mutation rate µ (δ > 20µ), processed pseudogenes will show high homology with each other, even in the absence of gene conversion between pseudogenes. We argue that formation of processed pseudogenes may share common pathways with transposable elements and retroviruses, creating the potential for correlated responses in the evolution of processed pseudogenes due to direct selection for control of transposable elements and/or retroviruses. Finally, we discuss the nature of the selective forces that may act directly or indirectly to influence the evolution of processed pseudogenes.

Anything produced by evolution is bound to be a bit of a mess—S. Brenner

  相似文献   

2.
3.
We propose a method by which the intensity of purifying selection on a functional protein-coding gene is estimated by using three aligned homologous sequences: a processed pseudogene (psi), a functional paralog from the same species (g), and a functional ortholog from a different species (o). For each such trio, we calculate the numbers of nucleotide substitutions along the branches leading to psi and g, i.e., K psi and K(g). If we assume that the mutation rates are the same in the genes and the pseudogenes and that mutations occurring in a pseudogene do not affect the fitness of the organism, we can show that the fraction of mutations that are selectively neutral, fg, is equal to the ratio K(g)/K psi. Since advantageous mutations occur only very rarely, such that they do not contribute significantly to the rate of molecular evolution, the fraction of deleterious mutations that are subject to purifying selection is 1-fg. Therefore, the K(g)/K psi ratio can be used directly to estimate the intensity of purifying selection, thereby isolating its effects on the rate of evolution from those of mutation. We compared the selection intensities of 12 orthologous protein-coding pairs from humans and murids. As expected, the fraction of mutations that are subject to purifying selection is strongest in the second codon position and weakest in the third. Interestingly, the mean fractions of effectively neutral mutations in the third codon position were only 41% and 42% for murids and humans, respectively, indicating that many synonymous mutations are subject to selective constraint. In several orthologous genes, we found that the intensity of purifying selection is very different between murid and human orthologous genes. There was no statistically significant difference in overall intensity of purifying selection between humans and murids. Thus, purifying selection does not seem to be an important factor contributing to the observed differences in the rates of evolution between these two taxa.  相似文献   

4.
Role of gene duplication in evolution   总被引:7,自引:0,他引:7  
T Ohta 《Génome》1989,31(1):304-310
It is now known that many multigene and supergene families exist in eukaryote genomes: multigene families with uniform copy members like genes for ribosomal RNA, those with variable members like immunoglobulin genes, and supergene families such as those for various growth factor and hormone receptors. Many such examples indicate that gene duplication and subsequent differentiation are extremely important for organismal evolution. In particular, gene duplication could well have been the primary mechanism for the evolution of complexity in higher organisms. Population genetic models for the origin of gene families with diverse functions are presented, in which natural selection favors those genomes with more useful mutants in duplicated genes. Since any gene has a certain probability of degenerating by mutation, success versus failure in acquiring a new gene by duplication may be expressed as the ratio of probabilities of spreading of useful versus detrimental mutations in redundant gene copies. Also examined are the effects of gene duplication on evolution by compensatory advantageous mutations. Results of the analyses show that both natural selection and random drift are important for the origin of gene families. In addition, interaction between molecular mechanisms such as unequal crossing-over and gene conversion, and selection or drift is found to have a large effect on evolution by gene duplication.  相似文献   

5.
Microarray technologies allow the identification of large numbers of expression differences within and between species. Although environmental and physiological stimuli are clearly responsible for changes in the expression levels of many genes, it is not known whether the majority of changes of gene expression fixed during evolution between species and between various tissues within a species are caused by Darwinian selection or by stochastic processes. We find the following: (1) expression differences between species accumulate approximately linearly with time; (2) gene expression variation among individuals within a species correlates positively with expression divergence between species; (3) rates of expression divergence between species do not differ significantly between intact genes and expressed pseudogenes; (4) expression differences between brain regions within a species have accumulated approximately linearly with time since these regions emerged during evolution. These results suggest that the majority of expression differences observed between species are selectively neutral or nearly neutral and likely to be of little or no functional significance. Therefore, the identification of gene expression differences between species fixed by selection should be based on null hypotheses assuming functional neutrality. Furthermore, it may be possible to apply a molecular clock based on expression differences to infer the evolutionary history of tissues.  相似文献   

6.
Selectionism and neutralism in molecular evolution   总被引:20,自引:0,他引:20  
Charles Darwin proposed that evolution occurs primarily by natural selection, but this view has been controversial from the beginning. Two of the major opposing views have been mutationism and neutralism. Early molecular studies suggested that most amino acid substitutions in proteins are neutral or nearly neutral and the functional change of proteins occurs by a few key amino acid substitutions. This suggestion generated an intense controversy over selectionism and neutralism. This controversy is partially caused by Kimura's definition of neutrality, which was too strict (|2Ns|< or =1). If we define neutral mutations as the mutations that do not change the function of gene products appreciably, many controversies disappear because slightly deleterious and slightly advantageous mutations are engulfed by neutral mutations. The ratio of the rate of nonsynonymous nucleotide substitution to that of synonymous substitution is a useful quantity to study positive Darwinian selection operating at highly variable genetic loci, but it does not necessarily detect adaptively important codons. Previously, multigene families were thought to evolve following the model of concerted evolution, but new evidence indicates that most of them evolve by a birth-and-death process of duplicate genes. It is now clear that most phenotypic characters or genetic systems such as the adaptive immune system in vertebrates are controlled by the interaction of a number of multigene families, which are often evolutionarily related and are subject to birth-and-death evolution. Therefore, it is important to study the mechanisms of gene family interaction for understanding phenotypic evolution. Because gene duplication occurs more or less at random, phenotypic evolution contains some fortuitous elements, though the environmental factors also play an important role. The randomness of phenotypic evolution is qualitatively different from allele frequency changes by random genetic drift. However, there is some similarity between phenotypic and molecular evolution with respect to functional or environmental constraints and evolutionary rate. It appears that mutation (including gene duplication and other DNA changes) is the driving force of evolution at both the genic and the phenotypic levels.  相似文献   

7.
The nonsynonymous (amino acid-altering) to synonymous (silent) substitution rate ratio (omega = d(N)/d(S)) provides a measure of natural selection at the protein level, with omega = 1, >1, and <1, indicating neutral evolution, purifying selection, and positive selection, respectively. Previous studies that used this measure to detect positive selection have often taken an approach of pairwise comparison, estimating substitution rates by averaging over all sites in the protein. As most amino acids in a functional protein are under structural and functional constraints and adaptive evolution probably affects only a few sites at a few time points, this approach of averaging rates over sites and over time has little power. Previously, we developed codon-based substitution models that allow the omega ratio to vary either among lineages or among sites. In this paper we extend previous models to allow the omega ratio to vary both among sites and among lineages and implement the new models in the likelihood framework. These models may be useful for identifying positive selection along prespecified lineages that affects only a few sites in the protein. We apply those branch-site models as well as previous branch- and site-specific models to three data sets: the lysozyme genes from primates, the tumor suppressor BRCA1 genes from primates, and the phytochrome (PHY) gene family in angiosperms. Positive selection is detected in the lysozyme and BRCA genes by both the new and the old models. However, only the new models detected positive selection acting on lineages after gene duplication in the PHY gene family. Additional tests on several data sets suggest that the new models may be useful in detecting positive selection after gene duplication in gene family evolution.  相似文献   

8.
Pseudogenes are defined as non-functional relatives of genes whose protein-coding abilities are lost and are no longer expressed within cells. They are an outcome of accumulation of mutations within a gene whose end product is not essential for survival. Proper investigation of the procedure of pseudogenization is relevant for estimating occurrence of duplications in genomes. Frankineae houses an interesting group of microorganisms, carving a niche in the microbial world. This study was undertaken with the objective of determining the abundance of pseudogenes, understanding strength of purifying selection, investigating evidence of pseudogene expression, and analysing their molecular nature, their origin, evolution and deterioration patterns amongst domain families. Investigation revealed the occurrence of 956 core pFAM families sharing common characteristics indicating co-evolution. WD40, Rve_3, DDE_Tnp_IS240 and phage integrase core domains are larger families, having more pseudogenes, signifying a probability of harmful foreign genes being disabled within transposable elements. High selective pressure depicted that gene families rapidly duplicating and evolving undoubtedly facilitated creation of a number of pseudogenes in Frankineae. Codon usage analysis between protein-coding genes and pseudogenes indicated a wide degree of variation with respect to different factors. Moreover, the majority of pseudogenes were under the effect of purifying selection. Frankineae pseudogenes were under stronger selective constraints, indicating that they were functional for a very long time and became pseudogenes abruptly. The origin and deterioration of pseudogenes has been attributed to selection and mutational pressure acting upon sequences for adapting to stressed soil environments.  相似文献   

9.
To study reductive evolutionary processes in bacterial genomes, we examine sequences in the Rickettsia genomes which are unconstrained by selection and evolve as pseudogenes, one of which is the metK gene, which codes for AdoMet synthetase. Here, we sequenced the metK gene and three surrounding genes in eight different species of the genus Rickettsia. The metK gene was found to contain a high incidence of deletions in six lineages, while the three genes in its surroundings were functionally conserved in all eight lineages. A more drastic example of gene degradation was identified in the metK downstream region, which contained an open reading frame in Rickettsia felis. Remnants of this open reading frame could be reconstructed in five additional species by eliminating sites of frameshift mutations and termination codons. A detailed examination of the two reconstructed genes revealed that deletions strongly predominate over insertions and that there is a strong transition bias for point mutations which is coupled to an excess of GC-to-AT substitutions. Since the molecular evolution of these inactive genes should reflect the rates and patterns of neutral mutations, our results strongly suggest that there is a high spontaneous rate of deletions as well as a strong mutation bias toward AT pairs in the Rickettsia genomes. This may explain the low genomic G + C content (29%), the small genome size (1.1 Mb), and the high noncoding content (24%), as well as the presence of several pseudogenes in the Rickettsia prowazekii genome.  相似文献   

10.
11.
Simulating Evolution by Gene Duplication   总被引:14,自引:5,他引:14       下载免费PDF全文
Tomoko Ohta 《Genetics》1987,115(1):207-213
By considering the recent finding that unequal crossing over and other molecular interactions are contributing to the evolution of multigene families, a model of the origin of repetitive genes was studied by Monte Carlo simulations. Starting from a single gene copy, how genetic systems evolve was examined under unequal crossing over, random drift and natural selection. Both beneficial and deteriorating mutations were incorporated, and the latter were assumed to occur ten times more frequently than the former. Positive natural selection favors those chromosomes with more beneficial mutations in redundant copies than others in the population, but accumulation of deteriorating mutations (pseudogenes) have no effect on fitness so long as there remains a functional gene. The results imply the following: Positive natural selection is needed in order to acquire gene families with new functions. Without it, too many pseudogenes accumulate before attaining a functional gene family. There is a large fluctuation in the outcome even if parameters are the same. When unequal crossing over occurs more frequently, the system evolves more rapidly. It was also shown, under realistic values of parameters, that the genetic load for acquiring a new gene is not as large as J.B.S. Haldane suggested, but not so small as in a model in which a system for selection started from already redundant genes.  相似文献   

12.
Somatic immunoglobulin diversity is generated in avian species by sequential gene conversion of variable (V) gene segments of the immunoglobulin heavy- and light-chain loci during B-cell development. The germ line pools of donor sequence information for somatic V-region gene conversion are found in families of V pseudogenes, located 5' of the single functional V gene of each locus. The sequence relationships among the pseudogenes (psi VL) and functional VL1 gene of the chicken light-chain alleles in three inbred strains were compared to determine the extent of diversity within the germ line pseudogene cluster. Numerous differences were observed. For example, compared with the previously reported CB allele and the G4 allele, the S3 allele contains two intact pseudogenes between psi VL16 and psi VL18. These two adjacent psi VL gene segments (psi VL17a and psi VL17b) could have given rise to the psi VL17 segment of the G4 and CB alleles by homologous recombination. The majority of other sequence polymorphisms among the psi VL alleles appear to be the result of meiotic gene conversion. The incidence of untemplated mutations within psi VL segments is significantly lower than the incidence of mutation within the pseudogene flanking regions. Together with the observations that most psi VL segments have open reading frames and lack stop codons, these data support the hypothesis that the psi VL cluster resembles a functional multigene family maintained by evolutionary selection for its functional role in generating somatic antibody diversity. Meiotic gene conversion events within the psi VL cluster serve both to introduce diversity by the exchange of short segments between family members and to prevent the accumulation of random mutations.  相似文献   

13.
In order to understand the origin of multigene families, Monte Carlo simulations were performed to see how a genetic system evolves under unequal crossing-over, mutation, random genetic drift and natural selection, starting from a single gene copy. Both haploid and diploid models were examined. Beneficial, neutral, and detrimental mutations were incorporated, and “positive” selection favors those chromosomes (haploid) or individuals (diploid) with more beneficial mutations than others. The same model for haploids was previously investigated with special reference to the evolution of gene organization, and the ratio of the numbers of beneficial genes to pseudogenes was found to be a rough indicator of the relative strengths of positive and negative (against deleterious alleles) natural selection (Ohta, 1987b). In the present paper, the evolution of gene organization and of sequence divergence among genes in the multigene family is examined. It is shown that positive selection accelerates the accumulation of arrays containing different beneficial mutations, but that total divergence including both neutral and beneficial mutations is not very sensitive to positive selection, under this model. The proportion of beneficial mutations in the total mutations accumulated is a better indicator of positive selection than is the total divergence. It is pointed out that various observed examples in which amino-acid substitutions are accelerated, as compared with synonymous substitutions in duplicated genes (Li, 1985), may reflect the effect of selection similar to the present scheme. The diploid model is shown to be more efficient for accumulating beneficial mutations in duplicated genes than the haploid one, and the relevance of this finding to the advantage of sexual reproduction is discussed.  相似文献   

14.
Rice DP  Townsend JP 《Genetics》2012,190(4):1533-1545
Evolutionary biologists attribute much of the phenotypic diversity observed in nature to the action of natural selection. However, for many phenotypic traits, especially quantitative phenotypic traits, it has been challenging to test for the historical action of selection. An important challenge for biologists studying quantitative traits, therefore, is to distinguish between traits that have evolved under the influence of strong selection and those that have evolved neutrally. Most existing tests for selection employ molecular data, but selection also leaves a mark on the genetic architecture underlying a trait. In particular, the distribution of quantitative trait locus (QTL) effect sizes and the distribution of mutational effects together provide information regarding the history of selection. Despite the increasing availability of QTL and mutation accumulation data, such data have not yet been effectively exploited for this purpose. We present a model of the evolution of QTL and employ it to formulate a test for historical selection. To provide a baseline for neutral evolution of the trait, we estimate the distribution of mutational effects from mutation accumulation experiments. We then apply a maximum-likelihood-based method of inference to estimate the range of selection strengths under which such a distribution of mutations could generate the observed QTL. Our test thus represents the first integration of population genetic theory and QTL data to measure the historical influence of selection.  相似文献   

15.
Buchnera, the primary bacterial endosymbiont of aphids, is known to provision essential amino acids lacking in the hosts' diet of plant sap. The recent discovery of silenced copies of genes for tryptophan biosynthesis (trpEG) in certain Buchnera lineages suggests a decay in symbiotic functions in some aphid species. However, neither the distribution of pseudogenes among lineages nor the impact of this gene silencing on amino-acid availability in hosts has been assessed. In Buchnera of the aphid Diuraphis noxia, tandem repeats of these pseudogenes have persisted in diverse lineages, and thpEG pseudogenes have originated at least twice within this aphid genus. Measures of amino-acid concentrations in Diuraphis species have shown that the presence of the pseudogene is associated with a decreased availability of tryptophan, indicating that gene silencing decreases nutrient provisioning by symbionts. In Buchnera of Diuraphis, rates of nonsynonymous substitutions are elevated in functional trpE copies, supporting the hypothesis that pseudogene origin and persistence reflect a reduced selection for symbiont biosynthetic contributions. The parallel evolution of trpEG pseudogenes in Buchnera of Diuraphis and certain other aphid hosts suggests that either selection at the host level is not effective or that fitness in these aphids is not limited by tryptophan availability.  相似文献   

16.
17.
We sequenced three argininosuccinate-synthetase-processed pseudogenes (ΨAS-A1, ΨAS-A3, ΨAS-3) and their noncoding flanking sequences in human, orangutan, baboon, and colobus. Our data showed that these pseudogenes were incorporated into the genome of the Old World monkeys after the divergence of the Old World and New World monkey lineages. These pseudogene flanking regions show variable mutation rates and patterns. The variation in the G/C to A/T mutation rate (u) can account for the unequal GC contents at equilibrium: 34.9, 36.9, and 41.7% in the pseudogene ΨAS-A1, ΨAS-A3, and ΨAS-3 flanking regions, respectively. The A/T to G/C mutation rate (v) seems stable and the u/v ratios equal 1.9, 1.7, and 1.4 in the flanking regions of ΨAS-A1, ΨAS-A3, and ΨAS-3, respectively. These ``regional' variations of the mutation rate affect the evolution of the pseudogenes, too. The ratio u/v being greater than 1.0 in each case, the overall mutation rate in the GC-rich pseudogenes is, as expected, higher than in their GC-poor flanking regions. Moreover, a ``sequence effect' has been found. In the three cases examined u and v are higher (at least 20%) in the pseudogene than in its flanking region—i.e., the pseudogene appears as mutation ``hot' spots embedded in ``cold' regions. This observation could be partly linked to the fact that the pseudogene flanking regions are long-standing unconstrained DNA sequences, whereas the pseudogenes were relieved of selection on their coding functions only around 30–40 million years ago. We suspect that relatively more mutable sites maintained unchanged during the evolution of the argininosuccinate gene are able to change in the pseudogenes, such sites being eliminated or rare in the flanking regions which have been void of strong selective constraints over a much longer period. Our results shed light on (1) the multiplicity of factors that tune the spontaneous mutation rate and (2) the impact of the genomic position of a sequence on its evolution. Received: 10 February 1997 / Accepted: 21 April 1997  相似文献   

18.
19.
Processed genes are created by retroposition from messenger RNA of expressed genes. The estimated amount of processed copies of genes in the human genome is 10,000-14,000. Some of these might be pseudogenes with the expected pattern for nonfunctional sequences, but some others might be an important source of new genes. We have studied the evolution of a Phosphoglycerate mutase processed gene (PGAM3) described in humans and believed to be a pseudogene. We sequenced PGAM3 in chimpanzee and macaque and obtained polymorphism data for human coding region. We found evidence that PGAM3 likely produces a functional protein, as an example of addressing functionality for human processed pseudogenes. First, the open reading frame was intact despite many deletions that occurred in the 3' untranslated region. Second, it appears that the gene is expressed. Finally, interspecies and intraspecies variation for PGAM3 was not consistent with the neutral model proposed for pseudogenes, suggesting that a new functional primate gene has originated. Amino acid divergence was significantly higher than synonymous divergence in PGAM3 lineage, supporting positive selection acting in this gene. This role of selection was further supported by the excess of rare alleles in a population genetic analysis. PGAM3 is located in a region of very low recombination; therefore, it is conceivable that the rapid fixation events in this newly arising gene may have contributed to a selective sweep of variation in the region.  相似文献   

20.
Studies of neutrally evolving sequences suggest that differences in eukaryotic genome sizes result from different rates of DNA loss. However, very few pseudogenes have been identified in microbial species, and the processes whereby genes and genomes deteriorate in bacteria remain largely unresolved. The typhus-causing agent, Rickettsia prowazekii, is exceptional in that as much as 24% of its 1.1-Mb genome consists of noncoding DNA and pseudogenes. To test the hypothesis that the noncoding DNA in the R. prowazekii genome represents degraded remnants of ancestral genes, we systematically examined all of the identified pseudogenes and their flanking sequences in three additional Rickettsia species. Consistent with the hypothesis, we observe sequence similarities between genes and pseudogenes in one species and intergenic DNA in another species. We show that the frequencies and average sizes of deletions are larger than insertions in neutrally evolving pseudogene sequences. Our results suggest that inactivated genetic material in the Rickettsia genomes deteriorates spontaneously due to a mutation bias for deletions and that the noncoding sequences represent DNA in the final stages of this degenerative process.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号