首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Gene conversion is the unidirectional transfer of genetic information between allelic (orthologous) or nonallelic (paralogous) DNA segments. Recently, there has been much interest in understanding how gene conversion shapes the nucleotide composition of the genomic landscape. A widely held hypothesis is that gene conversion is universally GC-biased. However, direct experimental evidence of this hypothesis is limited to a single study of meiotic crossovers in yeast. Although there have been a number of indirect studies of gene conversion, evidence of GC-biased replacements gathered from such studies can also be attributed to positive selection, which has the same evolutionary dynamics as biased gene conversion. Here, we apply a direct phylogenetic approach to examine nucleotide replacements produced by nonallelic gene conversion in Drosophila and primate genomes. We find no evidence for GC-biased gene conversion in either lineage, suggesting that previously observed GC biases may be due to positive selection rather than to biased gene conversion.  相似文献   

2.
Ancestral allele information is useful for genetics studies. Previously, the identification of ancestral alleles was primarily based on sequence alignments between species. Alternative ways to identify ancestral alleles were proposed in this study based on population sequencing data. The methods described here utilized the diversity between haplotypes harboring ancestral and newly emerged alleles. Simulations showed that these methods were reliable for identifying ancestral alleles when the variants had not aged too greatly. Application to the human genome sequencing data suggested the role of indels in maintaining the GC content in the human genome. The deletion-to-insertion ratios and GC proportions were correlated depending on the sizes of insertions and deletions in the direction of increasing GC content. There were GC-biased fixations in single base-pair insertions and AT-biased fixations in single base-pair deletions in the results based on the proposed methods. In the current study, GC-biased gene conversions in nucleotide substitutions were very slight or insignificant. In the variants of several quantitative trait loci (QTLs), slight GC-biased gene conversion was observed in nucleotide substitutions. For the QTL indels, insertions were observed more often than deletions, and deletion-biased fixation was observed, providing new insights into the evolution of functional genes.  相似文献   

3.
Comparative genomic analyses of primates offer considerable potential to define and understand the processes that mold, shape, and transform the human genome. However, primate taxonomy is both complex and controversial, with marginal unifying consensus of the evolutionary hierarchy of extant primate species. Here we provide new genomic sequence (~8 Mb) from 186 primates representing 61 (~90%) of the described genera, and we include outgroup species from Dermoptera, Scandentia, and Lagomorpha. The resultant phylogeny is exceptionally robust and illuminates events in primate evolution from ancient to recent, clarifying numerous taxonomic controversies and providing new data on human evolution. Ongoing speciation, reticulate evolution, ancient relic lineages, unequal rates of evolution, and disparate distributions of insertions/deletions among the reconstructed primate lineages are uncovered. Our resolution of the primate phylogeny provides an essential evolutionary framework with far-reaching applications including: human selection and adaptation, global emergence of zoonotic diseases, mammalian comparative genomics, primate taxonomy, and conservation of endangered species.  相似文献   

4.

Background

Gene duplication is a source of molecular innovation throughout evolution. However, even with massive amounts of genome sequence data, correlating gene duplication with speciation and other events in natural history can be difficult. This is especially true in its most interesting cases, where rapid and multiple duplications are likely to reflect adaptation to rapidly changing environments and life styles. This may be so for Class I of alcohol dehydrogenases (ADH1s), where multiple duplications occurred in primate lineages in Old and New World monkeys (OWMs and NWMs) and hominoids.

Methodology/Principal Findings

To build a preferred model for the natural history of ADH1s, we determined the sequences of nine new ADH1 genes, finding for the first time multiple paralogs in various prosimians (lemurs, strepsirhines). Database mining then identified novel ADH1 paralogs in both macaque (an OWM) and marmoset (a NWM). These were used with the previously identified human paralogs to resolve controversies relating to dates of duplication and gene conversion in the ADH1 family. Central to these controversies are differences in the topologies of trees generated from exonic (coding) sequences and intronic sequences.

Conclusions/Significance

We provide evidence that gene conversions are the primary source of difference, using molecular clock dating of duplications and analyses of microinsertions and deletions (micro-indels). The tree topology inferred from intron sequences appear to more correctly represent the natural history of ADH1s, with the ADH1 paralogs in platyrrhines (NWMs) and catarrhines (OWMs and hominoids) having arisen by duplications shortly predating the divergence of OWMs and NWMs. We also conclude that paralogs in lemurs arose independently. Finally, we identify errors in database interpretation as the source of controversies concerning gene conversion. These analyses provide a model for the natural history of ADH1s that posits four ADH1 paralogs in the ancestor of Catarrhine and Platyrrhine primates, followed by the loss of an ADH1 paralog in the human lineage.  相似文献   

5.
Recombination between homologous loci is accompanied by formation of heteroduplexes. Repairing mismatches in heteroduplexes often leads to single nucleotide substitutions in a process known as gene conversion. Gene conversion was shown to be GC‐biased in different organisms; that is, a W(A or T)→S(G or C) substitution is more likely in this process than a S→W substitution. Here, we show that the insertion/deletion ratio for short noncoding indels that reach fixation between species is positively correlated with the recombination rate in Drosophila melanogaster, Homo sapiens, and Saccharomyces cerevisiae. This correlation is both due to an increase of the fixation rate of insertions and decrease of the fixation rate of deletions in regions of high recombination. Whole‐genome data on indel polymorphism and divergence in D. melanogaster rule out mutation biases and selection as the cause of this trend, pointing to insertion‐biased gene conversion as the most likely explanation. The bias toward insertions is the strongest for single‐nucleotide indels, and decreases with indel length. In regions of high recombination rate this bias leads to an up to ~5‐fold excess of fixed short insertions over deletions, and substantially affects the evolution of DNA segments.  相似文献   

6.
Reconstructing the histories of complex adaptations and identifying the evolutionary mechanisms underlying their origins are two of the primary goals of evolutionary biology. Taricha newts, which contain high concentrations of the deadly toxin tetrodotoxin (TTX) as an antipredator defense, have evolved resistance to self-intoxication, which is a complex adaptation requiring changes in six paralogs of the voltage-gated sodium channel (Nav) gene family, the physiological target of TTX. Here, we reconstruct the origins of TTX self-resistance by sequencing the entire Nav gene family in newts and related salamanders. We show that moderate TTX resistance evolved early in the salamander lineage in three of the six Nav paralogs, preceding the proposed appearance of tetrodotoxic newts by ∼100 My. TTX-bearing newts possess additional unique substitutions across the entire Nav gene family that provide physiological TTX resistance. These substitutions coincide with signatures of positive selection and relaxed purifying selection, as well as gene conversion events, that together likely facilitated their evolution. We also identify a novel exon duplication within Nav1.4 encoding an expressed TTX-binding site. Two resistance-conferring changes within newts appear to have spread via nonallelic gene conversion: in one case, one codon was copied between paralogs, and in the second, multiple substitutions were homogenized between the duplicate exons of Nav1.4. Our results demonstrate that gene conversion can accelerate the coordinated evolution of gene families in response to a common selection pressure.  相似文献   

7.
Previous studies have shown that recombination between allelic sequences can cause likelihood-based methods for detecting positive selection to produce many false-positive results. In this article, we use simulations to study the impact of nonallelic gene conversion on the specificity of PAML to detect positive selection among gene duplicates. Our results show that, as expected, gene conversion leads to higher rates of false-positive results, although only moderately. These rates increase with the genetic distance between sequences, the length of converted tracts, and when no outgroup sequences are included in the analysis. We also find that branch-site models will incorrectly identify unconverted sequences as the targets of positive selection when their close paralogs are converted. Bayesian prediction of sites undergoing adaptive evolution implemented in PAML is affected by conversion, albeit in a less straightforward way. Our work suggests that particular attention should be devoted to the evolutionary analysis of recent duplicates that may have experienced gene conversion because they may provide false signals of positive selection. Fortunately, these results also imply that those cases most susceptible to false-positive results—i.e., high divergence between paralogs, long conversion tracts—are also the cases where detecting gene conversion is the easiest. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

8.
Alu elements belonging to the previously identified "young" subfamilies are thought to have inserted in the human genome after the divergence of humans from non-human primates and therefore should not be present in non-human primate genomes. Polymerase chain reaction (PCR) based screening of over 500 Alu insertion loci resulted in the recovery of a few "young" Alu elements that also resided at orthologous positions in non-human primate genomes. Sequence analysis demonstrated these "young" Alu insertions represented gene conversion events of pre-existing ancient Alu elements or independent parallel insertions of older Alu elements in the same genomic region. The level of gene conversion between Alu elements suggests that it may have a significant influence on the single nucleotide diversity within the genome. All the instances of multiple independent Alu insertions within the same small genomic regions were recovered from the owl monkey genome, indicating a higher Alu amplification rate in owl monkeys relative to many other primates. This study suggests that the majority of Alu insertions in primate genomes are the products of unique evolutionary events.  相似文献   

9.
Insights into the origins of structural variation and the mutational mechanisms underlying genomic disorders would be greatly improved by a genomewide map of hotspots of nonallelic homologous recombination (NAHR). Moreover, our understanding of sequence variation within the duplicated sequences that are substrates for NAHR lags far behind that of sequence variation within the single-copy portion of the genome. Perhaps the best-characterized NAHR hotspot lies within the 24-kb-long Charcot-Marie-Tooth disease type 1A (CMT1A)-repeats (REPs) that sponsor deletions and duplications that cause peripheral neuropathies. We investigated structural and sequence diversity within the CMT1A-REPs, both within and between species. We discovered a high frequency of retroelement insertions, accelerated sequence evolution after duplication, extensive paralogous gene conversion, and a greater than twofold enrichment of SNPs in humans relative to the genome average. We identified an allelic recombination hotspot underlying the known NAHR hotspot, which suggests that the two processes are intimately related. Finally, we used our data to develop a novel method for inferring the location of an NAHR hotspot from sequence variation within segmental duplications and applied it to identify a putative NAHR hotspot within the LCR22 repeats that sponsor velocardiofacial syndrome deletions. We propose that a large-scale project to map sequence variation within segmental duplications would reveal a wealth of novel chromosomal-rearrangement hotspots.  相似文献   

10.
On an evolutionary time scale, polymorphic alleles are believed to have a short life, persisting at most tens of millions of years even under long-term balancing selection. Here, we report highly diverged trans-species dimorphism of the proteasome subunit beta type 8 (PSMB8) gene, which encodes a catalytic subunit of the immunoproteasome responsible for the generation of peptides presented by major histocompatibility complex (MHC) class I molecules, in lower teleosts including Cypriniformes (zebrafish and loach) and Salmoniformes (trout and salmon), whose last common ancestor dates to 300 Ma. Moreover, phylogenetic analyses indicated that these dimorphic alleles share lineages with two shark paralogous genes, suggesting that these two lineages have been maintained for more than 500 My either as alleles or as paralogs, and that conversion between alleles and paralogs has occurred at least once during vertebrate evolution. Two lineages termed PSMB8A and PSMB8F show an A(31)F substitution that would probably affect their cleaving specificity, and whereas the PSMB8A lineage has been retained by all analyzed jawed vertebrates, the PSMB8F lineage has been lost by most jawed vertebrates except for cartilaginous fish and basal teleosts. However, a possible functional equivalent of the PSMB8F lineage has been revived as alleles within the PSMB8A lineage at least twice during vertebrate evolution in the amphibian Xenopus and teleostean Oryzias species. Dynamic evolution of the PSMB8 polymorphism through long-term persistence, loss, and regaining of dimorphism and conversion between alleles and paralogs implies the presence of strong selective pressure for functional polymorphism of this gene.  相似文献   

11.
12.
To study reductive evolutionary processes in bacterial genomes, we examine sequences in the Rickettsia genomes which are unconstrained by selection and evolve as pseudogenes, one of which is the metK gene, which codes for AdoMet synthetase. Here, we sequenced the metK gene and three surrounding genes in eight different species of the genus Rickettsia. The metK gene was found to contain a high incidence of deletions in six lineages, while the three genes in its surroundings were functionally conserved in all eight lineages. A more drastic example of gene degradation was identified in the metK downstream region, which contained an open reading frame in Rickettsia felis. Remnants of this open reading frame could be reconstructed in five additional species by eliminating sites of frameshift mutations and termination codons. A detailed examination of the two reconstructed genes revealed that deletions strongly predominate over insertions and that there is a strong transition bias for point mutations which is coupled to an excess of GC-to-AT substitutions. Since the molecular evolution of these inactive genes should reflect the rates and patterns of neutral mutations, our results strongly suggest that there is a high spontaneous rate of deletions as well as a strong mutation bias toward AT pairs in the Rickettsia genomes. This may explain the low genomic G + C content (29%), the small genome size (1.1 Mb), and the high noncoding content (24%), as well as the presence of several pseudogenes in the Rickettsia prowazekii genome.  相似文献   

13.
Understanding the prevailing mutational mechanisms responsible for human genome structural variation requires uniformity in the discovery of allelic variants and precision in terms of breakpoint delineation. We develop a resource based on capillary end sequencing of 13.8 million fosmid clones from 17 human genomes and characterize the complete sequence of 1054 large structural variants corresponding to 589 deletions, 384 insertions, and 81 inversions. We analyze the 2081 breakpoint junctions and infer potential mechanism of origin. Three mechanisms account for the bulk of germline structural variation: microhomology-mediated processes involving short (2-20 bp) stretches of sequence (28%), nonallelic homologous recombination (22%), and L1 retrotransposition (19%). The high quality and long-range continuity of the sequence reveals more complex mutational mechanisms, including repeat-mediated inversions and gene conversion, that are most often missed by other methods, such as comparative genomic hybridization, single nucleotide polymorphism microarrays, and next-generation sequencing.  相似文献   

14.
Recently integrated Alu elements and human genomic diversity   总被引:8,自引:0,他引:8  
A comprehensive analysis of two Alu Y lineage subfamilies was undertaken to assess Alu-associated genomic diversity and identify new Alu insertion polymorphisms for the study of human population genetics. Recently integrated Alu elements (283) from the Yg6 and Yi6 subfamilies were analyzed by polymerase chain reaction (PCR), and 25 of the loci analyzed were polymorphic for insertion presence/absence within the genomes of a diverse array of human populations. These newly identified Alu insertion polymorphisms will be useful tools for the study of human genomic diversity. Our screening of the Alu insertion loci also resulted in the recovery of several "young" Alu elements that resided at orthologous positions in nonhuman primate genomes. Sequence analysis demonstrated these "young" Alu insertions were the products of gene conversion events of older, preexisting Alu elements or independent parallel forward insertions of older Alu elements in the same short genomic region. The level of gene conversion between Alu elements suggests that it may have an influence on the single nucleotide polymorphism within Alu elements in the genome. We have also identified two genomic deletions associated with the retroposition and insertion of Alu Y lineage elements into the human genome. This type of Alu retroposition-mediated genomic deletion is a novel source of lineage-specific evolution within primate genomes.  相似文献   

15.
A. R. Godwin  R. M. Liskay 《Genetics》1994,136(2):607-617
We examined the effects of insertion mutations on intrachromosomal recombination. A series of mouse L cell lines carrying mutant herpes simplex virus thymidine kinase (tk) heteroalleles was generated; these lines differed in the nature of their insertion mutations. In direct repeat lines with different large insertions in each gene, there was a 20-fold drop in gene conversion rate and only a five-fold drop in crossover rate relative to the analogous rates in lines with small insertions in each gene. Surprisingly, in direct repeat lines carrying the same large insertion in each gene, there was a larger drop in both types of recombination. When intrachromosomal recombination between inverted repeat tk genes with different large insertions was examined, we found that the rate of gene conversion dropped five-fold relative to small insertions, while the rate of crossing over was unaffected. The differential effects on conversion and crossing over imply that gene conversion is more sensitive to insertion mutation size. Finally, the fraction of gene conversions associated with a crossover increased from 2% for inverted repeats with small insertions to 18% for inverted repeats with large insertions. One interpretation of this finding is that during intrachromosomal recombination in mouse cells long conversion tracts are more often associated with crossing over.  相似文献   

16.
Ye C  Li Y  Shi P  Zhang YP 《Gene》2005,350(2):183-192
Growth hormone is a classic molecule in the study of the molecular clock hypothesis as it exhibits a relatively constant rate of evolution in most mammalian orders except primates and artiodactyls, where dramatically enhanced rate of evolution (25–50-fold) has been reported. The rapid evolution of primate growth hormone occurred after the divergence of tarsiers and simians, but before the separation of old world monkeys (OWM) from new world monkeys (NWM). Interestingly, this event of rapid sequence evolution coincided with multiple duplications of the growth hormone gene, suggesting gene duplication as a possible cause of the accelerated sequence evolution. Here we determined 21 different GH-like sequences from four species of OWM and hominoids. Combining with published sequences from OWM and hominoids, our analysis demonstrates that multiple gene duplications and several gene conversion events both occurred in the evolutionary history of this gene family in OWM/hominoids. The episode of recent duplications of CSH-like genes in gibbon is accompanied with rapid sequence evolution likely resulting from relaxation of purifying selection. GHN genes in both hominoids and OWM are under strong purifying selection. In contrast, CSH genes in both lineages are probably not. GHV genes in OWM and hominoids evolved at different evolutionary rates and underwent different selective constraints. Our results disclosed the complex history of the primate growth hormone gene family and raised intriguing questions on the consequences of these evolutionary events.  相似文献   

17.
Eukaryotes and archaea both possess multiple genes coding for family B DNA polymerases. In animals and fungi, three family B DNA polymerases, alpha, delta, and epsilon, are responsible for replication of nuclear DNA. We used a PCR-based approach to amplify and sequence phylogenetically conserved regions of these three DNA polymerases from Giardia intestinalis and Trichomonas vaginalis, representatives of early-diverging eukaryotic lineages. Phylogenetic analysis of eukaryotic and archaeal paralogs suggests that the gene duplications that gave rise to the three replicative paralogs occurred before the divergence of the earliest eukaryotic lineages, and that all eukaryotes are likely to possess these paralogs. One eukaryotic paralog, epsilon, consistently branches within archaeal sequences to the exclusion of other eukaryotic paralogs, suggesting that an epsilon-like family B DNA polymerase was ancestral to both archaea and eukaryotes. Because crenarchaeote and euryarchaeote paralogs do not form monophyletic groups in phylogenetic analysis, it is possible that archaeal family B paralogs themselves evolved by a series of gene duplications independent of the gene duplications that gave rise to eukaryotic paralogs.   相似文献   

18.
Examination of polymorphisms in the Plasmodium falciparum gene for falcipain 2 revealed that this gene is one of two paralogs separated by 10.8 kb in chromosome 11. We designate the annotated gene denoted chr11.gen_424 as encoding falcipain 2A and the annotated gene denoted chr11.gen_427 as encoding falcipain 2B. The paralogs are 96% identical at the nucleotide level and 93% identical at the amino acid level. The consensus sequences differ in 31/309 synonymous sites and 45/1140 nonsynonymous sites, including three amino acid replacements (V393I, A400P, and Q414E) that are near the catalytic site and that may affect substrate affinity or specificity. In six reference isolates, among 36 synonymous sites and 46 nonsynonymous sites that are polymorphic in the gene for falcipain 2A, falcipain 2B, or both, significant spatial clustering is observed. All but one of the polymorphisms appear to result from gene conversion between the paralogs. The estimated rate of gene conversion between the paralogs may be as many as 1,400 to 1,700 times greater than the rate of mutation. Owing to gene conversion, one of the falcipain 2A alleles is more similar to the falcipain 2B alleles than it is to other falcipain 2A alleles. Divergence among the synonymous sites suggests that the paralogous genes last shared a common ancestor 15.2 MYA, with a range of 8.8 to 20.6 MYA. During this period, the paralogs have acquired 0.10 synonymous substitutions per synonymous site in the coding region. The 5' and 3' flanking regions differ in 47.7% and 39.8% of the nucleotide sites, respectively. Hence synonymous sites and flanking regions are not conserved in sequence in spite of their high AT content and T skew.  相似文献   

19.
R Ollo  F Rougeon 《Cell》1983,32(2):515-523
We have determined the complete nucleotide sequence of the C57BL/6 allele of the mouse immunoglobulin gamma 2a chain gene. A comparison with the BALB/c gamma 2a gene for 1912 nucleotides reveals that the two alleles exhibit extensive divergence, since there are 138 single-base-pair differences and 8 insertions or deletions. We have compared the two gamma 2a alleles with the two corresponding gamma 2b alleles, which differ in only 12 positions. It appears that among the 134 differences between the two gamma 2a alleles, 70 are at positions where gamma 2a and gamma 2b are identical in the BALB/c haplotype and 54 are at positions where gamma 2a and gamma 2b are identical in the C57BL/6 haplotype. All these results suggest that nonreciprocal gene conversion between nonallelic genes can introduce sequence homogeneity in linked genes and can generate extensive divergence and polymorphism in allelic genes. We suggest that the gamma 2a and gamma 2b gene ancestors freely diverged after duplication, and that the conversion events were promoted by a deletion shortening the distance between the two loci.  相似文献   

20.
Wang X  Tang H  Bowers JE  Feltus FA  Paterson AH 《Genetics》2007,177(3):1753-1763
Many genes duplicated by whole-genome duplications (WGDs) are more similar to one another than expected. We investigated whether concerted evolution through conversion and crossing over, well-known to affect tandem gene clusters, also affects dispersed paralogs. Genome sequences for two Oryza subspecies reveal appreciable gene conversion in the approximately 0.4 MY since their divergence, with a gradual progression toward independent evolution of older paralogs. Since divergence from subspecies indica, approximately 8% of japonica paralogs produced 5-7 MYA on chromosomes 11 and 12 have been affected by gene conversion and several reciprocal exchanges of chromosomal segments, while approximately 70-MY-old "paleologs" resulting from a genome duplication (GD) show much less conversion. Sequence similarity analysis in proximal gene clusters also suggests more conversion between younger paralogs. About 8% of paleologs may have been converted since rice-sorghum divergence approximately 41 MYA. Domain-encoding sequences are more frequently converted than nondomain sequences, suggesting a sort of circularity--that sequences conserved by selection may be further conserved by relatively frequent conversion. The higher level of concerted evolution in the 5-7 MY-old segmental duplication may reflect the behavior of many genomes within the first few million years after duplication or polyploidization.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号