首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Brandström M  Ellegren H 《Genetics》2007,176(3):1691-1701
It is increasingly recognized that insertions and deletions (indels) are an important source of genetic as well as phenotypic divergence and diversity. We analyzed length polymorphisms identified through partial (0.25x) shotgun sequencing of three breeds of domestic chicken made by the International Chicken Polymorphism Map Consortium. A data set of 140,484 short indel polymorphisms in unique DNA was identified after filtering for microsatellite structures. There was a significant excess of tandem duplicates at indel sites, with deletions of a duplicate motif outnumbering the generation of duplicates through insertion. Indel density was lower in microchromosomes than in macrochromosomes, in the Z chromosome than in autosomes, and in 100 bp of upstream sequence, 5'-UTR, and first introns than in intergenic DNA and in other introns. Indel density was highly correlated with single nucleotide polymorphism (SNP) density. The mean density of indels in pairwise sequence comparisons was 1.9 x 10(-4) indel events/bp, approximately 5% the density of SNPs segregating in the chicken genome. The great majority of indels involved a limited number of nucleotides (median 1 bp), with A-rich motifs being overrepresented at indel sites. The overrepresentation of deletions at tandem duplicates indicates that replication slippage in duplicate sequences is a common mechanism behind indel mutation. The correlation between indel and SNP density indicates common effects of mutation and/or selection on the occurrence of indels and point mutations.  相似文献   

2.
Nam K  Ellegren H 《PLoS genetics》2012,8(5):e1002680
Selective and/or neutral processes may govern variation in DNA content and, ultimately, genome size. The observation in several organisms of a negative correlation between recombination rate and intron size could be compatible with a neutral model in which recombination is mutagenic for length changes. We used whole-genome data on small insertions and deletions within transposable elements from chicken and zebra finch to demonstrate clear links between recombination rate and a number of attributes of reduced DNA content. Recombination rate was negatively correlated with the length of introns, transposable elements, and intergenic spacer and with the rate of short insertions. Importantly, it was positively correlated with gene density, the rate of short deletions, the deletion bias, and the net change in sequence length. All these observations point at a pattern of more condensed genome structure in regions of high recombination. Based on the observed rates of small insertions and deletions and assuming that these rates are representative for the whole genome, we estimate that the genome of the most recent common ancestor of birds and lizards has lost nearly 20% of its DNA content up until the present. Expansion of transposable elements can counteract the effect of deletions in an equilibrium mutation model; however, since the activity of transposable elements has been low in the avian lineage, the deletion bias is likely to have had a significant effect on genome size evolution in dinosaurs and birds, contributing to the maintenance of a small genome. We also demonstrate that most of the observed correlations between recombination rate and genome contraction parameters are seen in the human genome, including for segregating indel polymorphisms. Our data are compatible with a neutral model in which recombination drives vertebrate genome size evolution and gives no direct support for a role of natural selection in this process.  相似文献   

3.
Selective constraints on intron evolution in Drosophila   总被引:5,自引:0,他引:5  
Parsch J 《Genetics》2003,165(4):1843-1851
Intron sizes show an asymmetrical distribution in a number of organisms, with a large number of "short" introns clustered around a minimal intron length and a much broader distribution of longer introns. In Drosophila melanogaster, the short intron class is centered around 61 bp. The narrow length distribution suggests that natural selection may play a role in maintaining intron size. A comparison of 15 orthologous introns among species of the D. melanogaster subgroup indicates that, in general, short introns are not under greater DNA sequence or length constraints than long introns. There is a bias toward deletions in all introns (deletion/insertion ratio is 1.66), and the vast majority of indels are of short length (<10 bp). Indels occurring on the internal branches of the phylogenetic tree are significantly longer than those occurring on the terminal branches. These results are consistent with a compensatory model of intron length evolution in which slightly deleterious short deletions are frequently fixed within species by genetic drift, and relatively rare larger insertions that restore intron length are fixed by positive selection. A comparison of paralogous introns shared among duplicated genes suggests that length constraints differ between introns within the same gene. The janusA, janusB, and ocnus genes share two short introns derived from a common ancestor. The first of these introns shows significantly fewer indels than the second intron, although the two introns show a comparable number of substitutions. This indicates that intron-specific selective constraints have been maintained following gene duplication, which preceded the divergence of the D. melanogaster species subgroup.  相似文献   

4.
I present data on the evolution of intron lengths among 3 closely related Drosophila species, D. melanogaster, Drosophila simulans, and Drosophila yakuba. Using D. yakuba as an outgroup, I mapped insertion and deletion mutations in 148 introns (spanning approximately 30 kb) to the D. melanogaster and D. simulans lineages. Intron length evolution in the 2 sister species has been different: in D. melanogaster, X-linked introns have increased slightly in size, whereas autosomal ones have decreased slightly in size; in D. simulans, both X-linked and autosomal introns have decreased in size. To understand the possible evolutionary causes of these lineage- and chromosome-specific patterns of intron evolution, I studied insertion-deletion (indel) polymorphism and divergence in D. melanogaster. Small insertion mutations segregate at elevated frequencies and enjoy elevated probabilities of fixation, particularly on the X chromosome. In contrast, there is no detectable X chromosome effect on fixations in D. simulans. These findings suggest X chromosome-specific selection or biased gene conversion-gap repair favoring insertions in D. melanogaster but not in D. simulans. These chromosome- and lineage-specific patterns of indel substitution are not easily explained by existing general population genetic models of intron length evolution. Genomic data from D. melanogaster further suggest that the forces described here affect introns and intergenic regions similarly.  相似文献   

5.
Comeron JM  Kreitman M 《Genetics》2000,156(3):1175-1190
Intron length is negatively correlated with recombination in both Drosophila melanogaster and humans. This correlation is not likely to be the result of mutational processes alone: evolutionary analysis of intron length polymorphism in D. melanogaster reveals equivalent ratios of deletion to insertion in regions of high and low recombination. The polymorphism data do reveal, however, an excess of deletions relative to insertions (i.e., a deletion bias), with an overall deletion-to-insertion events ratio of 1.35. We propose two types of selection favoring longer intron lengths. First, the natural mutational bias toward deletion must be opposed by strong selection in very short introns to maintain the minimum intron length needed for the intron splicing reaction. Second, selection will favor insertions in introns that increase recombination between mutations under the influence of selection in adjacent exons. Mutations that increase recombination, even slightly, will be selectively favored because they reduce interference among selected mutations. Interference selection acting on intron length mutations must be very weak, as indicated by frequency spectrum analysis of Drosophila intron length polymorphism, making the equilibrium for intron length sensitive to changes in the recombinational environment and population size. One consequence of this sensitivity is that the advantage of longer introns is expected to decrease inversely with the rate of recombination, thus leading to a negative correlation between intron length and recombination rate. Also in accord with this model, intron length differs between closely related Drosophila species, with the longest variant present more often in D. melanogaster than in D. simulans. We suggest that the study of the proposed dynamic model, taking into account interference among selected sites, might shed light on many aspects of the comparative biology of genome sizes including the C value paradox.  相似文献   

6.
Recombination between homologous loci is accompanied by formation of heteroduplexes. Repairing mismatches in heteroduplexes often leads to single nucleotide substitutions in a process known as gene conversion. Gene conversion was shown to be GC‐biased in different organisms; that is, a W(A or T)→S(G or C) substitution is more likely in this process than a S→W substitution. Here, we show that the insertion/deletion ratio for short noncoding indels that reach fixation between species is positively correlated with the recombination rate in Drosophila melanogaster, Homo sapiens, and Saccharomyces cerevisiae. This correlation is both due to an increase of the fixation rate of insertions and decrease of the fixation rate of deletions in regions of high recombination. Whole‐genome data on indel polymorphism and divergence in D. melanogaster rule out mutation biases and selection as the cause of this trend, pointing to insertion‐biased gene conversion as the most likely explanation. The bias toward insertions is the strongest for single‐nucleotide indels, and decreases with indel length. In regions of high recombination rate this bias leads to an up to ~5‐fold excess of fixed short insertions over deletions, and substantially affects the evolution of DNA segments.  相似文献   

7.
We use a multigene data set (the mitochondrial locus and nine nuclear gene regions) to test phylogenetic relationships in the South American "lava lizards" (genus Microlophus) and describe a strategy for aligning noncoding sequences that accounts for differences in tempo and class of mutational events. We focus on seven nuclear introns that vary in size and frequency of multibase length mutations (i.e., indels) and present a manual alignment strategy that incorporates insertions and deletions (indels) for each intron. Our method is based on mechanistic explanations of intron evolution that does not require a guide tree. We also use a progressive alignment algorithm (Probabilistic Alignment Kit; PRANK) and distinguishes insertions from deletions and avoids the "gapcost" conundrum. We describe an approach to selecting a guide tree purged of ambiguously aligned regions and use this to refine PRANK performance. We show that although manual alignment is successful in finding repeat motifs and the most obvious indels, some regions can only be subjectively aligned, and there are limits to the size and complexity of a data matrix for which this approach can be taken. PRANK alignments identified more parsimony-informative indels while simultaneously increasing nucleotide identity in conserved sequence blocks flanking the indel regions. When comparing manual and PRANK with two widely used methods (CLUSTAL, MUSCLE) for the alignment of the most length-variable intron, only PRANK recovered a tree congruent at deeper nodes with the combined data tree inferred from all nuclear gene regions. We take this concordance as an objective function of alignment quality and present a strongly supported phylogenetic hypothesis for Microlophus relationships. From this hypothesis we show that (1) a coded indel data partition derived from the PRANK alignment contributed significantly to nodal support and (2) the indel data set permitted detection of significant conflict between mitochondrial and nuclear data partitions, which we hypothesize arose from secondary contact of distantly related taxa, followed by hybridization and mtDNA introgression.  相似文献   

8.
Nuclear DNA intron sequences are increasingly used to investigate evolutionary relationships among closely related organisms. The phylogenetic usefulness of intron sequences at higher taxonomic levels has, however, not been firmly established and very few studies have used these markers to address evolutionary questions above the family level. In addition, the mechanisms driving intron evolution are not well understood. We compared DNA sequence data derived from three presumably independently segregating introns (THY, PRKC I and MGF) across 158 mammalian species. All currently recognized extant eutherian mammalian orders were included with the exception of Cingulata, Dermoptera and Scandentia. The total aligned length of the data was 6366 base pairs (bp); after the exclusion of autapomorphic insertions, 1511 bp were analyzed. In many instances the Bayesian and parsimony analyses were complementary and gave significant posterior probability and bootstrap support (>80) for the monophyly of Afrotheria, Euarchontoglires, Laurasiatheria and Boreoeutheria. Apart from finding congruent support when using these methods, the intron data also provided several indels longer than 3 bp that support, among others, the monophyly of Afrotheria, Paenungulata, Ferae and Boreoeutheria. A quantitative analysis of insertions and deletions suggested that there was a 75% bias towards deletions. The average insertion size in the mammalian data set was 16.49 bp +/- 57.70 while the average deletion was much smaller (4.47 bp +/- 14.17). The tendency towards large insertions and small deletions is highlighted by the observation that out of a total of 17 indels larger than 100 bp, 15 were insertions. The majority of indels (>60% of all events) were 1 or 2 bp changes. Although the average overall indel substitution rate of 0.00559 per site is comparable to that previously reported for rodents and primates, individual analyses among different evolutionary lineages provide evidence for differences in the formation rate of indels among the different mammalian groups.  相似文献   

9.
10.

Background

Introns comprise a large fraction of eukaryotic genomes, yet little is known about their functional significance. Regulatory elements have been mapped to some introns, though these are believed to account for only a small fraction of genome wide intronic DNA. No consistent patterns have emerged from studies that have investigated general levels of evolutionary constraint in introns.

Results

We examine the relationship between intron length and levels of evolutionary constraint by analyzing inter-specific divergence at 225 intron fragments in Drosophila melanogaster and Drosophila simulans, sampled from a broad distribution of intron lengths. We document a strongly negative correlation between intron length and divergence. Interestingly, we also find that divergence in introns is negatively correlated with GC content. This relationship does not account for the correlation between intron length and divergence, however, and may simply reflect local variation in mutational rates or biases.

Conclusion

Short introns make up only a small fraction of total intronic DNA in the genome. Our finding that long introns evolve more slowly than average implies that, while the majority of introns in the Drosophila genome may experience little or no selective constraint, most intronic DNA in the genome is likely to be evolving under considerable constraint. Our results suggest that functional elements may be ubiquitous within longer introns and that these introns may have a more general role in regulating gene expression than previously appreciated. Our finding that GC content and divergence are negatively correlated in introns has important implications for the interpretation of the correlation between divergence and levels of codon bias observed in Drosophila.  相似文献   

11.
Nucleotide insertions and deletions (indels) are responsible for gaps in the sequence alignments. Indel is one of the major sources of evolutionary change at the molecular level. We have examined the patterns of insertions and deletions in the 19 mammalian genomes, and found that deletion events are more common than insertions in the mammalian genomes. Both the number of insertions and deletions decrease rapidly when the gap length increases and single nucleotide indel is the most frequent in all indel events. The frequencies of both insertions and deletions can be described well by power law.Key Words: Insertion, deletion, gap, indel, mammalian genome.  相似文献   

12.
13.
Ptak SE  Petrov DA 《Genetics》2002,162(3):1233-1244
Studies of "dead-on-arrival" transposable elements in Drosophila melanogaster found that deletions outnumber insertions approximately 8:1 with a median size for deletions of approximately 10 bp. These results are consistent with the deletion and insertion profiles found in most other Drosophila pseudogenes. In contrast, a recent study of D. melanogaster introns found a deletion/insertion ratio of 1.35:1, with 84% of deletions being shorter than 10 bp. This discrepancy could be explained if deletions, especially long deletions, are more frequently strongly deleterious than insertions and are eliminated disproportionately from intron sequences. To test this possibility, we use analysis and simulations to examine how deletions and insertions of different lengths affect different components of splicing and determine the distribution of deletions and insertions that preserve the original exons. We find that, consistent with our predictions, longer deletions affect splicing at a much higher rate compared to insertions and short deletions. We also explore other potential constraints in introns and show that most of these also disproportionately affect large deletions. Altogether we demonstrate that constraints in introns may explain much of the difference in the pattern of deletions and insertions observed in Drosophila introns and pseudogenes.  相似文献   

14.
One of the introns of the phosphofructokinase gene was sequenced and analyzed in seven species of endemic Baikalian gastropods of the family Baicaliidae. The intron length differed even between sister species. Along with relatively few single nucleotide substitutions, extended deletions and insertions accumulated during the evolution of the intron. The number of deletions/insertions did not correlate with the genetic distance estimated for the species by comparing the sequences of a mitochondrial DNA fragment. The deduced secondary structure markedly differed even between sister species, although a consensus motif was detectable. Since long RNA stems that appeared and disappeared in the course of evolution may contain regulatory elements, the evolution of the intron may be nonneutral.  相似文献   

15.
A phylogenetic analysis of indel dynamics in the cotton genus   总被引:2,自引:0,他引:2  
Genome size evolution is a dynamic process involving counterbalancing mechanisms whose actions vary across lineages and over time. Whereas the primary mechanism of expansion, transposable element (TE) amplification, has been widely documented, the evolutionary dynamics of genome contraction have been less thoroughly explored. To evaluate the relative impact and evolutionary stability of the mechanisms that affect genome size, we conducted a phylogenetic analysis of indel rates for 2 genomic regions in 4 Gossypium genomes: the 2 coresident genomes (A(T) and D(T)) of tetraploid cotton and its model diploid progenitors, Gossypium arboreum (A) and Gossypium raimondii (D). We determined the rates of sequence gain or loss along each branch, partitioned by mechanism, and how these changed during species divergence. In general, there has been a propensity toward growth of the diploid genomes and contraction in the polyploid. Most of the size difference between the diploid species occurred prior to polyploid divergence and was largely attributable to TE amplification in the A/A(T) genome. After separating from the true parents of the polyploid genomes, both diploid genomes experienced slower sequence gain than in the ancestor, due to fewer TE insertions in the A genome and a combination of increased deletions and decreased TE insertions in the D genome. Both genomes of the polyploid displayed increased rates of deletion and decreased rates of insertion, leading to a rate of near stasis in D(T) and overall contraction in A(T) resulting in polyploid genome contraction. As expected, TE insertions contributed significantly to the genome size differences; however, intrastrand homologous recombination, although rare, had the most significant impact on the rate of deletion. Small indel data for the diploids suggest the possibility of a bias as the smaller genomes add less or delete more sequence through small indels than do the larger genomes, whereas data for the polyploid suggest increased sequence turnover in general (both as small deletions and small insertions). Illegitimate recombination, although not demonstrated to be a dominant mechanism of genome size change, was biased in the polyploid toward deletions, which may provide a partial explanation of polyploid genomic downsizing.  相似文献   

16.
The plant mitochondrial rps3 intron was analyzed for substitution and indel rate variation among 15 monocot and dicot angiosperms from 10 genera, including perennial and annual taxa. Overall, the intron sequence was very conserved among angiosperms. Based on length polymorphism, 10 different alleles were identified among the 10 genera. These allelic differences were mainly attributable to large indels. An insertion of 133 nucleotides, observed in the Alnus intron was partially or completely absent in the other lineages of the family Betulaceae. This insertion was located within domain IV of the secondary-structure model of this group IIA intron. A mobile element of 47 nucleotides that showed homology to sequences located in rice rps3 intron and in intergenic plant mitochondrial genomes was found within this insertion. Both substitution and indel rates were low among the Betulaceae sequences, but substitution rates were increasingly larger than indel rates in comparisons involving more distantly related taxa. From a secondary-structure model, regions involved in helical structures were shown to be well preserved from indels as compared to substitutions, but compensatory changes were not observed among the angiosperm sequences analyzed. Using approximate divergence times based on the fossil record, substitution and indel rate heterogeneity was observed between different pairs of annual and perennial taxa. In particular, the annual petunia and primrose evolved more than 15 and 10 times faster, for substitution and indel rates respectively, than the perennial birch and alder. This is the first demonstration of an evolutionary rate difference between perennial and annual forms in noncoding DNA, lending support to neutral causes such as the generation time, population size, and speciation rate effects to explain such rate heterogeneity. Surprisingly, the sequence from the rps3 intron had a high identity with the sequence of intron 1 from the angiosperm mitochondrial nad5 gene, suggesting a common origin of these two group IIA introns.  相似文献   

17.
The waxy (wx) gene of Oryza glaberrima was cloned, and its nucleotide sequence was determined. A waxy mutant of O. glaberrima showing a glutinous phenotype was found to contain a substitution mutation generating a termination codon in the coding region of the wx gene. The Wx sequence of O. glaberrima was different from that of Oryza sativa by substitutions and insertions/deletions, among which only a few substitutions occurred in several exons not to severely alter the amino acid sequence of the Wx protein. The most striking difference observed in introns was a 139-bp deletion (or insertion) in intron 10 of O. glaberrima (or O. sativa). In O. sativa, 125 bp of the 139-bp sequence was flanked by direct repeats of a 14-bp sequence. A sequence homologous to the 125-bp sequence was found in the region preceding exon 2; this sequence was also flanked by direct repeats of another 14-bp sequence. This result and the observation that the 125-bp sequence was interspersed in rice genomes indicate that they are SINEs (short interspersed elements) in the plant system. We also identified a DNA sequence with long terminal inverted repeats in intron 13 of both O. glaberrima and O. sativa. This sequence was present in multiple copies in rice genomes, suggesting that it is a transposable element. These results obtained suggest that mobile DNA elements have diversified the rice Waxy gene by inserting into introns, each of which may originally have a length of about 100 bp.  相似文献   

18.
Most eukaryotes have at least some genes interrupted by introns. While it is well accepted that introns were already present at moderate density in the last eukaryote common ancestor, the conspicuous diversity of intron density among genomes suggests a complex evolutionary history, with marked differences between phyla. The question of the rates of intron gains and loss in the course of evolution and factors influencing them remains controversial. We have investigated a single gene family, alpha-amylase, in 55 species covering a variety of animal phyla. Comparison of intron positions across phyla suggests a complex history, with a likely ancestral intronless gene undergoing frequent intron loss and gain, leading to extant intron/exon structures that are highly variable, even among species from the same phylum. Because introns are known to play no regulatory role in this gene and there is no alternative splicing, the structural differences may be interpreted more easily: intron positions, sizes, losses or gains may be more likely related to factors linked to splicing mechanisms and requirements, and to recognition of introns and exons, or to more extrinsic factors, such as life cycle and population size. We have shown that intron losses outnumbered gains in recent periods, but that "resets" of intron positions occurred at the origin of several phyla, including vertebrates. Rates of gain and loss appear to be positively correlated. No phase preference was found. We also found evidence for parallel gains and for intron sliding. Presence of introns at given positions was correlated to a strong protosplice consensus sequence AG/G, which was much weaker in the absence of intron. In contrast, recent intron insertions were not associated with a specific sequence. In animal Amy genes, population size and generation time seem to have played only minor roles in shaping gene structures.  相似文献   

19.
We examined the gene structure of a set of 2563 Arabidopsis thaliana paralogous pairs that were duplicated simultaneously 20-60 MYA by tetraploidy. Out of a total of 23,164 introns in these genes, we found that 10,004 pairs have been conserved and 578 introns have been inserted or deleted in the time since the duplication event. This intron insertion/deletion rate of 2.7 x 10(-3) to 9.1 x 10(-4) per site per million years is high in comparison to previous studies. At least 56 introns were gained and 39 lost based on parsimony analysis of the phylogenetic distribution of these introns. We found weak evidence that genes undergoing intron gain and loss are biased with respect to gene ontology terms. Gene pairs that experienced at least 2 intron insertions or deletions show evidence of enrichment for membrane location and transport and transporter activity function. We do not find any relationship of intron flux to expression level or G + C content of the gene. Detection of a bias in the location of intron gains and losses within a gene depends on the method of measurement: an intragene method indicates that events (specifically intron losses) are biased toward the 3' end of the gene. Despite the relatively recent acquisition of these introns, we found only one case where we could identify the mechanism of intron origin--the TOUCH3 gene has experienced 2 tandem, partial, internal gene duplications that duplicated a preexisting intron and also created a novel, alternatively spliced intron that makes use of a duplicated pair of cryptic splice sites.  相似文献   

20.
Stearoyl-acyl carrier protein desaturase (SAD, EC 1.14.99.6) produces oleic acid (18:1 9) by desaturating 18:0. SAD genes have been targets for breeding and engineering oilseed crops with increased stearic acid (18:0). Our aim was to clone, describe, and develop genetic markers for the SAD genes of sunflower (Helianthus annuus L.). Nineteen SAD cDNA clones were partially sequenced and found to belong to two groups. Full-length cDNAs from each group (SAD6 and SAD17) were completely sequenced. The amino acid identity of SAD6 and SAD17 was 89%. Both genes were strongly expressed in developing seeds, moderately expressed in leaves and flowers, and weakly expressed in cotyledons, roots, and stems. One intron was found in SAD6 and two introns were found in SAD17. The SAD introns from two inbred lines (HA370 and HA372) were sequenced and found to vary in length and nucleotide sequence. The length variants were caused by monomeric repeat length differences, insertions, and deletions. Three long poly-T repeats (T9 to T39) were found in one of the SAD17 introns. Three short adjacent CA repeats were found in the 5-untranslated region of SAD6. DNA fragment length polymorphism (DFLP), single-strand conformational polymorphism (SSCP), and simple sequence repeat (SSR) markers were developed for SAD6 and SAD17 by developing primers to flank introns or the CA repeats. Two of six DFLP, four of six SSCP, and one of two SSR markers were polymorphic among eight elite inbred lines. The polymorphic information contents for DFLP, SSCP, and SSR markers were 0.18, 0.37, and 0.30, respectively. Most of the polymorphisms were caused by intron fragment length polymorphisms. Introns may be an excellent source of hypervariable markers in sunflower and other crop plants.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号