首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 890 毫秒
1.
The rapid generation of new shapes observed in the living world is the result of genetic variation, especially in "morphological" developmental genes. Many of these genes contain coding tandem repeats. Fondon and Garner have shown that expansions and contractions of these repeats are associated with the great diversity of morphologies observed in the domestic dog, Canis familiaris. In particular, they found that the repeat variations in two genes were significantly associated with changes in limb and skull morphology. These results open the possibility that such a mechanism contributes to the diversity of life.  相似文献   

2.
Mutation patterns of amino acid tandem repeats in the human proteome   总被引:1,自引:0,他引:1  

Background

Amino acid tandem repeats are found in nearly one-fifth of human proteins. Abnormal expansion of these regions is associated with several human disorders. To gain further insight into the mutational mechanisms that operate in this type of sequence, we have analyzed a large number of mutation variants derived from human expressed sequence tags (ESTs).

Results

We identified 137 polymorphic variants in 115 different amino acid tandem repeats. Of these, 77 contained amino acid substitutions and 60 contained gaps (expansions or contractions of the repeat unit). The analysis showed that at least about 21% of the repeats might be polymorphic in humans. We compared the mutations found in different types of amino acid repeats and in adjacent regions. Overall, repeats showed a five-fold increase in the number of gap mutations compared to adjacent regions, reflecting the action of slippage within the repetitive structures. Gap and substitution mutations were very differently distributed between different amino acid repeat types. Among repeats containing gap variants we identified several disease and candidate disease genes.

Conclusion

This is the first report at a genome-wide scale of the types of mutations occurring in the amino acid repeat component of the human proteome. We show that the mutational dynamics of different amino acid repeat types are very diverse. We provide a list of loci with highly variable repeat structures, some of which may be potentially involved in disease.  相似文献   

3.
A Phylogenetic Perspective on Sequence Evolution in Microsatellite Loci   总被引:9,自引:0,他引:9  
We examined the evolution of the repeat regions of three noncoding microsatellite loci in 58 species of the Polistinae, a subfamily of wasps that diverged over 140 million years ago. A phylogenetic approach allows two new kinds of approaches to studying microsatellite evolution: character mapping and comparative analysis. The basic repeat structure of the loci was highly conserved, but was often punctuated with imperfections that appear to be phylogenetically informative. Repeat numbers evolved more rapidly than other changes in the repeat region. Changes in number of repeats among species seem consistent with the stepwise mutation model, which is based on slippage during replication as the main source of mutations. Changes in repeat numbers can occur even when there are very few tandem repeats but longer repeats, especially perfect repeats led to greater rates of evolutionary change. Species phylogenetically closer to the one from which we identified the loci had longer stretches of uninterrupted repeats and more different motifs, but not longer total repeat regions. The number of perfect repeats increased more often than it decreased. However, there was no evidence that some species have consistently greater numbers of repeats across loci than other species have, once ascertainment bias is eliminated. We also found no evidence for a population size effect posited by one form of the directionality hypothesis. Overall, phylogenetic variation in repeat regions can be explained by adding neutral evolution to what is already known about the mutation process. The life cycle of microsatellites appears to reflect a balance between growth by slippage and degradation by an essentially irreversible accumulation of imperfections. Received: 13 April 1999 / Accepted: 8 September 1999  相似文献   

4.
Minisatellites are DNA tandem repeats exhibiting size polymorphism among individuals of a population. This polymorphism is generated by two different mechanisms, both in human and yeast cells, "replication slippage" during S-phase DNA synthesis and "repair slippage" associated to meiotic gene conversion. The Saccharomyces cerevisiae genome contains numerous natural minisatellites. They are located on all chromosomes without any obvious distribution bias. Minisatellites found in protein-coding genes have longer repeat units and on the average more repeat units than minisatellites in noncoding regions. They show an excess of cytosines on the coding strand, as compared to guanines (negative GC skew). They are always multiples of three, encode serine- and threonine-rich amino acid repeats, and are found preferably within genes encoding cell wall proteins, suggesting that they are positively selected in this particular class of genes. Genome-wide, there is no statistically significant association between minisatellites and meiotic recombination hot spots. In addition, minisatellites that are located in the vicinity of a meiotic hot spot are not more polymorphic than minisatellites located far from any hot spot. This suggests that minisatellites, in S. cerevisiae, evolve probably by strand slippage during replication or mitotic recombination. Finally, evolution of minisatellites among hemiascomycetous yeasts shows that even though many minisatellite-containing genes are conserved, most of the time the minisatellite itself is not conserved. The diversity of minisatellite sequences found in orthologous genes of different species suggests that minisatellites are differentially acquired and lost during evolution of hemiascomycetous yeasts at a pace faster than the genes containing them.  相似文献   

5.
Genome variation studies in Plasmodium falciparum have focused on SNPs and, more recently, large-scale copy number polymorphisms and ectopic rearrangements. Here, we examine another source of variation: variable number tandem repeats (VNTRs). Interspersed low complexity features, including the well-studied P. falciparum microsatellite sequences, are commonly classified as VNTRs; however, this study is focused on longer coding VNTR polymorphisms, a small class of copy number variations. Selection against frameshift mutation is a main constraint on tandem repeats (TRs) in coding regions, while limited propagation of TRs longer than 975 nt total length is a minor restriction in coding regions. Comparative analysis of three P. falciparum genomes reveals that more than 9% of all P. falciparum ORFs harbor VNTRs, much more than has been reported for any other species. Moreover, genotyping of VNTR loci in a drug-selected line, progeny of a genetic cross, and 334 field isolates demonstrates broad variability in these sequences. Functional enrichment analysis of ORFs harboring VNTRs identifies stress and DNA damage responses along with chromatin modification activities, suggesting an influence on genome mutability and functional variation. Analysis of the repeat units and their flanking regions in both P. falciparum and Plasmodium reichenowi sequences implicates a replication slippage mechanism in the generation of TRs from an initially unrepeated sequence. VNTRs can contribute to rapid adaptation by localized sequence duplication. They also can confound SNP-typing microarrays or mapping short-sequence reads and therefore must be accounted for in such analyses.  相似文献   

6.
Short tandem repeats (STRs) are units of 1–6 bp that repeat in a tandem fashion in DNA. Along with single nucleotide polymorphisms and large structural variations, they are among the major genomic variants underlying genetic, and likely phenotypic, divergence. STRs experience mutation rates that are orders of magnitude higher than other well-studied genotypic variants. Frequent copy number changes result in a wide range of alleles, and provide unique opportunities for modulating complex phenotypes through variation in repeat length. While classical studies have identified key roles of individual STR loci, the advent of improved sequencing technology, high-quality genome assemblies for diverse species, and bioinformatics methods for genome-wide STR analysis now enable more systematic study of STR variation across wide evolutionary ranges. In this review, we explore mutation and selection processes that affect STR copy number evolution, and how these processes give rise to varying STR patterns both within and across species. Finally, we review recent examples of functional and adaptive changes linked to STRs.  相似文献   

7.
Genes containing multiple coding mini- and microsatellite repeats are highly dynamic components of genomes. Frequent recombination events within these tandem repeats lead to changes in repeat numbers, which in turn alters the amino acid sequence of the corresponding protein. In bacteria and yeasts, the expansion of such coding repeats in cell wall proteins is associated with alterations in immunogenicity, adhesion, and pathogenesis. We hypothesized that identification of repeat-containing putative cell wall proteins in the human pathogen Aspergillus fumigatus may reveal novel pathogenesis-related elements. Here, we report that the genome of A. fumigatus contains as many as 292 genes with internal repeats. Fourteen of 30 selected genes showed size variation of their repeat-containing regions among 11 clinical A. fumigatus isolates. Four of these genes, Afu3g08990, Afu2g05150 (MP-2), Afu4g09600, and Afu6g14090, encode putative cell wall proteins containing a leader sequence and a glycosylphosphatidylinositol anchor motif. All four genes are expressed and produce variable-size mRNA encoding a discrete number of repeat amino acid units. Their expression was altered during development and in response to cell wall-disrupting agents. Deletion of one of these genes, Afu3g08990, resulted in a phenotype characterized by rapid conidial germination and reduced adherence to extracellular matrix suggestive of an alteration in cell wall characteristics. The Afu3g08990 protein was localized to the cell walls of dormant and germinating conidia. Our findings suggest that a subset of the A. fumigatus cell surface proteins may be hypervariable due to recombination events in their internal tandem repeats. This variation may provide the functional diversity in cell surface antigens which allows rapid adaptation to the environment and/or elusion of the host immune system.  相似文献   

8.
Microsatellites (MSs) are short tandem DNA repeats with the repetitive motif of two to six nucleotides, forming tracts up to hundreds of nucleotides long. Notwithstanding the active use of MSs in genetic studies of various biological problems, the reasons for their wide occurrence in the genome, their possible functions, and mutational behavior are still unclear. The mutation rate in MS repeats is on average several orders of magnitude higher than in the remaining DNA, which allows for direct estimation of evolutionary transformation rate in nucleotide sequences of the genome. Mutation process in MSs is highly heterogeneous, with distinct differences between species; furthermore, within a species it differs among loci with different repeat size, among alleles of one locus, and among individuals of different sex and age. Most MS mutations are caused by DNA slippage during replication but the probability of this event depends on the locus. In this review, a number of models of MS evolution are discussed, which account for the relationship between mutation rate and allele size, different mutation direction in alleles of different size, and the appearance of point mutations within repeat tracts restricting allele size. The MS evolution is considered mainly in the context of selective neutrality, although there is evidence showing functional significance of some variants of tandem repeats and thus their possible selective value.  相似文献   

9.
Human microsatellites: mutation and evolution   总被引:1,自引:0,他引:1  
Nikitina TV  Nazarenko SA 《Genetika》2004,40(10):1301-1318
Microsatellites (MSs) are short tandem DNA repeats with the repetitive motif of two to six nucleotides, forming tracts up to hundreds of nucleotides long. Notwithstanding the active use of MSs in genetic studies of various biological problems, the reasons for their wide occurrence in the genome, their possible functions, and mutational behavior are still unclear. The mutation rate in MS repeats is on average several orders of magnitude higher than in the remaining DNA, which allows for direct estimation of evolutionary transformation rate in nucleotide sequences of the genome. Mutation process in MSs is species-specific; furthermore, within a species it differs among loci with different repeat size, among alleles of one locus, and among individuals of different sex and age. Most MS mutations are caused by DNA slippage during replication but the probability of this event depends on the locus. In this review, a number of models of MS evolution are discussed, which account for the relationship between mutation rate and allele size, different mutation direction in alleles of different size, and the appearance of point mutations within repeat tracts restricting allele size. The MS evolution is considered mainly in the context of selective neutrality, although there is evidence showing functional significance of some variants of tandem repeats and thus their possible selective value.  相似文献   

10.

Background

Ancestral reconstructions of mammalian genomes have revealed that evolutionary breakpoint regions are clustered in regions that are more prone to break and reorganize. What is still unclear to evolutionary biologists is whether these regions are physically unstable due solely to sequence composition and/or genome organization, or do they represent genomic areas where the selection against breakpoints is minimal.

Methodology and Principal Findings

Here we present a comprehensive study of the distribution of tandem repeats in great apes. We analyzed the distribution of tandem repeats in relation to the localization of evolutionary breakpoint regions in the human, chimpanzee, orangutan and macaque genomes. We observed an accumulation of tandem repeats in the genomic regions implicated in chromosomal reorganizations. In the case of the human genome our analyses revealed that evolutionary breakpoint regions contained more base pairs implicated in tandem repeats compared to synteny blocks, being the AAAT motif the most frequently involved in evolutionary regions. We found that those AAAT repeats located in evolutionary regions were preferentially associated with Alu elements.

Significance

Our observations provide evidence for the role of tandem repeats in shaping mammalian genome architecture. We hypothesize that an accumulation of specific tandem repeats in evolutionary regions can promote genome instability by altering the state of the chromatin conformation or by promoting the insertion of transposable elements.  相似文献   

11.
The mechanism of recombination of tandem repeats in the chromosome of Escherichia coli was investigated by genetic means. Tandem repeats 624 bp long were introduced into the lacZ gene of E. coli and the efficiency of deletion of one repeat was compared in different recombination mutants. No effects of the recA , recBC , recF , ruvA or ruvA recG mutations were detected. Hence, tandem repeat deletion appears to not proceed via the RecBCD or RecF homologous recombination pathways. A new mutant in which RecA-independent recombination is increased 15-fold was isolated. The mutation lies in the dnaE gene coding for the alpha subunit of polymerase III: it is a Gly to Asp change at codon 133. Another dnaE mutation, dnaE486 , was tested and also shown to stimulate RecA-independent recombination. It is proposed that tandem-repeat recombination occurs by a replication slippage mechanism. RecA-independent recombination is also enhanced in a rep mutant, in which chromosomal replication is slowed down by the absence of the Rep helicase, suggesting that replication pausing may facilitate slippage.  相似文献   

12.

Background

A fundamental question in comparative genomics concerns the identification of mechanisms that underpin chromosomal change. In an attempt to shed light on the dynamics of mammalian genome evolution, we analyzed the distribution of syntenic blocks, evolutionary breakpoint regions, and evolutionary breakpoints taken from public databases available for seven eutherian species (mouse, rat, cattle, dog, pig, cat, and horse) and the chicken, and examined these for correspondence with human fragile sites and tandem repeats.

Results

Our results confirm previous investigations that showed the presence of chromosomal regions in the human genome that have been repeatedly used as illustrated by a high breakpoint accumulation in certain chromosomes and chromosomal bands. We show, however, that there is a striking correspondence between fragile site location, the positions of evolutionary breakpoints, and the distribution of tandem repeats throughout the human genome, which similarly reflect a non-uniform pattern of occurrence.

Conclusion

These observations provide further evidence that certain chromosomal regions in the human genome have been repeatedly used in the evolutionary process. As a consequence, the genome is a composite of fragile regions prone to reorganization that have been conserved in different lineages, and genomic tracts that do not exhibit the same levels of evolutionary plasticity.  相似文献   

13.
Haemophilus influenzae (Hi), an obligate upper respiratory tract commensal/pathogen, uses phase variation (PV) to adapt to host environment changes. Switching occurs by slippage of nucleotide repeats (microsatellites) within genes coding for virulence molecules. Most such microsatellites in Hi are tetranucleotide repeats, but an exception is the dinucleotide repeats in the pilin locus. To investigate the effects on PV rates of mutations in genes for mismatch repair (MMR), insertion/deletion mutations of mutS, mutL, mutH, dam, polI, uvrD, mfd and recA were constructed in Hi strain Rd. Only inactivation of polI destabilized tetranucleotide (5'AGTC) repeat tracts of chromosomally located reporter constructs, whereas inactivation of mutS, but not polI, destabilized dinucleotide (5'AT) repeats. Deletions of repeats were predominant in polI mutants, which we propose are due to end-joining occurring without DNA polymerization during polI-deficient Okazaki fragment processing. The high prevalence of tetranucleotides mediating PV is an exceptional feature of the Hi genome. The refractoriness to MMR of hypermutation in Hi tetranucleotides facilitates adaptive switching without the deleterious increase in global mutation rates that accompanies a mutator genotype.  相似文献   

14.
After the dog genome was sequenced, an increasing number of studies involving genetic research of dogs have been conducted to understand gene functions and mammalian evolution. To study the genetic diversity in dogs and other mammals, genetic markers linked to function and conserved in wide lineages are necessary. Thus far, few polymorphic markers have been used in dogs. In this study, we surveyed the entire dog genome and predicted a total of 109 tandem repeats (TRs) located on the protein coding region that may be polymorphic by our prediction model. We selected 10 TRs that may be related to neurophysiology and neural developments, and tested them in 167 individuals of 8 dog breeds: 5 European dog breeds (Beagle, Golden Retriever, Labrador Retriever, German Shepherd, and Toy Poodle) and 3 Japanese dog breeds (Japanese Spitz, Shiba, and Shikoku). Among the tested TRs, nine were polymorphic indicating that 90% of the TRs were successfully predicted to be polymorphic. PCR fragments of the TRs were amplified from dog brain cDNA, showing their expression in the dog brain. Our results provide abundant opportunities for the study of phenotypic variations in dogs, and our prediction method for variable number of tandem repeats (VNTRs) can be applied to any other animal genome sequences for the survey of functional and polymorphic markers.  相似文献   

15.
High levels of genetic diversity are generated in Haemophilus influenzae populations through DNA repeat-mediated phase variation and recombination with DNA fragments acquired by uptake from the external milieu. Conversely, multiple pathways for maintenance of the genome sequence are encoded in H. influenzae genomes. In Escherichia coli, mutations in single-stranded-DNA exonucleases destabilise tandem DNA repeats whilst inactivation of recG can stabilise repeat tracts. These enzymes also have varying effects on recombination. Deletion mutations were constructed in H. influenzae genes encoding homologs of ExoI, RecJ and RecG whilst ExoVII was refractory to mutation. Inactivation of RecJ and RecG, but not ExoI, increased sensitivity to irradiation with ultraviolet light. An increase in spontaneous mutation rate was not observed in single mutants but only when both RecJ and ExoI were mutated. None of the single- or double-mutations increased or decreased the rates of slippage in tetranucleotide repeat tracts. Furthermore, the exonuclease mutants did not exhibit significant defects in horizontal gene transfer. We conclude that RecJ, ExoI and RecG are required for maintenance of the H. influenzae genome but none of these enzymes influence the generation of genetic diversity through mutations in the tetranucleotide repeat tracts of this species.  相似文献   

16.
We describe the isolation of two chromosomal DNA fragments from Plasmodium falciparum. These fragments encode the antigenically distinct S antigens of two different P. falciparum isolates, namely FC27 from Papua New Guinea and NF7 from Ghana. The complete nucleotide sequences of both fragments are presented. The fragments are homologous over most of their lengths, including the entire regions flanking the protein coding sequences. Whereas the N- and C-terminal portions of sequences encoding the S antigens are homologous, major portions of the coding sequences are not. The nonhomologous regions are comprised of tandemly repeated sequences, of 33 bp in FC27 and predominantly of 24 bp in NF7. The 33 bp tandem repeats encoded by the FC27 S-antigen gene could not be detected in the NF7 genome. Conversely, the 24 bp tandem repeats encoded by the NF7 S-antigen gene could not be detected in the FC27 genome. The pattern of sequence variation within the repeats of both genes suggests a mechanism for the generation of S-antigen diversity.  相似文献   

17.
Simple sequence repeats are predominantly found in most organisms. They play a major role in studies of genetic diversity, and are useful as diagnostic markers for many diseases. The simple sequence repeats database (SSRD) for the human genome was created for easy access to such repeats, for analysis, and to be used to understand their biological significance. The data includes the abundance and distribution of SSRs in the coding and non-coding regions of the genome, as well as their association with the UTRs of genes. The exact locations of repeats with respect to genomic regions (such as UTRs, exons, introns or intergenic regions) and their association with STS markers are also highlighted. The resource will facilitate repeat sequence analysis in the human genome and the understanding of the functional and evolutionary significance of simple sequence repeats. SSRD is available through two websites, http://www.ccmb.res.in/ssr and http://www.ingenovis.com/ssr.  相似文献   

18.
Mularoni L  Veitia RA  Albà MM 《Genomics》2007,89(3):316-325
Single-amino-acid tandem repeats are very common in mammalian proteins but their function and evolution are still poorly understood. Here we investigate how the variability and prevalence of amino acid repeats are related to the evolutionary constraints operating on the proteins. We find a significant positive correlation between repeat size difference and protein nonsynonymous substitution rate in human and mouse orthologous genes. This association is observed for all the common amino acid repeat types and indicates that rapid diversification of repeat structures, involving both trinucleotide slippage and nucleotide substitutions, preferentially occurs in proteins subject to low selective constraints. However, strikingly, we also observe a significant negative correlation between the number of repeats in a protein and the gene nonsynonymous substitution rate, particularly for glutamine, glycine, and alanine repeats. This implies that proteins subject to strong selective constraints tend to contain an unexpectedly high number of repeats, which tend to be well conserved between the two species. This is consistent with a role for selection in the maintenance of a significant number of repeats. Analysis of the codon structure of the sequences encoding the repeats shows that codon purity is associated with high repeat size interspecific variability. Interestingly, polyalanine and polyglutamine repeats associated with disease show very distinctive features regarding the degree of repeat conservation and the protein sequence selective constraints.  相似文献   

19.
串联重复序列的物种差异及其生物功能   总被引:13,自引:0,他引:13  
高焕  孔杰 《动物学研究》2005,26(5):555-564
串联重复序列是指1-200个碱基左右的核心重复单位,以头尾相串联的方式重复多次所组成的重 复序列。它广泛存在于真核生物和一些原核生物的基因组中,并表现出种属、碱基组成等的特异性。在基因组 整体水平上,各种优势的重复序列类型不同。即使在同一重复序列类型内部,不同重复拷贝类别(如AT、AC 等)在基因组中的存在也表现出很大的差异。同时,这些重复序列类型和各重复拷贝类别在同一物种的不同染 色体间,以及基因的编码区和非编码区间也表现种属和碱基组成差异。这些差异显示了重复序列起源和进化的 复杂性,可能涉及到多种机制和因素,并与生物功能密切相关。另外,由于重复序列分析软件和统计标准还存 在算法、重复长度、完美性等问题,需要进一步探讨。此外,串联重复序列的自身进化关系、全基因组水平上 的进化地位、在基因组中的生物功能、重复序列数据库建立和应用研究等,将是今后研究的主要课题。  相似文献   

20.
Jo YD  Park J  Kim J  Song W  Hur CG  Lee YH  Kang BC 《Plant cell reports》2011,30(2):217-229
Plants in the family Solanaceae are used as model systems in comparative and evolutionary genomics. The complete chloroplast genomes of seven solanaceous species have been sequenced, including tobacco, potato and tomato, but not peppers. We analyzed the complete chloroplast genome sequence of the hot pepper, Capsicum annuum. The pepper chloroplast genome was 156,781 bp in length, including a pair of inverted repeats (IR) of 25,783 bp. The content and the order of 133 genes in the pepper chloroplast genome were identical to those of other solanaceous plastomes. To characterize pepper plastome sequence, we performed comparative analysis using complete plastome sequences of pepper and seven solanaceous plastomes. Frequency and contents of large indels and tandem repeat sequences and distribution pattern of genome-wide sequence variations were investigated. In addition, a phylogenetic analysis using concatenated alignments of coding sequences was performed to determine evolutionary position of pepper in Solanaceae. Our results revealed two distinct features of pepper plastome compared to other solanaceous plastomes. Firstly, large indels, including insertions on accD and rpl20 gene sequences, were predominantly detected in the pepper plastome compared to other solanaceous plastomes. Secondly, tandem repeat sequences were particularly frequent in the pepper plastome. Taken together, our study represents unique features of evolution of pepper plastome among solanaceous plastomes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号