首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Proteins with homopolymeric repeat tracts are very common in the human proteome. Intriguingly, some but not all repeat tracts show length variation in the population and, in a few, the expansion of repeat tract beyond the normal length is associated with neurodegenerative and developmental disorders. In this study we have addressed questions such as why some amino acid residues are favored in longer repeat tracts and why repeat tracts show terminal bias. Using cell biological assays for repeat tracts fused to green fluorescent protein we show here that homopolymeric repeats that are beyond their naturally occurring length in the proteome are cytotoxic in nature. This toxicity is further modulated by the length of the peptide that bears the repeat and the spatial location of the repeat within the peptide. Thus, the cellular toxicity appears to be one of the selective processes that regulate the evolution of homopolymeric repeats in the proteome.  相似文献   

2.
A census of protein repeats.   总被引:20,自引:0,他引:20  
In this study, we analyzed all known protein sequences for repeating amino acid segments. Although duplicated sequence segments occur in 14 % of all proteins, eukaryotic proteins are three times more likely to have internal repeats than prokaryotic proteins. After clustering the repetitive sequence segments into families, we find repeats from eukaryotic proteins have little similarity with prokaryotic repeats, suggesting most repeats arose after the prokaryotic and eukaryotic lineages diverged. Consequently, protein classes with the highest incidence of repetitive sequences perform functions unique to eukaryotes. The frequency distribution of the repeating units shows only weak length dependence, implicating recombination rather than duplex melting or DNA hairpin formation as the limiting mechanism underlying repeat formation. The mechanism favors additional repeats once an initial duplication has been incorporated. Finally, we show that repetitive sequences are favored that contain small and relatively water-soluble residues. We propose that error-prone repeat expansion allows repetitive proteins to evolve more quickly than non-repeat-containing proteins.  相似文献   

3.
4.
Instability of short tandem repeats (STRs) in DNA during replication is observed in all organisms examined, and is causatively involved in various human diseases. We explore the mechanisms involved in instability by examining length changes occurring during the replication of [(CA)(20)TA](n) and [(CAG)(20)TAG](n), in human cells. We show that the majority of alterations consist of an insertion or deletion of one repeat unit, and base substitutions or length changes involving many repeat units are rare. We also show that length changes of two-tract STRs are biased toward the 3'-end of the repeat tract, in reference to lagging strand synthesis. There are some differences between our observations and previous observations in microbes, e.g. the orientation effect was not observed in this study. The results of this study are discussed in terms of the molecular mechanisms leading to alterations in repeat tracts.  相似文献   

5.
Complete chromosome/genome sequences available from humans, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, and Saccharomyces cerevisiae were analyzed for the occurrence of mono-, di-, tri-, and tetranucleotide repeats. In all of the genomes studied, dinucleotide repeat stretches tended to be longer than other repeats. Additionally, tetranucleotide repeats in humans and trinucleotide repeats in Drosophila also seemed to be longer. Although the trends for different repeats are similar between different chromosomes within a genome, the density of repeats may vary between different chromosomes of the same species. The abundance or rarity of various di- and trinucleotide repeats in different genomes cannot be explained by nucleotide composition of a sequence or potential of repeated motifs to form alternative DNA structures. This suggests that in addition to nucleotide composition of repeat motifs, characteristic DNA replication/repair/recombination machinery might play an important role in the genesis of repeats. Moreover, analysis of complete genome coding DNA sequences of Drosophila, C. elegans, and yeast indicated that expansions of codon repeats corresponding to small hydrophilic amino acids are tolerated more, while strong selection pressures probably eliminate codon repeats encoding hydrophobic and basic amino acids. The locations and sequences of all of the repeat loci detected in genome sequences and coding DNA sequences are available at http://www.ncl-india.org/ssr and could be useful for further studies.  相似文献   

6.
Evolution of N-terminal sequences of the vertebrate HOXA13 protein   总被引:8,自引:0,他引:8  
While the the role of the homeodomain in HOX function has been evaluated extensively, little attention has been given to the non-homeodomain portions of the HOX proteins. To investigate the evolution of the HOXA13 protein and to identify conserved residues in the N-terminal region of the protein with potential functional significance, N-terminal Hoxa13 coding sequences were PCR-amplified from fish, amphibian, reptile, chicken, and marsupial and eutherian mammal genomic DNA. Compared with fish HOXA13, the mammalian protein has increased in size by 35% primarily owing to the accumulation of alanine repeats and flanking segments rich in proline, glycine, or serine within the first 215 amino acids. Certain residues and amino acid motifs were strongly conserved, and several HOXA13 N-terminal domains were also shared in the paralogous HOXB13 and HOXD13 genes; however, other conserved regions appear to be unique to HOXA13. Two domains highly conserved in HOXA13 orthologs are shared with Drosophila AbdB and other vertebrate AbdB-like proteins. Marsupial and eutherian mammalian HOXA13 proteins have three large homopolymeric alanine repeats of 14, 12, and 17–18 residues that are absent in reptiles, birds, and fish. Thus, the repeats arose after the divergence of reptiles from the lineage that would give rise to the mammals. In contrast, other short homopolymeric alanine repeats in mammalian HOXA13 have remained virtually the same length, suggesting that forces driving or limiting repeat expansion are context dependent. Consecutive stretches of identical third-base usage in alanine codons within the large repeats were found, supporting replication slippage as a mechanism for their generation. However, numerous species-specific base substitutions affecting third-base alanine repeat codon positions were observed, particularly in the largest repeat. Therefore, if the large alanine repeats were present prior to eutherian mammal development as is suggested by the opossum data, then a dynamic process of recurring replication slippage and point mutation within alanine repeat codons must be considered to reconcile these observations. This model might also explain why the alanine repeats are flanked by proline, serine, and glycine-rich sequences, and it reveals a biological mechanism that promotes increases in protein size and, potentially, acquisition of new functions. Received: 8 June 1999 / Accepted: 23 September 1999  相似文献   

7.
Instability of simple sequence DNA in Saccharomyces cerevisiae.   总被引:20,自引:5,他引:15       下载免费PDF全文
All eukaryotic genomes thus far examined contain simple sequence repeats. A particularly common simple sequence in many organisms (including humans) consists of tracts of alternating GT residues on one strand. Allelic poly(GT) tracts are often of different lengths in different individuals, indicating that they are likely to be unstable. We examined the instability of poly(GT) and poly(G) tracts in the yeast Saccharomyces cerevisiae. We found that these tracts were dramatically unstable, altering length at a minimal rate of 10(-4) events per division. Most of the changes involved one or two repeat unit additions or deletions, although one alteration involved an interaction with the yeast telomeres.  相似文献   

8.
The recombinational properties of long GAA.TTC repeating sequences were analyzed in Escherichia coli to gain further insights into the molecular mechanisms of the genetic instability of this tract as possibly related to the etiology of Friedreich's ataxia. Intramolecular and intermolecular recombination studies showed that the frequency of recombination between the GAA.TTC tracts was as much as 15 times higher than the non-repeating control sequences. Homologous, intramolecular recombination between GAA.TTC tracts and GAAGGA.TCCTTC repeats also occurred with a very high frequency (approximately 0.8%). Biochemical analyses of the recombination products demonstrated the expansions and deletions of the GAA.TTC repeats. These results, together with our previous studies on the CTG.CAG sequences, suggest that the recombinational hot spot characteristics may be a common feature of all triplet repeat sequences. Unexpectedly, we found that the recombination properties of the GAA.TTC tracts were unique, compared with CTG.CAG repeats, because they depended on the DNA secondary structure polymorphism. Increasing the length of the GAA.TTC repeats decreased the intramolecular recombination frequency between these tracts. Also, a correlation was found between the propensity of the GAA.TTC tracts to adopt the sticky DNA conformation and the inhibition of intramolecular recombination. The use of novobiocin to modulate the intracellular DNA topology, i.e. the lowering of the negative superhelical density, repressed the formation of the sticky DNA structure, thereby restoring the expected positive correlation between the length of the GAA.TTC tracts and the frequency of intramolecular recombination. Hence, our results demonstrate that sticky DNA exists and functions in E. coli.  相似文献   

9.
Alterations in the length (instability) of gene-specific microsatellites and minisatellites are associated with at least 35 human diseases. This review will discuss the various cis-elements that contribute to repeat instability, primarily through examination of the most abundant disease-associated repetitive element, trinucleotide repeats. For the purpose of this review, we define cis-elements to include the sequence of the repeat units, the length and purity of the repeat tracts, the sequences flanking the repeat, as well as the surrounding epigenetic environment, including DNA methylation and chromatin structure. Gender-, tissue-, developmental- and locus-specific cis-elements in conjunction with trans-factors may facilitate instability through the processes of DNA replication, repair and/or recombination. Here we review the available human data that supports the involvement of cis-elements in repeat instability with limited reference to model systems. In diverse tissues at different developmental times and at specific loci, repetitive elements display variable levels of instability, suggesting vastly different mechanisms may be responsible for repeat instability amongst the disease loci and between various tissues.  相似文献   

10.
Over 20 unstable microsatellite repeats have been identified as the cause of neurological disease in humans. The repeat nucleotide sequences, their location within the genes, the ranges of normal and disease‐causing repeat length and the clinical outcomes differ. Unstable repeats can be located in the coding or the non‐coding region of a gene. Different pathogenic mechanisms that are hypothesised to underlie the diseases are discussed. Evidence is given both from studies in simple model systems and from studies on human material and in animal models. Since somatic instability might affect the clinical outcome, this is briefly touched on. Available data and theories on the timing and mechanisms of the repeat instability itself are discussed, along with factors that have been observed to affect instability. Finally, the question of why the often harmful unstable repeats have been maintained throughout evolution is addressed.  相似文献   

11.
Expansion of trinucleotide repeat tracts has been shown to be associated with numerous human diseases. The mechanism and timing of the expansion events are poorly understood, however. We show that CTG repeats, associated with the human DMPK gene and implanted in two homologous yeast artificial chromosomes (YACs), are very unstable. The instability is 6 to 10 times more pronounced in meiosis than during mitotic division. The influence of meiosis on instability is 4.4 times greater when the second YAC with a repeat tract is not present. Most of the changes we observed in trinucleotide repeat tracts are large contractions of 21 to 50 repeats. The orientation of the insert with the repeats has no effect on the frequency and distribution of the contractions. In our experiments, expansions were found almost exclusively during gametogenesis. Genetic analysis of segregating markers among meiotic progeny excluded unequal crossover as the mechanism for instability. These unique patterns have novel implications for possible mechanisms of repeat instability.  相似文献   

12.
Homologous recombination was shown to enable the expansion of CTG.CAG repeat sequences. Other prior investigations revealed the involvement of replication and DNA repair in these genetic instabilities. Here we used a genetic assay to measure the frequency of homologous intermolecular recombination between two CTG.CAG tracts. When compared with non-repeating sequences of similar lengths, long (CTG.CAG)(n) repeats apparently recombine with an approximately 60-fold higher frequency. Sequence polymorphisms that interrupt the homogeneity of the CTG.CAG repeat tracts reduce the apparent recombination frequency as compared with the pure uninterrupted repeats. The orientation of the repeats relative to the origin of replication strongly influenced the apparent frequency of recombination. This suggests the involvement of DNA replication in the recombination process of triplet repeats. We propose that DNA polymerases stall within the CTG.CAG repeat tracts causing nicks or double-strand breaks that stimulate homologous recombination. The recombination process is RecA-dependent.  相似文献   

13.
14.
Abstract The complete sequence coding for the 57-kDa major soluble antigen of the salmonid fish pathogen, Renibacterium salmoninarum , was determined. The gene contained an opening reading frame of 1671 nucleotides coding for a protein of 557 amino acids with a calculated M r value of 57190. The first 26 amino acids constituted a signal peptide. The deduced sequence for amino acid residues 27–61 was in agreement with the 35 N-terminal amino acid residues determined by microsequencing, suggesting the protein in synthesized as a 557-amino acid precursor and processed to produce a mature protein of M r 54505. Two regions of the protein contained imperfect direct repeats. The first region contained two copies of an 81-residue repeat, the second contained five copies of an unrelated 25-residue repeat. Also, a perfect inverted repeat (including three in-frame UAA stop codons) was observed at the carboxyl-terminus of the gene.  相似文献   

15.
The complete sequence coding for the 57-kDa major soluble antigen of the salmonid fish pathogen, Renibacterium salmoninarum, was determined. The gene contained an opening reading frame of 1671 nucleotides coding for a protein of 557 amino acids with a calculated M(r) value of 57,190. The first 26 amino acids constituted a signal peptide. The deduced sequence for amino acid residues 27-61 was in agreement with the 35 N-terminal amino acid residues determined by microsequencing, suggesting the protein is synthesized as a 557-amino acid precursor and processed to produce a mature protein of M(r) 54,505. Two regions of the protein contained imperfect direct repeats. The first region contained two copies of an 81-residue repeat, the second contained five copies of an unrelated 25-residue repeat. Also, a perfect inverted repeat (including three in-frame UAA stop codons) was observed at the carboxyl-terminus of the gene.  相似文献   

16.
Trinucleotide repeat (TNR) instability is of interest because of its central role in human diseases such as Huntington’s and its unique genetic features. One distinctive characteristic of TNR instability is a threshold, defined as a minimal repeat length that confers frequent mutations. While thresholds are well established, important risk determinants for disease-causing mutations, their mechanistic analysis has been delayed by the lack of suitably tractable experimental systems. In this study, we directly compared for the first time three DNA elements—TNR sequence, purity and flanking sequence—all of which are suggested in the literature to contribute to thresholds. In a yeast model system, we find that CAG repeats require a substantially longer threshold to contract than CTG tracts, indicating that the lagging template repeat sequence helps determine the threshold. In contrast, ATG interruptions within a CTG run do not inhibit contractions via a threshold mechanism, but by altering the likelihood of forming a hairpin intermediate. The presence of a GC-rich flanking sequence, similar to a haplotype found in some Huntington’s patients, does not detectably alter expansions of Okazaki fragment CTG tracts, suggesting no role for this flanking sequence on thresholds. Together these results help better define TNR thresholds by delineating sequence elements that modulate instability.  相似文献   

17.
To examine the chromosomal stability of repetitions of the trinucleotide CAG, we have cloned CAG repeat tracts onto the 3' end of the Saccharomyces cerevisiae ADE2 gene and placed the appended gene into the ARO2 locus of chromosome VII. Examination of chromosomal DNA from sibling colonies arising from clonal expansion of strains harboring repeat tracts showed that repeat tracts often change in length. Most changes in tract length are decreases, but rare increases also occur. Longer tracts are more unstable than smaller tracts. The most unstable tracts, of 80 to 90 repeats, undergo changes at rates as high as 3 x 10(-2) changes per cell per generation. To examine whether repeat orientation or adjacent sequences alter repeat stability, we constructed strains with repeat tracts in both orientations, either with or without sequences 5' to ADE2 harboring an autonomously replicating sequence (ARS; replication origin). When CAG is in the ADE2 coding strand of strains harboring the ARS, the repeat tract is relatively stable regardless of the orientation of ADE2. When CTG is in the ADE2 coding strand of strains harboring the ARS, the repeat tract is relatively unstable regardless of the orientation of ADE2. Removal of the ARS as well as other sequences adjacent to the 5' end of ADE2 alters the orientation dependence such that stability now depends on the orientation of ADE2 in the chromosome. These results suggest that the proximity of an ARS or another sequence has a profound effect on repeat stability.  相似文献   

18.
19.
DNA rearrangement (recombination) mediated by direct repeats is a major cause of genome instability. InEscherichia coli, direct repeats in close proximity can mediate efficientrecA-independent intramolecular recombi nation, which produces multiple products. Using plasmid substrates, three basic forms of products have been revealed: the monomeric deletion product and two dimeric products. The frequency of recombination has been shown to be affected by structural factors such as the length of the repeat and the distance between the repeats. We show here that these factors also affect the relative abundance of each form of product. Recombination between very short tandem repeats yields exclusively the monomeric product. Lengthening the repeats increases the abundance of the dimeric products. Increasing the distance separating the repeats sharply reduces the formation of the monomeric product. These results can be explained by a model for DNA rearrangement (recombination) involving DNA replication. We propose that misalignment of the repeats at the replication fork creates a recombinogenic intermediate that can be differentially processed to form the three basic products. The proposed sister-strand recombination mediated by direct repeats might be a general mechanism for deletion and/or amplification of repeated sequences in both prokaryotic and eukaryotic genomes.  相似文献   

20.
At least nine dominant neurodegenerative diseases are caused by expansion of CAG repeats in coding regions of specific genes that result in abnormal elongation of polyglutamine (polyQ) tracts in the corresponding gene products. When above a threshold that is specific for each disease the expanded polyQ repeats promote protein aggregation, misfolding and neuronal cell death. The length of the polyQ tract inversely correlates with the age at disease onset. It has been observed that interruption of the CAG tract by silent (CAA) or missense (CAT) mutations may strongly modulate the effect of the expansion and delay the onset age. We have carried out an extensive study in which we have complemented DNA sequence determination with cellular and biophysical models. By sequencing cloned normal and expanded SCA1 alleles taken from our cohort of ataxia patients we have determined sequence variations not detected by allele sizing and observed for the first time that repeat instability can occur even in the presence of CAG interruptions. We show that histidine interrupted pathogenic alleles occur with relatively high frequency (11%) and that the age at onset inversely correlates linearly with the longer uninterrupted CAG stretch. This could be reproduced in a cellular model to support the hypothesis of a linear behaviour of polyQ. We clarified by in vitro studies the mechanism by which polyQ interruption slows down aggregation. Our study contributes to the understanding of the role of polyQ interruption in the SCA1 phenotype with regards to age at disease onset, prognosis and transmission.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号