首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 343 毫秒
1.
B. Liefshitz  A. Parket  R. Maya    M. Kupiec 《Genetics》1995,140(4):1199-1211
The presence of repeated sequences in the genome represents a potential source of karyotypic instability. Genetic control of recombination is thus important to preserve the integrity of the genome. To investigate the genetic control of recombination between repeated sequences, we have created a series of isogenic strains in which we could assess the role of genes involved in DNA repair in two types of recombination: direct repeat recombination and ectopic gene conversion. Naturally occurring (Ty elements) and artificially constructed repeats could be compared in the same cell population. We have found that direct repeat recombination and gene conversion have different genetic requirements. The role of the RAD51, RAD52, RAD54, RAD55, and RAD57 genes, which are involved in recombinational repair, was investigated. Based on the phenotypes of single and double mutants, these genes can be divided into three functional subgroups: one composed of RAD52, a second one composed of RAD51 and RAD54, and a third one that includes the RAD55 and RAD57 genes. Among seven genes involved in excision repair tested, only RAD1 and RAD10 played a role in the types of recombination studied. We did not detect a differential effect of any rad mutation on Ty elements as compared to artificially constructed repeats.  相似文献   

2.
Tandem repeats are common in eukaryotic genomes, but due to difficulties in assaying them remain poorly studied. Here, we demonstrate the utility of Nanostring technology as a targeted approach to perform accurate measurement of tandem repeats even at extremely high copy number, and apply this technology to genotype 165 HapMap samples from three different populations and five species of non-human primates. We observed extreme variability in copy number of tandemly repeated genes, with many loci showing 5–10 fold variation in copy number among humans. Many of these loci show hallmarks of genome assembly errors, and the true copy number of many large tandem repeats is significantly under-represented even in the high quality ‘finished’ human reference assembly. Importantly, we demonstrate that most large tandem repeat variations are not tagged by nearby SNPs, and are therefore essentially invisible to SNP-based GWAS approaches. Using association analysis we identify many cis correlations of large tandem repeat variants with nearby gene expression and DNA methylation levels, indicating that variations of tandem repeat length are associated with functional effects on the local genomic environment. This includes an example where expansion of a macrosatellite repeat is associated with increased DNA methylation and suppression of nearby gene expression, suggesting a mechanism termed “repeat induced gene silencing”, which has previously been observed only in transgenic organisms. We also observed multiple signatures consistent with altered selective pressures at tandemly repeated loci, suggesting important biological functions. Our studies show that tandemly repeated loci represent a highly variable fraction of the genome that have been systematically ignored by most previous studies, copy number variation of which can exert functionally significant effects. We suggest that future studies of tandem repeat loci will lead to many novel insights into their role in modulating both genomic and phenotypic diversity.  相似文献   

3.
Gene conversion is a recombinatorial mechanism which transfers genetic information from a donor into a recipient gene. A case of gene conversion between immunoglobulin VH region genes was analysed and palindromic sequences were found to be located near to the left recombinatorial breakpoint, which also is flanked by a direct repeat sequence. We performed a computer search for palindromes and direct repeats in the published sequences of eucaryotic genes which had been involved in gene conversion. In these sequences, the palindrome with the best or second best quality is located near to a breakpoint of recombination. A correlation of recombination breakpoints with direct repeats was not observed. This suggests that gene conversion is promoted by palindromic sequences.  相似文献   

4.
Analysis of evolution of paralogous genes in a genome is central to our understanding of genome evolution. Comparison of closely related bacterial genomes, which has provided clues as to how genome sequences evolve under natural conditions, would help in such an analysis. With species Staphylococcus aureus, whole-genome sequences have been decoded for seven strains. We compared their DNA sequences to detect large genome polymorphisms and to deduce mechanisms of genome rearrangements that have formed each of them. We first compared strains N315 and Mu50, which make one of the most closely related strain pairs, at the single-nucleotide resolution to catalogue all the middle-sized (more than 10 bp) to large genome polymorphisms such as indels and substitutions. These polymorphisms include two paralogous gene sets, one in a tandem paralogue gene cluster for toxins in a genomic island and the other in a ribosomal RNA operon. We also focused on two other tandem paralogue gene clusters and type I restriction-modification (RM) genes on the genomic islands. Then we reconstructed rearrangement events responsible for these polymorphisms, in the paralogous genes and the others, with reference to the other five genomes. For the tandem paralogue gene clusters, we were able to infer sequences for homologous recombination generating the change in the repeat number. These sequences were conserved among the repeated paralogous units likely because of their functional importance. The sequence specificity (S) subunit of type I RM systems showed recombination, likely at the homology of a conserved region, between the two variable regions for sequence specificity. We also noticed novel alleles in the ribosomal RNA operons and suggested a role for illegitimate recombination in their formation. These results revealed importance of recombination involving long conserved sequence in the evolution of paralogous genes in the genome.  相似文献   

5.
MOTIVATION: Tandem repeats are associated with disease genes, play an important role in evolution and are important in genomic organization and function. Although much research has been done on short perfect patterns of repeats, there has been less focus on imperfect repeats. Thus, there is an acute need for a tandem repeats database that provides reliable and up to date information on both perfect and imperfect tandem repeats in the human genome and relates these to disease genes. RESULTS: This paper presents a web-accessible relational tandem repeats database that relates tandem repeats to gene locations and disease genes of the human genome. In contrast to other available databases, this database identifies both perfect and imperfect repeats of 1-2000 bp unit lengths. The utility of this database has been illustrated by analysing these repeats for their distribution and frequencies across chromosomes and genomic locations and between protein-coding and non-coding regions. The applicability of this database to identify diseases associated with previously uncharacterized tandem repeats is demonstrated.  相似文献   

6.
Immunoglobulin heavy-chain switching is effected by recombination events between sites associated with tandemly repeated switch sequences located 5' to immunoglobulin heavy-chain genes. Using the band mobility shift assay, we have identified two distinct sites 5' to the alpha heavy-chain switch sequence with affinity for a single B-cell-specific DNA-binding protein, S alpha-BP. S alpha-BP was present in nuclear extracts from pre-B and B cells but was not detected in extracts from plasmacytomas, B-cell hybridomas, T-cell lymphomas, or a macrophage cell line. It was also not detectable in other nonlymphoid cells tested. Evidence suggests there are S alpha-BP-binding sites near other immunoglobulin switch sequences. As with the S alpha sites, these sites appear to be distinct from the consensus tandem repeats characteristic of immunoglobulin switch sequences. The possible functions of S alpha-BP on contacting its binding sites are discussed in the context of immunoglobulin heavy-chain switch recombination.  相似文献   

7.
DNA rearrangement (recombination) mediated by direct repeats is a major cause of genome instability. InEscherichia coli, direct repeats in close proximity can mediate efficientrecA-independent intramolecular recombi nation, which produces multiple products. Using plasmid substrates, three basic forms of products have been revealed: the monomeric deletion product and two dimeric products. The frequency of recombination has been shown to be affected by structural factors such as the length of the repeat and the distance between the repeats. We show here that these factors also affect the relative abundance of each form of product. Recombination between very short tandem repeats yields exclusively the monomeric product. Lengthening the repeats increases the abundance of the dimeric products. Increasing the distance separating the repeats sharply reduces the formation of the monomeric product. These results can be explained by a model for DNA rearrangement (recombination) involving DNA replication. We propose that misalignment of the repeats at the replication fork creates a recombinogenic intermediate that can be differentially processed to form the three basic products. The proposed sister-strand recombination mediated by direct repeats might be a general mechanism for deletion and/or amplification of repeated sequences in both prokaryotic and eukaryotic genomes.  相似文献   

8.
The pangenomic diversity in Burkholderia pseudomallei is high, with approximately 5.8% of the genome consisting of genomic islands. Genomic islands are known hotspots for recombination driven primarily by site-specific recombination associated with tRNAs. However, recombination rates in other portions of the genome are also high, a feature we expected to disrupt gene order. We analyzed the pangenome of 37 isolates of B. pseudomallei and demonstrate that the pangenome is ‘open’, with approximately 136 new genes identified with each new genome sequenced, and that the global core genome consists of 4568±16 homologs. Genes associated with metabolism were statistically overrepresented in the core genome, and genes associated with mobile elements, disease, and motility were primarily associated with accessory portions of the pangenome. The frequency distribution of genes present in between 1 and 37 of the genomes analyzed matches well with a model of genome evolution in which 96% of the genome has very low recombination rates but 4% of the genome recombines readily. Using homologous genes among pairs of genomes, we found that gene order was highly conserved among strains, despite the high recombination rates previously observed. High rates of gene transfer and recombination are incompatible with retaining gene order unless these processes are either highly localized to specific sites within the genome, or are characterized by symmetrical gene gain and loss. Our results demonstrate that both processes occur: localized recombination introduces many new genes at relatively few sites, and recombination throughout the genome generates the novel multi-locus sequence types previously observed while preserving gene order.  相似文献   

9.
10.
J P Murnane  B R Young 《Gene》1989,84(1):201-205
Characterization of human cell clones containing a promoterless selectable gene (neo), integrated at various locations in the genome, demonstrated that one of the integration sites had a high rate of spontaneous tandem duplications. Other investigators have suggested that specific sequences, such as short repeats, found near an integration site, could be responsible for this kind of instability. To learn more about this process, we sequenced the DNA at the recombination site in two independently derived subclones, and compared these sequences with those found in the parental cell DNA. The results demonstrate that specific sequences are not required at the recombination site. In one G418-resistant subclone, recombination occurred between an Alu retroposon in the cellular DNA and integrated pBR322 sequences sharing 3 bp of similarity at the recombination site. In the other subclone, recombination occurred between single-copy cellular DNA and integrated simian virus 40 sequences sharing a single bp of similarity at the recombination site. This heterogeneity at the recombination site indicates a general enhancement of the rate of recombination within the entire region, with little if any sequence specificity or similarity required.  相似文献   

11.
Sublimons are substoichiometric DNA molecules which are generated by recombinations across short repeats, located in main mitochondrial genome of plants. Since short repeats are believed to recombine irreversibly and to be usually inactive, it is unknown how substoichiometric sequences are maintained. Occasionally, sublimons are amplified during substoichiometric shifting (SSS) and take the role of the main genome. Using the Phaseolus vulgaris system, we have addressed the questions concerning accumulation of sublimons, the role of recombination in their maintenance and selective amplification during SSS. We found that sublimons accompanied by parental recombination sequences were maintained by constant recombination across a short 314-bp repeat. The abundance of these sublimons was three orders of magnitude higher than accumulation of those which could not be maintained by recombination because their parental forms were absent from the main genome. As expected for active recombination, two recombination-derived sublimons were equimolar and so were their parental forms. One parental and one substoichiometric form shared the A/C polymorphism indicating their frequent inter-conversion. Only the C variant of the sublimon was amplified during substoichiometric shift implying strong selection of DNA molecules operating during SSS.  相似文献   

12.
Small heat-shock proteins (sHSP) are the most abundant heat stress-induced proteins in plants. In rice, there are at least seven members of class-I sHSP. A 1.6-kb DNA fragment was isolated from the EcoRI-digested rice genomic library probed with the cDNA pTS1 encoding a 16.9-kDa class-I sHSP. This fragment was composed of 365-bp tandem direct repeats (DRs) and 441-bp near perfect long terminal inverted repeats (LTIRs). The DRs contain 123-bp regions with 99% nucleotide identity to the 5' coding region of the Oshsp16.9B gene. Two putative pseudogenes were deduced from the DRs. Using the LTIR as a specific probe, Southern-blotting analysis showed that there was a single copy of this 1.6-kb DNA fragment in the rice genome. By genomic walking, we located this fragment in proximity 5'-upstream of the Oshsp16.9B gene that was mapped on chromosome 1 with other two class-I sHSP genes, Oshsp16.9A and Oshsp16.9C. By comparative analysis of the nucleotide sequences of class-I sHSP genes clustered on chromosome 1 between Tainung No. 67 and Nipponbare cultivars, we confirmed our mapping results of these genes and only the promoter region of Oshsp16.9B was different. However, we found that the expression profile of Oshsp16.9B upon different heat stresses in Nipponbare was not significantly different relative to that in Tainung No. 67.  相似文献   

13.
Gene-containing regions of wheat and the other grass genomes   总被引:18,自引:0,他引:18  
Sandhu D  Gill KS 《Plant physiology》2002,128(3):803-811
Deletion line-based high-density physical maps revealed that the wheat (Triticum aestivum) genome is partitioned into gene-rich and -poor compartments. Available deletion lines have bracketed the gene-containing regions to about 10% of the genome. Emerging sequence data suggest that these may further be partitioned into "mini" gene-rich and gene-poor regions. An average of about 10% of each gene-rich region seem to contain genes. Sequence analyses in various species suggest that uneven distribution of genes may be a characteristic of all grasses and perhaps all higher organisms. Comparison of the physical maps with genetic linkage maps showed that recombination in wheat and barley (Hordeum vulgare) is confined to the gene-containing regions. Number of genes, gene density, and the extent of recombination vary greatly among the gene-rich regions. The gene order, relative region size, and recombination are highly conserved within the tribe Triticeae and moderately conserved within the family. Gene-poor regions are composed of retrotransposon-like non-transcribing repeats and pseudogenes. Direct comparisons of orthologous regions indicated that gene density in wheat is about one-half compared with rice (Oryza sativa). Genome size difference between wheat and rice is, therefore, mainly because of amplification of the gene-poor regions. Presence of species-, genera-, and family-specific repeats reveal a repeated invasion of the genomes by different retrotransposons over time. Preferential transposition to adjacent locations and presence of vital genes flanking a gene-rich region may have restricted retrotransposon amplification to gene-poor regions, resulting into tandem blocks of non-transcribing repeats. Insertional inactivation by adjoining retro-elements and selection seem to have played a major role in stabilizing genomes.  相似文献   

14.
Herpes simplex virus type 1 (HSV-1) is a ubiquitous human pathogen which establishes lifelong infections. In the present study, we determined the sequence diversity of the complete genes coding for glycoproteins G (gG), I (gI), and E (gE), comprising 2.3% of the HSV-1 genome and located within the unique short (US) region, for 28 clinical HSV-1 isolates inducing oral lesions, genital lesions, or encephalitis. Laboratory strains F and KOS321 were sequenced in parallel. Phylogenetic analysis, including analysis of laboratory strain 17 (GenBank), revealed that the sequences were separated into three genetic groups. The identification of different genogroups facilitated the detection of recombinant viruses by using specific nucleotide substitutions as recombination markers. Seven of the isolates and strain 17 displayed sequences consistent with intergenic recombination, and at least four isolates were intragenic recombinants. The observed frequency of recombination based on an analysis of a short stretch of the US region suggests that most full-length HSV-1 genomes consist of a mosaic of segments from different genetic groups. Polymorphic tandem repeat regions, consisting of two to eight blocks of 21 nucleotides in the gI gene and seven to eight repeats of 3 nucleotides in the gG gene, were also detected. Laboratory strain KOS321 displayed a frameshift mutation in the gI gene with a subsequent alteration of the deduced intracellular portion of the protein. The presence of polymorphic tandem repeat regions and the different genogroup identities can be used for molecular epidemiology studies and for further detection of recombination in the HSV-1 genome.  相似文献   

15.
The a sequences of herpes simplex virus type 1 are believed to be the cis sites for inversion events that generate four isomeric forms of the viral genome. Using an assay that measures deletion of a beta-galactosidase gene positioned between two directly repeated sequences in plasmids transiently maintained in Vero cells, we had found that the a sequence is more recombinogenic than another sequence of similar size. To investigate the basis for the enhanced recombination mediated by the a sequence, we examined plasmids containing direct repeats of approximately 350 bp from a variety of sources and with a wide range of G+C content. We observed that all of these plasmids show similar recombination frequencies (3 to 4%) in herpes simplex virus type 1-infected cells. However, recombination between directly repeated a sequences occurs at twice this frequency (6 to 10%). In addition, we find that insertion of a cleavage site for an a-sequence-specific endonuclease into the repeated sequences does not appreciably increase the frequency of recombination, indicating that the presence of endonuclease cleavage sites within the a sequence does not account for its recombinogenicity. Finally, by replacing segments of the a sequence with DNA fragments of similar length, we have determined that only the 95-bp Uc-DR1 segment is indispensable for high-level a-sequence-mediated recombination.  相似文献   

16.
Activation-induced cytidine deaminase (AID) is required for initiation of Ig class switch recombination (CSR) and somatic hypermutation (SHM) of antibody genes during immune responses. AID has also been shown to induce chromosomal translocations, mutations, and DNA double-strand breaks (DSBs) involving non-Ig genes in activated B cells. To determine what makes a DNA site a target for AID-induced DSBs, we identify off-target DSBs induced by AID by performing chromatin immunoprecipitation (ChIP) for Nbs1, a protein that binds DSBs, followed by deep sequencing (ChIP-Seq). We detect and characterize hundreds of off-target AID-dependent DSBs. Two types of tandem repeats are highly enriched within the Nbs1-binding sites: long CA repeats, which can form Z-DNA, and tandem pentamers containing the AID target hotspot WGCW. These tandem repeats are not nearly as enriched at AID-independent DSBs, which we also identified. Msh2, a component of the mismatch repair pathway and important for genome stability, increases off-target DSBs, similar to its effect on Ig switch region DSBs, which are required intermediates during CSR. Most of the off-target DSBs are two-ended, consistent with generation during G1 phase, similar to DSBs in Ig switch regions. However, a minority are one-ended, presumably due to conversion of single-strand breaks to DSBs during replication. One-ended DSBs are repaired by processes involving homologous recombination, including break-induced replication repair, which can lead to genome instability. Off-target DSBs, especially those present during S phase, can lead to chromosomal translocations, deletions and gene amplifications, resulting in the high frequency of B cell lymphomas derived from cells that express or have expressed AID.  相似文献   

17.
It has previously been shown that recombination between tandem repeats is not significantly affected by a recA mutation in Escherichia coli . Here, we describe the activation of a RecA-dependent recombination pathway in a hyper-recombination mutant. In order to analyse how tandem repeat deletion may proceed, we searched for mutants that affect this process. Three hyper-recombination clones were characterized and shown to be mutated in the uvrD gene. Two of the mutations were identified as opal mutations at codons 130 and 438. A uvrD  ::Tn 5 mutation was used to investigate the mechanism of deletion formation in these mutants. The uvrD -mediated stimulation of deletion was abolished by a lexAind3 mutation or by inactivation of either the recA , recF , recQ or ruvA genes. We conclude that (i) this stimulation requires SOS induction and (ii) tandem repeat recombination in uvrD mutants occurs via the RecF pathway. In uvrD + cells, constitutive expression of SOS genes is not sufficient to stimulate deletion formation. This suggests that the RecF recombination pathway activated by SOS induction is antagonized by the UvrD protein. Paradoxically, we observed that the overproduction of UvrD from a plasmid also stimulates tandem repeat deletion. However, this stimulation is RecA independent, as is deletion in a wild-type strain. We propose that the presence of an excess of the UvrD helicase favours replication slippage. This work suggests that the UvrD helicase controls a balance between different routes of tandem repeat deletion.  相似文献   

18.
The human genome has linkage disequilibrium (LD) blocks, within which single-nucleotide polymorphisms show strong association with each other. We examined data from the International HapMap Project to define LD blocks and to detect DNA sequence features inside of them. We used permutation tests to determine the empirical significance of the association of LD blocks with genes and Alu repeats. Very large LD blocks (>200 kb) have significantly higher gene coverage and Alu frequency than the outcome obtained from permutation-based simulation, whereas there was no significant positive correlation between gene density and block size. We also observed a reduced frequency of Alu repeats at the gaps between large LD blocks, indicating that their enrichment in large LD blocks does not introduce recombination hotspots that would cause these gaps.  相似文献   

19.

Background

Ancestral reconstructions of mammalian genomes have revealed that evolutionary breakpoint regions are clustered in regions that are more prone to break and reorganize. What is still unclear to evolutionary biologists is whether these regions are physically unstable due solely to sequence composition and/or genome organization, or do they represent genomic areas where the selection against breakpoints is minimal.

Methodology and Principal Findings

Here we present a comprehensive study of the distribution of tandem repeats in great apes. We analyzed the distribution of tandem repeats in relation to the localization of evolutionary breakpoint regions in the human, chimpanzee, orangutan and macaque genomes. We observed an accumulation of tandem repeats in the genomic regions implicated in chromosomal reorganizations. In the case of the human genome our analyses revealed that evolutionary breakpoint regions contained more base pairs implicated in tandem repeats compared to synteny blocks, being the AAAT motif the most frequently involved in evolutionary regions. We found that those AAAT repeats located in evolutionary regions were preferentially associated with Alu elements.

Significance

Our observations provide evidence for the role of tandem repeats in shaping mammalian genome architecture. We hypothesize that an accumulation of specific tandem repeats in evolutionary regions can promote genome instability by altering the state of the chromatin conformation or by promoting the insertion of transposable elements.  相似文献   

20.
Transposable elements (TEs) play a fundamental role in the evolution of genomes. In Drosophila they are disproportionately represented in regions of low recombination, such as in heterochromatin. This pattern has been attributed to selection against repeated elements in regions of normal recombination, owing to either (1) the slightly deleterious position effects of TE insertions near or into genes, or (2) strong selection against chromosomal abnormalities arising from ectopic exchange between TE repeats. We have used defective non-long-terminal repeat (LTR) TEs that are "dead-on-arrival" (DOA) and unable to transpose in order to estimate spontaneous deletion rates in different constituents of chromatin. These elements have previously provided evidence for an extremely high rate of spontaneous deletion in Drosophila as compared with mammals, potentially explaining at least part of the differences in the genome sizes in these organisms. However, rates of deletion could be overestimated due to positive selection for a smaller likelihood of ectopic exchange. In this article, we show that rates of spontaneous deletion in DOA repeats are as high in heterochromatin and regions of euchromatin with low recombination as they are in regions of euchromatin with normal recombination. We have also examined the age distribution of five non-LTR families throughout the genome. We show that there is substantial variation in the historical pattern of transposition of these TEs. The overrepresentation of TEs in the heterochromatin is primarily due to their longer retention time in heterochromatin, as evidenced by the average time since insertion. Fragments inserted recently are much more evenly distributed in the genome. This contrast demonstrates that the accumulation of TEs in heterochromatin and in euchromatic regions of low recombination is not due to biased transposition but by greater probabilities of fixation in these regions relative to regions of normal recombination.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号