首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
MOTIVATION: Tandem repeats (TRs) are associated with human disease, play a role in evolution and are important in regulatory processes. Despite their importance, locating and characterizing these patterns within anonymous DNA sequences remains a challenge. In part, the difficulty is due to imperfect conservation of patterns and complex pattern structures. We study recognition algorithms for two complex pattern structures: variable length tandem repeats (VLTRs) and multi-period tandem repeats (MPTRs). RESULTS: We extend previous algorithmic research to a class of regular tandem repeats (RegTRs). We formally define RegTRs, as well as two important subclasses: VLTRs and MPTRs. We present algorithms for identification of TRs in these classes. Furthermore, our algorithms identify degenerate VLTRs and MPTRs: repeats containing substitutions, insertions and deletions. To illustrate our work, we present results of our analysis for two difficult regions in cattle and human data which reflect practical occurrences of these subclasses in GenBank sequence data. In addition, we show the applicability of our algorithmic techniques for identifying Alu sequences, gene clusters and other distant regions of similarity. We illustrate this with an example from yeast chromosome I.  相似文献   

2.
Xu J  Fonseca DM 《Mitochondrial DNA》2011,22(5-6):155-158
Repetitive DNA sequences not only exist abundantly in eukaryotic nuclear genomes, but also occur as tandem repeats in many animal mitochondrial DNA (mtDNA) control regions. Due to concerted evolution, these repetitive sequences are highly similar or even identical within a genome. When long repetitive regions are the targets of amplification for the purpose of sequencing, multiple amplicons may result if one primer has to be located inside the repeats. Here, we show that, without separating these amplicons by gel purification or cloning, directly sequencing the mitochondrial repeats with the primer outside repetitive region is feasible and efficient. We exemplify it by sequencing the mtDNA control region of the mosquito Aedes albopictus, which harbors typical large tandem DNA repeats. This one-way sequencing strategy is optimal for population surveys.  相似文献   

3.
Microsatellites (SSR--simple sequence repeats, STR--short tandem repeats, SSLP--simple sequence length polymorphism, VNTR--variable number of tandem repeats) are the class of repetitive DNA sequences present in all living organisms. Particular characteristics of microsatellites, such as their presence in the genomes of all living organisms, high level of allelic variation, co-dominant mode of inheritance and potential for automated analysis make them an excellent tool for a number of approaches like genotyping, mapping and positional cloning of genes. The three most popular types of markers containing microsatellite sequences that are presently used are: (1) SSR (simple sequence repeats), generated by amplifying in a PCR reaction with the use of primers complementary to flanking regions; (2) ISSR (inter-simple sequence repeats), based on the amplification of regions between inversely oriented closely spaced microsatellites; and (3) SAMPL (selective amplification of microsatellite polymorphic loci), which utilises AFLP (amplified fragment-length polymorphism) methodology, with one exception--for the second amplification, one of the starters is complementary to the microsatellite sequence. The usefulness of the three above-mentioned markers for numerous purposes has been well documented for plants.  相似文献   

4.
The two regions of the Epstein-Barr virus genome (DSL and DSR) carrying homologous sequences at distant parts of the long unique region are described. Cleavage of cloned DNA containing the DSR region with restriction endonucleases revealed a so far unrecognized small tandem repeat of approximately 120 base pairs present in approximately 20 copies. Heteroduplexes of the DNA of two clones containing DSL and DSR respectively, visualized in the electron microscope by cytochrome c spreading, revealed that the region of homology is approximately 2.5 kb long, involves small tandem repeats, and has the same orientation in the viral genome. Mica adsorption of the heteroduplex showed, that the homologous region consists of approximately 1.5 kb with only partial homology including the small internal repeats and 0.9 kb with well-matched duplexes. When DNA containing the DSL region reanneals, it can give rise to two single-stranded loops of the same size at different positions suggesting the presence of a row of tandem repeats also in this region.  相似文献   

5.
Bacteriophage lambda clones containing Theileria parva genomic DNA derived from two different telomeres were isolated and the nucleotide sequences of the telomeric repeats and adjacent telomere-associated (TAS) DNA were determined. The T.parva telomeric repeat sequences, a tandem array of TTTTAGGG or TTTAGGG interspersed with a few variant copies, showed a high degree of sequence identity to those of the photosynthetic algae Chlamydomonas reinhardtii (97% identity) and Chlorella vulgaris (87.7% identity) and the angiosperm Arabidopsis thaliana (84.4% identity). Unlike most organisms which have been studied, no significant repetitive sequences were found in the nucleotide sequences of TAS DNA located centromere-proximal to the telomeric repeats. Restriction mapping and hybridisation analysis of lambda EMBL3 clones containing 16 kilobases of TAS DNA derived from one telomere suggested that they did not contain long regions of repetitive DNA. The cloned TAS DNAs were mapped to T.parva Muguga genomic SfiI fragments 8 and 20, which are located at opposite ends of the largest T.parva chromosome. A 126 bp sequence located directly centromere-proximal to the telomeric repeats was 94% identical between the two cloned telomeres. The conserved 126 bp sequence was present on all T.parva Muguga telomeric SfiI fragments.  相似文献   

6.
A plasmid (pRR983) was constructed which has a gene coding for neomycin and kanamycin resistance flanked by direct repeats of regions of homology which contain no known insertion sequences. pRR983 does not have any homologous IS1 sequences. Growth of Proteus mirabilis harboring pRR983 in medium containing high concentration of neomycin resulted in cells which were highly resistant to both neomycin and kanamycin. Plasmid DNA was analyzed by using restriction endonucleases. In most cases the neomycin resistance gene had been tandemly duplicated by using the homologous DNA sequences flanking the resistance gene as recombination sites. This is analogous to tandem duplication of drug resistance genes on NR1 using the two direct repeats of IS1 as recombination sites. The amplified plasmid DNA returned to its original structure by the deletion of amplified neomycin resistance determinants when the host cells were cultured without selection for high resistance to neomycin.  相似文献   

7.
BAC2, a rice BAC clone containing (TTTAGGG)n homologous sequences, was analyzed by Southern hybridization and DNA sequencing of its subclones. It was disclosed that there were many tandem repeated satellite DNA sequences, called TA352, as well as simple tandem repeats consisting of TTTAGGG or its variant within the BAC2 insert. A 0. 8 kb (TTTAGGG) n-containing fragment in BAC2 was mapped in the telomere regions of at least 5 pairs of rice chromosomes by using fluorescence in situ hybridization (FISH). By RFLP analysis of low copy sequences the BAC2 clone was localized in one terminal region of chromosome 6. All the results strongly suggest that the telomeric DNA sequences of rice are TTTAGGG or its variant, and the linked satellite DNA TA352 sequences belong to telomere-associated sequences.  相似文献   

8.
A total of 36 clones were randomly selected from a recombinant DNA library of small polydisperse circular DNA (spcDNA) molecules from HeLa cells and were shown to contain repetitive sequences of different reiteration frequencies that ranged from several hundred to several hundred thousand per genome. Sequencing of representative clones revealed tandem repeats of alphoid (alpha) satellite DNA, clustered repeats of the Alu family, KpnI family sequences, tandem repeats of an alpha satellite DNA specific to the X chromosome (alpha X), and A + T-rich segments carrying short stretches of poly(A) or poly(T). DNA rearrangement was frequently found in the repetitive sequences enriched in these spcDNA clones. Short regions of homology that were patchy and inverted were often found, especially at the novel joint where spcDNA sequences are circularized. The presence of these inverted repeats suggests that HeLa spcDNAs are formed by a mechanism that involves looping out of the spcDNA region and joining of the flanking DNA by illegitimate recombination.  相似文献   

9.
Zhang D  Yang Q  Ding Y  Cao X  Xue Y  Cheng Z 《Genomics》2008,92(2):107-114
Tandem repetitive sequences are DNA motifs common in the genomes of eukaryotic species and are often embedded in heterochromatic regions. In most eukaryotes, ribosomal genes, as well as centromeres and telomeres or subtelomeres, are associated with abundant tandem arrays of repetitive sequences and typically represent the final barriers to completion of whole-genome sequencing. The nature of these repeats makes it difficult to estimate their actual sizes. In this study, combining the two cytological techniques DNA fiber-FISH and pachytene chromosome FISH allowed us to characterize the tandem repeats distributed genome wide in Antirrhinum majus and identify four types of tandem repeats, 45S rDNA, 5S rDNA, CentA1, and CentA2, representing the major tandem repetitive components, which were estimated to have a total length of 18.50 Mb and account for 3.59% of the A. majus genome. FISH examination revealed that all the tandem repeats correspond to heterochromatic knobs along the pachytene chromosomes. Moreover, the methylation status of the tandem repeats was investigated in both somatic cells and pollen mother cells from anther tissues using an antibody against 5-methylcytosine combined with sequential FISH analyses. Our results showed that these repeats were hypomethylated in anther tissues, especially in the pollen mother cells at pachytene stage.  相似文献   

10.
Phase variation of lipopolysaccharide epitopes of an Haemophilus influenzae serotype b strain (strain RM.7004) occurs through a mechanism which depends on multiple tandem repeats of the DNA sequence 5'-CAAT-3' situated within the chromosomal locus lic1. We report here that the same tetranucleotide repeats are also found in two other genomic loci (lic2 and lic3) of RM.7004. Similar to lic1, there are multiple tandem repeats of 5'-CAAT-3' present at the 5' ends of long open reading frames in lic2 and lic3. Variation in the number of repeats of CAAT, by shifting the upstream initiation codons in or out of phase with the remainder of the open reading frame, could switch on or off the translation of downstream genes. Similar to previously reported findings for lic1, site-directed mutations in the open reading frame downstream (3') from the repeats of CAAT in lic2 abolished phase variation and identified DNA sequences required for the expression of additional oligosaccharide epitopes. When we used an oligonucleotide comprising five repeats of CAAT or DNA sequences specific for lic1, lic2, and lic3 as probes, a survey of other encapsulated H. influenzae strains (serotypes a through f) and nontypable H. influenzae strains (including biotype aegyptius) showed that the chromosome of H. influenzae can have from two to five regions which contain multiple tandem repeats of CAAT in addition to other sequences which hybridize to lic1 and lic2.  相似文献   

11.
12.
BAC2, a rice BAC clone containing (TTTAGGG)n homologous sequences, was analyzed by Southern hybridization and DNA sequencing of its subclones. It was disclosed that there were many tandem repeated satellite DNA sequences, called TA352, as well as simple tandem repeats consisting of TTTAGGG or its variant within the BAC2 insert. A 0. 8 kb (TTTAGGG)n-containing fragment in BAC2 was mapped in the telomere regions of at least 5 pairs of rice chromosomes by using fluorescencein situ hybridization (FISH). By RFLP analysis of low copy sequences the BAC2 clone was localized in one terminal region of chromosome 6. All the results strongly suggest that the telomeric DNA sequences of rice are TTTAGGG or its variant, and the linked satellite DNA TA352 sequences belong to telomere-associated sequences.  相似文献   

13.
BAC2, a rice BAC clone containing (TTTAGGG)n homologous sequences, was analyzed by Southern hybridization and DNA sequencing of its subclones. It was disclosed that there were many tandem repeated satellite DNA sequences, called TA352, as well as simple tandem repeats consisting of TTTAGGG or its variant within the BAC2 insert. A 0. 8 kb (TTTAGGG)n-containing fragment in BAC2 was mapped in the telomere regions of at least 5 pairs of rice chromosomes by using fluorescencein situ hybridization (FISH). By RFLP analysis of low copy sequences the BAC2 clone was localized in one terminal region of chromosome 6. All the results strongly suggest that the telomeric DNA sequences of rice are TTTAGGG or its variant, and the linked satellite DNA TA352 sequences belong to telomere-associated sequences.  相似文献   

14.
15.
A new rice repetitive DNA shows sequence homology to both 5S RNA and tRNA.   总被引:2,自引:0,他引:2  
T Y Wu  R Wu 《Nucleic acids research》1987,15(15):5913-5923
Moderately repetitive DNA sequences are found in the genomes of all eucaryotes that have been examined. We now report the discovery of a novel, transcribed, moderately repetitive DNA sequence in a higher plant which is different from any of the known repetitive DNA sequences from any organism. We isolated a rice cDNA clone which hybridizes to multiple bands on genomic blot analysis. The sequence of this 352 bp cDNA contains four regions of homology to the wheat phenylalanine tRNA, including the polymerase III-type promoter. Unexpectedly, two regions of the same 352 bp sequence also show homology to the wheat 5S RNA sequence. Using the cDNA as a probe, we have isolated six genomic clones which contain long tandem repeats of 355 bp sequence, and have sequenced nine repeat units. Our findings suggest that the rice repetitive sequence may be an amplified pseudogene with sequence homology to both 5S RNA and tRNA, but organized as long tandem repeats resembling 5S RNA genes. This is the first example showing homology between the sequences of a moderately repetitive DNA with unknown function and 5S RNA.  相似文献   

16.
Tandem repeats of single short peptide sequences are useful for many purposes. Here we describe a method called ligation-PCR to construct DNA sequences encoding numerous tandem peptide repeats that can stably produce such repeats in both prokaryotic and eukaryotic cells. The method employs double-strand target monomers consisting of a short peptide coding sequences. These sequences contain 3-bp cohesive overhangs to ensure correct repeat orientation and reading frame during ligation. The ligation products are PCR amplified and directly cloned into a new TA-cloning vector, pZeroT. Constructs containing tandem 10-amino-acid myc-tag peptide coding sequence repeats that ranged from approximately 0.45–1.2 kb, representing 15–40 copies of the corresponding peptide, were successfully obtained by this method. When one of the constructs was subcloned into prokaryotic vector pET-28 c (+) and eukaryotic vector rGHpcDNA3.0, and introduced into E. Coli and COS-7 cells, respectively, proteins containing tandem myc-tag peptide repeats were expressed with expected molecular weights. Purified proteins from E. Coli could successfully stimulate a peptide specific immune response. This method provides a means to manipulate peptides at the nucleic acid level, and can serve as the basis for biological peptide synthesis, epitope-specific antibody production, and epitope-based DNA vaccine construction.  相似文献   

17.
Chromosome breakage in germline and somatic genomes gives rise to copy number variation (CNV) responsible for genomic disorders and tumorigenesis. DNA sequence is known to play an important role in breakage at chromosome fragile sites; however, the sequences susceptible to double-strand breaks (DSBs) underlying CNV formation are largely unknown. Here we analyze 140 germline CNV breakpoints from 116 individuals to identify DNA sequences enriched at breakpoint loci compared to 2800 simulated control regions. We find that, overall, CNV breakpoints are enriched in tandem repeats and sequences predicted to form G-quadruplexes. G-rich repeats are overrepresented at terminal deletion breakpoints, which may be important for the addition of a new telomere. Interstitial deletions and duplication breakpoints are enriched in Alu repeats that in some cases mediate non-allelic homologous recombination (NAHR) between the two sides of the rearrangement. CNV breakpoints are enriched in certain classes of repeats that may play a role in DNA secondary structure, DSB susceptibility and/or DNA replication errors.  相似文献   

18.
19.
The recovery of maize (Zea mays L.) chromosome addition lines of oat (Avena sativa L.) from oat x maize crosses enables us to analyze the structure and composition of individual maize chromosomes via the isolation and characterization of chromosome-specific cosmid clones. Restriction fragment fingerprinting, sequencing, and in situ hybridization were applied to discover a new family of knob associated tandem repeats, the TR1, which are capable of forming fold-back DNA segments, as well as a new family of centromeric tandem repeats, CentC. Analysis of knob and centromeric DNA segments revealed a complex organization in which blocks of tandemly arranged repeating units are interrupted by insertions of other repeated DNA sequences, mostly represented by individual full size copies of retrotransposable elements. There is an obvious preference for the integration/association of certain retrotransposable elements into knobs or centromere regions as well as for integration of retrotransposable elements into certain sites (hot spots) of the 180-bp repeat. DNA hybridization to a blot panel of eight individual maize chromosome addition lines revealed that CentC, TR1, and 180-bp tandem repeats are found in each of these maize chromosomes, but the copy number of each can vary significantly from about 100 to 25,000. In situ hybridization revealed variation among the maize chromosomes in the size of centromeric tandem repeats as well as in the size and composition of knob regions. It was found that knobs may be composed of either 180-bp or TR1, or both repeats, and in addition to large knobs these repeated elements may form micro clusters which are detectable only with the help of in situ hybridization. The association of the fold-back elements with knobs, knob polymorphism and complex structure suggest that maize knob may be consider as megatransposable elements. The discovery of the interspersion of retrotransposable elements among blocks of tandem repeats in maize and some other organisms suggests that this pattern may be basic to heterochromatin organization for eukaryotes.  相似文献   

20.
Molecular evolutionary processes modify DNA over time, creating both newly derived substitutions shared by related descendant lineages (phylogenetic signal) and “false” similarities which confound phylogenetic reconstruction (homoplasy). However, some types of DNA regions, for example those containing tandem duplicate repeats, are preferentially subject to homoplasy-inducing processes such as sporadically occurring concerted evolution and DNA insertion/deletion. This added level of homoplasic “noise” can make DNA regions with repeats less reliable in phylogenetic reconstruction than those without repeats. Most molecular datasets which distinguish among African hominoids support a human-chimpanzee clade; the most notable exception is from the involucrin gene. However, phylogenetic resolution supporting a chimpanzee-gorilla clade is based entirely on involucrin DNA repeat regions. This is problematic because (1) involucrin repeats are difficult to align, and published alignments are contradictory; (2) involucrin repeats are subject to DNA insertion/deletion; (3) gorillas are polymorphic in that some do not have repeats reported to be synapomorphies linking chimpanzees and gorillas. Gene tree/species tree conflicts can occur due to the sorting of ancestrally polymorphic alleles during speciation. Because hominoid females transfer between groups, mitochondrial and nuclear gene flow occur to the same extent, and the probability of conflict between mitochondrial and nuclear gene trees is theoretically low. When hominoid intraspecific mitochondrial variability is taken into account [based on cytochrome oxidase subunit II (COII) gene sequences], humans and chimpanzees are most closely related, showing the same relative degree of separation from gorillas as when single individuals representing species are analyzed. Conflicting molecular phylogenies can be explained in terms of molecular evolutionary processes and sorting of ancient polymorphisms. This perspective can enhance our understanding of hominoid molecular phylogenies. © 1994 Wiley-Liss, Inc.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号