首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
We have sequenced the long terminal direct repeats (and adjacent DNA) of two members of the 412 family of transposable elements of Drosophila melanogaster cloned on fragments of DNA from strain Oregon R. The repeats of the first element are identical and 481 base-pairs long; the repeats of the second are also identical but are 571 base-pairs long. The first 482 base-pairs of the 571 base-pair sequence correspond to the 481 base-pair repeat differing by five base substitutions and one addition/deletion. The 571 base-pair repeats are rare. Each of these 412 elements is flanked by a four base-pair direct repeat, suggesting that insertion of a 412 element is associated with duplication of four base-pairs. Analysis of the “empty site” from strain Canton S corresponding to one of these elements supports this conclusion. The sequence of 481 base-pair repeats and of 412 DNA immediately adjacent to them show striking similarities to corresponding regions of vertebrate proviruses and we discuss the implications this may have for the mechanism of transposition.  相似文献   

2.
3.
Tandem repeats finder: a program to analyze DNA sequences.   总被引:66,自引:3,他引:63       下载免费PDF全文
A tandem repeat in DNA is two or more contiguous, approximate copies of a pattern of nucleotides. Tandem repeats have been shown to cause human disease, may play a variety of regulatory and evolutionary roles and are important laboratory and analytic tools. Extensive knowledge about pattern size, copy number, mutational history, etc. for tandem repeats has been limited by the inability to easily detect them in genomic sequence data. In this paper, we present a new algorithm for finding tandem repeats which works without the need to specify either the pattern or pattern size. We model tandem repeats by percent identity and frequency of indels between adjacent pattern copies and use statistically based recognition criteria. We demonstrate the algorithm's speed and its ability to detect tandem repeats that have undergone extensive mutational change by analyzing four sequences: the human frataxin gene, the human beta T cellreceptor locus sequence and two yeast chromosomes. These sequences range in size from 3 kb up to 700 kb. A World Wide Web server interface atc3.biomath.mssm.edu/trf.html has been established for automated use of the program.  相似文献   

4.
5.
6.
G. S. Wilkinson  F. Mayer  G. Kerth    B. Petri 《Genetics》1997,146(3):1035-1048
Analysis of mitochondrial DNA control region sequences from 41 species of bats representing 11 families revealed that repeated sequence arrays near the tRNA-Pro gene are present in all vespertilionine bats. Across 18 species tandem repeats varied in size from 78 to 85 bp and contained two to nine repeats. Heteroplasmy ranged from 15% to 63%. Fewer repeats among heteroplasmic than homoplasmic individuals in a species with up to nine repeats indicates selection may act against long arrays. A lower limit of two repeats and more repeats among heteroplasmic than homoplasmic individuals in two species with few repeats suggests length mutations are biased. Significant regressions of heteroplasmy, θ and π, on repeat number further suggest that repeat duplication rate increases with repeat number. Comparison of vespertilionine bat consensus repeats to mammal control region sequences revealed that tandem repeats of similar size, sequence and number also occur in shrews, cats and bighorn sheep. The presence of two conserved protein-binding sequences in all repeat units indicates that convergent evolution has occurred by duplication of functional units. We speculate that D-loop region tandem repeats may provide signal redundancy and a primitive repair mechanism in the event of somatic mutations to these binding sites.  相似文献   

7.
DNA sequence data for a DNA repeated sequence, found largely in centromeres of specific human chromosomes is presented. The sequence consists of two tandem 169 and 171 base-pair units that show 27% base variation with each other. In contrast the dimer is more faithfully copied in longer tandem repeats, such as the sequenced 680 base-pair tetramer. In the major sequence of the tetramer, base variation of the order of only 1%, in comparison to the complete dimer is seen. A minimum of two steps in the formation of this sequence is proposed, consisting of evolution of a tandem dimer of two 170 base-pair variant units of a related family within the human genome, and later saltation or amplification of this dimer. No evidence that these sequences contained or evolved from a simpler 6 to 20 base-pair repeat was found, and no homology with known simpler human satellites could be discerned. In reviewing and comparing the literature on repeated DNAs it appears that overall length and tandem repetition are the critical features, rather than individual unit repeat length or secondary structural potential, in defining these sequences as a class and their special centromeric functions and higher chromosome order. The possibility that such sequences arise from a reservoir of interspersed sequences that are common to at least several species is discussed.  相似文献   

8.
A highly polymorphic locus associated with the variable tandem repetition of a 35 bp consensus sequence was mapped to chromosome 10, band q26. Examination of leukocyte DNA from a cancer patient revealed the twenty-fold amplification of one allelic fragment of this locus, while the other allelic fragment demonstrated a normal copy number. In another patient, Southern blotting of leukocyte DNA detected the deletion of the 3'-flanking region from one tandem repeat allele. These results indicate that variable tandem repeats may mark highly unstable regions of DNA in the human genome which can be altered by changes more extensive than simple tandem repeat variation.  相似文献   

9.
Polymers of random 14 mer oligonucleotides are shown to detect discrete loci in the human genome. Eighteen different synthetic tandem repeats of random 14 base-pair units (STRs) have been generated and all of them turn out to detect polymorphic loci on southern blots of human DNA samples, presumably corresponding to a variable number of tandem repeats (VNTR). This finding suggests that minisatellites are a major component of the human genome and are strongly associated with the generation of genetic variability. In addition, it should open new strategies to make new polymorphic probes available.  相似文献   

10.
White spot syndrome virus (WSSV) presently causes the most serious losses to shrimp farmers worldwide. Earlier reports of high DNA sequence homology among isolates from widely separated geographical regions suggested that a single virus was the cause. However, we have found surprisingly high variation in the number of 54 bp DNA repeats in ORF94 (GenBank AF369029) from 55 shrimp ponds (65 shrimp samples) experiencing WSSV outbreaks in Thailand in 2000 and 2002. These were detected by PCR amplification using primers ORF94-F and ORF94-R flanking the repeat region. Altogether, 12 different repeat groups were found (from 6 to 20 repeats) with 8 repeats being most frequent (about 32%). Extracts prepared from individual shrimp in the same outbreak pond belonged to the same repeat group while those collected at the same time from separate WSSV outbreak ponds, or from the same ponds at different times, usually belonged to different repeat groups. This suggested that different outbreaks were caused by different WSSV isolates. In contrast to the highly variable numbers of repeats, sequence variation within the repeat region was confined to either T or G at Position 36. These variations may be useful for epidemiological studies on the local and global movement of WSSV, since there is high variation in the number of repeats (good for local studies) but little sequence change (good for global studies).  相似文献   

11.
Summary In dipteran insects the most distal telomere-associated DNA known to exist consists of long, complex tandem repeats. We have classified the 340-bp tandemly arranged repeats in Chironomus pallidivittatus. The repeats are distributed in a small number of subfamilies. One type of the repeat has the character of a master unit from which other main units can be derived usually by simple changes. The derived subfamilies contain segments that are degenerate versions of the corresponding segment in the master sequence. Such segments can also occur together in one and the same repeat unit in different combinations. There is a complete absence of subfamily-specific base variants in regions lying outside of the degenerate segments. Homogenization takes place between DNA sequences that are often smaller than a whole repeat unit. The mosaic structure of the repeat arrays suggests that gene conversion is an important force in the generation and maintenance of this family of repeats.Offprint requests to: M. Cohn  相似文献   

12.
We have shown that the mRNAs for apopolysialoglycoproteins (apoPSGP) of rainbow trout contain various numbers of a repetitive sequence of 39 base-pairs encoding mature apoPSGP, and that this sequence is bordered by highly homologous 5' and 3' regions encoding pre-, pro- and telopeptides. These mRNAs are thought to be transcribed from different genes that constitute a large multiple gene family (more than 100 members). Here, we have determined the structures of several members of the apoPSGP gene family. The results show that two of three genomic DNA fragments contain two independent apoPSGP genes in the same orientation with unrelated sequences intervening. Five characterized genes have essentially the same organization and sequence. Each gene has four exons, and CAAT and TATA sequences were found in the 5'-flanking regions. However, two noteworthy differences were observed among the five genes; a diversity in the number of the 39 base-pair repeats, also observed among the cDNA clones, and a one-base polymorphism in the 39 base-pair repeat, which causes an amino acid change. This polymorphism was not detected among the cDNA clones obtained. The boundary positions of the genes are various and contain no transposon-like structures. The variation in the number of repeats and the absence of a rule for bordering positions of the genes suggest that apoPSGP genes may have been amplified by gene duplications, unequal recombination, and selection of chromosomes having larger numbers of apoPSGP genes.  相似文献   

13.
Interactions between the termini of adeno-associated virus DNA   总被引:10,自引:0,他引:10  
  相似文献   

14.
Hsu FC  Wang CJ  Chen CM  Hu HY  Chen CC 《Genetics》2003,164(3):1087-1097
Two families of tandem repeats, 180-bp and TR-1, have been found in the knobs of maize. In this study, we isolated 59 clones belonging to the TR-1 family from maize and teosinte. Southern hybridization and sequence analysis revealed that members of this family are composed of three basic sequences, A (67 bp); B (184 bp) or its variants B' (184 bp), 2/3B (115 bp), 2/3B' (115 bp); and C (108 bp), which are arranged in various combinations to produce repeat units that are multiples of approximately 180 bp. The molecular structure of TR-1 elements suggests that: (1) the B component may evolve from the 180-bp knob repeat as a result of mutations during evolution; (2) B' may originate from B through lateral amplification accompanied by base-pair changes; (3) C plus A may be a single sequence that is added to B and B', probably via nonhomologous recombination; and (4) 69 bp at the 3' end of B or B', and the entire sequence of C can be removed from the elements by an unknown mechanism. Sequence comparisons showed partial homologies between TR-1 elements and two centromeric sequences (B repeats) of the supernumerary B chromosome. This result, together with the finding of other investigators that the B repeat is also fragmentarily homologous to the 180-bp repeat, suggests that the B repeat is derived from knob repeats in A chromosomes, which subsequently become structurally modified. Fluorescence in situ hybridization localized the B repeat to the B centromere and the 180-bp and TR-1 repeats to the proximal heterochromatin knob on the B chromosome.  相似文献   

15.
A 1565-base segment of the antibiotic resistance plasmid R6K carries sufficient information to replicate as a plasmid in Escherichia coli. This segment contains a functional origin of replication and the structural gene pir for a protein, designated π, that is required for the initiation of R6K DNA replication. The nucleotide sequence of this 1565 base-pair replicon was determined. From the sequence and the analysis of proteins produced by minicells of E. coli strains carrying the wild-type pir gene and a deletion of the pir gene, it can be concluded that the π structural gene encodes for a protein of a molecular weight of approximately 35,000.On the basis of the nucleotide sequence, the π protein is the only protein containing more than 50 amino acid residues that is encoded by this 1583 base-pair replicon. The nucleotide sequence also contains eight 22 base-pair direct repeats. Seven of the direct repeats are in tandem and located in the origin region, while the eighth repeat is near or part of the promoter for the π structural gene. This eighth repeat may play a role in the autoregulated expression of the π structural gene.  相似文献   

16.
We have constructed and characterized two related human chromosome 12-specific cosmid libraries. DNA from flow-sorted chromosomes from a somatic cell hybrid was cloned into a cosmid vector. Approximately 61% of the cosmids in the nearly 26,200 member arrayed libraries (LLt2NC01 and LLt2NC02) contain human DNA inserts, and 31% of the cosmids derived from human DNA contain CA repeats. One hundred and fifty-two cosmids isolated from the libraries have been mapped by fluorescence in situ hybridization (FISH). Cosmids containing human DNA inserts were localized by FISH exclusively to chromosome 12, confirming the chromosomal specificity of the libraries. The cosmids have been localized to all parts of this chromosome, although some regions are more highly represented than others. Partial sequence information was obtained from 44 mapped cosmids, and oligonucleotide primer pairs were synthesized that define unique sequence tagged sites (STSs). These mapped cosmids, and unique STSs derived from them, provide a set of useful clones and primer pairs for screening YAC libraries and developing contigs centered on regions of interest within chromosome 12. In addition, 120 of the mapped cosmids contain CA repeats, and thus they also provide a useful resource for defining highly polymorphic simple tandem repeat elements that serve as genetic markers for linkage analysis and disease gene localization.  相似文献   

17.
A highly repetitive long interspersed sequence from rat DNA has been isolated and partly characterized. This sequence comprises at least a 1300 base-pair and a 2400 base-pair EcoRI fragment and probably additional elements. The 2400 base-pair segment has been analyzed in detail. It appears to be part of the chromosomal DNA in rat cells. The 2400 base-pair repeat is likely to be distributed over several regions in the rat genome. The 2400 base-pair segment has been cloned, mapped for restriction sites, and part of its nucleotide sequence has been determined. The 2400 base-pair sequence is a member of a typical highly repetitive long interspersed sequence with high copy number and restriction site polymorphism. There are sequence homologies to mouse and human DNA. A striking homology has been detected to the flanking sequences of a repetitive mouse DNA sequence that has been described to be located adjacent to one of the kappa-immunoglobulin variable genes. Elements in the 2400 base-pair rat repeat are transcribed in cells from most rat organs and from several continuous rat cell lines. This RNA from rat cell lines was found polyadenylated or not polyadenylated. The nucleotide sequence of parts of the 2400 base-pair DNA segment revealed open reading frames for polypeptide sequences. Such open reading frames have been detected in two different segments of the 2400 base-pair DNA repeat. Open reading frames exist in the two complementary strands in the same DNA segment. The hypothetical polypeptide whose sequence has been determined in toto has a length of 190 amino acid residues and is enriched in hydrophobic amino acids, reminiscent of the amino acid composition in membrane proteins. Hence, it is conceivable that the 2400 base-pair repeat sequence from rat DNA, at least in part, encodes messenger RNAs that might be translated into functional proteins.  相似文献   

18.

Background

Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data.

Results

Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution.

Conclusions

While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.  相似文献   

19.
This study shows for the first time that the tandemly repeated icosapeptide of human MUC1 underlies a genetic sequence polymorphism at three positions (underlined): PDTRPAPGSTAPPAHGVTSA. The concerted replacement DT-->ES (sequence variation 1) and the single replacements P-->Q (sequence variation 2), P-->A (sequence variation 3), and P-->T (sequence variation 4) were identified by sequencing of polymerase chain reaction products and studied by minisatellite variant repeat analysis for their incidence and topology in the 5' and 3' peripheral regions of the variable number of tandem repeats domain. Minisatellite variant repeat analyses were performed with 27 individual samples of genomic DNA from human cells and tissues covering 30-60% of the domain. Within the peripheral regions, sequence variations 1-4 occur at high incidence and show a nearly constant repeat topology in all individual normal and tumor samples. Also, individuals who were non-Caucasian or of different ethnic background were found to have the same set of replacements with identical topology. The repeat variant 1 replacing the established tumor target motif DTR with ESR was found in all individuals and appears predominantly in repeat clusters (diads and triads). The largely constant topology of variant repeats is interpreted by the assumption that the variable number of tandem repeats domain has evolved as a recent expansion of sequence variable super-repeats.  相似文献   

20.
Mycobacterial interspersed repetitive units (MIRUs) are 40-100 bp DNA elements often found as tandem repeats and dispersed in intergenic regions of the Mycobacterium tuberculosis complex genomes. The M. tuberculosis H37Rv chromosome contains 41 MIRU loci. After polymerase chain reaction (PCR) and sequence analyses of these loci in 31 M. tuberculosis complex strains, 12 of them were found to display variations in tandem repeat copy numbers and, in most cases, sequence variations between repeat units as well. These features are reminiscent of those of certain human variable minisatellites. Of the 12 variable loci, only one was found to vary among genealogically distant BCG substrains, suggesting that these interspersed bacterial minisatellite-like structures evolve slowly in mycobacterial populations.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号