首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A 2.76 kb segment of the 12 kb divergent region of the Leishmania tarentolae kinetoplast maxicircle DNA consists almost entirely of repeated sequences. The repeats can be grouped into six families, some of which are present throughout the remainder of the divergent region. The repeats are oriented in a head-to-tail fashion with the three simplest repeats clustered into large arrays. A 47 bp palindrome and two copies of a "supercluster" of three different types of repeats are also present in the sequenced region. A sequence change in the divergent region is described for a clonal strain of L. tarentolae which was passaged continuously for several years. The repetitive sequences found in the divergent region appear to be appropriate substrates for the presumed deletion/insertion/recombination events occurring in this rapidly evolving portion of the maxicircle.  相似文献   

2.
3.
4.
Canrep is a heterogeneous, tandemly repeated, 176 bp nucleotide sequence that contains a single Hind III site and is present in high copy numbers in the genomes of many Brassica species. Complete clusters of repeats of this DNA were cloned from the nuclear DNA of Brassica juncea. Restriction-fragment dimers and higher multimers of the 176 bp sequence have arisen by mutations within the Hind III recognition sequence. Adjacent repeats from within the same cluster usually have different nucleotide sequences with features indicating that diversity is generated by a mechanism that causes site-specific base substitutions. While most of the units of canrep DNA are clustered in long arrays of tandem repeats, some are dispersed throughout the genome as isolated copies or in small clusters. Regardless of the size of the arrays, each cluster begins and ends with a variable-length, truncated repeat and is flanked by inverted copies of the sequence 5-ATCTCAT3-,which is not part of the basic sequence of the canrep family of DNAs. Furthermore, some clusters are located close to nucleotide sequences related to those of known plant transposons. Thus, canrep elements may be dispersed by transposition. There are two distinct subfamilies of canrep sequences in B. juncea, and one of these is closely related to one of the two subfamilies of this type of DNA from B. napus, indicating that it originated from B. campestris, the common diploid ancestor of both amphidiploid species. Neither the repetitive DNA nor nucleotide sequences flanking canrep clusters are transcribed in seedlings, suggesting that even small arrays of repeats are located in heterochromatic regions and might be involved in chromatin condensation and/or chromosome segregation.  相似文献   

5.
Sohanpal B  Wasawo D  Bishop R 《Gene》2000,255(2):401-409
Telomere-associated (TA) DNA sequences of the intracellular protozoan parasite Theileria parva were isolated by a novel strategy using a modified version of single-specific-primer polymerase chain reaction (SSP-PCR). Nucleotide sequences of non-coding TA DNA from three telomeres (6017bp, 2435bp and 4859bp) contained no extensive tracts of repetitive DNA. Long open reading frames (ORFs) were present at the centromeric ends of two of the TA sequences, the 3' ends of the closest ORFs being only 2670bp and 2719bp from the telomeric repeats. There were regions of significant similarity between the nucleotide sequences of the non-coding regions of different telomeres. The longest region of similarity was a virtually identical 1650bp domain, located directly adjacent to the telomeric repeats of two separate telomeres. Comparison of the telomere proximal sequences defined in this study and two additional T. parva telomeres, whose sequences were determined previously, resulted in identification of a single copy 141bp conserved sequence directly adjacent to the telomeric repeats. The conserved sequence is present at all five T. parva telomeres that have been characterised. The only organism currently known to have a single copy conserved sequence located adjacent to the telomeric repeats is another intracellular protozoan, Leishmania braziliensis.  相似文献   

6.
The histidine tRNA genes of yeast   总被引:9,自引:0,他引:9  
Yeast has at least seven nuclear histidine tRNA genes although there is a single tRNAHis. We have sequenced three of the histidine tRNA genes. The genes have identical coding sequences and the DNA anti-codon sequence GTG corresponds to the GUG anti-codon in tRNAHis. None of the three yeast histidine tRNA genes has an intervening sequence. Two of the three genes contain repeated DNA elements in the region adjacent to the 5' end of the histidine tRNA gene. One of the elements, sigma, is 18 base pairs (bp) from the 5' end of each of these genes, sigma elements are highly conserved and flanked by 5-bp repeats. The other element, delta, is at variable distances from the tRNA gene; one is 439 bp from a histidine tRNA gene and the other is 52 bp from a histidine tRNA gene. These solo delta elements are quite divergent when compared with delta s associated with transposon yeast elements and are not flanked by 5-bp repeats.  相似文献   

7.
Summary The origin of transfer (oriT) is the sequence within which conjugal transfer of plasmid DNA is initiated, and is absolutely required in cis for plasmid mobilization. We have cloned oriT from the 52 kb IncN plasmid R46 on a 600 bp fragment, and mapped the limits of the relevant sequence by deletion analysis and transposon mutagenesis. The nucleotide sequence of the oriT region contains 13 direct repeats of an 11 bp consensus sequence, 3 different pairs of 10 bp inverted repeats, and a segment that is extremely A-T rich. The direct repeats are within a region required for high frequency transfer and their sequence is such that their periodic alignment along the helix may induce curvature of the DNA. Analysis of Tn1725 insertions within the sequenced fragment of R46 revealed that, unlike most other transposons, transposition of Tn1725 can cause target sequence duplications of three different sizes.  相似文献   

8.
Forty per cent of patients with mitochondrial myopathies, a diverse group of multisystem diseases predominantly affecting skeletal muscle and the brain, have large deletions of a proportion of muscle mitochondrial DNA (mt DNA). These appeared to be identical in 13 of 28 cases, contained within the region 8286-13595 bp. Analysis of the deletion junction in two cases showed a 13 nucleotide sequence which occurred in the normal genome as a direct repeat flanking the region deleted in the mutant mt DNAs. Mt DNA deletions may arise from recombination or slippage between short sequence repeats during replication.  相似文献   

9.
Two members of the human salivary proline-rich protein (PRP) multigene family have been isolated and completely sequenced. These PRP genes, PRH1 and PRH2, are of the HaeIII-type subfamily and code for acidic PRP proteins. Both genes are approximately 3.5 kilobase pairs (kb) in length and contain four exons. Exon 3 encodes the proline-rich part of the protein and includes five 63-base pair (bp) repeats. CAT and ATA boxes and several possible enhancer sequences occur in a 1-kb region 5' to exon 1. Two sets of repeats occur in the sequenced region in addition to the 63-bp repeats: one pair of about 140 bp flanks 500 bp of DNA in the first intervening sequence, and the other pair of 72 bp is tandemly repeated 1.4 kb 5' to the PRH1 gene. The 4-kb region of sequenced DNA from PRH1 differs by an average of 8.7% from the same region in PRH2, but the nucleotide sequences of the exon 3 of the two genes differ by only 0.2%. This result suggests the occurrence of a recent gene conversion event. The regions containing the 5-fold repeated sequences of 63 bp are identical in the two genes, PRH1 and PRH2. A comparison of the human HaeIII and BstNI subfamily repeats and a comparison of the human, mouse, and rat repeats suggest that the individual repeats have evolved in a concerted fashion within each gene and within the PRP gene family as a whole.  相似文献   

10.
An S1-hypersensitive site was found at the 60 bp direct repeats of the cis-acting, stability and/or copy number control region of the yeast 2 micron DNA in the supercoiled hybrid plasmid pDB248'. It was retained in a different plasmid, pYK2121, consisting of pBR322 and the 300 bp long repeated DNA. Analyses of 5'-end-labeled fragments and nucleotide sequence determination showed that the S1-cleavage site was at the central part of an AT-rich 19 bp palindrome present in the repeats. Two other homologous palindromes (21 and 15 bp) containing the 12 bp consensus sequences were not cleaved. The nucleotide sequences at the base of the stem and/or loop may determine the efficiency of the cruciform extrusion.  相似文献   

11.
The nature of the ends of the vaccinia virus genome was determined by nucleotide sequencing. Our finding of terminal hairpins indicated that the linear double-stranded DNA molecule consists of a single continuous polynucleotide chain. The 104 nucleotide apex of the hairpin contains predominantly A and T residues and is incompletely base-paired. These loops exist in two forms, which when inverted with respect to each other are complementary in sequence. Both forms of the 104 nucleotide loop are present in nearly equimolar amounts at each end of the genome. A set of 13 tandem 70 bp repeats begins 87 bp from the proximal segment of the terminal loop, followed by a unique sequence of 325 bp, and then by a second set of 18 tandem 70 bp repeats. The sequence of the 70 bp repeats reveals a 13 bp internal redundancy. Self-priming and de novo start replication models, which involve a site-specific nick in one DNA strand proximal to the 104 nucleotide loop, account for the observed sequence inversions and incomplete base-pairing. Similar mechanisms may be involved in replication of the ends of the eucaryotic chromosome.  相似文献   

12.
The complete nucleotide sequence of the gene for chain c of hemoglobin of the earthworm Lumbricus terrestris has been determined. The sequence of 4037 base pairs (bp) includes about 310 bp of 5'-flanking sequence and 110 bp 3' to the poly(A) site. Comparison of cDNA and genomic sequences shows four silent differences in codons that suggest the presence of at least two genes. The coding sequence is split by two introns of 1344 and 1169 bp at highly conserved positions (Jhiang, S. M., Garey, J. R., and Riggs, A. F. (1988) Science 240, 334-336). The first intron possesses the unusual 5' splice junction sequence GC instead of GT. Many tandem triplet repeats based on (GAT) and (CCT) are present in the first intron. The second intron has nine tandem repeats based on the consensus sequence AAGGAAGGAGGTC. Each intron has several exact inverted repeats of 9-10 bp that might result in loops of 78-140 nucleotides in the RNA prior to splicing. The sequences in the second intron, at positions 2423-2644 are about 65% identical with parts of several genes found in yeast mitochondria and in DNA from several other organisms.  相似文献   

13.
R F Howard  F Ardeshir  R T Reese 《Gene》1986,46(2-3):197-205
Complementary DNA (cDNA) clones for GP185, a major antigenically diverse glycoprotein of Plasmodium falciparum, were isolated from a cDNA library of the Honduras I/CDC (Honduras I) isolate, and 1052 bp were sequenced. The expression of cDNA fragments in Escherichia coli using the vector pCQV2 allowed verification of the reading frame. This GP185 cDNA sequence, like the cDNA sequence for a homologous gene of the K1 isolate [Hall et al., Nature 311 (1984) 379-382], codes for a polypeptide which is truncated due to multiple, in-frame stop codons. This polypeptide corresponds to the N-terminal 15% of the proposed coding region of the GP185 gene [Holder et al., Nature 317 (1985) 270-273]. Comparison of the nucleotide sequences for the GP185 gene of Honduras I and five other isolates indicated that there are two areas of conserved DNA sequence, one of 310 bp (beginning 181 bp upstream from the proposed initiation codon) and the other of greater than or equal to 360 bp (located entirely within the coding region), separated by a region encoding isolate-specific tandem amino acid repeats. Rat antiserum was raised to a fusion protein derived from the conserved regions and the intervening repeat region of this Honduras I protein. This antiserum bound GP185 on immunoblots of the homologous Honduras I isolate and the heterologous K1 isolate, which has different tandem repeats. Serum from owl monkeys and humans previously infected with P. falciparum reacted with the fusion protein on immunoblots demonstrating that determinants in the N-terminal 15% of GP185 were immunogenic in infected individuals and suggesting that some of these sites are conserved among isolates.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

14.
The expression of incompatibility properties between the IncX plasmids R6K and R485 of Escherichia coli was examined. For small autonomously replicating derivatives of both plasmid elements, the requirements for incompatibility expression include a functional R485 replicon and an active R6K beta-origin region. Functional R6K alpha and gamma origins are not directly involved in incompatibility expression between R6K and R485. A trans-acting replication system was constructed for plasmid R485. It consists of a 3.2-(kb) DNA fragment of R485 that specifies a product(s) in trans which supports replication from an R485 origin plasmid. A minimal R485 origin region of 591 bp was derived utilizing this trans-acting replication system and the nucleotide sequence of this origin region determined. The most striking feature of the sequence is the presence of six tandem 22-bp nucleotide sequence direct repeats.  相似文献   

15.
The PstI family of elements are short, highly repetitive DNA sequences interspersed throughout the genome of the Bovidae. We have cloned and sequenced some members of the PstI family from cattle, goat, and buffalo. These elements are approximately 500 bp, have a copy number of 2 x 10(5) - 4 x 10(5), and comprise about 4% of the haploid genome. Studies of nucleotide sequence homology indicate that the buffalo and goat PstI repeats (type II) are similar types of short interspersed nucleotide element (SINE) sequences, but the cattle PstI repeat (type I) is considerably more divergent. Additionally, the goat PstI sequence showed significant sequence homology with bovine serine tRNA, and is therefore likely derived from serine tRNA. Interestingly, Southern hybridization suggests that both types of SINEs (I and II) are present in all the species of Bovidae. Dendrogram analysis indicates that cattle PstI SINE is similar to bovine Alu-like SINEs. Goat and buffalo SINEs formed a separate cluster, suggesting that these two types of SINEs evolved separately in the genome of the Bovidae.  相似文献   

16.
The complete nucleotide sequence of the rhsA locus and selected portions of other members of the rhs multigene family of Escherichia coli K-12 have been determined. A definition of the limits of the rhsA and rhsC loci was established by comparing sequences from E. coli K-12 with sequences from an independent E. coli isolate whose DNA contains no homology to the rhs core. This comparison showed that rhsA comprises 8,249 base pairs (bp) in strain K-12 and that the Rhs0 strain, instead, contains an unrelated 32-bp sequence. Similarly, the K-12 rhsC locus is 9.6 kilobases in length and a 10-bp sequence resides at its location in the Rhs0 strain. The rhsA core, the highly conserved portion shared by all rhs loci, comprises a single open reading frame (ORF) 3,714 bp in length. The nucleotide sequence of the core ORF predicts an extremely hydrophilic 141-kilodalton peptide containing 28 repeats of a motif whose consensus is GxxxRYxYDxxGRL(I or T). One of the most novel aspects of the rhs family is the extension of the core ORF into the divergent adjacent region. Core extensions of rhsA, rhsB, rhsC, and rhsD add 139, 173, 159, and 177 codons to the carboxy termini of the respective core ORFs. For rhsA, the extended core protein would have a molecular mass of 156 kilodaltons. Core extensions of rhsB and rhsD are related, exhibiting 50.3% conservation of the predicted amino acid sequence. However, comparison of the core extensions of rhsA and rhsC at both the nucleotide and the predicted amino acid level reveals that each is highly divergent from the other three rhs loci. The highly divergent portion of the core extension is joined to the highly conserved core by a nine-codon segment of intermediate conservation. The rhsA and rhsC loci both contain partial repetitions of the core downstream from their primary cores. The question of whether the rhs loci should be considered accessory genetic elements is discussed but not resolved.  相似文献   

17.
The ribosomal DNA (rDNA) repeats of Saccharomyces cerevisiae contain an autonomously replicating sequence (ARS) that colocalizes with a chromosomal origin of replication. We show that a minimal sequence necessary for full ARS function corresponds to a 107-bp rDNA fragment which contains three 10-of-11-bp matches to the ARS consensus sequence. Point mutations in only one of the 10-of-11-bp matches, GTTTAT GTTTT, inactivate the rDNA ARS, indicating that this consensus sequence is essential. A perfect match to a revised ARS consensus is present but not essential. Sequences up to 9 bp 5' from the essential consensus are dispensable. A broad DNA region directly 3' to the essential consensus is required and is easily unwound as indicated by: (i) hypersensitivity to nicking of an approximately 100-bp region by mung bean nuclease in a negatively supercoiled plasmid and (ii) helical instability determined by thermodynamic analysis of the nucleotide sequence. A correlation between DNA helical instability and replication efficiency of wild-type and mutated ribosomal ARS derivatives suggests that a broad region 3' to the essential ARS consensus functions as a DNA unwinding element. Certain point mutations that do not stabilize the DNA helix in the 3' region but reduce ARS efficiency reveal an element distinct from, but overlapping, the DNA unwinding element. The nucleotide sequence of the functionally important constituents in the ARS appears to be conserved among the rDNA repeats in the chromosome.  相似文献   

18.
The nucleotide sequences of the Streptomyces transposable element IS110 and its insertion site in the DNA of a derivative of the temperate phage luminal diameter C31 were determined. The element is inserted about 460 bp from the right-hand end of luminal diameter C31 DNA, in a region of apparently non-coding DNA. The target site (in a run of seven C residues) is within an 11 bp sequence homologous with one end of IS110. The inserted element is flanked by runs of 11 and 15 C residues which form part of more extensive regions of homology between the left and right junction regions. Imperfect inverted repeats (10 matches out of 15 bp) are present near (but not at) the ends of IS110. The whole IS110 element contains about 1550 bp of which 71% are G-C bp. One major potentially protein-coding region (ORF 1215) was detected, of 1215 bp, the product of which, a presumptively soluble protein of MR 43,563, was not overtly related to any entry in a protein sequence database. A smaller open reading frame (ORF 330) was tentatively identified in the opposite strand of the ORF 1215 region.  相似文献   

19.
The long (4.6-kb) A+T region of Drosophila melanogaster mitochondrial DNA has been cloned and sequenced. The A+T region is organized in two large arrays of tandemly repeated DNA sequence elements, with nonrepetitive intervening and flanking sequences comprising only 22% of its length. The first repeat array consists of five repeats of 338-373 bp. The second consists of four intact 464-bp repeats and a fifth partial repeat of 137 bp. Three DNA sequence elements are found to be highly conserved in D. melanogaster and in several Drosophila species with short A+T regions. These include a 300-bp DNA sequence element that overlaps the DNA replication origin and two thymidylate stretches identified on opposite DNA strands. We conclude that the length heterogeneity observed in the A+T regulatory region in mitochondrial DNAs from the genus Drosophila results from the expansion (and contraction) of the number of repeated DNA sequence elements. We also propose that the 300-bp conserved DNA sequence element, in conjunction with another primary sequence determinant, perhaps the adjacent thymidylate stretch, functions in the regulation of mitochondrial DNA replication.   相似文献   

20.
The nucleotide sequence of the entire beta-like globin gene cluster of rabbits has been determined. This sequence of a continuous stretch of 44.5 x 10(3) base-pairs (bp) starts about 6 x 10(3) bp upstream from epsilon (the 5'-most gene) and ends about 12 x 10(3) bp downstream from beta (the 3'-most gene). Analysis of the sequence reveals that: (1) the sequence is relatively A + T rich (about 60%); (2) regions with high G + C content are associated with OcC repeats, a short interspersed repeated DNA in rabbits; (3) the distribution of polypurines, polypyrimidines and alternating purine/pyrimidine tracts is not random within the cluster; (4) most open reading frames are associated with known globin coding regions, OcC repeats or long interspersed repeats (L1 repeats); (5) the most prominent open reading frames are found in the L1 repeats; (6) different strand asymmetries in base composition are associated with embyronic and adult genes as well as the tandem L1 repeats at the 3' end of the cluster; and (7) essentially all the repeats appear to have been inserted by a transposon mechanism. A comparison of the sequence with itself by a dot-plot analysis has revealed nine new members of the OcC family of repeats in addition to the six previously reported. The OcC repeats tend to be clustered, particularly in the epsilon-gamma and gamma-psi delta intergenic regions. Dot-plot comparisons between the rabbit and the human clusters have revealed extensive sequence matches. Homology starts about 6 x 10(3) bp 5' to epsilon or as far upstream as the rabbit sequence is available. It continues throughout the entire cluster and stops about 0.7 x 10(3) bp 3' to beta, at which point several repeats have inserted in both rabbits and humans. Throughout the gene cluster, the homology is interrupted mainly by insertions or deletions in either the rabbit or the human genome. Almost all of the insertions are of known short or long repeated DNAs. The positions of the insertions are different in the two gene clusters, which indicates that both short and long repeats have been transposing throughout the genome for the time since the mammalian radiation. An alignment of rabbit and human sequences allows the calculation of the substitution rate around epsilon. Sequences far removed from the gene are evolving at a rate equivalent to the pseudogene rate, although some short regions show an apparently higher rate.(ABSTRACT TRUNCATED AT 400 WORDS)  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号