首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
51 polypeptides of known 3-dimensional structures have been submitted to a search for internal similarities. It is shown that the frequency of proteins displaying significant amounts of internal similarities is higher than predicted by chance. A non-negligible part of those similarities probably occurs in connection with the existence of ordered secondary structures. Indeed, similarity occurs at a much more important rate when analyses are restricted to protein subsequences corresponding to alpha helices or beta pleated sheets. Furthermore, the correlation existing between the rates at which linear and inverted repeats occur inside protein subregions of ordered secondary structures suggests that a significant part of short similarities are analogies rather than homologies. An hypothesis is put forward suggesting that the regular alternations of hydrophobicity which characterize most of alpha helices and beta strands could provoke the occurrence of significant amounts of similarities inside protein sequences.  相似文献   

2.
The termini of the avian chick embryo lethal orphan (CELO) virus DNA have been sequenced. The results revealed a 63-bp-long inverted terminal repetition (ITR) which shared the sequence ATAATA with all adenovirus termini, thus far analyzed. The CELO virus ITR differed from those of the mammalian adenoviruses in two major aspects: (i) it is not a perfect duplication; (ii) it begins with a 5'-guanylic acid residue instead of the cytidylic acid normally observed in adenoviruses.  相似文献   

3.
4.
We have constructed a RP4 KorB overproducing strain and purified the protein to near homogeneity. KorB is a DNA binding protein recognizing defined palindromic 13-bp sequences (TTTAGCSGCTAAA). Inverted sequence repetitions of this type, designated OB, are present on RP4 12 times. OB-sequences are localized in replication and maintenance regions as well as in the regions Tra1 and Tra2 essential for conjugative transfer. All sites found in Tra regions by computer search act as targets for specific binding of KorB protein. KorB-DNA complexes were detected by DNA fragment retardation assay using polyacrylamide gels. The 13-bp symmetric arrangement of the consensus OB-sequence constitutes the core for binding KorB protein since any truncation of this sequence prevents complex assembly or leads to a considerable destabilization of the KorB-DNA complexes. A hydroxyl radical footprint analysis demonstrated complex formation of KorB with the OB-sequence directly and suggests the presence of an unusual DNA structure within the nucleoprotein complex.  相似文献   

5.
Biological weighted sequences are used extensively in molecular biology as profiles for protein families, in the representation of binding sites and often for the representation of sequences produced by a shotgun sequencing strategy. In this paper, we address three fundamental problems in the area of biologically weighted sequences: (i) computation of repetitions, (ii) pattern matching, and (iii) computation of regularities. Our algorithms can be used as basic building blocks for more sophisticated algorithms applied on weighted sequences.  相似文献   

6.
We have established the nucleotide sequence for the inverted terminal repetition of human adenovirus type 3, a subgroup B adenovirus. The repetition, which is 136 bp long, shows a high degree of homology with the known sequence for the inverted repetition of adenovirus type 5 (Steenbergh et al., 1977) a subgroup C adenovirus. Partial sequence information convering 120 bp of the inverted terminal repetitions of human serotype 12, a subgroup A member, and of simian adenovirus type 7 has also been obtained. A comparison of the established sequences shows that the terminal repetitions, in particular the first 50 bp from the ends, contain sequences that have been well conserved in adenovirus evolution. For instance, only six mismatched base pairs were detected among the first 50 bp in the repetitions of simian adenovirus type 7 and human adenovirus type 5, although the homology between simian adenovirus 7 and human subgroup C adenoviruses was estimated to be only 30%. A 14 bp sequence located 9-22 nucleotides from the ends is present in DNAs from all the human serotypes examined as well as in simian adenovirus 7 DNA. Furthermore, the simian adenovirus 7 repetition contains a 21 bp sequence which is present in SV40 DNA, close to the origin of DNA replication.  相似文献   

7.
The terminal structure of the linear mitochondrial DNA (mtDNA) from the yeast Candida parapsilosis was investigated. This mtDNA, 30 kb long, has symmetrical ends forming inverted terminal repeats. These repeats are made up of a variable number of tandemly repeating units of 738 by each; the terminal nucleotide corresponds to a precise position within the last repeat unit sequence. The ends had an open structure accessible to enzymes, with a 5 single-stranded extension of about 110 nucleotides. No circular forms were detected in the DNA preparations. Two other unrelated species, Pichia philodendra and Candida salmanticensis also appear to have a linear mtDNA of similar organization. These linear DNAs (which we name Type 2 linear mtDNAs) are distinct from the previously described linear mtDNAs of yeasts whose termini are formed by a closed hairpin loop (Type 1 linear mtDNA). The terminal structure of C. parapsilosis mtDNA is reminiscent of the linear mitochondrial genomes of the ciliate Tetrahymena although, in the latter, the telomeric tandem repeat unit is considerably shorter.  相似文献   

8.
D G Ross  J Swan  N Kleckner 《Cell》1979,16(4):721-731
We report here the physical structures of deletions and inversions promoted by the translocatable tetracycline-resistance element Tn10. DNA/DNA heteroduplex and restriction enzyme analyses of alterations in the genome of bacteriophage lambda suggest that both types of DNA alterations almost always originate at the internal termini of the 1400 bp terminal inverted repetitions of Tn10. Tn10-promoted deletions remove a single contiguous DNA segment beginning at one such terminus; Tn10-promoted inversions are more complex, and involve both an inversion and a specific deletion of Tn10 DNA.  相似文献   

9.
The inverted repeated sequences of the chromatin-eliminating nematode Ascaris lumbricoides var. suum have been examined by electron microscopy and by hydroxyapatite chromatography, both in the germ-line and in the somatic DNA. 38% of the inverted repeats of the germ-line DNA analysed in the electron microscope have a single-stranded loop, in comparison to about 50% of looped structures in the somatic DNA. The loops are on average 2.3 X 10(3) base pairs (bp) long. The rest of the foldback DNA consists of simple hairpins. The average length of looped and unlooped inverted repeats is of the order of 300-400 bp in the germ-line and in the somatic DNA. The content of S1-resistant foldback duplexes isolated by hydroxyapatite chromatography amounts to 1.3% in spermatids, with an average length of 350 bp, and to 1.1% in intestinal or larval cell nuclei, with a length of about 320 bp. We estimate by two different methods that there exist approximately 12500 inverted repeats per haploid germ-line genome and approximately 8000 in the haploid somatic genome. A statistical analysis of the data indicates that the great majority of the foldback sequences are randomly distributed in the Ascaris genome, with a spacing of about (40-80) X 10(3) bp, both in the germ-line and in the somatic DNA.  相似文献   

10.
We have analysed the structure of the mtDNAs of six amicronucleate Tetrahymena pyriformis strains, belonging to at least four phenosets, as defined by Borden et al. (Borden, D., Whitt, G.S. and Nanney, D.L. (1973) J. Protozool. 20, 693--700). 2. The mtDNAs of all strains are linear, but they differ in size, in their fragmentation by endonuclease EcoRI and in overall sequence; less than 20% sequence homology was found by DNA-DNA hybridization in all combinations tested, except for the mtDNAs from strains T and ST which are indistinguishable. 3. In spite of these marked sequence differences the mtDNAs of all strains share two structural peculiarities: ragged (gnawed) duplex ends and a duplication-inversion, which varies in length between 0.3 and 1.2 micrometer, depending on the strain. In four strains the duplication-inversion is terminal, allowing formation of single-stranded DNA circles with a duplex tail; in two strains it is subterminal. 4. The ragged ends and sub-terminal position of the duplication-inversion in some of the Tetrahymena mtDNAs do not fit any of the current models for the replication of linear mtDNAs.  相似文献   

11.
The properties of inverted repeated sequences in wheat nuclear DNA have been studied by HAP(1) chromatography, nuclease S1 digestion and electron microscopy. Inverted repeated sequences comprise 1.7% of wheat genome. The HAP studies show that the amount of "foldback HAP bound DNA" depends on DNA length. Inverted repeats appear to be clustered with an average intercluster distance of 25 kb. It is estimated that there are approximately 3 x 10(6) inverted repeats per haploid wheat genome. The sequences around inverted repeats involve all families of repetition frequencies. Inverted repeats are observed as hairpins in electron microscopy. 20% of hairpins are terminated by a single-stranded spacer ranging from 0.3 to 1.5 kb in length. Duplex regions of the inverted repeats range from 0.1 to 0.45 kb with number average values of 0.24 kb and 0.18 kb for unlooped and looped hairpin respectively. Thermal denaturations and nuclease S1 digestions have revealed a length of about 100 bases for duplex regions. The methods used to study inverted repeated sequences are compared and discussed.  相似文献   

12.
Inverted repeated DNA sequences have been isolated from sheared Vicia faba DNA by hydroxylapatite column chromatography, treated with nuclease S1, tritiated by the nick translation method and hybridized in situ on squashes of Vicia faba root tips. Silver grains appear grouped in a rather limited portion of interphase nuclei and form a sort of band across them. The central regions of metaphase chromosomes are preferentially labeled, labeling being excluded from telomeres, centromeres and secondary constrictions. These results are briefly discussed in relation to those obtained in other species and the functional significance of inverted repeats.  相似文献   

13.
14.
Hamster cell nuclear DNA is shown to contain inverted repeat (foldback) sequences, in some respects similar to the foldback fraction in DNA from other animal cell types. Using electron microscopy the majority of foldback duplexes are shown to be located in simple hairpin-like DNA structures, formed from individual pairs of complementary inverted repeated sequences 50--1000 nucleotides in length, in some cases arranged in tandem, and in other cases separated by intervening sequences, up to 16000 nucleotide residues long. In addition, a novel class of foldback structure, referred to as 'bubbled hairpins' is reported, which appear to be formed from clusters of inverted repeat sequences that are separated from adjacent clusters of complementary inverted repeats by large intervening sequences which vary in length from 5000 to over 20000 nucleotide residues. Due to the special pattern of distribution of these latter inverted repeat sequences, 'bubbled hairpins' are observed only in long foldback DNA. Evidence is presented that the distribution of foldback sequences in hamster cell DNA is highly ordered. The lengths of the intervening single chains in foldback structures appear to vary non-randomly. This gives rise to a localised periodic pattern of organisation that is believed to be a consequence of regular alternating arrangements of foldback and non-foldback sequences in the segments of DNA from which foldback structures are derived.  相似文献   

15.
The Wassilewskija (WS) strain of Arabidopsis has four PAI genes at three sites: an inverted repeat at one locus plus singlet genes at two unlinked loci. These four genes are methylated over their regions of DNA identity. In contrast, the Columbia (Col) strain has three singlet PAI genes with no methylation. To test the hypothesis that the WS inverted repeat locus triggers methylation of unlinked identical sequences, we introduced this locus into the Col background by genetic crosses. The inverted repeat induced de novo methylation of all three unmethylated Col PAI genes, with methylation efficiency varying with the position of the target locus. These results, plus results with inverted repeat transgenes, show that methylation is communicated by a DNA/DNA pairing mechanism.  相似文献   

16.
Electron microscope studies on self-annealed intact single strands and on partially denatured molecules show that herpes simplex virus 1 DNA consists of two unequal regions, each bounded by inverted redundant sequences. Thus the region L (70 percent of the contour length of the DNA) separates the left terminal region a1b from its inverted repeat b'a'1, each of which comprises 6 percent of the DNA. The region S (9.4 percent of DNA) separates the right terminal region cas (4.3 percent of the DNA) from its inverted repeat a'sc'. The regions of the two termini which are inverted and repeated itnernally differ in topology. Thus, cas is guanine plus cytosine rich, whereas only the terminal 1 percent of the a1b region, designated as subregion a1, is guanine plus cytosine rich.  相似文献   

17.
The correlation between various amino acid residues (either same or different), along the polypeptide chain have been studied using a large data base. A table of preference values for pairs having strong correlations has been constructed, which can be used to study any sequence and by calculating the weight of these sequences based on these preference values, a rough distinction between a “natural” and a “random” sequence can be made, One can further comment on the evolutionary status of proteins based on these weights.  相似文献   

18.
Fliess A  Motro B  Unger R 《Proteins》2002,48(2):377-387
An important question in protein evolution is to what extent proteins may have undergone swaps (switches of domain or fragment order) during evolution. Such events might have occurred in several forms: Swaps of short fragments, swaps of structural and functional motifs, or recombination of domains in multidomain proteins. This question is important for the theoretical understanding of the evolution of proteins, and has practical implications for using swaps as a design tool in protein engineering. In order to analyze the question systematically, we conducted a large scale survey of possible swaps and permutations among all pairs of protein from the Swissport database. A swap is defined as a specific kind of sequence mutation between two proteins in which two fragments that appear in both sequences have different relative order in the two sequences. For example, aXbYc and dYeXf are defined as a swap, where X and Y represent sequence fragments that switched their order. Identifying such swaps is difficult using standard sequence comparison packages. One of the main problems in the analysis stems from the fact that many sequences contain repeats, which may be identified as false-positive swaps. We have used two different approaches to detect pairs of proteins with swaps. The first approach is based on the predefined list of domains in Pfam. We identified all the proteins that share at least two domains and analyzed their relative order, looking for pairs in which the order of these domains was switched. We designed an algorithm to distinguish between real swaps and duplications. In the second approach, we used Blast to detect pairs of proteins that share several fragments. Then, we used an automatic procedure to select pairs that are likely to contain swaps. Those pairs were analyzed visually, using a graphical tool, to eliminate duplications. Combining these approaches, about 140 different cases of swaps in the Swissprot database were found (after eliminating multiple pairs within the same family). Some of the cases have been described in the literature, but many are novel examples. Although each new example identified may be interesting to analyze, our main conclusion is that cases of swaps are rare in protein evolution. This observation is at odds with the common view that proteins are very modular to the point that modules (e.g., domains) can be shuffled between proteins with minimal constraints. Our study suggests that sequential constraints, i.e., the relative order between domains, are highly conserved.  相似文献   

19.
P McCaldon  P Argos 《Proteins》1988,4(2):99-122
We have examined oligopeptides with lengths ranging from 2 to 11 residues in protein sequences that show no obvious evolutionary relationship. All sequences in the Protein Identification Resource database were carefully classified by sensitive homology searches into superfamilies to obtain unbiased oligopeptide counts. The results, contrary to previous studies, show clear prejudices in protein sequences. The oligopeptide preferences were used to help decide the significance of sequence homologies and to improve the more general methods for detecting protein coding regions within nucleotide sequences.  相似文献   

20.
Inverted repeat sequences, capable of forming stable intra-chain foldback duplexes, are shown using electron microscopy to be located in over 90% of fragments of nuclear DNA from Physarum polycephalum. A statistical treatment of the data indicates that, on average, foldback sequence foci are spaced every 7,000 nucleotides and that they are distributed uniformly amongst the DNA chains. The majority of inverted repeat sequences give rise to the simple types of foldback structure observed in DNA from other eukaryotic species, but a significant proportion of the DNA fragments also contain novel foldback structures with a more complex appearance, referred to as 'bubbled' hairpins. The latter structures appear to be formed by the annealing of several distinct segments of homologous inverted repeat sequence, each separated by interspersed non-foldback sequences of variable sizes up to 15,000 nucleotides in length. The size, both of the foldback duplexes and of the intervening single-chain segments of DNA, are not random. Instead, they appear to form a regular, arithmetic series of lengths. These observations suggest that the different segments of Physarum DNA from which foldback structures are derived contain nucleotide sequences that share a highly ordered and unform pattern of structural organisation. These regular units of organisation in Physarum DNA in some cases extend over distances up to 50,000 nucleotides in length.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号