首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 22 毫秒
1.
The human Pur factor binds strongly to a sequence element repeated within zones of initiation of DNA replication in several eukaryotic cells. The protein binds preferentially to the purine-rich single strand of this element, PUR. We report here the cloning and sequencing of a cDNA encoding a protein with strong affinity for the PUR element. Analysis with a series of mutated oligonucleotides defines a minimal single-stranded DNA Pur-binding element. The expressed Pur open reading frame encodes a protein of 322 amino acids. This protein, Pur alpha, contains three repeats of a consensus motif of 23 amino acids and two repeats of a second consensus motif of 26 amino acids. Near its carboxy terminus, the protein possesses an amphipathic alpha-helix and a glutamine-rich domain. The repeat region of Pur cDNA is homologous to multiple mRNA species in each of several human cell lines and tissues. The HeLa cDNA library also includes a clone encoding a related gene, Pur beta, containing a version of the 23-amino-acid consensus motif similar, but not identical, to those in Pur alpha. Results indicate a novel type of modular protein with capacity to bind repeated elements in single-stranded DNA.  相似文献   

2.
A region of the IncHI plasmid R27 has been found to share very close nucleotide sequence homology with the RepFIA replicon of F. This region has been located on a 1.6 kb segment of R27 plasmid DNA, and corresponds to ori-2 and the E gene of F. The incC repeat sequence region shows reduced homology, with the F repeats being an imperfect subset of a larger repeated sequence found in R27. The E gene homologue of R27 is able to initiate replication from the F ori-2 sequence and to repress the E gene promoter of F. The results are consistent with the observed incompatibility behaviour of R27, and have a bearing on the specificity of interaction of E protein with its DNA-binding sites.  相似文献   

3.
The plasmids R401 and Rtsl belong to the same incompatibility group, IncT. The nucleotide sequence of the basic replicon of R401 consisting of 1,857 base pairs was determined and compared with that of mini-Rtsl previously reported. The mini-R401 was found to be composed of two clusters of direct repeated sequences flanking a large open reading frame that could encode a 33,000 Mr protein (RepA protein) consisting of 288 amino acids. This structure of mini-R401 is quite similar to that of mini-Rtsl. Furthermore, the nucleotide sequence of mini-R401 is identical to that of mini-Rtsl except for eleven nucleotides; three are located near the carboxyl terminus portion of the RepA coding region (repA) and four are in the repeated sequences (incI) located downstream from repA. Incompatibility study showed that mini-R401 plasmid coexisted stably with the cloned incI repeats of mini-Rtsl, suggesting that mini-R401 RepA protein binds to incI repeats of mini-Rtsl less efficiently than does mini-Rtsl RepA protein.  相似文献   

4.
The DNA sequence located between mecA, the gene that codes for penicillin-binding protein PBP2', and insertion sequence-like element IS431mec has been termed hypervariable because of its length polymorphism among different staphylococcal isolates. We sequenced and characterized the hypervariable region of the methicillin resistance determinant (mec) isolated from Staphylococcus aureus BB270. Within the 2,040-bp hypervariable region, we identified an unusual accumulation of long direct repeats. Analysis of the DNA sequence revealed a minimal direct repeat unit (dru) of 40 bp which was repeated 10 times within 500 bp. The dru sequences are responsible for the length polymorphism of mec. Moreover, we identified an open reading frame that codes for 145 amino acids (ORF145), whose deduced amino acid sequence showed 57% amino acid sequence similarity to the N terminus of the glycerophosphoryl diester phosphodiesterase (UgpQ) of Escherichia coli.  相似文献   

5.
I van Die  H Bergmans 《Gene》1984,32(1-2):83-90
The cloned DNA fragment encoding the F72 fimbrial subunit from the uropathogenic Escherichia coli strain AD110 has been identified. The nucleotide sequence of the structural gene and of 196 bp of the noncoding region preceding the gene was determined. The structural gene codes for a polypeptide of 188 amino acid residues, including a 21-residue N-terminal signal sequence. The nucleotide sequence and the deduced amino acid sequence of the F72 gene were compared with the reported sequences of the papA gene (B?ga et al., 1984). Both genes code for subunits of fimbriae that are involved in mannose-resistant hemagglutination (MRHA) of human erythrocytes. The available data show that there is absolute homology between the noncoding regions preceding both genes over 129 bp. The two proteins are homologous at the N terminus and C terminus; there is less, but significant, homology in the region between the N and C termini.  相似文献   

6.
Four polymorphic variants of the platelet receptor for von Willebrand factor, glycoprotein Ib, have been described that differ in molecular weight on sodium dodecyl sulfate-polyacrylamide gels (Moroi, M., Jung, S. M., and Yoshida, N. (1984) Blood 64, 622-629). A recent report localized the polymorphic site to the heavily O-glycosylated region of the glycoprotein Ib alpha-chain known as the macroglycopeptide (Meyer, M., and Schellenberg, I. (1990) Thromb. Res. 58, 233-242). This region contains several tandem repeats of a mucin-like sequence, which appeared to be a likely site for polymorphic variation. We amplified genomic DNA corresponding to the macroglycopeptide from 206 individuals from four ethnic groups and identified three length variants based on the migration of the amplified DNA on denaturing polyacrylamide gels. DNA sequencing revealed that the three variants represented four alleles, two of which varied by only one base pair, a difference that did not result in an amino acid change. The three length variants differed in the number of tandem repeats of a 39-base pair sequence that results in perfect duplication of a 13-amino acid sequence that originated within a region flanked by Glu-396 and Thr-411. The smallest isoform contained one such sequence; the next largest, two repeats; and the largest, three repeats. The DNA sequence containing the tandem repeats was flanked by direct repeats typical of the target site duplications found flanking transposed DNA, suggesting a mechanism for acquisition of this region by the primordial glycoprotein Ib alpha precursor. The amino acid sequence of the repeated element that accounts for the polymorphism contained five sites for potential O-glycosylation, which together with the repeated amino acids would result in incremental differences in molecular weight of approximately 6,000 between the different isoforms. The addition of repeats to the macroglycopeptide is predicted to increase the length of this elongated glycosylated region and extend the distance between the ligand-binding domain of glycoprotein Ib and the platelet plasma membrane, an effect that would project the ligand-binding domain farther into the bloodstream. Such a change may alter the susceptibility of platelets to shear-induced activation, a process that requires an interaction between glycoprotein Ib and von Willebrand factor.  相似文献   

7.
Plasmodium berghei: cloning of the circumsporozoite protein gene   总被引:6,自引:0,他引:6  
A DNA fragment encoding the carboxy terminal 80% of the Plasmodium berghei circumsporozoite protein was selected from a genomic DNA expression library. Sequencing revealed that the P. berghei circumsporozoite protein was similar in overall structure to circumsporozoite proteins from other malaria species, although the central repeat region was unique in comprising two different blocks of tandem peptide repeats: 11 eight amino acid repeats with predominant sequence DPAPPNAN were followed by 16 two amino repeats, predominantly PQ. The P. berghei circumsporozoite protein exhibited limited, but about equal amino acid homology to circumsporozoite proteins from P. knowlesi, P. vivax, and P. falciparum, indicating that P. berghei is not closely related to any of these other malaria species. Cloning of the P. berghei circumsporozoite protein gene will allow direct testing of sporozoite vaccines in mice.  相似文献   

8.
We have examined parameters that affect sequence-specific interactions of the mouse c-myb protein with DNA oligomers containing the Myb-binding motif (CA/CGTTPu). Complexes formed between these oligomers and in vitro translated c-myb proteins were analysed by electrophoresis on non-denaturing polyacrylamide gels using the mobility-shift assay. By progressive truncation of c-myb coding sequences it was demonstrated that amino acids downstream of a region of three imperfect 51-52 residue repeats (designated R1, R2 and R3), which are located close to the amino terminus of the protein, had no qualitative or quantitative effect on the ability to interact specifically with this DNA motif. However, removal of only five amino acids of the R3 repeat completely abolished this activity. The contribution of individual DNA-binding domain repeats to this interaction was investigated by precisely deleting each individually: it was demonstrated that a combination of R2 and R3 was absolutely required for complex formation while the R1 repeat was completely dispensible. c-myb proteins showed quantitatively greater interaction with oligomers containing duplicated rather than single Myb-binding motif, in particular where these were arranged in tandem. Moreover, it was observed that c-myb protein interacted with these tandem motifs as a monomer. These findings imply that a single protein subunit straddles adjacent binding sites and the implications for c-myb activity are discussed.  相似文献   

9.
We have identified p10 as a fifth gag protein of avian sarcoma and leukemia viruses. Amino-terminal protein sequencing of this polypeptide purified from the Prague C strain of Rous sarcoma virus and from avian myeloblastosis virus implies that it is encoded within a stretch of 64 amino acid residues between p19 and p27 on the gag precursor polypeptide. For p10 from the Prague C strain of Rous sarcoma virus the first 30 residues were found to be identical with the predicted amino acid sequence from the Prague C strain of Rous sarcoma virus DNA sequence, whereas for p10 from avian myeloblastosis virus the protein sequence for the same region showed two amino acid substitutions. Amino acid composition data indicate that there are no gross composition changes beyond the region sequenced. The amino terminus of p10 is located two amino acid residues past the carboxy terminus of p19, whereas its carboxy terminus probably is located immediately adjacent to the first amino acid residue of p27.  相似文献   

10.
The malF gene product is an inner membrane component of the maltose transport system in Escherichia coli. Some gene fusions between malF and lacZ (encoding the normally cytoplasmic enzyme beta-galactosidase) produce hybrid proteins which are membrane-bound while other fusions produce hybrid proteins which are cytoplasmic (Silhavy, T. J., Casadaban, M. J., Shuman, H. A., and Beckwith, J. R. (1976) Proc. Natl. Acad. Sci. U. S. A. 73, 3423-3427). To further analyze the localization properties of the different classes of fusion proteins and of the intact MalF protein, we have obtained the DNA sequence of 5 malF-lacZ fusions and the wild type malF gene. From the predicted amino acid sequence, MalF protein contains 514 amino acids and has a molecular weight of 56,947. Analysis of the hydropathic character of MalF using the Kyte-Doolittle assignments (Kyte, J., and Doolittle, R. F. (1982) J. Mol. Biol. 157, 105-132), indicates that the protein may have 2 or 3 amino-terminal membrane-spanning segments and 4 or 5 carboxy-terminal membrane-spanning segments separated by a region of 181 hydrophilic residues. Localization properties of the different fusion proteins correspond with degree of hydrophobicity. By sequencing upstream from malF, the malE-malF intercistronic region was found to be 153 base pairs in length and to contain inverted repeats, homologous to intercistronic repeats of many other operons. Further analysis of this region may help in understanding the observed step-down in synthesis of the MalF protein.  相似文献   

11.
The coding region of the alpha-amylase inhibitor (HaimII) gene from the producing strain Streptomyces griseosporeus YM-25 was localized on an 800-base-pair DNA segment. The nucleotide sequence of a 1,191-base-pair region including the HaimII gene was determined by the dideoxy-chain termination method. The nucleotide sequence data predicted an open reading frame of 363 base pairs starting with an ATG initiation codon and ending with a TGA translational stop codon. The amino acid sequence deduced from the nucleotide sequence indicated that the presumptive pre-HaimII protein extends 37 amino acids to the amino terminus and 6 amino acids to the carboxyl terminus of the mature HaimII protein. The pre-HaimII protein is believed to be processed both during and after secretion. Two forms of the inhibitor, which have a higher molecular weight than that of the HaimII protein isolated from S. griseosporeus, were partially purified from the culture filtrate of Streptomyces lividans containing the cloned HaimII gene.  相似文献   

12.
Cloned DNA from the replication terminus region of Bacillus subtilis 168 was used to identify and construct a restriction map of the homologous region in B. subtilis W23. With this information, DNA from the terminus region of W23 was cloned and the sequence was determined for a 1,499-base-pair segment spanning the expected terC site. The position of the site was then located more precisely. Use of the cloned DNA from strain W23 as a probe for digests of DNA from exponentially growing cells of the same strain established the presence of the slowly migrating replication termination intermediate (forked DNA). The orientation and dimensions of the forked molecule were consistent with arrest of the clockwise fork at the terC site in W23, as has been shown to occur in strain 168. Thus, despite significant differences between the two strains, the same termination mechanism appears to be used. The DNA sequences spanning the terC site in strains 168 and W23 showed a high level of homology (90.2%) close to the site but very little at a distance of approximately 250 base pairs from the site in one particular direction. The overall sequence comparison emphasised the importance of the open reading frame for a 122-amino-acid protein adjacent to terC. Although there were 22 base differences in the open reading frames between the strains, the amino acid sequence of the encoded protein was completely conserved. It is suggested that the amino acid sequence conservation reflects a role for the protein in the clockwise fork arrest mechanism as proposed earlier (M.T. Smith and R.G. Wake, J. Bacteriol. 170:4083-4090, 1988).  相似文献   

13.
The identification of a region of sequence variability among individual isolates of Bacillus anthracis as well as the two closely related species, Bacillus cereus and Bacillus mycoides, has made a sequence-based approach for the rapid differentiation among members of this group possible. We have identified this region of sequence divergence by comparison of arbitrarily primed (AP)-PCR "fingerprints" generated by an M13 bacteriophage-derived primer and sequencing the respective forms of the only polymorphic fragment observed. The 1,480-bp fragment derived from genomic DNA of the Sterne strain of B. anthracis contained four consecutive repeats of CAATATCAACAA. The same fragment from the Vollum strain was identical except that two of these repeats were deleted. The Ames strain of B. anthracis differed from the Sterne strain by a single-nucleotide deletion. More than 150 nucleotide differences separated B. cereus and B. mycoides from B. anthracis in pairwise comparisons. The nucleotide sequence of the variable fragment from each species contained one complete open reading frame (ORF) (designated vrrA, for variable region with repetitive sequence), encoding a potential 30-kDa protein located between the carboxy terminus of an upstream ORF (designated orf1) and the amino terminus of a downstream ORF (designated lytB). The sequence variation was primarily in vrrA, which was glutamine- and proline-rich (30% of total) and contained repetitive regions. A large proportion of the nucleotide substitutions between species were synonymous. vrrA has 35% identity with the microfilarial sheath protein shp2 of the parasitic worm Litomosoides carinii.  相似文献   

14.
Two forms of small, interstitial proteoglycans have been isolated from bovine articular cartilage and have different core proteins, based on NH2-terminal analysis and peptide mapping (Choi, H. U., Johnson, T. L., Pal, S., Tang, L-H., Rosenberg, L. C., and Neame, P. J. (1989) J. Biol. Chem. 264, 2876-2884). These proteoglycans have been called PG I and PG II. Since they were first described, they have also been called "biglycan" (PG I), "decorin," and "DS-PG" (PG II). This report describes the primary structure of PG I from bovine articular cartilage. The protein core consists of 331 amino acids with a molecular mass of 37,280 Da. The amino acid sequence shows 55% identity to the cDNA-derived sequence of PG II from bovine bone. There are four discrete domains in the amino acid sequence. Domain 1, at the NH2 terminus (approximately 23 amino acids), contains two sites of attachment of dermatan sulfate, both of which match the consensus sequence of Asp/Glu-X-X-Ser-Gly-hydrophobic. Neither of these sites is substituted to 100% with glycosaminoglycan in native PG I. Domain 2, near the NH2 terminus and containing approximately 28 amino acids, has a cysteine pattern similar to a domain near the COOH terminus of mouse metallothionein and contains at least one disulfide bond (between the first and fourth cysteine residues). The majority of the core protein of PG I (domain 3) is a leucine-rich domain containing ten repeating units (approximately 231 amino acids). Patthy [1987) J. Mol. Biol. 198, 567-577) has shown that for PG II, the majority of domain 3 shows considerable similarity to leucine-rich alpha 2-glycoprotein (LRG) from serum. Domain 2 of PG I or PG II also has an analog in LRG, in that it has two cysteines in a similar place. The major motif in the PG I described here, in PG II and in LRG, is a series of leucine-rich repeats. PG I and PG II both contain 10 leucine-rich repeats which are 14 amino acids long and which are somewhat irregularly spaced, while LRG contains 9 leucine-rich repeats spaced 10 amino acids apart. Other proteins which contain leucine repeats are the platelet glycoprotein Ib, which is involved in platelet adherence to subendothelium (eight repeats in the alpha chain and two in the beta chain), the protein encoded by the Toll gene (involved in lateral and ventral spatial organization in Drosophila) and chaoptin (a protein involved in Drosophila photoreceptor morphogenesis).(ABSTRACT TRUNCATED AT 400 WORDS)  相似文献   

15.
DNA obtained from the Sheila Smith strain of Rickettsia rickettsii was digested to completion with the restriction endonucleases BamHI and SalI and ligated with the plasmid vector pUC19. The ligation mixture was used to transform Escherichia coli. A total of 465 bacterial clones were screened for antigen production with hyperimmune rabbit serum. One of the reactive clones, containing a recombinant plasmid designated pSS124, was solubilized and subjected to immunoblot analysis and revealed expression of a 17-kilodalton protein reactive with anti-R. rickettsii serum that comigrated with an antigen from R. rickettsii. A 1.6-kilobase PstI-BamHI fragment from pSS124 was subcloned and continued to direct synthesis of the 17-kilodalton antigen. The nucleotide sequence was determined for this 1.6-kilobase subclone, which encompassed the gene encoding the polypeptide as well as flanking regions containing potential regulatory sequences. The open reading frame consisted of 477 nucleotides that specified a 159-amino-acid protein with a calculated molecular weight of 16,840. The deduced amino acid sequence contained a hydrophobic sequence near the amino terminus that resembled signal peptides described for E. coli. The carboxy terminus was hydrophilic in nature and probably contained the exposed epitopes.  相似文献   

16.
17.
The nucleotide sequence of the promoter-distal region of the tra operon of R100 was determined. There are five open reading frames in the region between traT and finO, and their protein products were identified. Nucleotide sequences of plasmid F corresponding to the junction regions among the open reading frames seen in R100 were also determined. Comparison of these nucleotide sequences revealed strong homology in the regions containing traD, traI and an open reading frame (named orfD). The TraD protein (83,899 Da) contains three hydrophobic regions, of which two are located near the amino-terminal region. This protein also contains a possible ATP-binding consensus sequence at the amino-terminal region and a characteristic repeated peptide sequence (Gln-Gln-Pro)10 at the carboxy-terminal region. The TraI protein (191,679 Da) contains the sequence motif conserved in an ATP-dependent DNA helicase superfamily in its carboxy-terminal region. The protein product of orfD, which is probably a new tra gene (named traX), contains 65% hydrophobic amino acids, especially rich in alanine and leucine. There exist non-homologous regions between R100 and F that could be represented as four I-D (insertion or deletion) loops in heteroduplex molecules. Assignment of each loop to the strand of R100 or F was , however, found to be the reverse from that previously assumed. The three I-D loops that were located between traT and traD, between traD and traI, and between traI and finO had no terminal inverted repeat sequences nor had they any homology with known insertion sequences, while the fourth was IS3, located within the finO gene of F. The sequences in the I-D loops, except IS3, may also code for proteins that are, however, likely to be nonessential for transfer of plasmids.  相似文献   

18.
Human mammary cells present on the cell surface a polymorphic epithelial mucin (PEM) which is developmentally regulated and aberrantly expressed in tumors. PEM carries tumor-associated epitopes recognized by the monoclonal antibodies HMFG-1, HMFG-2, and SM-3. Previously isolated partial cDNA clones revealed that the core protein contained a large domain consisting of variable numbers of 20-amino acid repeat units. We now report the full sequence for PEM, as deduced from cDNA sequences. The encoded protein consists of three distinct regions: the amino terminus consisting of a putative signal peptide and degenerate repeats; the major portion of the protein which is the tandem repeat region; the carboxyl terminus consisting of degenerate tandem repeats and a unique sequence containing a transmembrane sequence and a cytoplasmic tail. Potential O-glycosylation sites (serines or threonines) make up more than one-fourth of the amino acids. Length variations in the tandem repeat result in PEM being an expressed variable number tandem repeat locus. Tandem repeats appear to be a general characteristic of mucin core proteins.  相似文献   

19.
Laminin (Mr = 800,000) is a glycoprotein consisting of three chains, A, B1, and B2, and has diverse biological activities. Previously we reported the complete primary structure of the B1 and B2 chains of mouse laminin deduced from cDNA sequence (Sasaki, M., Kohno, K., Kato, S., Martin, G. R., and Yamada, Y. (1987) Proc. Natl. Acad. Sci. U.S.A. 84, 935-939; Sasaki, M., and Yamada, Y. (1988) J. Biol. Chem. 262, 17111-17117). Here we describe the isolation, characterization, and sequence of cDNA clones spanning 9,520 bases which encode the entire A chain of mouse laminin. The nucleotide sequence of the clones contains an open reading frame of 3,084 amino acids including 24 amino acids of a signal peptide. The A chain contains some eight distinct domains including alpha-helices, cysteine-rich repeats and globules. There is considerable sequence and structural homology between the A chain and the B1 and B2 chains. However, the A chain has a unique globular structure containing homologous repeats at the carboxyl terminus and constituting one third of the molecular mass of the chain. Furthermore, the A chain contains three globules and three cysteine-rich domains at the amino terminus, whereas the B1 and B2 chains have only two each of such domains. The A chain shows homology to the basement membrane heparan sulfate proteoglycan core protein and the extracellular domain of the Drosophila neurogenic protein Notch. There is an RGD (Arg-Gly-Asp) sequence in one of the cysteine-rich domains of the A chain. This potential cell binding sequence could be active as another adhesion signal in addition to the previously identified cell binding sequence YIGSR (Tyr-Ile-Gly-Ser-Arg) of the B1 chain.  相似文献   

20.
Campylobacter jejuni strains 81116 and NCTC 11392 were shown to contain a region of DNA that was not present in other strains. This region was cloned from the chromosome of strain 81116 and the nucleotide sequence determined. It was found to contain an insert of 742 bp which was flanked by direct repeats of 45 bp. Although the nature of this sequence is unknown at present, it is clear that it only presents in certain strains of Camp. jejuni and it may therefore be useful as an epidemiological tool.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号