首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
We investigated protein sequence/structure correlation by constructing a space of protein sequences, based on methods developed previously for constructing a space of protein structures. The space is constructed by using a representation of the amino acids as vectors of 10 property factors that encode almost all of their physical properties. Each sequence is represented by a distribution of overlapping sequence fragments. A distance between any two sequences can be calculated. By attaching a weight to each factor, intersequence distances can be varied. We optimize the correlation between corresponding distances in the sequence and structure spaces. The optimal correlation between the sequence and structure spaces is significantly better than that which results from correlating randomly generated sequences, having the overall composition of the data base, with the structure space. However, sets of randomly generated sequences, each of which approximates the composition of the real sequence it replaces, produce correlations with the structure space that are as good as that observed for the actual protein sequences. A connection is proposed with previous studies of the protein folding code. It is shown that the most important property factors for the correlation of the sequence and structure spaces are related to helix/bend preference, side chain bulk, and beta-structure preference.  相似文献   

2.
The cyanogen bromide fragments of S-carboxymethylated fructose-bisphosphatase were purified. The amino acid sequences of the small fragments were determined by the dansyl-Edman method. The large fragments were subjected to proteolytic digestion to give smaller peptides more amenable for purification and sequencing by similar methods. Enzyme digests of the S-carboxymethylated enzyme gave overlap peptides containing the methionine residues. In conjunction with the amino acid sequence of the 60-residue N-terminal fragment previously determined on the S-peptide released by limited proteolysis with subtilisin the complete sequence of 336 residues was deduced. The sequence has been compared with the 335 residue sequence of pig kidney fructose-bisphosphatase and some areas of sequence for rabbit liver enzyme. The strong homology previously noted for the S-peptide sequence is maintained for the complete enzyme with only 34 changes in 336 residues when comparing the pig and sheep enzymes.  相似文献   

3.
Hydroxyindole-O-methyltransferase (HIOMT) catalyzes the final step in melatonin synthesis. The nucleotide and deduced amino acid sequences of bovine HIOMT have been reported. Our laboratory recently isolated a cDNA clone encoding human HIOMT. Comparison of the human and bovine nucleotide sequences revealed several discrepancies which prevented perfect alignment and produced defined regions of virtually no homology in the deduced amino acid sequence. Consequently, we repeated sequence analysis of the original bovine HIOMT cDNA clone, the results of which are reported here. The revised nucleotide sequence includes 23 differences from the published sequence. This completely changes the deduced amino acid sequence in two regions, encompassing a total of 96 residues, or 28% of the protein. The revised deduced amino acid sequence predicts different post-translational modifications as compared to that of the original deduced sequence. This information will make it possible in future investigations of HIOMT to design improved polymerase chain reaction primers, peptides for the generation of antisera, and probes for various types of analysis and screening of libraries.  相似文献   

4.
Nucleotide sequence of mouse satellite DNA.   总被引:33,自引:20,他引:13       下载免费PDF全文
The nucleotide sequence of uncloned mouse satellite DNA has been determined by analyzing Sau96I restriction fragments that correspond to the repeat unit of the satellite DNA. An unambiguous sequence of 234 bp has been obtained. The sequence of the first 250 bases from dimeric satellite fragments present in Sau96I limit digests corresponds almost exactly to two tandemly arranged monomer sequences including a complete Sau96I site in the center. This is in agreement with the hypothesis that a low level of divergence which cannot be detected in sequence analyses of uncloned DNA is responsible for the appearance of dimeric fragments. Most of the sequence of the 5% fraction of Sau96 monomers that are susceptible to TaqI has also been determined and has been found to agree completely with the prototype sequence. The monomer sequence is internally repetitious being composed of eight diverged subrepeats. The divergence pattern has interesting implications for theories on the evolution of mouse satellite DNA.  相似文献   

5.
Hybrid molecules containing DNA sequences complementary to bovine pituitary mRNA were constructed in the Pst I site of pBR322 by the dC . dG tailing technique. Recombinant plasmids containing bovine prolactin (bPRL) sequences were amplified in bacteria and identified by hybridization to purified [32P]bPRL cDNA sequences. Nucleotide sequence analysis was performed on the inserts from two of the positive clones. One clone, pBPRL72, contained a 982-base pair insert that included 67 nucleotides of the 5'-untranslated region, the complete coding region of the preprolactin protein (690 nucleotides), and the entire 3'-untranslated region (150 nucleotides) of bPRL mRNA. The nucleotide sequence analysis of clone pBPRL72 predicted the sequence of a 30-amino acid signal peptide and confirmed the published amino acid sequence of the protein with one exception. A comparison of the pBPRL72 cDNA sequence with a second bPRL clone, pBPRL4, revealed four silent nucleotide differences. Three of the base changes occurred in the third position of amino acid codons, and one occurred in the 3'-noncoding region. The sequence polymorphism suggests the existence of alleles or multiple loci for bPRL that do not alter the protein structure.  相似文献   

6.
The amino acid sequence of beta-galactosidase has been determined. The monomer contains 1,021 amino acid residues in a single polypeptide chain and has a molecular weight of 116,349. All 80 tryptic peptides as well as all 24 CNBr peptides have been isolated in pure form. Evidence is presented for the ordering of the CNBr peptides. The sequence determination was aided by analysis of cyanogen bromide peptides obtained from a polypeptide fragment produced by a lacZ termination mutant strain.  相似文献   

7.
A plasmid containing the transposon gamma delta sequence was immune to further insertion of gamma delta (transposition immunity). Plasmids carrying a fragment containing either 0.2 kilobase pairs of the gamma end or 0.4 kilobase pairs of the delta end of the gamma delta sequence were immune, and other parts of the gamma delta sequence did not confer immunity. The terminal 38-base-pair (bp) sequence of the delta end of the gamma delta was sufficient to confer immunity, the 38-bp sequence of the gamma end conferred only moderate immunity, and the terminal 35-bp sequence, which was completely identical at both the gamma and delta ends, was insufficient to confer immunity.  相似文献   

8.
Dispersed repetitive DNA sequence of Mucor racemosus.   总被引:1,自引:0,他引:1       下载免费PDF全文
A dispersed repetitive DNA sequence has been identified within the genome of the fungus Mucor racemosus. Recombinant phage clones, as well as a plasmid harboring the sequence, have been isolated. Examination of cloned fragments comprising part of the repetitive sequence has led to a partial characterization of the element. The sequence has been detected in other Mucor species, and although the apparent number and chromosomal position of the repetitive sequence vary from strain to strain, it is clear that at least portions of the element have been conserved.  相似文献   

9.
Nucleotide sequence of bacteriophage f1 DNA.   总被引:30,自引:2,他引:28       下载免费PDF全文
The nucleotide sequence of the DNA of the filamentous coliphage f1 has been determined. In agreement with earlier conclusions, the genome was found to comprise 6,407 nucleotides, 1 less than that of the related phage fd. Phage f1 DNA differs from that of phage M13 by 52 nucleotide changes, which lead to 5 amino acid substitutions in the corresponding proteins of the two phages, and from phage fd DNA by 186 nucleotide changes (including the single-nucleotide deletion), which lead to 12 amino acid differences between the proteins of phages f1 and fd. More than one-half of the nucleotide changes in each case are found in the sequence of 1,786 nucleotides comprising gene IV and the major intergenic region between gene IV and gene II. The sequence of this intergenic region (nucleotides 5501 to 6005) of phage f1 differs from the sequence reported by others through the inclusion of additional single nucleotides in eight positions and of a run of 13 nucleotides between positions 5885 and 5897, a point of uncertainty in the earlier published sequence. The differences between the sequence of bacteriophage f1 DNA now presented and a complete sequence for the DNA previously published by others are discussed, and the f1 DNA sequence is compared with those of bacteriophages M13 and fd.  相似文献   

10.
11.
The nucleotide sequence of the gene (tnpA) which codes for the transposase of transposon Tn501 has been determined. It contains an open reading frame for a polypeptide of Mr = 111,500, which terminates within the inverted repeat sequence of the transposon. The reading frame would be transcribed in the same direction as the mercury-resistance genes and the tnpR gene. The amino acid sequence predicted from this reading frame shows 32% identity with that of the transposase of the related transposon Tn3. The C-terminal regions of these two polypeptides show slightly greater homology than the N-terminal regions when conservative amino acid substitutions are considered. With this sequence determination, the nucleotide sequence of Tn501 is fully defined. The main features of the sequence are briefly presented.  相似文献   

12.
High sequence specificity of micrococcal nuclease.   总被引:58,自引:31,他引:27       下载免费PDF全文
The substrate specificity of micrococcal nuclease (EC 3.1.4.7.) has been studied. The enzyme recognises features of nucleotide composition, nucleotide sequence and tertiary structure of DNA. Kinetic analysis indicates that the rate of cleavage is 30 times greater at the 5' side of A or T than at G or C. Digestion of end-labelled linear DNA molecules of known sequence revealed that only a limited number of sites are cut, generating a highly specific pattern of fragments. The frequency of cleavage at each site has been determined and it may reflect the poor base overlap in the 5' T-A 3' stack as well as the length of contiguous A and T residues. The same sequence preferences are found when DNA is assembled into nucleosomes. Deoxyribonuclease 1 (EC 3.1.4.5.) recognises many of the same sequence features. Micrococcal nuclease also mimics nuclease S1 selectively cleaving an inverted repeat in supercoiled pBR322. The value of micrococcal nuclease as a "non-specific" enzymatic probe for studying nucleosome phasing is questioned.  相似文献   

13.
At the ends of bacteriophage λ DNA, the 5′-terminated strands are 12 nucleotides longer than the 3′-terminated strands. The complete sequence of deoxynucleotides in both the protruding 5′-terminated single strands of λ DNA has been determined by partial repair and by complete repair followed by sequencing of isolated oligonucleotides. Starting from the 5′-end of the left-hand cohesive end, the 12 nucleotides are in the sequence dpGpGpGpCpGpGpCpGpApCpCpT. The sequence from the right-hand cohesive end is exactly complementary to that from the left-hand end.  相似文献   

14.
Nucleotide sequence of a chicken delta-crystallin gene.   总被引:12,自引:2,他引:10       下载免费PDF全文
We have determined the complete nucleotide sequence of one of the two non-allelic delta-crystallin genes in the chicken, arbitrarily designated delta-gene 1, using a genomic clone (lambda g delta 106) containing the entire gene sequence. By comparison of the genomic sequence and the delta-crystallin cDNA sequence previously determined, we have identified exon sequences in the genomic sequence. Thus, the presence of 17 exons and 16 introns in the gene has been clarified. The delta-crystallin polypeptide deduced from the exon sequences consists of 465 amino acids which is larger, by 19 amino acid residues, than the polypeptide deduced from the cDNA sequence previously reported. Re-examination of the cDNA sequence using the same cDNA clone previously used shows that the present exon sequences are correct and the molecular weight of the deduced delta-crystallin polypeptide is 50,615 daltons instead of the previously reported value of 48,447 daltons. In addition, some structural features of the delta-crystallin gene including putative expression signals are discussed.  相似文献   

15.
The sequence specificity of a mammalian DNA methylase.   总被引:4,自引:4,他引:0       下载免费PDF全文
The sequence specificity of an extensively purified DNA methylase preparation from Krebs II mouse ascites cells has been examined. The enzyme appears to be highly sequence dependent. Moreover the sequence distribution of cytosine residues that are methylated, bears a very close resemblance to the sequence distribution of 5'-methyl cytosine found in vivo in a wide range of vertebrate cells and is consistent with methylation of cytosines in the sequence R-Yn-C-R.  相似文献   

16.
The base sequence of the cohesive ends of bacteriophage φ80 DNA has been shown to be identical to the base sequence of the cohesive ends of bacteriophage lambda DNA.  相似文献   

17.
Complete sequence of IS3.   总被引:35,自引:4,他引:31       下载免费PDF全文
  相似文献   

18.
Multiple sequence alignment by consensus.   总被引:5,自引:3,他引:2       下载免费PDF全文
An algorithm for multiple sequence alignment is given that matches words of length and degree of mismatch chosen by the user. The alignment maximizes an alignment scoring function. The method is based on a novel extension of our consensus sequence methods. The algorithm works for both DNA and protein sequences, and from earlier work on consensus sequences, it is possible to estimate statistical significance.  相似文献   

19.
During macronuclear development in the ciliated protozoan Tetrahymena thermophila, sequence reorganization including sequence loss occurs. Addressing questions about the organization and nucleotide sequence of micronucleus limited regions can lead to insights about mechanisms of DNA rearrangements during macronuclear development as well as mechanisms for the maintenance of the stability of micronucleus-limited sequence families. We have previously identified a moderately repetitive micronucleus-limited sequence family called X-H (family members hybridize to an approximately 450 bp Xbal-HindIII restriction fragment), completely absent from macronuclear DNA. The first member of this family which we isolated is associated with terminal sequences characteristic of a Tel-1 element, a putative micronuclear transposable element. Two additional family members have been isolated which are not closely associated with Tel-1 terminal sequences. We have nucleotide sequence data for three cloned members of the X-H family. This analysis has demonstrated that the longest cloned members of the X-H family share a region of homology of approximately 2,400 bp and are highly conserved, differing only by small insertions or deletions of 100 bp or less. The sequences from one of the sequenced family members flanking the region of homology are themselves mostly micronucleus-limited.  相似文献   

20.
DNA sequence classification is the activity of determining whether or not an unlabeled sequence S belongs to an existing class C. This paper proposes two new techniques for DNA sequence classification. The first technique works by comparing the unlabeled sequence S with a group of active motifs discovered from the elements of C and by distinction with elements outside of C. The second technique generates and matches gapped fingerprints of S with elements of C. Experimental results obtained by running these algorithms on long and well conserved Alu sequences demonstrate the good performance of the presented methods compared with FASTA. When applied to less conserved and relatively short functional sites such as splice-junctions, a variation of the second technique combining fingerprinting with consensus sequence analysis gives better results than the current classifiers employing text compression and machine learning algorithms.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号