首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The primary structure of a nuclear-encoded subunit of the respiratory chain NADH:ubiquinone reductase (complex I) from Neurospora crassa was determined by sequencing cDNA, genomic DNA and the N-terminus of the protein. The sequence correlates to a protein of 200 amino acids and a molecular mass of 21349 Da. The protein is synthesized without a cleavable presequence. It contains two alpha-helices predicted to traverse the bilayer and is a constituent of the membrane part of complex I.  相似文献   

2.
Hernandez P  Gras R  Frey J  Appel RD 《Proteomics》2003,3(6):870-878
In recent years, proteomics research has gained importance due to increasingly powerful techniques in protein purification, mass spectrometry and identification, and due to the development of extensive protein and DNA databases from various organisms. Nevertheless, current identification methods from spectrometric data have difficulties in handling modifications or mutations in the source peptide. Moreover, they have low performance when run on large databases (such as genomic databases), or with low quality data, for example due to bad calibration or low fragmentation of the source peptide. We present a new algorithm dedicated to automated protein identification from tandem mass spectrometry (MS/MS) data by searching a peptide sequence database. Our identification approach shows promising properties for solving the specific difficulties enumerated above. It consists of matching theoretical peptide sequences issued from a database with a structured representation of the source MS/MS spectrum. The representation is similar to the spectrum graphs commonly used by de novo sequencing software. The identification process involves the parsing of the graph in order to emphasize relevant sections for each theoretical sequence, and leads to a list of peptides ranked by a correlation score. The parsing of the graph, which can be a highly combinatorial task, is performed by a bio-inspired algorithm called Ant Colony Optimization algorithm.  相似文献   

3.
Exploring the proteome of Plasmodium   总被引:2,自引:0,他引:2  
With the entire genomic sequence of several species of Plasmodium soon to be available, researchers are now focusing on methods to study gene and protein expression at the whole organism level. Traditional methods of characterising and identifying large numbers of proteins from a complex protein mixture have relied predominantly on two-dimensional gel electrophoresis combined with N-terminal sequencing or mass spectrometry of individually prepared proteins. New proteomics methods are now available that are based on resolving small peptides derived from complex protein mixtures by high-resolution liquid chromatography and directly identifying them by tandem mass spectrometry (LC/LC/MS/MS) and sophisticated computer search algorithms against whole genome sequence databases. These newer proteomic methods have the potential to accelerate the reproducible identification of large numbers of proteins from various life cycle stages of Plasmodium and may help to better understand parasite biology and lead to the identification of new targets of vaccines and drugs.  相似文献   

4.
The primary structure of the iron-sulfur subunit of ubiquinol-cytochrome c reductase from Neurospora mitochondria was determined by cDNA and genomic DNA sequencing. A first cDNA was identified from a cDNA bank cloned in Escherichia coli by hybridization selection of mRNA, cell-free protein synthesis and immunoadsorption. Further cDNA and geonomic DNA were identified by colony filter hybridization. The N-terminal sequence of the mature protein was determined by automated Edman degradation. From the sequence a molecular mass of 24749 Da results for the precursor protein and of 21556 Da for the mature protein. The presequence consists of 32 amino acids with four arginines as the only charged residues. The mature protein consists of 199 amino acids. It is characterized by a small N-terminal hydrophilic part of 29 residues, a hydrophobic stretch of 25 residues and a large C-terminal hydrophilic domain of 145 residues. The only four cysteines of the protein, which are assumed to bind the 2 Fe-2S cluster, are located in a moderate hydrophobic region of this large domain. Cysteines 3 and 4 are unusually arranged in that they are separated by only one proline. From sequence data the arrangement of the subunit in the membrane is deduced.  相似文献   

5.
6.
Delahunty CM  Yates JR 《BioTechniques》2007,43(5):563, 565, 567 passim
Large-scale biology emerged out of the efforts to sequence genomes of important organisms. Based on resources created by whole genome sequencing, large-scale analyses of messenger RNA (mRNA) and protein expression are now possible. With the availability of large amounts of genomic sequence information, a convenient method for the identification and analysis of proteins based on proteolytic digestion into peptides emerged. Processes to fragment peptides using collision-activated dissociation (CAD) in tandem mass spectrometers and computer algorithms to match the tandem mass spectra of peptides to sequences in databases enable rapid identification of amino acid sequences, and hence proteins, present in mixtures. The inherent complexity of the peptide mixtures has necessitated improvements in methodology for mass spectrometry (MS) analysis of peptides.  相似文献   

7.
Second generation sequencing has been widely used to sequence whole genomes. Though various paired-end sequencing methods have been developed to construct the long scaffold from contigs derived from shotgun sequencing, the classical paired-end sequencing of the Bacteria Artificial Chromosome (BAC) or fosmid libraries by the Sanger method still plays an important role in genome assembly. However, sequencing libraries with the Sanger method is expensive and time-consuming. Here we report a new strategy to sequence the paired-ends of genomic libraries with parallel pyrosequencing, using a Chinese amphioxus (Branchiostoma belcheri) BAC library as an example. In total, approximately 12,670 non-redundant paired-end sequences were generated. Mapping them to the primary scaffolds of Chinese amphioxus, we obtained 413 ultra-scaffolds from 1,182 primary scaffolds, and the N50 scaffold length was increased approximately 55 kb, which is about a 10% improvement. We provide a universal and cost-effective method for sequencing the ultra-long paired-ends of genomic libraries. This method can be very easily implemented in other second generation sequencing platforms.  相似文献   

8.
The complete primary structure of the minor beta-lactoglobulin II component from donkey milk is presented. It has been established by amino-acid sequencing and mass-spectrometry analysis of intact protein and peptides obtained after enzymatic and chemical cleavages. The molecular mass and the pI of the protein are calculated to be 18,261 Da and 4.5 respectively. Despite the close structural similarity of the donkey and horse major beta-lactoglobulin I components, their minor beta-lactoglobulin II components show substantial differences in sequence. Most observed exchanges are clustered at residues 78-106 where only 6 amino-acid residues are conserved. The primary structure of donkey beta-lactoglobulin II reveals some unusual features of minor beta-lactoglobulins II and gives new light to the evolution of beta-lactoglobulins and other lipocalins involved in retinol binding or reproductive functions.  相似文献   

9.
Highly active cytochrome b(6)f complexes from spinach and the cyanobacterium Mastigocladus laminosus have been analyzed by liquid chromatography with electrospray ionization mass spectrometry (LCMS+). Both size-exclusion and reverse-phase separations were used to separate protein subunits allowing measurement of their molecular masses to an accuracy exceeding 0.01% (+/-3 Da at 30,000 Da). The products of petA, petB, petC, petD, petG, petL, petM, and petN were detected in complexes from both spinach and M. laminosus, while the spinach complex also contained ferredoxin-NADP(+) oxidoreductase (Zhang, H., Whitelegge, J. P., and Cramer, W. A. (2001) Flavonucleotide:ferredoxin reductase is a subunit of the plant cytochrome b(6)f complex. J. Biol. Chem. 276, 38159-38165). While the measured masses of PetC and PetD (18935.8 and 17311.8 Da, respectively) from spinach are consistent with the published primary structure, the measured masses of cytochrome f (31934.7 Da, PetA) and cytochrome b (24886.9 Da, PetB) modestly deviate from values calculated based upon genomic sequence and known post-translational modifications. The low molecular weight protein subunits have been sequenced using tandem mass spectrometry (MSMS) without prior cleavage. Sequences derived from the MSMS spectra of these intact membrane proteins in the range of 3.2-4.2 kDa were compared with translations of genomic DNA sequence where available. Products of the spinach chloroplast genome, PetG, PetL, and PetN, all retained their initiating formylmethionine, while the nuclear encoded PetM was cleaved after import from the cytoplasm. While the sequences of PetG and PetN revealed no discrepancy with translations of the spinach chloroplast genome, Phe was detected at position 2 of PetL. The spinach chloroplast genome reports a codon for Ser at position 2 implying the presence of a DNA sequencing error or a previously undiscovered RNA editing event. Clearly, complete annotation of genomic data requires detailed expression measurements of primary structure by mass spectrometry. Full subunit coverage of an oligomeric intrinsic membrane protein complex by LCMS+ presents a new facet to intact mass proteomics.  相似文献   

10.
Due to the limited applicability of conventional protein identification methods to the proteomes of organisms with unsequenced genomes, researchers have developed approaches to identify proteins using mass spectrometry and sequence similarity database searches. Both the integration of mass spectrometry with bioinformatics and genomic sequencing drive the expanding organismal scope of proteomics.  相似文献   

11.
Mitochondrial ribosomal proteins (MRPs) are required for the translation of all 13 mitochondrial encoded genes in humans. It has been speculated that mutations and polymorphisms in the human MRPs may be a primary cause of some oxidative phosphorylation disorders or modulate the severity and tissue specificity of pathogenic mitochondrial DNA mutations. Although the sequences of most of the yeast MRPs are known, only very few mammalian and nearly no human MRPs have been completely characterized. MRPs differ greatly in sequence, and sometimes biochemical properties, between different species, not allowing easy recognition by sequence homology. Therefore, the Mammalian Mitochondrial Ribosomal Consortium is using a direct approach of purifying individual mammalian (bovine) MRPs, determining their N-terminal and/or internal peptide sequences using different protein sequencing techniques, and using the resulting sequence information for screening expressed sequence tags and genomic data bases to determine human, mouse, and rat homologues of the bovine proteins. Two proteins of the large and three proteins of the small ribosomal subunit have been analyzed in this manner. Three of them represent "new," i.e. formerly unknown mammalian mitochondrial ribosomal protein classes. Only one of these three different MRPs shows significant sequence similarities to known ribosomal proteins. In one case, the corresponding human genomic DNA sequences were found in the data bases, and the exon/intron structure was determined.  相似文献   

12.
Precise manipulations with genetic material, typical for modern experiments in molecular biology and in new biotechnology, require a capability to determine DNA base sequence. This capability enables today to exploit specific genetic knowledge for the dissection of complex cell processes and for modulation of cell metabolism in transgenic organisms. The review focuses on such DNA sequencing technologies that are widespread in general laboratory practice. They can safely be called, with the availability of commercial reagents, industrial techniques. Modern DNA sequencing requires recurrent breakdown of large genomic DNA into smaller pieces, that are then amplified, sequenced and the initial long stretch reconstructed via overlap of small pieces. The DNA sequencing process has several steps: a DNA fragment is obtained in sufficient quantity and purity, it is converted to a form suitable for a particular sequencing method, a sequencing reaction is performed and its products fractionated; and finally the resultant data are interpreted (i.e. an autoradiograph is read into a computer memory) and a long sequence in reconstructed via overlap of short stretches. These steps are considered in separate parts; an accent is made on sequencing strategies with respect to their biological task. In the last part, possibilities for automation of sequencing experiment are considered, followed by a discussion of domestic problems in DNA sequencing.  相似文献   

13.
《TARGETS》2003,2(6):245-252
Comparative genomic sequencing and analysis offers new wealth of information for target selection and the development of therapeutics. This article focuses on the following two key innovations in mapping and sequencing: first, shotgun sequencing of clone pools to combine the benefits of whole-genome shotgun and clone-by-clone strategies, and second, the leveraging of newly available assembled genomic sequences to improve the effectiveness of new sequencing projects through comparative mapping and comparative sequence assembly. The following specific sequencing and mapping methods are discussed in detail: clone-array pooled shotgun sequencing (CAPSS); transversal shotgun pooling designs; clone-array pooled shotgun mapping (CAPS-MAP); pooled genomic indexing (PGI); short-tag pooled genomic indexing (ST-PGI); and comparative sequence assembly (the CSA™ method). The methods can be implemented with only modest modifications of current large-scale sequencing pipelines and are highly synergistic with the next generation of sequencing technologies.  相似文献   

14.
The nucleotide sequence containing the complete structural information for a glucan branching enzyme was isolated from a Neisseria denitrificans genomic library. The gene was expressed in Escherichia coli and the active recombinant protein was purified. The deduced protein of 762 amino acids with a calculated molecular weight of 86313 Da shows similarity to the primary protein sequences of other known glucan branching enzymes. Amino acid sequencing of the isolated protein by Edman degradation confirmed the deduced start codon of the structural gene of the glucan branching enzyme. The purified glucan branching enzyme has a stimulating effect on the Neisseria amylosucrase activity.  相似文献   

15.
16.
De novo peptide sequencing by mass spectrometry (MS) can determine the amino acid sequence of an unknown peptide without reference to a protein database. MS-based de novo sequencing assumes special importance in focused studies of families of biologically active peptides and proteins, such as hormones, toxins, and antibodies, for which amino acid sequences may be difficult to obtain through genomic methods. These protein families often exhibit sequence homology or characteristic amino acid content; yet, current de novo sequencing approaches do not take advantage of this prior knowledge and, hence, search an unnecessarily large space of possible sequences. Here, we describe an algorithm for de novo sequencing that incorporates sequence constraints into the core graph algorithm and thereby reduces the search space by many orders of magnitude. We demonstrate our algorithm in a study of cysteine-rich toxins from two cone snail species (Conus textile and Conus stercusmuscarum) and report 13 de novo and about 60 total toxins.  相似文献   

17.
The accelerated rate of genomic sequencing has led to an abundance of completely sequenced genomes. Annotation of the open reading frames (ORFs) (i.e., gene prediction) in these genomes is an important task and is most often performed computationally based on features in the nucleic acid sequence. Using recent advances in proteomics, we set out to predict the set of ORFs for an organism based principally on expressed protein-based evidence. Using a novel search strategy, we mapped peptides detected in a whole-cell lysate of Mycoplasma pneumoniae onto a genomic scaffold and extended these "hits" into ORFs bound by traditional genetic signals to generate a "proteogenomic map". We were able to generate an ORF model for M. pneumoniae strain FH using proteomic data with a high correlation to models based on sequence features. Ultimately, we detected over 81% of the genomically predicted ORFs in M. pneumoniae strain M129 (the originally sequenced strain). We were also able to detect several new ORFs not originally predicted by genomic methods, various N-terminal extensions, and some evidence that would suggest that certain predicted ORFs are bogus. Some of these differences may be a result of the strain analyzed but demonstrate the robustness of protein analysis across closely related genomes. This technique is a cost-effective means to add value to genome annotation, and a prerequisite for proteome quantitation and in vivo interaction measures.  相似文献   

18.
The primary structure of recombinant hepatitis B surface antigen protein produced in yeast has been confirmed by mass spectrometric peptide mapping. These studies corroborate more than 85% of the amino acid sequence derived by sequencing of the gene and identified the presence of an acetyl moiety on approximately 70% of the NH2-terminal methionine residues. Prior to the present work, direct structural analysis was largely prevented by the insolubility of this integral membrane protein and its primary degradation fragments in aqueous buffers and by partial blockage of the NH2 terminus. These difficulties were overcome by preparative isolation using electroelution of the monomeric 226 amino acid protein from a polyacrylamide electrophoretic gel in the presence of sodium dodecyl sulfate. Chymotryptic digestion of the reduced and carboxymethylated monomer produced a large number of small, predominantly hydrophobic peptides ideally suited for peptide mapping by fast atom bombardment mass spectrometry. The percentage of NH2-terminal methionine blocked by acetyl was determined by a new strategy involving cyanogen bromide cleavage, permethylation, and gas chromatography/mass spectrometry identification and quantitation of the N-methyl-N-acetylhomoserine produced.  相似文献   

19.
The amino acid sequence of the iron-sulfur-flavoprotein, trimethylamine dehydrogenase, isolated from the bacterium W3A1 has been deduced from the x-ray diffraction pattern obtained at 2.4-A resolution. This sequence has been compared to portions of the primary sequence derived by gas-phase sequencing of isolated peptides obtained from cyanogen bromide and endoprotease Arg-C and Asp-N digestions of the purified enzyme. A consensus sequence has resulted and is comprised of 729 amino acids with Ala at both NH2- and COOH-terminal positions. The consensus sequence contains 13 cysteine residues. Approximately 80% of the sequence has been confirmed by direct sequencing with approximately 81% agreement with the x-ray deduced sequence. The calculated subunit molecular mass of the apoenzyme is 78,899 Da, in good agreement with published values of approximately 83,000. The anomalous scattering map from the native protein has also been shown to provide accurate information about the positions of most of the weak anomalous scattering centers such as sulfur or phosphorus atoms and to complement x-ray or chemical sequencing methods.  相似文献   

20.
This paper describes the isolation and primary structure analysis of a new phospholipase A2 with platelet-aggregation-inhibiting activity from the venom of Bothrops jararaca. The protein, named BJ-PLA2, was isolated by means of ammonium sulfate precipitation and anion-exchange and reversed-phase chromatographies and behaved as a homogeneous single-chain protein on SDS-PAGE. Its amino acid sequence was determined by N-terminal sequencing and analysis of overlapped chemical and proteolytic fragments by automated Edman degradation and mass spectometry determination. BJ-PLA2 consists of 124 amino acid residues and has the structural features of snake venom class II phospholipases A2. Chemical modification with p-bromophenacylbromide caused complete loss of enzymatic activity and partially affected the platelet-aggregation-inhibiting activity of BJ-PLA2.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号