首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
ANTHEPROT: a package for protein sequence analysis using a microcomputer   总被引:2,自引:0,他引:2  
A simple microcomputer package is described to make the theoreticalanalysis of protein sequences. Several methods designed to comparetwo sequences, to model proteolytic reactions and to predictthe secondary structure, the hydro-phobic/hydrophilic regionsand the potential antigenic sites of proteins have been includedin an Apple II microcomputer software. The package comprises21 programs as well as the secondary structure database of Kabschand Sander (1983). Received on November 24, 1987; accepted on March 8, 1988  相似文献   

2.
An algorithm for multiple sequence comparison was implementedin FORTRAN 77 for VAX/VMS in GCG-atible format. The MULTICOMPprogram package includes several procedures with which one querysequence can be compared simultaneously to several DNA, RNAor amino acid sequences. The same technique was also introducedfor comparing propensities of secondary structural features,which can be predicted on the basis of amino acid sequences.The technique has been applied to a wide range of sequence andstructural analyses.  相似文献   

3.
RnaViz, a program for the visualisation of RNA secondary structure.   总被引:13,自引:3,他引:10       下载免费PDF全文
RnaViz is a user-friendly, portable, windows-type program for producing publication-quality secondary structure drawings of RNA molecules. Drawings can be created starting from DCSE alignment files if they incorporate structure information or from mfold ct files. The layout of a structure can be changed easily. Display of special structural elements such as pseudo-knots or unformatted areas is possible. Sequences can be automatically numbered, and several other types of labels can be used to annotate particular bases or areas. Although the program does not try to produce an initially non-overlapping drawing, the layout of a properly positioned structure drawing can be applied to a newly created drawing using skeleton files. In this way a range of similar structures can be drawn with a minimum of effort. Skeletons for several types of RNA molecule are included with the program.  相似文献   

4.
Analysis of phylogenetically conserved secondary structure has been important in the development of models for the secondary structure of structural RNAs. In this paper, we apply this type of analysis to several families of informational RNAs to evaluate its usefulness in developing secondary structure models for mRNAs and mRNA precursors. We observed many conserved helices in all mRNA groups analyzed. Three criteria were used to identify potential helices which were not conserved solely because of coding sequence constraints, and may therefore be important for the structure and function of the RNA. These results suggest that this approach will be useful in deriving secondary structure models for informational RNAs when used in conjunction with other complementary techniques, and in designing experiments to determine the functional significance of conserved base pairing interactions.  相似文献   

5.
Summary: The conventional approach to calculating biomolecularstructures from nuclear magnetic resonance (NMR) data is oftenviewed as subjective due to its dependence on rules of thumbfor deriving geometric constraints and suitable values for theoryparameters from noisy experimental data. As a result, it canbe difficult to judge the precision of an NMR structure in anobjective manner. The inferential Structure determination (ISD)framework, which has been introduced recently, addresses thisproblem by using Bayesian inference to derive a probabilitydistribution that represents both the unknown structure andits uncertainty. It also determines additional unknowns, suchas theory parameters, that normally need to be chosen empirically.Here we give an overview of the ISD software package, whichimplements this methodology. Availability: http://www.bioc.cam.ac.uk/isd Contact: wolfgang.rieping{at}bioc.cam.ac.uk, michael.habeck{at}tuebingen.mpg.de Associate Editor: Alfonso Valencia  相似文献   

6.
7.
BACKGROUND: HDX mass spectrometry is a powerful platform to probe protein structure dynamics during ligand binding, protein folding, enzyme catalysis, and such. HDX mass spectrometry analysis derives the protein structure dynamics based on the mass increase of a protein of which the backbone protons exchanged with solvent deuterium. Coupled with enzyme digestion and MS/MS analysis, HDX mass spectrometry can be used to study the regional dynamics of protein based on the m/z value or percentage of deuterium incorporation for the digested peptides in the HDX experiments. Various software packages have been developed to analyze HDX mass spectrometry data. Despite the progresses, proper and explicit statistical treatment is still lacking in most of the current HDX mass spectrometry software. In order to address this issue, we have developed the HDXanalyzer for the statistical analysis of HDX mass spectrometry data using R, Python, and RPY2. IMPLEMENTATION AND RESULTS: HDXanalyzer package contains three major modules, the data processing module, the statistical analysis module, and the user interface. RPY2 is employed to enable the connection of these three components, where the data processing module is implemented using Python and the statistical analysis module is implemented with R. RPY2 creates a low-level interface for R and allows the effective integration of statistical module for data processing. The data processing module generates the centroid for the peptides in form of m/z value, and the differences of centroids between the peptides derived from apo and ligand-bound protein allow us to evaluate whether the regions have significant changes in structure dynamics or not. Another option of the software is to calculate the deuterium incorporation rate for the comparison. The two types of statistical analyses are Paired Student's t-test and the linear combination of the intercept for multiple regression and ANCOVA model. The user interface is implemented with wxpython to facilitate the data visualization in graphs and the statistical analysis output presentation. In order to evaluate the software, a previously published xylanase HDX mass spectrometry analysis dataset is processed and presented. The results from the different statistical analysis methods are compared and shown to be similar. The statistical analysis results are overlaid with the three dimensional structure of the protein to highlight the regional structure dynamics changes in the xylanase enzyme. CONCLUSION: Statistical analysis provides crucial evaluation of whether a protein region is significantly protected or unprotected during the HDX mass spectrometry studies. Although there are several other available software programs to process HDX experimental data, HDXanalyzer is the first software program to offer multiple statistical methods to evaluate the changes in protein structure dynamics based on HDX mass spectrometry analysis. Moreover, the statistical analysis can be carried out for both m/z value and deuterium incorporation rate. In addition, the software package can be used for the data generated from a wide range of mass spectrometry instruments.  相似文献   

8.
Lee W  Chen SL 《BioTechniques》2002,33(6):1334-1341
Genome-tools is a Perl module, a set of programs, and a user interface that facilitates access to genome sequence information. The package is flexible, extensible, and designed to be accessible and useful to both nonprogrammers and programmers. Any relatively well-annotated genome available with standard GenBank genome files may be used with genome-tools. A simple Web-based front end permits searching any available genome with an intuitive interface. Flexible design choices also make it simple to handle revised versions of genome annotation files as they change. In addition, programmers can develop cross-genomic tools and analyses with minimal additional overhead by combining genome-tools modules with newly written modules. Genome-tools runs on any computer platform for which Perl is available, including Unix, Microsoft Windows, and Mac OS. By simplifying the access to large amounts of genomic data, genome-tools may be especially useful for molecular biologists looking at newly sequenced genomes, for which few informatics tools are available. The genome-tools Web interface is accessible at http://genome-tools.sourceforge.net, and the source code is available at http://sourceforge.net/projects/genome-tools.  相似文献   

9.
RNAstructure: software for RNA secondary structure prediction and analysis   总被引:1,自引:0,他引:1  

Background  

To understand an RNA sequence's mechanism of action, the structure must be known. Furthermore, target RNA structure is an important consideration in the design of small interfering RNAs and antisense DNA oligonucleotides. RNA secondary structure prediction, using thermodynamics, can be used to develop hypotheses about the structure of an RNA sequence.  相似文献   

10.
Isoprenyl diphosphate synthases are ubiquitous enzymes that catalyze the basic chain-elongation reaction in the isoprene biosynthetic pathway. Pairwise sequence comparisons were made for 6 farnesyl diphosphate synthases, 6 geranylgeranyl diphosphate synthases, and a hexaprenyl diphosphate synthase. Five regions with highly conserved residues, two of which contain aspartate-rich DDXX(XX)D motifs found in many prenyltransferases, were identified. A consensus secondary structure for the group, consisting mostly of alpha-helices, was predicted for the multiply aligned sequences from amino acid compositions, computer assignments of local structure, and hydropathy indices. Progressive sequence alignments suggest that the 13 isoprenyl diphosphate synthases evolved from a common ancestor into 3 distinct clusters. The most distant separation is between yeast hexaprenyl diphosphate synthetase and the other enzymes. Except for the chromoplastic geranylgeranyl diphosphate synthase from Capsicum annuum, the remaining farnesyl and geranylgeranyl diphosphate synthases segregate into prokaryotic/archaebacterial and eukaryotic families.  相似文献   

11.
Zhang CT  Zhang R 《Biopolymers》2000,53(7):539-549
A secondary structure sequence is a symbolic string composed of three kinds of letters, indicating the helix, strand, and coil (including turns), respectively. A graphic representation for this abstract symbolic sequence is proposed here, called the S curve. The S curve is the unique representation for a given secondary structure sequence in the sense that the sequence and the S curve can be uniquely determined from the other. Therefore, the S curve contains all the information that the secondary structure sequence contains. Different geometrical properties of the S curve are studied in details, which reflect the basic characteristics of the secondary structure sequences. The S curves are used to display, analyze, and compare the secondary structure sequences. Detailed application examples are presented. One advantage of the S curve methodology is that the main patterns of a given secondary structure sequence can be grasped quickly in a perceivable form. This is particularly useful in the cases in which longer sequences are involved and structures of proteins are unknown.  相似文献   

12.
Predict7, a program for protein structure prediction   总被引:4,自引:0,他引:4  
We describe a program for protein sequence analysis which runs in IBM PC computers. Protein sequences are loaded from files in Mount-Conrad and Lipman-Pearson format. Seven features are analyzed: hydrophilicity, hydropathy, surface probability, side chain flexibility, antigenicity, secondary structure and N-glycosylation sites. Numeric results can be shown, printed or stored in files exportable to other programs. Graphics of up to four predictions can be displayed on the screen, printed out or plotted, with several definable options. This program has been designed to be fast, user-friendly and to be shared with the scientific community.  相似文献   

13.
ADSP-a new package for computational sequence analysis   总被引:3,自引:0,他引:3  
A new protein sequence analysis package, ADSP, is described,of which the SOMAP Screen–Oriented Multiple AlignmentProcedure forms an integral part. ADSP (Algorithms and DataStructures for Protein sequence analysis) incorporates facilitiesto generate potent pattern-recognition discriminators and offersfour algorithms with which to scan any NBRF format sequencedatabase: the package has been designed, in particular, to interfacewith the OWL composite sequence database, one of the largest,distributed non-redundant sources of sequence data of its kind.The system incorporates a powerful method for compound featureanalysis, which provides the basis for characterizing and predictingthe occurrence of complete protein superfamilies and for pinpointingthe emergence of related subfamilies. Used iteratively, theapproach allows diagnostic performance to be rigorously refinedand its efficacy to be assessed both qualitatively and quantitatively,and results in the generation of refined structural or functionalfeatures suitable for entry into a database: this compilationof characteristic signatures is distinct from, but complementaryto, widely used compendia of pattern templates such as PROSUE.  相似文献   

14.
Portable microcomputer software for nucleotide sequence analysis.   总被引:17,自引:10,他引:17       下载免费PDF全文
B Fristensky  J Lis    R Wu 《Nucleic acids research》1982,10(20):6451-6463
The most common types of nucleotide sequence data analyses and handling can be done more conveniently and inexpensively on microcomputers than on large time-sharing systems. We present a package of computer programs for the analysis of DNA and RNA sequence data which overcomes many of the limitations imposed by microcomputers, while offering most of the features of programs commonly available on large computers, including sequence numbering and translation, restriction site and homology searches with dot-matrix plots, nucleotide distribution analysis, and graphic display of data. Most of the programs were written in Standard Pascal (on an Apple II computer) to facilitate portability to other micro-, mini-, and and mainframe computers.  相似文献   

15.
Spectroscopic methods for analysis of protein secondary structure   总被引:2,自引:0,他引:2  
Several methods for determination of the secondary structure of proteins by spectroscopic measurements are reviewed. Circular dichroism (CD) spectroscopy provides rapid determinations of protein secondary structure with dilute solutions and a way to rapidly assess conformational changes resulting from addition of ligands. Both CD and Raman spectroscopies are particularly useful for measurements over a range of temperatures. Infrared (IR) and Raman spectroscopy require only small volumes of protein solution. The frequencies of amide bands are analyzed to determine the distribution of secondary structures in proteins. NMR chemical shifts may also be used to determine the positions of secondary structure within the primary sequence of a protein. However, the chemical shifts must first be assigned to particular residues, making the technique considerably slower than the optical methods. These data, together with sophisticated molecular modeling techniques, allow for refinement of protein structural models as well as rapid assessment of conformational changes resulting from ligand binding or macromolecular interactions. A selected number of examples are given to illustrate the power of the techniques in applications of biological interest.  相似文献   

16.
With the decreasing cost of DNA sequencing technology and the vast diversity of biological resources, researchers increasingly face the basic challenge of annotating a larger number of expressed sequences tags (EST) from a variety of species. This typically consists of a series of repetitive tasks, which should be automated and easy to use. The results of these annotation tasks need to be stored and organized in a consistent way. All these operations should be self-installing, platform independent, easy to customize and amenable to using distributed bioinformatics resources available on the Internet.  相似文献   

17.
The conformational parametersP k for each amino acid species (j=1–20) of sequential peptides in proteins are presented as the product ofP i,k , wherei is the number of the sequential residues in thekth conformational state (k=-helix,-sheet,-turn, or unordered structure). Since the average parameter for ann-residue segment is related to the average probability of finding the segment in the kth state, it becomes a geometric mean of (P k )av=(P i,k ) 1/n with amino acid residuei increasing from 1 ton. We then used ln(Pk)av to convert a multiplicative process to a summation, i.e., ln(P k ) av =(1/n)P i,k (i=1 ton) for ease of operation. However, this is unlike the popular Chou-Fasman algorithm, which has the flaw of using the arithmetic mean for relative probabilities. The Chou-Fasman algorithm happens to be close to our calculations in many cases mainly because the difference between theirP k and our InP k is nearly constant for about one-half of the 20 amino acids. When stronger conformation formers and breakers exist, the difference become larger and the prediction at the N- and C-terminal-helix or-sheet could differ. If the average conformational parameters of the overlapping segments of any two states are too close for a unique solution, our calculations could lead to a different prediction.  相似文献   

18.
The staden sequence analysis package   总被引:31,自引:0,他引:31  
I describe the current version of the sequence analysis package developed at the MRC Laboratory of Molecular Biology, which has come to be known as the “Staden Package.” The package covers most of the standard sequence analysis tasks such as restriction site searching, translation, pattern searching, comparison, gene finding, and secondary structure prediction, and provides powerful tools for DNA sequence determination. Currently the programs are only available for computers running the UNIX operating system. Detailed information about the package is available from our WWW site: http://www.mrc-lmb.cam.ac.uk/pubseq/.  相似文献   

19.
Programs for the calculation, storage and analysis of restriction maps derived from the analysis of partial digestion products from end labelled DNA (1,2,3) and their correlation with digestion - and hybridisation patterns in total digestions and Southern blot experiments are described. These programs allow direct input of gel patterns from partial or complete digestion experiments using a digitizer tablet, calculation of molecular weights and restriction maps, plotting of maps and actual or predicted fragment patterns and automated identification of overlapping cosmids from partial restriction mapping results. Programs are written in PASCAL and have been implemented on a VAX/VMS system, with a HP-7221T plotter and a digitizing tablet.  相似文献   

20.

Background  

The automated annotation of biological sequences (protein, DNA) relies on the computation of hits (predicted features) on the sequences using various algorithms. Public databases of biological sequences provide a wealth of biological "knowledge", for example manually validated annotations (features) that are located on the sequences, but mining the sequence annotations and especially the predicted and curated features requires dedicated tools. Due to the heterogeneity and diversity of the biological information, it is difficult to handle redundancy, frequent updates, taxonomic information and "private" data together with computational algorithms in a common workflow.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号