共查询到20条相似文献,搜索用时 9 毫秒
1.
TM or not TM: transmembrane protein prediction with low false positive rate using DAS-TMfilter 总被引:2,自引:0,他引:2
Web-based servers implementing the DAS-TMfilter algorithm have been launched at three mirror sites and their usage is described. The underlying computer program is an upgraded and modified version of the DAS-prediction method. The new server is (approximately 1 among 100 unrelated queries) while the high efficiency of the original algorithm locating TM segments in queries is preserved (sensitivity of approximately 95% among documented proteins with helical TM regions). AVAILABILITY: The server operates at three mirror sites: http://mendel.imp.univie.ac.at/sat/DAS/DAS.html, http://wooster.bip.bham.ac.uk/DAS.html and http://www.enzim.hu/DAS/DAS.html. The program is available on request. 相似文献
2.
The ability ofYarrowia lipolytica to produce ammonia from urea was found variable on some media. The colour change of the indicator in Christensen's urea agar was not due to the urease activity of this species but was a non-specific alkalization reaction. Rapid urea broth was reliable giving no false positive results. It was found thatY. lipolytica is a urease negative yeast species. 相似文献
3.
Transmembrane prediction methods are generally benchmarked on a set of proteins with experimentally verified topology. We have investigated if the accuracy measured on such datasets can be expected in an unbiased genomic analysis, or if there is a bias towards 'easily predictable' proteins in the benchmark datasets. As a measurement of accuracy, the concordance of the results from five different prediction methods was used (TMHMM, PHD, HMMTOP, MEMSAT, and TOPPRED). The benchmark dataset showed significantly higher levels (up to five times) of agreement between different methods than in 10 tested genomes. We have also analyzed which programs are most prone to make mispredictions by measuring the frequency of one-out-of-five disagreeing predictions. 相似文献
4.
Data preprocessing including proper normalization and adequate quality control before complex data mining is crucial for studies using the cDNA microarray technology. We have developed a simple procedure that integrates data filtering and normalization with quantitative quality control of microarray experiments. Previously we have shown that data variability in a microarray experiment can be very well captured by a quality score q(com) that is defined for every spot, and the ratio distribution depends on q(com). Utilizing this knowledge, our data-filtering scheme allows the investigator to decide on the filtering stringency according to desired data variability, and our normalization procedure corrects the q(com)-dependent dye biases in terms of both the location and the spread of the ratio distribution. In addition, we propose a statistical model for false positive rate determination based on the design and the quality of a microarray experiment. The model predicts that a lower limit of 0.5 for the replicate concordance rate is needed in order to be certain of true positives. Our work demonstrates the importance and advantages of having a quantitative quality control scheme for microarrays. 相似文献
5.
6.
L H Oliver R S Poulsen G T Toussaint 《The journal of histochemistry and cytochemistry》1977,25(7):696-701
The performance of a cell recognition system on unknown data is often estimated in terms of its error rates on a test set. This paper investigates methods for producing estimates of error rates in cervical cell classification. Classification performance curves calculated using these methods are given for several classification schemes used to classify 1500 cervical cells. 相似文献
7.
8.
Protein structural flexibility is important for catalysis, binding, and allostery. Flexibility has been predicted from amino acid sequence with a sliding window averaging technique and applied primarily to epitope search. New prediction parameters were derived from 92 refined protein structures in an unbiased selection of the Protein Data Bank by developing further the method of Karplus and Schulz (Naturwissenschaften 72:212–213, 1985). The accuracy of four flexibility prediction techniques was studied by comparing atomic temperature factors of known three-dimensional protein structures to predictions by using correlation coefficients. The size of the prediction window was optimized for each method. Predictions made with our new parameters, using an optimized window size of 9 residues in the prediction window, were giving the best results. The difference from another previously used technique was small, whereas two other methods were much poorer. Applicability of the predictions was also tested by searching for known epitopes from amino acid sequences. The best techniques predicted correctly 20 of 31 continuous epitopes in seven proteins. Flexibility parameters have previously been used for calculating protein average flexibility indices which are inversely correlated to protein stability. Indices with the new parameters showed better correlation to protein stability than those used previously; furthermore they had relationship even when the old parameters failed. © 1994 Wiley-Liss, Inc. 相似文献
9.
10.
We have explored the possibility that consensus predictions of membrane protein topology might provide a means to estimate the reliability of a predicted topology. Using five current topology prediction methods and a test set of 60 Escherichia coli inner membrane proteins with experimentally determined topologies, we find that prediction performance varies strongly with the number of methods that agree, and that the topology of nearly half of all E. coli inner membrane proteins can be predicted with high reliability (>90% correct predictions) by a simple majority-vote approach. 相似文献
11.
Grishin NV 《Cell》2012,149(7):1424-1425
A daring experiment is performed. Using sequence alignments to predict contacts between residues in protein spatial structures, Hopf et al. are publishing untested de novo structure models for 11 transmembrane protein families. Will their models stand the test of time and hold up to experimentation? The prospects are excellent. 相似文献
12.
13.
Estimation of extinction thresholds arising from Allee effects (Allee thresholds) and related probabilities of population extinction is notoriously difficult. One way is to analyze adequately parameterized population models. Traditionally, a point estimate is substituted for the Allee effect strength in such models. However, each point estimate entails an underlying uncertainty. We explore how accounting for this uncertainty affects the probability of population extinction, and show that this probability decreases sigmoidally with increasing population density, even in the absence of any stochasticity. Deviations from when only a point estimate of the Allee effect strength is used can be significant, unless stochasticity is added and the stochastic noise intensity is high. Significant deviations from when only a point estimate is used also occur when the Allee threshold and the environmental carrying capacity of the species are close enough one to another. We also show that the impact of the uncertainty in the Allee effect strength estimate increases as the Allee effect strength itself increases and decreases as the species recovery potential increases. This is not a good news, since we would like to preferentially and efficiently manage slowly recovering populations prone to strong Allee effects. Still, there is a way to come up with relatively good Allee threshold estimates. Besides an obvious option of collecting as many data as possible, the impact of the uncertainty can be mitigated by diversifying Allee effect experiments such that we put more emphasis on larger size groups. This is somewhat surprising, given that frequent complaints on the (im)possibility of detecting Allee effects concern difficulties in locating, observing and experimenting on rare populations. Our results extend current theory surrounding Allee effects and have broad ramifications for applied ecology. 相似文献
14.
A model has been developed that permits the prediction of mRNA nucleic acid sequence from the sequences of the translated proteins. The model relies on the information obtained from the comparison of protein sequences in related species to reduce the number of possible codons for those amino acids where mutations are observed. The predictions so obtained have been tested by applying the model to proteins whose mRNA sequences are known. The model's predictions have been found to be 100% accurate if three or more different amino acids are known at a given position and if the protein sequences are restricted to relatively closely related species (within the same class). The use of this model may permit a reduction of the mRNA sequence degeneracy and therefore be helpful in the synthesis of cDNA probes or for the prediction of restriction endonuclease sites. Computer programs have been developed to ease the use of the model. 相似文献
15.
The accurate distribution and recycling of transmembrane proteins amongst the membrane-bound organelles of the cell is vital to ensure its correct functioning. Transmembrane protein cargo destined for clathrin-mediated endocytosis and transport along the endocytic pathway is sorted into transport vesicles by interactions with adaptors, which simultaneously link clathrin to the membrane. Clathrin adaptors recognize a variety of signals present in the cytoplasmic portions of cargo proteins; recent structural, biophysical and cell biological studies have elucidated new types of cargo-adaptor interactions and probed the molecular mechanisms regulating cargo selection and vesicle maturation. Here, we review this recent progress in the context of our existing knowledge of endocytic sorting mechanisms. 相似文献
16.
Sommer I Rahnenführer J Domingues FS de Lichtenberg U Lengauer T 《Bioinformatics (Oxford, England)》2004,20(5):770-776
MOTIVATION: We introduce a new approach to using the information contained in sequence-to-function prediction data in order to recognize protein template classes, a critical step in predicting protein structure. The data on which our method is based comprise probabilities of functional categories; for given query sequences these probabilities are obtained by a neural net that has previously been trained on a variety of functionally important features. On a training set of sequences we assess the relevance of individual functional categories for identifying a given structural family. Using a combination of the most relevant categories, the likelihood of a query sequence to belong to a specific family can be estimated. RESULTS: The performance of the method is evaluated using cross-validation. For a fixed structural family and for every sequence, a score is calculated that measures the evidence for family membership. Even for structural families of small size, family members receive significantly higher scores. For some examples, we show that the relevant functional features identified by this method are biologically meaningful. The proposed approach can be used to improve existing sequence-to-structure prediction methods. AVAILABILITY: Matlab code is available on request from the authors. The data are available at http://www.mpisb.mpg.de/~sommer/Fun2Struc/ 相似文献
17.
Background
We introduce a computational protocol for effective predictions of the supramolecular organization of integral transmembrane proteins, starting from the monomer. Despite the demonstrated constitutive and functional importance of supramolecular assemblies of transmembrane subunits or proteins, effective tools for structure predictions of such assemblies are still lacking. Our computational approach consists in rigid-body docking samplings, starting from the docking of two identical copies of a given monomer. Each docking run is followed by membrane topology filtering and cluster analysis. Prediction of the native oligomer is therefore accomplished by a number of progressive growing steps, each made of one docking run, filtering and cluster analysis. With this approach, knowledge about the oligomerization status of the protein is required neither for improving sampling nor for the filtering step. Furthermore, there are no size-limitations in the systems under study, which are not limited to the transmembrane domains but include also the water-soluble portions. 相似文献18.
False positive reduction in protein-protein interaction predictions using gene ontology annotations 总被引:1,自引:0,他引:1
Background
Many crucial cellular operations such as metabolism, signalling, and regulations are based on protein-protein interactions. However, the lack of robust protein-protein interaction information is a challenge. One reason for the lack of solid protein-protein interaction information is poor agreement between experimental findings and computational sets that, in turn, comes from huge false positive predictions in computational approaches. Reduction of false positive predictions and enhancing true positive fraction of computationally predicted protein-protein interaction datasets based on highly confident experimental results has not been adequately investigated. 相似文献19.
20.
植物跨膜蛋白研究进展 总被引:4,自引:0,他引:4
跨膜蛋白是一类结构独特,在植物细胞中广泛存在,并发挥重要生理功能的蛋白质。综述了植物跨膜蛋白的理化性质、蛋白质结构预测的方法,以及其生理功能的研究进展。随着对植物跨膜蛋白的深入研究,将有助于揭示跨膜蛋白对植物生长发育调控的重要分子机制。 相似文献
