首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Interpreting consensus sequences based on plurality rule.   总被引:1,自引:0,他引:1  
Our goal is to help researchers interpret the results of a function, based on the concept of plurality rule, that calculates a consensus of a profile of molecular bases. By expressing the plurality rule function as a composition of simpler functions, we obtain both an algorithm to calculate the consensus result and an upper bound on the number of nonequivalent results. Consequently, when used to analyze molecular sequences such as DNA or RNA, the plurality rule function yields at most 48 nonequivalent consensus results. For problems of reasonable size, we describe an algorithm to calculate the probability that each consensus result would occur if the bases were equally likely to appear at every position of the plurality rule function's input profile.  相似文献   

2.
Run charts are widely used in healthcare improvement, but there is little consensus on how to interpret them. The primary aim of this study was to evaluate and compare the diagnostic properties of different sets of run chart rules. A run chart is a line graph of a quality measure over time. The main purpose of the run chart is to detect process improvement or process degradation, which will turn up as non-random patterns in the distribution of data points around the median. Non-random variation may be identified by simple statistical tests including the presence of unusually long runs of data points on one side of the median or if the graph crosses the median unusually few times. However, there is no general agreement on what defines “unusually long” or “unusually few”. Other tests of questionable value are frequently used as well. Three sets of run chart rules (Anhoej, Perla, and Carey rules) have been published in peer reviewed healthcare journals, but these sets differ significantly in their sensitivity and specificity to non-random variation. In this study I investigate the diagnostic values expressed by likelihood ratios of three sets of run chart rules for detection of shifts in process performance using random data series. The study concludes that the Anhoej rules have good diagnostic properties and are superior to the Perla and the Carey rules.  相似文献   

3.

Background  

Dekapentagonal maps depict the phylogenetic relationships of five genomes in a visually appealing diagram and can be viewed as an alternative to a single evolutionary consensus tree. In particular, the generated maps focus attention on those gene families that significantly deviate from the consensus or plurality phylogeny. PentaPlot is a software tool that computes such dekapentagonal maps given an appropriate probability support matrix.  相似文献   

4.
Although molecular biologists often calculate consensus sequencesfrom aligned DNA or protein sequences, relatively little isknown about the properties of many of the consensus methodsbeing used. Consequently, we wrote a program, CONSENSUS, toanalyze and compare methods of calculating a consensus result(a base, an ambiguity code or a subset of codes) at a positionin an aligned set of molecular sequences. The program supportsalphabets of up to four symbols (e.g. (R, Y) or A, C, G, T).The program's output makes it suitable for exploratory dataanalysis or for selecting values of thresholds or confidencelevels in consensus methods having such parameters.  相似文献   

5.
The methods presented here summarize phylogenetic relationships of genomes in visually appealing and informative figures. Dekapentagonal maps depict phylogenetic information for orthologous genes present in five genomes, and provide a pre-screen for putatively horizontally transferred genes. If the majority of individual gene phylogenies are unresolved, bipartition histograms provide a means of uncovering and analyzing the plurality consensus. Analyses of genomes representing five photosynthetic bacterial phyla and of the prokaryotic contributions to the eukaryotic cell illustrate the utility of the methods.  相似文献   

6.
We describe a fast computer algorithm for identifying consensuspatterns in DNA sequences. The method requires no prior assumptionsabout the consensus pattern other than its length. In particularno previous knowledge of the frequency or spacing of consensuspatterns is required. However, a priori information about theshape of the consensus pattern, or invariability of individualpositions, or the overall conservation level, can be utilizedto enhance the selectivity and sensitivity of search. As thenumber of all possible consensus words increases very rapidlywith length, comprehensive searches have usually been restrictedto a maximum of 10–12 nucleotides, even when large mainframesare used. Our algorithm enables searching for consensus patternsof this order on current mid-range and powerful microcomputers.Searches may be conducted on single, long sequences or a setof possibly aligned shorter sequences. We give examples of identifiedconsensus patterns in both prokaryotic and eukaryotic DNA sequences,along with some typical program timings. Received on January 14, 1991; accepted on March 5, 1991  相似文献   

7.
Given the concern about biodiversity loss, there are a number of arguments used for biodiversity conservation ranging from those emphasising the intrinsic value of biodiversity to those on the direct use value of ecosystems. Yet arguing the case for biodiversity conservation effectively requires an understanding of why people value biodiversity. We used Q methodology to explore and understand how different conservation practitioners (social and natural science researchers, environmental non-Governmental organisations and decision-makers) in nine European countries argue for conservation. We found that there was a plurality of views about biodiversity and its conservation. A moral argument and some arguments around the intrinsic and ecological value of biodiversity were held by all stakeholder groups. They also shared the view that species valuation does not justify the destruction of nature. However, there were also some differences within and between the groups, which primarily reflected the espousal of either ecocentric or anthropocentric viewpoints. Our findings suggest that moral arguments and those around biodiversity’s intrinsic and ecological value could potentially serve as a starting point for building consensus among conservation practitioners.  相似文献   

8.
MOTIVATION: Given inputs extracted from an aligned column of DNA bases and the underlying Perkin Elmer Applied Biosystems (ABI) fluorescent traces, our goal is to train a neural network to determine correctly the consensus base for the column. Choosing an appropriate network input representation is critical to success in this task. We empirically compare five representations; one uses only base calls and the others include trace information. RESULTS: We attained the most accurate results from networks that incorporate trace information into their input representations. Based on estimates derived from using 10-fold cross-validation, the best network topology produces consensus accuracies ranging from 99.26% to >99.98% for coverages from two to six aligned sequences. With a coverage of six, it makes only three errors in 20 000 consensus calls. In contrast, the network that only uses base calls in its input representation has over double that error rate: eight errors in 20 000 consensus calls. CONTACT: allex@cs.wisc.edu  相似文献   

9.
The secondary structure of the retrovirus integration protein (IN) was predicted from seven inferred retrovirus IN sequences. The IN sequences were aligned by computer and the phylogenetic relationships between them were determined. The secondary structure of the aligned IN sequences was predicted by two consensus prediction methods. The predicted secondary structural patterns from the two consensus prediction schemes were compared with and superimposed on a composite structural profile of hydropathic/chain flexibility/amphipathic indexes with each index profile being calculated independently for the aligned IN sequences. The use of this composite structural profile not only enhanced the prediction accuracy but also helped in defining the surface loop regions which would be otherwise unpredictable by the use of consensus prediction methods alone. An amphipathic helix was identified by these united structural prediction-chain property profiles. Helical wheel analysis gave the amphipathic helix a coiled-coil like pattern which was similar to the leucine zipper discovered for some eukaryotic gene regulatory proteins. The proposed amphipathic helix may play an essential role in defining the biological properties of IN.  相似文献   

10.
11.
12.
We aligned 14 5'-leading sequences of small subunit ribulose-1,5-bisphosphate carboxylase (rbcS) genes. A strong consensus sequence ("CCTTATCAT") was located directly upstream of the TATA-box. The occurrence of this motif in other light dependent phytochrome regulated plant genes led to the calculation of two consensus matrices. With these two matrices we are able to distinguish almost all known light induced plant genes which are phytochrome regulated from non-light induced plant genes indicating, that all these genes share a common light-responsive element (LRE). The results obtained by computer analysis are discussed with regard to experimental data.  相似文献   

13.
A genetic map is an ordering of genetic markers calculated from a population of known lineage.While traditionally a map has been generated from a single population for each species, recently researchers have created maps from multiple populations. In the face of these new data, we address the need to find a consensus map--a map that combines the information from multiple partial and possibly inconsistent input maps. We model each input map as a partial order and formulate the consensus problem as finding a median partial order. Finding the median of multiple total orders (preferences or rankings)is a well studied problem in social choice. We choose to find the median using the weighted symmetric difference distance, a more general version of both the symmetric difference distance and the Kemeny distance. Finding a median order using this distance is NP-hard. We show that for our chosen weight assignment, a median order satisfies the positive responsiveness, extended Condorcet,and unanimity criteria. Our solution involves finding the maximum acyclic subgraph of a weighted directed graph.We present a method that dynamically switches between an exact branch and bound algorithm and a heuristic algorithm, and show that for real data from closely related organisms, an exact median can often be found.We present experimental results using seven populations of the crop plant Zea mays.  相似文献   

14.
RNA editing factors of the pentatricopeptide repeat (PPR) family show a very high degree of sequence specificity in the recognition of their target sites. A molecular basis for target recognition by editing factors has been proposed based on statistical correlations but has not been tested experimentally. To achieve this, we systematically mutated the pentatricopeptide motifs in the Arabidopsis thaliana RNA editing factor CLB19 to investigate their individual contribution to RNA recognition. We find that the motifs contributing significantly to the specificity of binding follow the previously proposed recognition rules, distinguishing primarily between purines and pyrimidines. Our results are consistent with proposals that each motif recognizes one nucleotide in the RNA target with the protein aligned parallel to the RNA and contiguous motifs aligned with contiguous nucleotides such that the final PPR motif aligns four nucleotides upstream of the edited cytidine. By altering S motifs in CLB19 and another editing factor, OTP82, and using the modified proteins to attempt to complement the respective mutants, we demonstrate that we can predictably alter the specificity of these factors in vivo.  相似文献   

15.
The sequences of the peptide binding domains of 33 70 kd heat shock proteins (hsp70) have been aligned and a consensus secondary structure has been deduced. Individual members showed no significant deviation from the consensus, which showed a beta 4 alpha motif repeated twice, followed by two further helices and a terminus rich in Pro and Gly. The repeated motif could be aligned with the secondary structure of the functionally equivalent peptide binding domain of human leucocyte antigen (HLA) class I maintaining equivalent residues in structurally important positions in the two families and a model was built based on this alignment. The interaction of this domain with the ATP domain is considered. The overall model is shown to be consistent with the properties of products of chymotryptic cleavage.  相似文献   

16.
C A Fields  D L Grady  R K Moyzis 《Genomics》1992,13(2):431-436
Fifteen examples of the transposon-like human element (THE) LTR and thirteen examples of the MstII interspersed repeat are aligned to generate new consensus sequences for these human repetitive elements. The consensus sequences of these elements are very similar, indicating that they compose subfamilies of a single human interspersed repetitive sequence family. Members of this highly polymorphic repeat family have been mapped to at least 11 chromosomes. Seven examples of the THE internal sequence are also aligned to generate a new consensus sequence for this element. Estimates of the abundance of this repetitive sequence family, derived from both hybridization analysis and frequency of occurrence in GenBank, indicate that THE-LTR/MstII sequences are present every 100-3000 kb in human DNA. The widespread occurrence of members of this family makes them useful landmarks, like Alu, L1, and (GT)n repeats, for physical and genetic mapping of human DNA.  相似文献   

17.
Dengue virus (DENV 1-4) represents the major emerging arthropod-borne viral infection in the world. Currently, there is neither an available vaccine nor a specific treatment. Hence, there is a need of antiviral drugs for these viral infections; we describe the prediction of short interfering RNA (siRNA) as potential therapeutic agents against the four DENV serotypes. Our strategy was to carry out a series of multiple alignments using ClustalX program to find conserved sequences among the four DENV serotype genomes to obtain a consensus sequence for siRNAs design. A highly conserved sequence among the four DENV serotypes, located in the encoding sequence for NS4B and NS5 proteins was found. A total of 2,893 complete DENV genomes were downloaded from the NCBI, and after a depuration procedure to identify identical sequences, 220 complete DENV genomes were left. They were edited to select the NS4B and NS5 sequences, which were aligned to obtain a consensus sequence. Three different servers were used for siRNA design, and the resulting siRNAs were aligned to identify the most prevalent sequences. Three siRNAs were chosen, one targeted the genome region that codifies for NS4B protein and the other two; the region for NS5 protein. Predicted secondary structure for DENV genomes was used to demonstrate that the siRNAs were able to target the viral genome forming double stranded structures, necessary to activate the RNA silencing machinery.  相似文献   

18.
Many evolutionary models assume that behaviors are caused directly by genes. An implication is that behavioral uniformity should be found only in groups that are genetically uniform. Yet, the members of human social groups often behave in a uniform fashion, despite the fact that they are genetically diverse. Behavioral uniformity can occur through a variety of psychological mechanisms and social processes, such as imitation, consensus decision making, or the imposition of social norms. We present a series of models in which genes code for social transmission rules, which in turn govern the behaviors that are adopted. Transmission rules can evolve in randomly formed groups that concentrate phenotypic variation at the between-group level, favoring the evolution of altruistic behaviors and other group-advantageous traits. In addition, a direct bias toward adopting altruistic behaviors can evolve. Our models begin to show how group selection can be a strong force in human evolution, despite the absence of extreme genetic variation among groups.  相似文献   

19.
Animal liberation ethics and environmental ethics have recently come of age. Concerning concrete moral rules considered by researchers in these areas there is much consensus. Highly general theories formulated to justify the rules are more problematic. However, the search for such theories may well be misguided.  相似文献   

20.
Sequence logos: a new way to display consensus sequences.   总被引:100,自引:22,他引:78       下载免费PDF全文
A graphical method is presented for displaying the patterns in a set of aligned sequences. The characters representing the sequence are stacked on top of each other for each position in the aligned sequences. The height of each letter is made proportional to its frequency, and the letters are sorted so the most common one is on top. The height of the entire stack is then adjusted to signify the information content of the sequences at that position. From these 'sequence logos', one can determine not only the consensus sequence but also the relative frequency of bases and the information content (measured in bits) at every position in a site or sequence. The logo displays both significant residues and subtle sequence patterns.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号