首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
MOTIVATION: The availability of increasing amounts of sequence data about completely sequenced genomes spurs the development of new methods in the fields of automated annotation, and of comparative genomics. Tools allowing the visualization of results produced by analysis methods, superimposed on possibly annotated sequence data, and enabling synchronized navigation in multiple genomes, provide new means for interactive genome exploration. This kind of visual inspection can be used as a basis to assess the quality of new analysis algorithms, or to discover genome portions to be subjected to in-depth studies. RESULTS: We propose a software package, MuGeN, built for navigating through multiple annotated genomes. It is capable of retrieving annotated sequences in several formats, stored in local files, or available in databases over the network. From these, it then generates an interactive display, or an image file, in most common formats suitable for printing, further editing or integrating in Web pages. Genome maps may be mixed with computer analysis results loaded from XML files, whose format is generic enough to be adapted to a majority of sequence oriented analysis methods. AVAILABILITY: MuGeN is available at http://www-mig.jouy.inra.fr/bdsi/MuGeN.  相似文献   

3.
The analysis of genetic diversity within species is vital for understanding evolutionary processes at the population level and at the genomic level. A large quantity of data can now be produced at an unprecedented rate, requiring the use of dedicated computer programs to extract all embedded information. Several statistical packages have been recently developed, which offer a panel of standard and more sophisticated analyses. We describe here the functionalities, special features and assumptions of more than 20 such programs, indicate how they can interoperate, and discuss new directions that could lead to improved software and analyses.  相似文献   

4.
We present the ggtreeExtra package for visualizing heterogeneous data with a phylogenetic tree in a circular or rectangular layout (https://www.bioconductor.org/packages/ggtreeExtra). The package supports more data types and visualization methods than other tools. It supports using the grammar of graphics syntax to present data on a tree with richly annotated layers and allows evolutionary statistics inferred by commonly used software to be integrated and visualized with external data. GgtreeExtra is a universal tool for tree data visualization. It extends the applications of the phylogenetic tree in different disciplines by making more domain-specific data to be available to visualize and interpret in the evolutionary context.  相似文献   

5.
Although many of the statistical techniques used in comparative biology were originally developed in quantitative genetics, subsequent development of comparative techniques has progressed in relative isolation. Consequently, many of the new and planned developments in comparative analysis already have well‐tested solutions in quantitative genetics. In this paper, we take three recent publications that develop phylogenetic meta‐analysis, either implicitly or explicitly, and show how they can be considered as quantitative genetic models. We highlight some of the difficulties with the proposed solutions, and demonstrate that standard quantitative genetic theory and software offer solutions. We also show how results from Bayesian quantitative genetics can be used to create efficient Markov chain Monte Carlo algorithms for phylogenetic mixed models, thereby extending their generality to non‐Gaussian data. Of particular utility is the development of multinomial models for analysing the evolution of discrete traits, and the development of multi‐trait models in which traits can follow different distributions. Meta‐analyses often include a nonrandom collection of species for which the full phylogenetic tree has only been partly resolved. Using missing data theory, we show how the presented models can be used to correct for nonrandom sampling and show how taxonomies and phylogenies can be combined to give a flexible framework with which to model dependence.  相似文献   

6.
Phylogenetic comparative methods play a critical role in our understanding of the adaptive origin of primate behaviors. To incorporate evolutionary history directly into comparative behavioral research, behavioral ecologists rely on strong, well-resolved phylogenetic trees. Phylogenies provide the framework on which behaviors can be compared and homologies can be distinguished from similarities due to convergent or parallel evolution. Phylogenetic reconstructions are also of critical importance when inferring the ancestral state of behavioral patterns and when suggesting the evolutionary changes that behavior has undergone. Improvements in genome sequencing technologies have increased the amount of data available to researchers. Recently, several primate phylogenetic studies have used multiple loci to produce robust phylogenetic trees that include hundreds of primate species. These trees are now commonly used in comparative analyses and there is a perception that we have a complete picture of the primate tree. But how confident can we be in those phylogenies? And how reliable are comparative analyses based on such trees? Herein, we argue that even recent molecular phylogenies should be treated cautiously because they rely on many assumptions and have many shortcomings. Most phylogenetic studies do not model gene tree diversity and can produce misleading results, such as strong support for an incorrect species tree, especially in the case of rapid and recent radiations. We discuss implications that incorrect phylogenies can have for reconstructing the evolution of primate behaviors and we urge primatologists to be aware of the current limitations of phylogenetic reconstructions when applying phylogenetic comparative methods.  相似文献   

7.
The Human Proteome Organization's Proteomics Standards Initiative (PSI) promotes the development of exchange standards to improve data integration and interoperability. PSI specifies the suitable level of detail required when reporting a proteomics experiment (via the Minimum Information About a Proteomics Experiment), and provides extensible markup language (XML) exchange formats and dedicated controlled vocabularies (CVs) that must be combined to generate a standard compliant document. The framework presented here tackles the issue of checking that experimental data reported using a specific format, CVs and public bio‐ontologies (e.g. Gene Ontology, NCBI taxonomy) are compliant with the Minimum Information About a Proteomics Experiment recommendations. The semantic validator not only checks the XML syntax but it also enforces rules regarding the use of an ontology class or CV terms by checking that the terms exist in the resource and that they are used in the correct location of a document. Moreover, this framework is extremely fast, even on sizable data files, and flexible, as it can be adapted to any standard by customizing the parameters it requires: an XML Schema Definition, one or more CVs or ontologies, and a mapping file describing in a formal way how the semantic resources and the format are interrelated. As such, the validator provides a general solution to the common problem in data exchange: how to validate the correct usage of a data standard beyond simple XML Schema Definition validation. The framework source code and its various applications can be found at http://psidev.info/validator .  相似文献   

8.

Background

The explosion in biological information creates the need for databases that are easy to develop, easy to maintain and can be easily manipulated by annotators who are most likely to be biologists. However, deployment of scalable and extensible databases is not an easy task and generally requires substantial expertise in database development.

Results

BioBuilder is a Zope-based software tool that was developed to facilitate intuitive creation of protein databases. Protein data can be entered and annotated through web forms along with the flexibility to add customized annotation features to protein entries. A built-in review system permits a global team of scientists to coordinate their annotation efforts. We have already used BioBuilder to develop Human Protein Reference Database http://www.hprd.org, a comprehensive annotated repository of the human proteome. The data can be exported in the extensible markup language (XML) format, which is rapidly becoming as the standard format for data exchange.

Conclusions

As the proteomic data for several organisms begins to accumulate, BioBuilder will prove to be an invaluable platform for functional annotation and development of customizable protein centric databases. BioBuilder is open source and is available under the terms of LGPL.  相似文献   

9.
Functional genomics provides new opportunities to address issues of fundamental interest in evolutionary biology and suggests many new research directions that are ripe for evolutionary investigation. New types of data, and the ability to study biological processes from a whole genome perspective, are likely to have a profound impact on evolutionary biology and ecology. To illustrate, we discuss how genomewide gene expression studies can be used to reformulate questions about trade-offs and pleiotropy. We then touch on some of the new research opportunities that the application of functional genomics affords to evolutionary biologists. We end with some brief notes about how evolutionary biology and comparative approaches will probably have an impact on functional genomics.  相似文献   

10.

Background  

Evolutionary trees are central to a wide range of biological studies. In many of these studies, tree nodes and branches need to be associated (or annotated) with various attributes. For example, in studies concerned with organismal relationships, tree nodes are associated with taxonomic names, whereas tree branches have lengths and oftentimes support values. Gene trees used in comparative genomics or phylogenomics are usually annotated with taxonomic information, genome-related data, such as gene names and functional annotations, as well as events such as gene duplications, speciations, or exon shufflings, combined with information related to the evolutionary tree itself. The data standards currently used for evolutionary trees have limited capacities to incorporate such annotations of different data types.  相似文献   

11.
Smartphones and their apps (application software) are now used by millions of people worldwide and represent a powerful combination of sensors, information transfer, and computing power that deserves better exploitation by ecological and evolutionary researchers. We outline the development process for research apps, provide contrasting case studies for two new research apps, and scan the research horizon to suggest how apps can contribute to the rapid collection, interpretation, and dissemination of data in ecology and evolutionary biology. We emphasize that the usefulness of an app relies heavily on the development process, recommend that app developers are engaged with the process at the earliest possible stage, and commend efforts to create open‐source software scaffolds on which customized apps can be built by nonexperts. We conclude that smartphones and their apps could replace many traditional handheld sensors, calculators, and data storage devices in ecological and evolutionary research. We identify their potential use in the high‐throughput collection, analysis, and storage of complex ecological information.  相似文献   

12.
Animals produce a wide array of sounds with highly variable acoustic structures. It is possible to understand the causes and consequences of this variation across taxa with phylogenetic comparative analyses. Acoustic and evolutionary analyses are rapidly increasing in sophistication such that choosing appropriate acoustic and evolutionary approaches is increasingly difficult. However, the correct choice of analysis can have profound effects on output and evolutionary inferences. Here, we identify and address some of the challenges for this growing field by providing a roadmap for quantifying and comparing sound in a phylogenetic context for researchers with a broad range of scientific backgrounds. Sound, as a continuous, multidimensional trait can be particularly challenging to measure because it can be hard to identify variables that can be compared across taxa and it is also no small feat to process and analyse the resulting high-dimensional acoustic data using approaches that are appropriate for subsequent evolutionary analysis. Additionally, terminological inconsistencies and the role of learning in the development of acoustic traits need to be considered. Phylogenetic comparative analyses also have their own sets of caveats to consider. We provide a set of recommendations for delimiting acoustic signals into discrete, comparable acoustic units. We also present a three-stage workflow for extracting relevant acoustic data, including options for multivariate analyses and dimensionality reduction that is compatible with phylogenetic comparative analysis. We then summarize available phylogenetic comparative approaches and how they have been used in comparative bioacoustics, and address the limitations of comparative analyses with behavioural data. Lastly, we recommend how to apply these methods to acoustic data across a range of study systems. In this way, we provide an integrated framework to aid in quantitative analysis of cross-taxa variation in animal sounds for comparative phylogenetic analysis. In addition, we advocate the standardization of acoustic terminology across disciplines and taxa, adoption of automated methods for acoustic feature extraction, and establishment of strong data archival practices for acoustic recordings and data analyses. Combining such practices with our proposed workflow will greatly advance the reproducibility, biological interpretation, and longevity of comparative bioacoustic studies.  相似文献   

13.
The creation of cell models from annotated genome information, as well as additional data from other databases, requires both a format and medium for its distribution. Standards are described for the representation of the data in the form of Document Type Definitions (DTDs) for XML files. Separate DTDs are detailed for genetic, metabolic and gene product-interaction networks, which can be used to hold information on individual subsystems, or which may be combined to create a whole cell DTD. In the execution of this work, a fifth DTD was also created for a metabolite thesaurus, which allows incorporation of metabolite synonyms and generic nomenclature data into the models. A gene-regulation classification scheme was also created, to facilitate incorporation of gene regulatory information in an efficient manner. The work is described with particular reference to the metabolic network of Escherichia coli, which contains 808 individual enzymes. The assignment of confidence levels to these data, through the use of Gene Ontology evidence codes, is highlighted. In silico investigations may now be performed using the mathematical simulation workbench, DBsolve, which incorporates the facility to introduce data directly from XML.  相似文献   

14.
15.
16.
In an effort to understand how to improve student learning about evolution, a focus of science education research has been to document and address students?? naive ideas. Less research has investigated how students reason about alternative scientific models that attempt to explain the same phenomenon (e.g., which causal model best accounts for evolutionary change?). Within evolutionary biology, research has yet to explore how non-adaptive factors are situated within students?? conceptual ecologies of evolutionary causation. Do students construct evolutionary explanations that include non-adaptive and adaptive factors? If so, how are non-adaptive factors structured within students?? evolutionary explanations? We used clinical interviews and two paper and pencil instruments (one open-response and one multiple-choice) to investigate the use of non-adaptive and adaptive factors in undergraduate students?? patterns of evolutionary reasoning. After instruction that included non-adaptive causal factors (e.g., genetic drift), we found them to be remarkably uncommon in students?? explanatory models of evolutionary change in both written assessments and clinical interviews. However, consistent with many evolutionary biologists?? explanations, when students used non-adaptive factors they were conceptualized as causal alternatives to selection. Interestingly, use of non-adaptive factors was not associated with greater understanding of natural selection in interviews or written assessments, or with fewer naive ideas of natural selection. Thus, reasoning using non-adaptive factors appears to be a distinct facet of evolutionary thinking. We propose a theoretical framework for an expert?Cnovice continuum of evolutionary reasoning that incorporates both adaptive and non-adaptive factors, and can be used to inform instructional efficacy in evolutionary biology.  相似文献   

17.
The evolution of calcified tissues is a defining feature in vertebrate evolution. Investigating the evolution of proteins involved in tissue calcification should help elucidate how calcified tissues have evolved. The purpose of this study was to collect and compare sequences of matrix and bone gamma-carboxyglutamic acid proteins (MGP and BGP, respectively) to identify common features and determine the evolutionary relationship between MGP and BGP. Thirteen cDNAs and genes were cloned using standard methods or reconstructed through the use of comparative genomics and data mining. These sequences were compared with available annotated sequences (a total of 48 complete or nearly complete sequences, 28 BGPs and 20 MGPs) have been identified across 32 different species (representing most classes of vertebrates), and evolutionarily conserved features in both MGP and BGP were analyzed using bioinformatic tools and the Tree-Puzzle software. We propose that: 1) MGP and BGP genes originated from two genome duplications that occurred around 500 and 400 million years ago before jawless and jawed fish evolved, respectively; 2) MGP appeared first concomitantly with the emergence of cartilaginous structures, and BGP appeared thereafter along with bony structures; and 3) BGP derives from MGP. We also propose a highly specific pattern definition for the Gla domain of BGP and MGP.  相似文献   

18.
Co-evolution has produced many intriguing adaptations and made significant contributions to biodiversity through the co-adaptive radiations of interacting groups, such as pollinating insects and flowering plants or hosts and endosymbionts. New methods from molecular genetics and comparative genomics, in conjunction with advances in evolutionary genetic theory, are for the first time providing tools for detecting, investigating and understanding the genetic bases of the co-adaptive process and co-speciation. Advances in the emerging field of community genetics, which integrates genetics and community ecology, could revolutionize how co-evolution is studied, how genes are functionally annotated and how conservation geneticists implement preservation strategies.  相似文献   

19.
Recent developments in the analysis of comparative data   总被引:5,自引:0,他引:5  
Comparative methods can be used to test ideas about adaptation by identifying cases of either parallel or convergent evolutionary change across taxa. Phylogenetic relationships must be known or inferred if comparative methods are to separate the cross-taxonomic covariation among traits associated with evolutionary change from that attributable to common ancestry. Only the former can be used to test ideas linking convergent or parallel evolutionary change to some aspect of the environment. The comparative methods that are currently available differ in how they manage the effects brought about by phylogenetic relationships. One method is applicable only to discrete data, and uses cladistic techniques to identify evolutionary events that depart from phylogenetic trends. Techniques for continuous variables attempt to control for phylogenetic effects in a variety of ways. One method examines the taxonomic distribution of variance to identify the taxa within which character variation is small. The method assumes that taxa with small amounts of variation are those in which little evolutionary change has occurred, and thus variation is unlikely to be independent of ancestral trends. Analyses are then concentrated among taxa that show more variation, on the assumption that greater evolutionary change in the character has taken place. Several methods estimate directly the extent to which ancestry can predict the observed variation of a character, and subtract the ancestral effect to reveal variation of phylogeny. Yet another can remove phylogenetic effects if the true phylogeny is known. One class of comparative methods controls for phylogenetic effects by searching for comparative trends within rather than across taxa. With current knowledge of phylogenies, there is a trade-off in the choice of a comparative method: those that control phylogenetic effects with greater certainty are either less applicable to real data, or they make restrictive or untestable assumptions. Those that rely on statistical patterns to infer phylogenetic effects may not control phylogeny as efficiently but are more readily applied to existing data sets.  相似文献   

20.
MOTIVATION: The National Cancer Institute's Center for Bioinformatics (NCICB) has developed a Java based data management and information system called caCORE. One component of this software suite is the object oriented API (caBIO) used to access the rich biological datasets collected at the NCI. This API can access the data using native Java classes, SOAP requests or HTTP calls. Non-Java based clients wanting to use this API have to use the SOAP or HTTP interfaces with the data being returned from the NCI servers as an XML data stream. Although the XML can be read and manipulated using DOM or SAX parsers, one loses the convenience and usability of an object oriented programming paradigm. caBIONet is a set of .NET wrapper classes (managers, genes, chromosomes, sequences, etc.) capable of serializing the XML data stream into local .NET objects. The software is able to search NCICB databases and provide local objects representing the data that can be manipulated and used by other .NET programs. The software was written in C# and compiled as a .NET DLL.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号