首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
The functional form of spillover, measured as a gradient of abundance of fish, may provide insight about processes that control the spatial distribution of fish inside and outside the MPA. In this study, we aimed to infer on spillover mechanism of Diplodus spp. (family Sparidae) from a Mediterranean MPA (Carry-le-Rouet, France) from visual censuses and artisanal fisheries data. From the existing literature, three potential functional forms of spillover such as a linear gradient, an exponential gradient and a logistic gradient are defined. Each functional form is included in a spatial generalized linear mixed model allowing accounting for spatial autocorrelation of data. We select between the different forms of gradients by using a Bayesian model selection procedure. In a first step, the functional form of the spillover for visual census and artisanal fishing data is assessed separately. For both sets of data, our model selection favoured the negative exponential model, evidencing a decrease of the spatial abundance of fish vanishing around 1000 m from the MPA border. We combined both datasets in a joint model by including an observability parameter. This parameter captures how the different sources of data quantify the underlying spatial distribution of the harvested species. This enabled us to demonstrate that the different sampling methods do not affect the estimation of the underlying spatial distribution of Diplodus spp. inside and outside the MPA. We show that data from different sources can be pooled through spatial generalized linear mixed model. Our findings allow to better understand the underlying mechanisms that control spillover of fish from MPA.  相似文献   

2.

Background  

Information obtained from diverse data sources can be combined in a principled manner using various machine learning methods to increase the reliability and range of knowledge about protein function. The result is a weighted functional linkage network (FLN) in which linked neighbors share at least one function with high probability. Precision is, however, low. Aiming to provide precise functional annotation for as many proteins as possible, we explore and propose a two-step framework for functional annotation (1) construction of a high-coverage and reliable FLN via machine learning techniques (2) development of a decision rule for the constructed FLN to optimize functional annotation.  相似文献   

3.
Chen Y  Wang W  Zhou Y  Shields R  Chanda SK  Elston RC  Li J 《PloS one》2011,6(6):e21137
Identifying disease genes is crucial to the understanding of disease pathogenesis, and to the improvement of disease diagnosis and treatment. In recent years, many researchers have proposed approaches to prioritize candidate genes by considering the relationship of candidate genes and existing known disease genes, reflected in other data sources. In this paper, we propose an expandable framework for gene prioritization that can integrate multiple heterogeneous data sources by taking advantage of a unified graphic representation. Gene-gene relationships and gene-disease relationships are then defined based on the overall topology of each network using a diffusion kernel measure. These relationship measures are in turn normalized to derive an overall measure across all networks, which is utilized to rank all candidate genes. Based on the informativeness of available data sources with respect to each specific disease, we also propose an adaptive threshold score to select a small subset of candidate genes for further validation studies. We performed large scale cross-validation analysis on 110 disease families using three data sources. Results have shown that our approach consistently outperforms other two state of the art programs. A case study using Parkinson disease (PD) has identified four candidate genes (UBB, SEPT5, GPR37 and TH) that ranked higher than our adaptive threshold, all of which are involved in the PD pathway. In particular, a very recent study has observed a deletion of TH in a patient with PD, which supports the importance of the TH gene in PD pathogenesis. A web tool has been implemented to assist scientists in their genetic studies.  相似文献   

4.
5.
6.

Background  

Biological studies involve a growing number of distinct high-throughput experiments to characterize samples of interest. There is a lack of methods to visualize these different genomic datasets in a versatile manner. In addition, genomic data analysis requires integrated visualization of experimental data along with constantly changing genomic annotation and statistical analyses.  相似文献   

7.
Biologists are frequently faced with the problem of integrating information from multiple heterogeneous sources with their own experimental data. Given the large number of public sources, it is difficult to choose which sources to integrate without assistance. When doing this manually, biologists differ in their preferences concerning the sources to be queried as well as the strategies, i.e. the querying process they follow for navigating through the sources. In response to these findings, we have developed BioGuide to assist scientists search for relevant data within external sources while taking their preferences and strategies into account. In this article, we present BioGuideSRS, a user-friendly system which automatically retrieves instances of data by using BioGuide on top of the sequence retrieval system (SRS). BioGuideSRS is an Applet that can be run from its web page on any system with Java 5.0. AVAILABILITY: http://www.bioguide-project.net.  相似文献   

8.
The completion of the Arabidopsis genome and the large collections of other plant sequences generated in recent years have sparked extensive functional genomics efforts. However, the utilization of this data is inefficient, as data sources are distributed and heterogeneous and efforts at data integration are lagging behind. PlaNet aims to overcome the limitations of individual efforts as well as the limitations of heterogeneous, independent data collections. PlaNet is a distributed effort among European bioinformatics groups and plant molecular biologists to establish a comprehensive integrated database in a collaborative network. Objectives are the implementation of infrastructure and data sources to capture plant genomic information into a comprehensive, integrated platform. This will facilitate the systematic exploration of Arabidopsis and other plants. New methods for data exchange, database integration and access are being developed to create a highly integrated, federated data resource for research. The connection between the individual resources is realized with BioMOBY. BioMOBY provides an architecture for the discovery and distribution of biological data through web services. While knowledge is centralized, data is maintained at its primary source without a need for warehousing. To standardize nomenclature and data representation, ontologies and generic data models are defined in interaction with the relevant communities.Minimal data models should make it simple to allow broad integration, while inheritance allows detail and depth to be added to more complex data objects without losing integration. To allow expert annotation and keep databases curated, local and remote annotation interfaces are provided. Easy and direct access to all data is key to the project.  相似文献   

9.
10.
It has been a challenging task to integrate high-throughput data into investigations of the systematic and dynamic organization of biological networks. Here, we presented a simple hierarchical clustering algorithm that goes a long way to achieve this aim. Our method effectively reveals the modular structure of the yeast protein-protein interaction network and distinguishes protein complexes from functional modules by integrating high-throughput protein-protein interaction data with the added subcellular localization and expression profile data. Furthermore, we take advantage of the detected modules to provide a reliably functional context for the uncharacterized components within modules. On the other hand, the integration of various protein-protein association information makes our method robust to false-positives, especially for derived protein complexes. More importantly, this simple method can be extended naturally to other types of data fusion and provides a framework for the study of more comprehensive properties of the biological network and other forms of complex networks.  相似文献   

11.
12.

Background

Over the past decade the workflow system paradigm has evolved as an efficient and user-friendly approach for developing complex bioinformatics applications. Two popular workflow systems that have gained acceptance by the bioinformatics community are Taverna and Galaxy. Each system has a large user-base and supports an ever-growing repository of application workflows. However, workflows developed for one system cannot be imported and executed easily on the other. The lack of interoperability is due to differences in the models of computation, workflow languages, and architectures of both systems. This lack of interoperability limits sharing of workflows between the user communities and leads to duplication of development efforts.

Results

In this paper, we present Tavaxy, a stand-alone system for creating and executing workflows based on using an extensible set of re-usable workflow patterns. Tavaxy offers a set of new features that simplify and enhance the development of sequence analysis applications: It allows the integration of existing Taverna and Galaxy workflows in a single environment, and supports the use of cloud computing capabilities. The integration of existing Taverna and Galaxy workflows is supported seamlessly at both run-time and design-time levels, based on the concepts of hierarchical workflows and workflow patterns. The use of cloud computing in Tavaxy is flexible, where the users can either instantiate the whole system on the cloud, or delegate the execution of certain sub-workflows to the cloud infrastructure.

Conclusions

Tavaxy reduces the workflow development cycle by introducing the use of workflow patterns to simplify workflow creation. It enables the re-use and integration of existing (sub-) workflows from Taverna and Galaxy, and allows the creation of hybrid workflows. Its additional features exploit recent advances in high performance cloud computing to cope with the increasing data size and complexity of analysis. The system can be accessed either through a cloud-enabled web-interface or downloaded and installed to run within the user's local environment. All resources related to Tavaxy are available at http://www.tavaxy.org.  相似文献   

13.
BACKGROUND: Saccharomyces cerevisiae is recognized as a model system representing a simple eukaryote whose genome can be easily manipulated. Information solicited by scientists on its biological entities (Proteins, Genes, RNAs...) is scattered within several data sources like SGD, Yeastract, CYGD-MIPS, BioGrid, PhosphoGrid, etc. Because of the heterogeneity of these sources, querying them separately and then manually combining the returned results is a complex and time-consuming task for biologists most of whom are not bioinformatics expert. It also reduces and limits the use that can be made on the available data. RESULTS: To provide transparent and simultaneous access to yeast sources, we have developed YeastMed: an XML and mediator-based system. In this paper, we present our approach in developing this system which takes advantage of SB-KOM to perform the query transformation needed and a set of Data Services to reach the integrated data sources. The system is composed of a set of modules that depend heavily on XML and Semantic Web technologies. User queries are expressed in terms of a domain ontology through a simple form-based web interface. CONCLUSIONS: YeastMed is the first mediation-based system specific for integrating yeast data sources. It was conceived mainly to help biologists to find simultaneously relevant data from multiple data sources. It has a biologist-friendly interface easy to use. The system is available at http://www.khaos.uma.es/yeastmed/.  相似文献   

14.
Although Arabidopsis (Arabidopsis thaliana) is the best studied plant species, the biological role of one-third of its proteins is still unknown. We developed a probabilistic protein function prediction method that integrates information from sequences, protein-protein interactions, and gene expression. The method was applied to proteins from Arabidopsis. Evaluation of prediction performance showed that our method has improved performance compared with single source-based prediction approaches and two existing integration approaches. An innovative feature of our method is that it enables transfer of functional information between proteins that are not directly associated with each other. We provide novel function predictions for 5,807 proteins. Recent experimental studies confirmed several of the predictions. We highlight these in detail for proteins predicted to be involved in flowering and floral organ development.  相似文献   

15.
MAGIC Tool: integrated microarray data analysis   总被引:4,自引:1,他引:4  
Summary: Several programs are now available for analyzing thelarge datasets arising from cDNA microarray experiments. Mostprograms are expensive commercial packages or require expensivethird party software. Some are freely available to academicresearchers, but are limited to one operating system. MicroArrayGenome Imaging and Clustering Tool (MAGIC Tool) is an open sourceprogram that works on all major platforms, and takes users ‘fromtiff to gif’. Several unique features of MAGIC Tool areparticularly useful for research and teaching. Availability: http://www.bio.davidson.edu/MAGIC Contact: laheyer{at}davidson.edu  相似文献   

16.
Characterizing gene function is one of the major challenging tasks in the post-genomic era. To address this challenge, we have developed GeneFAS (Gene Function Annotation System), a new integrated probabilistic method for cellular function prediction by combining information from protein-protein interactions, protein complexes, microarray gene expression profiles, and annotations of known proteins through an integrative statistical model. Our approach is based on a novel assessment for the relationship between (1) the interaction/correlation of two proteins' high-throughput data and (2) their functional relationship in terms of their Gene Ontology (GO) hierarchy. We have developed a Web server for the predictions. We have applied our method to yeast Saccharomyces cerevisiae and predicted functions for 1548 out of 2472 unannotated proteins.  相似文献   

17.
Previous compartmental models have introduced variability either at the particle or at the replicate level. This paper integrates both types of variability through the concept of clustering. The paper develops two different, general clustered models, each with time-dependent hazard rates for the clusters and for the particles within the clusters, and each with random initial number and sizes of clusters. The coefficient of variation of the total number of particles,CV[X(t)], for either model is shown to be bounded below, under very broad conditions, by the coefficient of variation of the initial number of clusters,CV[c(0)]. This high relative variability of the clustered models makes them potentially very useful in kinetic modeling. In many applications, binding and clustering are common phenomena, and two applications of the models to such phenomena are breifly outlined.  相似文献   

18.
This paper describes a general data base management package defined for medical applications. CHRONOS is a user-oriented system which has been designed for physicians to get periodical reports and for researchers to prepare statistical treatments. The basic principles of the data base and program organization are described: many possibilities are offered for data acquisition and specific efforts have been made in order to analyze easily the evolution of patients. Several medical applications are now operational with CHRONOS in fields as different as psychiatry and nephrology.  相似文献   

19.
20.
Two key types of well-being, eudaimonic and hedonic, are reviewed. The first addresses ideas of self-development, personal growth and purposeful engagement, while the second is concerned with positive feelings such as happiness and contentment. How well-being varies by age and socio-economic standing is briefly summarized, followed by examination of its biological correlates (neuroendocrine, immune, cardiovascular, rapid eye movement (REM) sleep). Preliminary findings on a sample of ageing women showed that those with higher levels of eudaimonic well-being had lower levels of daily salivary cortisol, pro-inflammatory cytokines, cardiovascular risk, and longer duration REM sleep compared with those showing lower levels of eudaimonic well-being. Hedonic well-being, however, showed minimal linkage to biomarker assessments. Future research directions building on these initial findings are discussed.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号