共查询到20条相似文献,搜索用时 15 毫秒
1.
Background
Protein fold recognition is a key step in protein three-dimensional (3D) structure discovery. There are multiple fold discriminatory data sources which use physicochemical and structural properties as well as further data sources derived from local sequence alignments. This raises the issue of finding the most efficient method for combining these different informative data sources and exploring their relative significance for protein fold classification. Kernel methods have been extensively used for biological data analysis. They can incorporate separate fold discriminatory features into kernel matrices which encode the similarity between samples in their respective data sources. 相似文献2.
Background
An alternative to standard approaches to uncover biologically meaningful structures in micro array data is to treat the data as a blind source separation (BSS) problem. BSS attempts to separate a mixture of signals into their different sources and refers to the problem of recovering signals from several observed linear mixtures. In the context of micro array data, "sources" may correspond to specific cellular responses or to co-regulated genes. 相似文献3.
Sacha?AFT?van Hijum Anne?de Jong Richard?JS?Baerends Harma?A?Karsens Naomi?E?Kramer Rasmus?Larsen Chris?D?den Hengst Casper?J?Albers Jan?Kok Oscar?P?Kuipers
Background
In research laboratories using DNA-microarrays, usually a number of researchers perform experiments, each generating possible sources of error. There is a need for a quick and robust method to assess data quality and sources of errors in DNA-microarray experiments. To this end, a novel and cost-effective validation scheme was devised, implemented, and employed. 相似文献4.
Background
Analysis of variance is a powerful approach to identify differentially expressed genes in a complex experimental design for microarray and macroarray data. The advantage of the anova model is the possibility to evaluate multiple sources of variation in an experiment. 相似文献5.
Background
Biological information is commonly used to cluster or classify entities of interest such as genes, conditions, species or samples. However, different sources of data can be used to classify the same set of entities and methods allowing the comparison of the performance of two data sources or the determination of how well a given classification agrees with another are frequently needed, especially in the absence of a universally accepted "gold standard" classification. 相似文献6.
A joint finite mixture model for clustering genes from independent Gaussian and beta distributed data 总被引:1,自引:0,他引:1
Background
Cluster analysis has become a standard computational method for gene function discovery as well as for more general explanatory data analysis. A number of different approaches have been proposed for that purpose, out of which different mixture models provide a principled probabilistic framework. Cluster analysis is increasingly often supplemented with multiple data sources nowadays, and these heterogeneous information sources should be made as efficient use of as possible. 相似文献7.
Anna K Jönsson Olav Spigset Micaela Tjäderborn Henrik Druid Staffan Hägg 《BMC clinical pharmacology》2009,9(1):7-5
Background
Pharmaceutical drug poisonings have previously been reported using single sources of information, either hospital data or forensic data, which might not reveal the true incidence. We therefore aimed to estimate the incidence of suspected fatal drug poisonings, defined as poisonings by pharmaceutical agents, by using all relevant case records from various sources in a Swedish population. 相似文献8.
Background
Current genomic research methods provide researchers with enormous amounts of data. Combining data from different high-throughput research technologies commonly available in biological databases can lead to novel findings and increase research efficiency. However, combining data from different heterogeneous sources is often a very arduous task. These sources can be different microarray technology platforms, genomic databases, or experiments performed on various species. Our aim was to develop a software program that could facilitate the combining of data from heterogeneous sources, and thus allow researchers to perform genomic cross-platform/cross-species studies and to use existing experimental data for compendium studies. 相似文献9.
Michael Baitaluk Xufei Qian Shubhada Godbole Alpan Raval Animesh Ray Amarnath Gupta 《BMC bioinformatics》2006,7(1):55-13
Background
The goal of information integration in systems biology is to combine information from a number of databases and data sets, which are obtained from both high and low throughput experiments, under one data management scheme such that the cumulative information provides greater biological insight than is possible with individual information sources considered separately. 相似文献10.
Diogo FT Veiga Helena F Deus Caner Akdemir Ana Tereza R Vasconcelos Jonas S Almeida 《BMC systems biology》2009,3(1):109-9
Background
DAS is a widely adopted protocol for providing syntactic interoperability among biological databases. The popularity of DAS is due to a simplified and elegant mechanism for data exchange that consists of sources exposing their RESTful interfaces for data access. As a growing number of DAS services are available for molecular biology resources, there is an incentive to explore this protocol in order to advance data discovery and integration among these resources. 相似文献11.
Xue Gong Ruihong Wu Yuannv Zhang Wenyuan Zhao Lixin Cheng Yunyan Gu Lin Zhang Jing Wang Jing Zhu Zheng Guo 《BMC bioinformatics》2010,11(1):76
Background
Hundreds of genes that are causally implicated in oncogenesis have been found and collected in various databases. For efficient application of these abundant but diverse data sources, it is of fundamental importance to evaluate their consistency. 相似文献12.
Background
Scientists striving to unlock mysteries within complex biological systems face myriad barriers in effectively integrating available information to enhance their understanding. While experimental techniques and available data sources are rapidly evolving, useful information is dispersed across a variety of sources, and sources of the same information often do not use the same format or nomenclature. To harness these expanding resources, scientists need tools that bridge nomenclature differences and allow them to integrate, organize, and evaluate the quality of information without extensive computation. 相似文献13.
Background
Success of metabolomics as the phenotyping platform largely depends on its ability to detect various sources of biological variability. Removal of platform-specific sources of variability such as systematic error is therefore one of the foremost priorities in data preprocessing. However, chemical diversity of molecular species included in typical metabolic profiling experiments leads to different responses to variations in experimental conditions, making normalization a very demanding task. 相似文献14.
Background
Information resources on the World Wide Web play an indispensable role in modern biology. But integrating data from multiple sources is often encumbered by the need to reformat data files, convert between naming systems, or perform ongoing maintenance of local copies of public databases. Opportunities for new ways of combining and re-using data are arising as a result of the increasing use of web protocols to transmit structured data. 相似文献15.
Background
Biological data resources have become heterogeneous and derive from multiple sources. This introduces challenges in the management and utilization of this data in software development. Although efforts are underway to create a standard format for the transmission and storage of biological data, this objective has yet to be fully realized. 相似文献16.
Background
Inference of protein interaction networks from various sources of data has become an important topic of both systems and computational biology. Here we present a supervised approach to identification of gene expression regulatory networks. 相似文献17.
Florent Baty Micha?l Facompré Jan Wiegand Joseph Schwager Martin H Brutsche 《BMC bioinformatics》2006,7(1):422
Background
Evaluating the importance of the different sources of variations is essential in microarray data experiments. Complex experimental designs generally include various factors structuring the data which should be taken into account. The objective of these experiments is the exploration of some given factors while controlling other factors. 相似文献18.
Chem2Bio2RDF: a semantic framework for linking and data mining chemogenomic and systems chemical biology data 总被引:1,自引:0,他引:1
Bin Chen Xiao Dong Dazhi Jiao Huijun Wang Qian Zhu Ying Ding David J Wild 《BMC bioinformatics》2010,11(1):255
Background
Recently there has been an explosion of new data sources about genes, proteins, genetic variations, chemical compounds, diseases and drugs. Integration of these data sources and the identification of patterns that go across them is of critical interest. Initiatives such as Bio2RDF and LODD have tackled the problem of linking biological data and drug data respectively using RDF. Thus far, the inclusion of chemogenomic and systems chemical biology information that crosses the domains of chemistry and biology has been very limited 相似文献19.
Background
With the explosion of microarray studies, an enormous amount of data is being produced. Systematic integration of gene expression data from different sources increases statistical power of detecting differentially expressed genes and allows assessment of heterogeneity. The challenge, however, is in designing and implementing efficient analytic methodologies for combination of data generated by different research groups. 相似文献20.