期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Structure and evolution of protein interaction networks: a statistical model for link dynamics and gene duplications

Johannes?Berg Email author Michael?L?ssig Andreas?Wagner 《BMC evolutionary biology》2004,4(1):51

Background

The structure of molecular networks derives from dynamical processes on evolutionary time scales. For protein interaction networks, global statistical features of their structure can now be inferred consistently from several large-throughput datasets. Understanding the underlying evolutionary dynamics is crucial for discerning random parts of the network from biologically important properties shaped by natural selection. 相似文献

2.

GenomeGraphs: integrated genomic data visualization with R

Steffen Durinck James Bullard Paul T Spellman Sandrine Dudoit 《BMC bioinformatics》2009,10(1):2-9

Background

Biological studies involve a growing number of distinct high-throughput experiments to characterize samples of interest. There is a lack of methods to visualize these different genomic datasets in a versatile manner. In addition, genomic data analysis requires integrated visualization of experimental data along with constantly changing genomic annotation and statistical analyses. 相似文献

3.

Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments

James H Bullard Elizabeth Purdom Kasper D Hansen Sandrine Dudoit 《BMC bioinformatics》2010,11(1):94

相似文献

4.

A simple method to combine multiple molecular biomarkers for dichotomous diagnostic classification

Manju R Mamtani Tushar P Thakre Mrunal Y Kalkonde Manik A Amin Yogeshwar V Kalkonde Amit P Amin Hemant Kulkarni 《BMC bioinformatics》2006,7(1):442

Background

In spite of the recognized diagnostic potential of biomarkers, the quest for squelching noise and wringing in information from a given set of biomarkers continues. Here, we suggest a statistical algorithm that – assuming each molecular biomarker to be a diagnostic test – enriches the diagnostic performance of an optimized set of independent biomarkers employing established statistical techniques. We validated the proposed algorithm using several simulation datasets in addition to four publicly available real datasets that compared i) subjects having cancer with those without; ii) subjects with two different cancers; iii) subjects with two different types of one cancer; and iv) subjects with same cancer resulting in differential time to metastasis. 相似文献

5.

The MetabolomeExpress Project: enabling web-based processing,analysis and transparent dissemination of GC/MS metabolomics datasets

Adam J Carroll Murray R Badger A Harvey Millar 《BMC bioinformatics》2010,11(1):376

Background

Standardization of analytical approaches and reporting methods via community-wide collaboration can work synergistically with web-tool development to result in rapid community-driven expansion of online data repositories suitable for data mining and meta-analysis. In metabolomics, the inter-laboratory reproducibility of gas-chromatography/mass-spectrometry (GC/MS) makes it an obvious target for such development. While a number of web-tools offer access to datasets and/or tools for raw data processing and statistical analysis, none of these systems are currently set up to act as a public repository by easily accepting, processing and presenting publicly submitted GC/MS metabolomics datasets for public re-analysis. 相似文献

6.

AxPcoords &; parallel AxParafit: statistical co-phylogenetic analyses on thousands of taxa

Alexandros Stamatakis Alexander F Auch Jan Meier-Kolthoff Markus Göker 《BMC bioinformatics》2007,8(1):405

Background

Current tools for Co-phylogenetic analyses are not able to cope with the continuous accumulation of phylogenetic data. The sophisticated statistical test for host-parasite co-phylogenetic analyses implemented in Parafit does not allow it to handle large datasets in reasonable times. The Parafit and DistPCoA programs are the by far most compute-intensive components of the Parafit analysis pipeline. We present AxParafit and AxPcoords (Ax stands for Accelerated) which are highly optimized versions of Parafit and DistPCoA respectively. 相似文献

7.

Genomic distance entrained clustering and regression modelling highlights interacting genomic regions contributing to proliferation in breast cancer

Tim J Dexter David Sims Costas Mitsopoulos Alan Mackay Anita Grigoriadis Amar S Ahmad Marketa Zvelebil 《BMC systems biology》2010,4(1):127

Background

Genomic copy number changes and regional alterations in epigenetic states have been linked to grade in breast cancer. However, the relative contribution of specific alterations to the pathology of different breast cancer subtypes remains unclear. The heterogeneity and interplay of genomic and epigenetic variations means that large datasets and statistical data mining methods are required to uncover recurrent patterns that are likely to be important in cancer progression. 相似文献

8.

Comparative evaluation of gene-set analysis methods

Qi Liu Irina Dinu Adeniyi J Adewale John D Potter Yutaka Yasui 《BMC bioinformatics》2007,8(1):431

Background

Multiple data-analytic methods have been proposed for evaluating gene-expression levels in specific biological pathways, assessing differential expression associated with a binary phenotype. Following Goeman and Bühlmann's recent review, we compared statistical performance of three methods, namely Global Test, ANCOVA Global Test, and SAM-GS, that test "self-contained null hypotheses" Via. subject sampling. The three methods were compared based on a simulation experiment and analyses of three real-world microarray datasets. 相似文献

9.

An eScience-Bayes strategy for analyzing omics data

Martin Eklund Ola Spjuth Jarl ES Wikberg 《BMC bioinformatics》2010,11(1):282

Background

The omics fields promise to revolutionize our understanding of biology and biomedicine. However, their potential is compromised by the challenge to analyze the huge datasets produced. Analysis of omics data is plagued by the curse of dimensionality, resulting in imprecise estimates of model parameters and performance. Moreover, the integration of omics data with other data sources is difficult to shoehorn into classical statistical models. This has resulted in ad hoc approaches to address specific problems. 相似文献

10.

Asymmetric microarray data produces gene lists highly predictive of research literature on multiple cancer types

Noor B Dawany Aydin Tozeren 《BMC bioinformatics》2010,11(1):483

Background

Much of the public access cancer microarray data is asymmetric, belonging to datasets containing no samples from normal tissue. Asymmetric data cannot be used in standard meta-analysis approaches (such as the inverse variance method) to obtain large sample sizes for statistical power enrichment. Noting that plenty of normal tissue microarray samples exist in studies not involving cancer, we investigated the viability and accuracy of an integrated microarray analysis approach based on significance analysis of microarrays (merged SAM) using a collection of data from separate diseased and normal samples. 相似文献

11.

ProbCD: enrichment analysis accounting for categorization uncertainty

Ricardo ZN Vêncio Ilya Shmulevich 《BMC bioinformatics》2007,8(1):383

Background

As in many other areas of science, systems biology makes extensive use of statistical association and significance estimates in contingency tables, a type of categorical data analysis known in this field as enrichment (also over-representation or enhancement) analysis. In spite of efforts to create probabilistic annotations, especially in the Gene Ontology context, or to deal with uncertainty in high throughput-based datasets, current enrichment methods largely ignore this probabilistic information since they are mainly based on variants of the Fisher Exact Test. 相似文献

12.

Bi-directional gene set enrichment and canonical correlation analysis identify key diet-sensitive pathways and biomarkers of metabolic syndrome

Melissa J Morine Jolene McMonagle Sinead Toomey Clare M Reynolds Aidan P Moloney Isobel C Gormley Peadar Ó Gaora Helen M Roche 《BMC bioinformatics》2010,11(1):499

相似文献

13.

Nh3D: A reference dataset of non-homologous protein structures

B Thiruv G Quon SA Saldanha B Steipe 《BMC structural biology》2005,5(1):12

相似文献

14.

Discover protein sequence signatures from protein-protein interaction data

Jianwen Fang Ryan J Haasl Yinghua Dong Gerald H Lushington 《BMC bioinformatics》2005,6(1):277

Background

The development of high-throughput technologies such as yeast two-hybrid systems and mass spectrometry technologies has made it possible to generate large protein-protein interaction (PPI) datasets. Mining these datasets for underlying biological knowledge has, however, remained a challenge. 相似文献

15.

An assessment of false discovery rates and statistical significance in label-free quantitative proteomics with combined filters

Qingbo Li Bryan AP Roxas 《BMC bioinformatics》2009,10(1):43-18

Background

Many studies have provided algorithms or methods to assess a statistical significance in quantitative proteomics when multiple replicates for a protein sample and a LC/MS analysis are available. But, confidence is still lacking in using datasets for a biological interpretation without protein sample replicates. Although a fold-change is a conventional threshold that can be used when there are no sample replicates, it does not provide an assessment of statistical significance such as a false discovery rate (FDR) which is an important indicator of the reliability to identify differentially expressed proteins. In this work, we investigate whether differentially expressed proteins can be detected with a statistical significance from a pair of unlabeled protein samples without replicates and with only duplicate LC/MS injections per sample. A FDR is used to gauge the statistical significance of the differentially expressed proteins. 相似文献

16.

Missing value imputation for microarray gene expression data using histone acetylation information

Qian Xiang Xianhua Dai Yangyang Deng Caisheng He Jiang Wang Jihua Feng Zhiming Dai 《BMC bioinformatics》2008,9(1):252

Background

It is an important pre-processing step to accurately estimate missing values in microarray data, because complete datasets are required in numerous expression profile analysis in bioinformatics. Although several methods have been suggested, their performances are not satisfactory for datasets with high missing percentages. 相似文献

17.

Probabilistic prediction and ranking of human protein-protein interactions

Michelle S Scott Geoffrey J Barton 《BMC bioinformatics》2007,8(1):239

Background

Although the prediction of protein-protein interactions has been extensively investigated for yeast, few such datasets exist for the far larger proteome in human. Furthermore, it has recently been estimated that the overall average false positive rate of available computational and high-throughput experimental interaction datasets is as high as 90%. 相似文献

18.

MASPECTRAS: a platform for management and analysis of proteomics LC-MS/MS data

Jürgen Hartler Gerhard G Thallinger Gernot Stocker Alexander Sturn Thomas R Burkard Erik Körner Robert Rader Andreas Schmidt Karl Mechtler Zlatko Trajanoski 《BMC bioinformatics》2007,8(1):197

Background

The advancements of proteomics technologies have led to a rapid increase in the number, size and rate at which datasets are generated. Managing and extracting valuable information from such datasets requires the use of data management platforms and computational approaches. 相似文献

19.

AutoSOME: a clustering method for identifying gene expression modules without prior knowledge of cluster number

Aaron M Newman James B Cooper 《BMC bioinformatics》2010,11(1):117

Background

Clustering the information content of large high-dimensional gene expression datasets has widespread application in "omics" biology. Unfortunately, the underlying structure of these natural datasets is often fuzzy, and the computational identification of data clusters generally requires knowledge about cluster number and geometry. 相似文献

20.

Statistical Test of Expression Pattern (STEPath): a new strategy to integrate gene expression data with genomic information in individual and meta-analysis studies

Paolo Martini Davide Risso Gabriele Sales Chiara Romualdi Gerolamo Lanfranchi Stefano Cagnin 《BMC bioinformatics》2011,12(1):92

相似文献