首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 375 毫秒
1.
One of the main objectives in the analysis of microarray experiments is the identification of genes that are differentially expressed under two experimental conditions. This task is complicated by the noisiness of the data and the large number of genes that are examined simultaneously. Here, we present a novel technique for identifying differentially expressed genes that does not originate from a sophisticated statistical model but rather from an analysis of biological reasoning. The new technique, which is based on calculating rank products (RP) from replicate experiments, is fast and simple. At the same time, it provides a straightforward and statistically stringent way to determine the significance level for each gene and allows for the flexible control of the false-detection rate and familywise error rate in the multiple testing situation of a microarray experiment. We use the RP technique on three biological data sets and show that in each case it performs more reliably and consistently than the non-parametric t-test variant implemented in Tusher et al.'s significance analysis of microarrays (SAM). We also show that the RP results are reliable in highly noisy data. An analysis of the physiological function of the identified genes indicates that the RP approach is powerful for identifying biologically relevant expression changes. In addition, using RP can lead to a sharp reduction in the number of replicate experiments needed to obtain reproducible results.  相似文献   

2.
3.
4.
RNA interference (RNAi) screening is a powerful technology for functional characterization of biological pathways. Interpretation of RNAi screens requires computational and statistical analysis techniques. We describe a method that integrates all steps to generate a scored phenotype list from raw data. It is implemented in an open-source Bioconductor/R package, cellHTS (). The method is useful for the analysis and documentation of individual RNAi screens. Moreover, it is a prerequisite for the integration of multiple experiments.  相似文献   

5.
6.

Background  

One of the most time-consuming tasks after performing a gene expression experiment is the biological interpretation of the results by identifying physiologically important associations between the differentially expressed genes. A large part of the relevant functional evidence can be represented in the form of graphs, e.g. metabolic and signaling pathways, protein interaction maps, shared GeneOntology annotations, or literature co-citation relations. Such graphs are easily constructed from available genome annotation data. The problem of biological interpretation can then be described as identifying the subgraphs showing the most significant patterns of gene expression. We applied a graph-based extension of our iterative Group Analysis (iGA) approach to obtain a statistically rigorous identification of the subgraphs of interest in any evidence graph.  相似文献   

7.
8.
9.
Structural crystallography and nuclear magnetic resonance (NMR) spectroscopy are the predominant techniques for understanding the biological world on a molecular level. Crystallography is constrained by the ability to form a crystal that diffracts well and NMR is constrained to smaller proteins. Although powerful techniques, they leave many soluble, purified structurally uncharacterized protein samples. Small angle X-ray scattering (SAXS) is a solution technique that provides data on the size and multiple conformations of a sample, and can be used to reconstruct a low-resolution molecular envelope of a macromolecule. In this study, SAXS has been used in a high-throughput manner on a subset of 28 proteins, where structural information is available from crystallographic and/or NMR techniques. These crystallographic and NMR structures were used to validate the accuracy of molecular envelopes reconstructed from SAXS data on a statistical level, to compare and highlight complementary structural information that SAXS provides, and to leverage biological information derived by crystallographers and spectroscopists from their structures. All the ab initio molecular envelopes calculated from the SAXS data agree well with the available structural information. SAXS is a powerful albeit low-resolution technique that can provide additional structural information in a high-throughput and complementary manner to improve the functional interpretation of high-resolution structures.  相似文献   

10.
MOTIVATION: The result of a typical microarray experiment is a long list of genes with corresponding expression measurements. This list is only the starting point for a meaningful biological interpretation. Modern methods identify relevant biological processes or functions from gene expression data by scoring the statistical significance of predefined functional gene groups, e.g. based on Gene Ontology (GO). We develop methods that increase the explanatory power of this approach by integrating knowledge about relationships between the GO terms into the calculation of the statistical significance. RESULTS: We present two novel algorithms that improve GO group scoring using the underlying GO graph topology. The algorithms are evaluated on real and simulated gene expression data. We show that both methods eliminate local dependencies between GO terms and point to relevant areas in the GO graph that remain undetected with state-of-the-art algorithms for scoring functional terms. A simulation study demonstrates that the new methods exhibit a higher level of detecting relevant biological terms than competing methods.  相似文献   

11.
12.
13.
EASE is a customizable software application for rapid biological interpretation of gene lists that result from the analysis of microarray, proteomics, SAGE, and other high-throughput genomic data. The biological themes returned by EASE recapitulate manually determined themes in previously published gene lists and are robust to varying methods of normalization, intensity calculation and statistical selection of genes. EASE is a powerful tool for rapidly converting the results of functional genomics studies from "genes to themes."  相似文献   

14.
Identifying biological themes within lists of genes with EASE   总被引:2,自引:0,他引:2  
EASE is a customizable software application for rapid biological interpretation of gene lists that result from the analysis of microarray, proteomics, SAGE and other high-throughput genomic data. The biological themes returned by EASE recapitulate manually determined themes in previously published gene lists and are robust to varying methods of normalization, intensity calculation and statistical selection of genes. EASE is a powerful tool for rapidly converting the results of functional genomics studies from 'genes' to 'themes'.  相似文献   

15.
To assess the usefulness and applications of machine vision (MV) and machine learning (ML) techniques that have been used to develop a single cell-based phenotypic (live and fixed biomarkers) platform that correlates with tumor biological aggressiveness and risk stratification, 100 fresh prostate samples were acquired, and areas of prostate cancer were determined by post-surgery pathology reports logged by an independent pathologist. The prostate samples were dissociated into single-cell suspensions in the presence of an extracellular matrix formulation. These samples were analyzed via live-cell microscopy. Dynamic and fixed phenotypic biomarkers per cell were quantified using objective MV software and ML algorithms. The predictive nature of the ML algorithms was developed in two stages. First, random forest (RF) algorithms were developed using 70% of the samples. The developed algorithms were then tested for their predictive performance using the blinded test dataset that contained 30% of the samples in the second stage. Based on the ROC (receiver operating characteristic) curve analysis, thresholds were set to maximize both sensitivity and specificity. We determined the sensitivity and specificity of the assay by comparing the algorithm-generated predictions with adverse pathologic features in the radical prostatectomy (RP) specimens. Using MV and ML algorithms, the biomarkers predictive of adverse pathology at RP were ranked and a prostate cancer patient risk stratification test was developed that distinguishes patients based on surgical adverse pathology features. The ability to identify and track large numbers of individual cells over the length of the microscopy experimental monitoring cycles, in an automated way, created a large biomarker dataset of primary biomarkers. This biomarker dataset was then interrogated with ML algorithms used to correlate with post-surgical adverse pathology findings. Algorithms were generated that predicted adverse pathology with >0.85 sensitivity and specificity and an AUC (area under the curve) of >0.85. Phenotypic biomarkers provide cellular and molecular details that are informative for predicting post-surgical adverse pathologies when considering tumor biopsy samples. Artificial intelligence ML-based approaches for cancer risk stratification are emerging as important and powerful tools to compliment current measures of risk stratification. These techniques have capabilities to address tumor heterogeneity and the molecular complexity of prostate cancer. Specifically, the phenotypic test is a novel example of leveraging biomarkers and advances in MV and ML for developing a powerful prognostic and risk-stratification tool for prostate cancer patients.  相似文献   

16.
Darwin proposed a powerful functional principle-natural selection-to interpret phylogenetic diversity and complexity. Nevertheless, some 70 years elapsed before even biologists embraced his account. The triumph of natural selection required two additional factors: (a) biological mechanisms that implemented the functional principle (i.e. genetics); and (b) quantitative procedures that traced its cumulative effects (i.e. population genetics). Thorndike and, later, Skinner proposed a functional principle-selection by reinforcement-that interpreted ontogenetic diversity and complexity. This principle has been substantially refined by subsequent experimental and theoretical work and now provides an equally powerful functional account. However, a purely functional principle has once again not persuaded most scientists (apart from the behavior-analytic minority) that complex behavior can be understood by a selection principle. If the history of ontogeny recapitulates the history of phylogeny, the biological mechanisms that implement selection by reinforcement must be discovered and quantitative techniques that trace its effects must be devised. In short, the triumph of reinforcement may await the integration of behavior analysis with the neurosciences and the development of coordinated quantitative procedures. This article identifies specific interrelations with neuroscience that form the basis for a behavior-analytically faithful interpretation of reinforcement using neural-network techniques. Implications for the architecture of networks and for the learning algorithm are emphasized.  相似文献   

17.
Bone Morphogenetic Proteins (BMPs) are critical for pattern formation in many animals. In numerous tissues, BMPs become distributed in spatially non-uniform profiles. The gradients of signaling activity can be detected by a number of biological assays involving fluorescence microscopy. Quantitative analyses of BMP gradients are powerful tools to investigate the regulation of BMP signaling pathways during development. These approaches rely heavily on images as spatial representations of BMP activity levels, using them to infer signaling distributions that inform on regulatory mechanisms. In this perspective, we discuss current imaging assays and normalization methods used to quantify BMP activity profiles with a focus on the Drosophila wing primordium. We find that normalization tends to lower the number of samples required to establish statistical significance between profiles in controls and experiments, but the increased resolvability comes with a cost. Each normalization strategy makes implicit assumptions about the biology that impacts our interpretation of the data. We examine the tradeoffs for normalizing versus not normalizing, and discuss their impacts on experimental design and the interpretation of resultant data.  相似文献   

18.
李灏  姜颖  贺福初 《遗传》2008,30(4):389-399
在后基因组时代, 系统生物学研究成为人们关注的焦点。转录组学、蛋白质组学等功能基因组学研究方法可同时检测药物或其他因素影响下大量基因或蛋白质的表达变化情况, 但这些变化不能与生物学功能的变化建立直接联系。代谢组学方法则可为代谢物含量变化与生物表型变化建立直接相关性。代谢组学研究的目的是定量分析一个生物系统内所有代谢物的含量, 进行全面代谢物分析需要分析化学技术的支撑, 核磁共振和基于质谱的分析技术是代谢组学研究的两种主要技术手段。代谢组学研究可产生大量数据信息, 对这些数据进行分析离不开化学统计学的应用, 比如主成分分析、多维缩放、各种聚类分析技术以及功能差异分析等。文章综述了近年来代谢组学分析技术及数据分析技术的研究进展, 在此基础上, 对代谢组学在临床研究及临床前研究中的应用研究进展进行了综述。对疾病代谢表型图谱的研究有助于人们了解疾病发生、发展以及致死的机制; 在临床条件下, 这些代谢图谱可以作为疾病诊断、预后以及治疗的评判标准。代谢物组成的变化是毒物胁迫对机体造成的最终影响, 利用代谢组技术可以直接反映毒物对机体的影响。质谱技术、核磁共振技术的应用使得药物筛选过程可以快速完成, 并有助于实现个性化用药。此外, 利用代谢组学技术还可以进行已知酶的新活性研究, 也可以研究未知酶。  相似文献   

19.
Because of high dimensionality, machine learning algorithms typically rely on feature selection techniques in order to perform effective classification in microarray gene expression data sets. However, the large number of features compared to the number of samples makes the task of feature selection computationally hard and prone to errors. This paper interprets feature selection as a task of stochastic optimization, where the goal is to select among an exponential number of alternative gene subsets the one expected to return the highest generalization in classification. Blocking is an experimental design strategy which produces similar experimental conditions to compare alternative stochastic configurations in order to be confident that observed differences in accuracy are due to actual differences rather than to fluctuations and noise effects. We propose an original blocking strategy for improving feature selection which aggregates in a paired way the validation outcomes of several learning algorithms to assess a gene subset and compare it to others. This is a novelty with respect to conventional wrappers, which commonly adopt a sole learning algorithm to evaluate the relevance of a given set of variables. The rationale of the approach is that, by increasing the amount of experimental conditions under which we validate a feature subset, we can lessen the problems related to the scarcity of samples and consequently come up with a better selection. The paper shows that the blocking strategy significantly improves the performance of a conventional forward selection for a set of 16 publicly available cancer expression data sets. The experiments involve six different classifiers and show that improvements take place independent of the classification algorithm used after the selection step. Two further validations based on available biological annotation support the claim that blocking strategies in feature selection may improve the accuracy and the quality of the solution. The first validation is based on retrieving PubMEd abstracts associated to the selected genes and matching them to regular expressions describing the biological phenomenon underlying the expression data sets. The biological validation that follows is based on the use of the Bioconductor package GoStats in order to perform Gene Ontology statistical analysis.  相似文献   

20.
The MicroCore toolkit is a suite of analysis programs for microarray and proteomics data that is open source and programmed exclusively in Java. MicroCore provides a flexible and extensible environment for the interpretation of functional genomics data through visualization. The first version of the application (downloadable from the MicroCore website: http://www.ucl.ac.uk/oncology/MicroCore/microcore.htm), implements two programs-PIMs (protein interaction maps) and MicroExpress-and is soon to be followed by an extended version which will also feature a fuzzy k-means clustering application and a Java-based R plug-in for microarray analysis. PIMs and MicroExpress provide a simple yet powerful way of graphically relating large quantities of expression data from multiple experiments to cellular pathways and biological processes in a statistically meaningful way.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号