期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Data distribution of short oligonucleotide expression arrays and its application to the construction of a generalized intellectual framework

Konishi T 《Statistical applications in genetics and molecular biology》2008,7(1):Article25

相似文献

2.

Sparse canonical methods for biological data integration: application to a cross-platform study

Kim-Anh Lê Cao Pascal GP Martin Christèle Robert-Granié Philippe Besse 《BMC bioinformatics》2009,10(1):34

相似文献

3.

<Emphasis Type="Italic">μ</Emphasis>-CS: An extension of the TM4 platform to manage Affymetrix binary data

Pietro H Guzzi Mario Cannataro 《BMC bioinformatics》2010,11(1):315

Background

A main goal in understanding cell mechanisms is to explain the relationship among genes and related molecular processes through the combined use of technological platforms and bioinformatics analysis. High throughput platforms, such as microarrays, enable the investigation of the whole genome in a single experiment. There exist different kind of microarray platforms, that produce different types of binary data (images and raw data). Moreover, also considering a single vendor, different chips are available. The analysis of microarray data requires an initial preprocessing phase (i.e. normalization and summarization) of raw data that makes them suitable for use on existing platforms, such as the TIGR M4 Suite. Nevertheless, the annotations of data with additional information such as gene function, is needed to perform more powerful analysis. Raw data preprocessing and annotation is often performed in a manual and error prone way. Moreover, many available preprocessing tools do not support annotation. Thus novel, platform independent, and possibly open source tools enabling the semi-automatic preprocessing and annotation of microarray data are needed. 相似文献

4.

A Probabilistic Framework for Peptide and Protein Quantification from Data-Dependent and Data-Independent LC-MS Proteomics Experiments

K Richardson R Denny C Hughes J Skilling J Sikora M Dadlez A Manteca HR Jung ON Jensen V Redeker R Melki JI Langridge JP Vissers 《Omics : a journal of integrative biology》2012,16(9):468-482

Abstract A probability-based quantification framework is presented for the calculation of relative peptide and protein abundance in label-free and label-dependent LC-MS proteomics data. The results are accompanied by credible intervals and regulation probabilities. The algorithm takes into account data uncertainties via Poisson statistics modified by a noise contribution that is determined automatically during an initial normalization stage. Protein quantification relies on assignments of component peptides to the acquired data. These assignments are generally of variable reliability and may not be present across all of the experiments comprising an analysis. It is also possible for a peptide to be identified to more than one protein in a given mixture. For these reasons the algorithm accepts a prior probability of peptide assignment for each intensity measurement. The model is constructed in such a way that outliers of any type can be automatically reweighted. Two discrete normalization methods can be employed. The first method is based on a user-defined subset of peptides, while the second method relies on the presence of a dominant background of endogenous peptides for which the concentration is assumed to be unaffected. Normalization is performed using the same computational and statistical procedures employed by the main quantification algorithm. The performance of the algorithm will be illustrated on example data sets, and its utility demonstrated for typical proteomics applications. The quantification algorithm supports relative protein quantification based on precursor and product ion intensities acquired by means of data-dependent methods, originating from all common isotopically-labeled approaches, as well as label-free ion intensity-based data-independent methods. 相似文献

5.

Normalization of DNA-microarray data by nonlinear correlation maximization.

D Faller H U Voss J Timmer U Hobohm 《Journal of computational biology》2003,10(5):751-762

Signal data from DNA-microarray ("chip") technology can be noisy; i.e., the signal variation of one gene on a series of repetitive chips can be substantial. It is becoming more and more recognized that a sufficient number of chip replicates has to be made in order to separate correct from incorrect signals. To reduce the systematic fraction of the noise deriving from pipetting errors, from different treatment of chips during hybridization, and from chip-to-chip manufacturing variability, normalization schemes are employed. We present here an iterative nonparametric nonlinear normalization scheme called simultaneous alternating conditional expectation (sACE), which is designed to maximize correlation between chip repeats in all-chip-against-all space. We tested sACE on 28 experiments with 158 Affymetrix one-color chips. The procedure should be equally applicable to other DNA-microarray technologies, e.g., two-color chips. We show that the reduction of noise compared to a simple normalization scheme like the widely used linear global normalization leads to fewer false-positive calls, i.e., to fewer genes which have to be laboriously confirmed by independent methods such as TaqMan or quantitative PCR. 相似文献

6.

Parametric mode regression for bounded responses

Haiming Zhou Xianzheng Huang for the Alzheimer's Disease Neuroimaging Initiative 《Biometrical journal. Biometrische Zeitschrift》2020,62(7):1791-1809

We propose new parametric frameworks of regression analysis with the conditional mode of a bounded response as the focal point of interest. Covariate effects estimation and prediction based on the maximum likelihood method under two new classes of regression models are demonstrated. We also develop graphical and numerical diagnostic tools to detect various sources of model misspecification. Predictions based on different central tendency measures inferred using various regression models are compared using synthetic data in simulations. Finally, we conduct regression analysis for data from the Alzheimer's Disease Neuroimaging Initiative to demonstrate practical implementation of the proposed methods. Supporting Information that contain technical details and additional simulation and data analysis results are available online. 相似文献

7.

Evidence of neutral transcriptome evolution in plants

Broadley MR White PJ Hammond JP Graham NS Bowen HC Emmerson ZF Fray RG Iannetta PP McNicol JW May ST 《The New phytologist》2008,180(3):587-593

相似文献

8.

A systematic assessment of normalization approaches for the Infinium 450K methylation platform

Michael C Wu Bonnie R Joubert Pei-fen Kuan Siri E H?berg Wenche Nystad Shyamal D Peddada Stephanie J London 《Epigenetics》2014,9(2):318-329

The Illumina Infinium HumanMethylation450 BeadChip has emerged as one of the most popular platforms for genome wide profiling of DNA methylation. While the technology is wide-spread, systematic technical biases are believed to be present in the data. For example, this array incorporates two different chemical assays, i.e., Type I and Type II probes, which exhibit different technical characteristics and potentially complicate the computational and statistical analysis. Several normalization methods have been introduced recently to adjust for possible biases. However, there is considerable debate within the field on which normalization procedure should be used and indeed whether normalization is even necessary. Yet despite the importance of the question, there has been little comprehensive comparison of normalization methods. We sought to systematically compare several popular normalization approaches using the Norwegian Mother and Child Cohort Study (MoBa) methylation data set and the technical replicates analyzed with it as a case study. We assessed both the reproducibility between technical replicates following normalization and the effect of normalization on association analysis. Results indicate that the raw data are already highly reproducible, some normalization approaches can slightly improve reproducibility, but other normalization approaches may introduce more variability into the data. Results also suggest that differences in association analysis after applying different normalizations are not large when the signal is strong, but when the signal is more modest, different normalizations can yield very different numbers of findings that meet a weaker statistical significance threshold. Overall, our work provides useful, objective assessment of the effectiveness of key normalization methods. 相似文献

9.

Gene expression profiling in the rhesus macaque: Methodology, annotation and data interpretation

Nigel C. Noriega Steven G. Kohama Henryk F. Urbanski 《Methods (San Diego, Calif.)》2009,49(1):42

相似文献

10.

Gene Expression,Single Nucleotide Variant and Fusion Transcript Discovery in Archival Material from Breast Tumors

Nadine Norton Zhifu Sun Yan W. Asmann Daniel J. Serie Brian M. Necela Aditya Bhagwate Jin Jen Bruce W. Eckloff Krishna R. Kalari Kevin J. Thompson Jennifer M. Carr Jennifer M. Kachergus Xochiquetzal J. Geiger Edith A. Perez E. Aubrey Thompson 《PloS one》2013,8(11)

相似文献

11.

Intra-Platform Repeatability and Inter-Platform Comparability of MicroRNA Microarray Technology

Fumiaki Sato Soken Tsuchiya Kazuya Terasawa Gozoh Tsujimoto 《PloS one》2009,4(5)

Over the last decade, DNA microarray technology has provided a great contribution to the life sciences. The MicroArray Quality Control (MAQC) project demonstrated the way to analyze the expression microarray. Recently, microarray technology has been utilized to analyze a comprehensive microRNA expression profiling. Currently, several platforms of microRNA microarray chips are commercially available. Thus, we compared repeatability and comparability of five different microRNA microarray platforms (Agilent, Ambion, Exiqon, Invitrogen and Toray) using 309 microRNAs probes, and the Taqman microRNA system using 142 microRNA probes. This study demonstrated that microRNA microarray has high intra-platform repeatability and comparability to quantitative RT-PCR of microRNA. Among the five platforms, Agilent and Toray array showed relatively better performances than the others. However, the current lineup of commercially available microRNA microarray systems fails to show good inter-platform concordance, probably because of lack of an adequate normalization method and severe divergence in stringency of detection call criteria between different platforms. This study provided the basic information about the performance and the problems specific to the current microRNA microarray systems. 相似文献

12.

A systematic assessment of normalization approaches for the Infinium 450K methylation platform

《Epigenetics》2013,8(2):318-329

The Illumina Infinium HumanMethylation450 BeadChip has emerged as one of the most popular platforms for genome wide profiling of DNA methylation. While the technology is wide-spread, systematic technical biases are believed to be present in the data. For example, this array incorporates two different chemical assays, i.e., Type I and Type II probes, which exhibit different technical characteristics and potentially complicate the computational and statistical analysis. Several normalization methods have been introduced recently to adjust for possible biases. However, there is considerable debate within the field on which normalization procedure should be used and indeed whether normalization is even necessary. Yet despite the importance of the question, there has been little comprehensive comparison of normalization methods. We sought to systematically compare several popular normalization approaches using the Norwegian Mother and Child Cohort Study (MoBa) methylation data set and the technical replicates analyzed with it as a case study. We assessed both the reproducibility between technical replicates following normalization and the effect of normalization on association analysis. Results indicate that the raw data are already highly reproducible, some normalization approaches can slightly improve reproducibility, but other normalization approaches may introduce more variability into the data. Results also suggest that differences in association analysis after applying different normalizations are not large when the signal is strong, but when the signal is more modest, different normalizations can yield very different numbers of findings that meet a weaker statistical significance threshold. Overall, our work provides useful, objective assessment of the effectiveness of key normalization methods. 相似文献

13.

Normalization Using Weighted Negative Second Order Exponential Error Functions （NeONORM） Provides Robustness Against Asymmetries in Comparative Transcriptome Profiles and Avoids False Calls 总被引：3，自引：3，他引：0

Noth S Brysbaert G Benecke A 《基因组蛋白质组与生物信息学报(英文版)》2006,4(2):90-109

相似文献

14.

Modeling transcriptome based on transcript-sampling data

Zhu J He F Wang J Yu J 《PloS one》2008,3(2):e1659

相似文献

15.

ARChip epoxy and ARChip UV for covalent on-chip immobilization of pmoA gene-specific oligonucleotides

Preininger C Bodrossy L Sauer U Pichler R Weilharter A 《Analytical biochemistry》2004,330(1):29-36

ARChip Epoxy and ARChip UV are presented as novel chip platforms for oligonucleotide immobilization. ARChip Epoxy is made of reactive epoxy resin available commercially. ARChip UV consists of photoactivatable poly(styrene-co-4-vinylbenzylthiocyanate). Both ARChip surfaces are tested in a model assay based on oligonucleotide probes from a real-life genotyping project and are evaluated in comparison with five commercial chip surfaces based on nitrocellulose, epoxy, and aldehyde polymer, and two different aminosilanes. Optimum print buffer, spotter compatibility, and data normalization are discussed. 相似文献

16.

Deep learning-based transcriptome data classification for drug-target interaction prediction

Xie Lingwei He Song Song Xinyu Bo Xiaochen Zhang Zhongnan 《BMC genomics》2018,19(7):667-102

相似文献

17.

Cannot see the random forest for the decision trees: selecting predictive models for restoration ecology

David M. Barnard Matthew J. Germino David S. Pilliod Robert S. Arkle Cara Applestein Bill E. Davidson Matthew R. Fisk 《Restoration Ecology》2019,27(5):1053-1063

Improving predictions of restoration outcomes is increasingly important to resource managers for accountability and adaptive management, yet there is limited guidance for selecting a predictive model from the multitude available. The goal of this article was to identify an optimal predictive framework for restoration ecology using 11 modeling frameworks (including machine learning, inferential, and ensemble approaches) and three data groups (field data, geographic data [GIS], and a combination thereof). We test this approach with a dataset from a large postfire sagebrush reestablishment project in the Great Basin, U.S.A. Predictive power varied among models and data groups, ranging from 58% to 79% accuracy. Finer‐scale field data generally had the greatest predictive power, although GIS data were present in the best models overall. An ensemble prediction computed from the 10 models parameterized to field data was well above average for accuracy but was outperformed by others that prioritized model parsimony by selecting predictor variables based on rankings of their importance among all candidate models. The variation in predictive power among a suite of modeling frameworks underscores the importance of a model comparison and refinement approach that evaluates multiple models and data groups, and selects variables based on their contribution to predictive power. The enhanced understanding of factors influencing restoration outcomes accomplished by this framework has the potential to aid the adaptive management process for improving future restoration outcomes. 相似文献

18.

Microarray standard data set and figures of merit for comparing data processing methods and experiment designs 总被引：5，自引：0，他引：5

He YD Dai H Schadt EE Cavet G Edwards SW Stepaniants SB Duenwald S Kleinhanz R Jones AR Shoemaker DD Stoughton RB 《Bioinformatics (Oxford, England)》2003,19(8):956-965

MOTIVATION: There is a very large and growing level of effort toward improving the platforms, experiment designs, and data analysis methods for microarray expression profiling. Along with a growing richness in the approaches there is a growing confusion among most scientists as to how to make objective comparisons and choices between them for different applications. There is a need for a standard framework for the microarray community to compare and improve analytical and statistical methods. RESULTS: We report on a microarray data set comprising 204 in-situ synthesized oligonucleotide arrays, each hybridized with two-color cDNA samples derived from 20 different human tissues and cell lines. Design of the approximately 24 000 60mer oligonucleotides that report approximately 2500 known genes on the arrays, and design of the hybridization experiments, were carried out in a way that supports the performance assessment of alternative data processing approaches and of alternative experiment and array designs. We also propose standard figures of merit for success in detecting individual differential expression changes or expression levels, and for detecting similarities and differences in expression patterns across genes and experiments. We expect this data set and the proposed figures of merit will provide a standard framework for much of the microarray community to compare and improve many analytical and statistical methods relevant to microarray data analysis, including image processing, normalization, error modeling, combining of multiple reporters per gene, use of replicate experiments, and sample referencing schemes in measurements based on expression change. AVAILABILITY/SUPPLEMENTARY INFORMATION: Expression data and supplementary information are available at http://www.rii.com/publications/2003/HE_SDS.htm 相似文献

19.

Gaussian process regression model for normalization of LC-MS data using scan-level information

Mohammad?R?Nezami Ranjbar Yi?Zhao Mahlet?G?Tadesse Yue?Wang Habtom?W?Ressom Email author 《Proteome science》2013,11(Z1):S13

Background

Differences in sample collection, biomolecule extraction, and instrument variability introduce bias to data generated by liquid chromatography coupled with mass spectrometry (LC-MS). Normalization is used to address these issues. In this paper, we introduce a new normalization method using the Gaussian process regression model (GPRM) that utilizes information from individual scans within an extracted ion chromatogram (EIC) of a peak. The proposed method is particularly applicable for normalization based on analysis order of LC-MS runs. Our method uses measurement variabilities estimated through LC-MS data acquired from quality control samples to correct for bias caused by instrument drift. Maximum likelihood approach is used to find the optimal parameters for the fitted GPRM. We review several normalization methods and compare their performance with GPRM.

Results

To evaluate the performance of different normalization methods, we consider LC-MS data from a study where metabolomic approach is utilized to discover biomarkers for liver cancer. The LC-MS data were acquired by analysis of sera from liver cancer patients and cirrhotic controls. In addition, LC-MS runs from a quality control (QC) sample are included to assess the run to run variability and to evaluate the ability of various normalization method in reducing this undesired variability. Also, ANOVA models are applied to the normalized LC-MS data to identify ions with intensity measurements that are significantly different between cases and controls.

Conclusions

One of the challenges in using label-free LC-MS for quantitation of biomolecules is systematic bias in measurements. Several normalization methods have been introduced to overcome this issue, but there is no universally applicable approach at the present time. Each data set should be carefully examined to determine the most appropriate normalization method. We review here several existing methods and introduce the GPRM for normalization of LC-MS data. Through our in-house data set, we show that the GPRM outperforms other normalization methods considered here, in terms of decreasing the variability of ion intensities among quality control runs.

相似文献

20.

Quality Assessment of Transcriptome Data Using Intrinsic Statistical Properties

Guillaume Brysbaert Franois-Xavier Pellay Sebastian Noth Arndt Benecke 《基因组蛋白质组与生物信息学报(英文版)》2010,8(1):57-71

相似文献