共查询到20条相似文献,搜索用时 15 毫秒
1.
A random variance model for detection of differential gene expression in small microarray experiments 总被引:9,自引:0,他引:9
MOTIVATION: Microarray techniques provide a valuable way of characterizing the molecular nature of disease. Unfortunately expense and limited specimen availability often lead to studies with small sample sizes. This makes accurate estimation of variability difficult, since variance estimates made on a gene by gene basis will have few degrees of freedom, and the assumption that all genes share equal variance is unlikely to be true. RESULTS: We propose a model by which the within gene variances are drawn from an inverse gamma distribution, whose parameters are estimated across all genes. This results in a test statistic that is a minor variation of those used in standard linear models. We demonstrate that the model assumptions are valid on experimental data, and that the model has more power than standard tests to pick up large changes in expression, while not increasing the rate of false positives. AVAILABILITY: This method is incorporated into BRB-ArrayTools version 3.0 (http://linus.nci.nih.gov/BRB-ArrayTools.html). SUPPLEMENTARY MATERIAL: ftp://linus.nci.nih.gov/pub/techreport/RVM_supplement.pdf 相似文献
2.
The recent advent of exon microarrays has made it possible to reveal differences in alternative splicing events on a global scale. We introduce a novel statistical procedure that takes full advantage of the probe-level information on Affymetrix exon arrays when detecting differential splicing between sample groups. In comparison to existing ranking methods, the procedure shows superior reproducibility and accuracy in distinguishing true biological findings from background noise in high agreement with experimental validations. 相似文献
3.
Hironori Fujisawa Youko Horiuchi Yoshiaki Harushima Toyoyuki Takada Shinto Eguchi Takako Mochizuki Takayuki Sakaguchi Toshihiko Shiroishi Nori Kurata 《BMC bioinformatics》2009,10(1):131
Background
High-density short oligonucleotide microarrays are useful tools for studying biodiversity, because they can be used to investigate both nucleotide and expression polymorphisms. However, when different strains (or species) produce different signal intensities after mRNA hybridization, it is not easy to determine whether the signal intensities were affected by nucleotide or expression polymorphisms. To overcome this difficulty, nucleotide and expression polymorphisms are currently examined separately. 相似文献4.
5.
Seo J Bakay M Chen YW Hilmer S Shneiderman B Hoffman EP 《Bioinformatics (Oxford, England)》2004,20(16):2534-2544
MOTIVATION: The most commonly utilized microarrays for mRNA profiling (Affymetrix) include 'probe sets' of a series of perfect match and mismatch probes (typically 22 oligonucleotides per probe set). There are an increasing number of reported 'probe set algorithms' that differ in their interpretation of a probe set to derive a single normalized 'signal' representative of expression of each mRNA. These algorithms are known to differ in accuracy and sensitivity, and optimization has been done using a small set of standardized control microarray data. We hypothesized that different mRNA profiling projects have varying sources and degrees of confounding noise, and that these should alter the choice of a specific probe set algorithm. Also, we hypothesized that use of the Microarray Suite (MAS) 5.0 probe set detection p-value as a weighting function would improve the performance of all probe set algorithms. RESULTS: We built an interactive visual analysis software tool (HCE2W) to test and define parameters in Affymetrix analyses that optimize the ratio of signal (desired biological variable) versus noise (confounding uncontrolled variables). Five probe set algorithms were studied with and without statistical weighting of probe sets using the MAS 5.0 probe set detection p-values. The signal-to-noise ratio optimization method was tested in two large novel microarray datasets with different levels of confounding noise, a 105 sample U133A human muscle biopsy dataset (11 groups: mutation-defined, extensive noise), and a 40 sample U74A inbred mouse lung dataset (8 groups: little noise). Performance was measured by the ability of the specific probe set algorithm, with and without detection p-value weighting, to cluster samples into the appropriate biological groups (unsupervised agglomerative clustering with F-measure values). Of the total random sampling analyses, 50% showed a highly statistically significant difference between probe set algorithms by ANOVA [F(4,10) > 14, p < 0.0001], with weighting by MAS 5.0 detection p-value showing significance in the mouse data by ANOVA [F(1,10) > 9, p < 0.013] and paired t-test [t(9) = -3.675, p = 0.005]. Probe set detection p-value weighting had the greatest positive effect on performance of dChip difference model, ProbeProfiler and RMA algorithms. Importantly, probe set algorithms did indeed perform differently depending on the specific project, most probably due to the degree of confounding noise. Our data indicate that significantly improved data analysis of mRNA profile projects can be achieved by optimizing the choice of probe set algorithm with the noise levels intrinsic to a project, with dChip difference model with MAS 5.0 detection p-value continuous weighting showing the best overall performance in both projects. Furthermore, both existing and newly developed probe set algorithms should incorporate a detection p-value weighting to improve performance. AVAILABILITY: The Hierarchical Clustering Explorer 2.0 is available at http://www.cs.umd.edu/hcil/hce/ Murine arrays (40 samples) are publicly available at the PEPR resource (http://microarray.cnmcresearch.org/pgadatatable.asp http://pepr.cnmcresearch.org Chen et al., 2004). 相似文献
6.
Richard J. Wall Mohammad Zeeshan Nicholas J. Katris Rebecca Limenitakis Edward Rea Jessica Stock Declan Brady Ross F. Waller Anthony A. Holder Rita Tewari 《Cellular microbiology》2019,21(10)
The myosin superfamily comprises of actin‐dependent eukaryotic molecular motors important in a variety of cellular functions. Although well studied in many systems, knowledge of their functions in Plasmodium, the causative agent of malaria, is restricted. Previously, six myosins were identified in this genus, including three Class XIV myosins found only in Apicomplexa and some Ciliates. The well characterized MyoA is a Class XIV myosin essential for gliding motility and invasion. Here, we characterize all other Plasmodium myosins throughout the parasite life cycle and show that they have very diverse patterns of expression and cellular location. MyoB and MyoE, the other two Class XIV myosins, are expressed in all invasive stages, with apical and basal locations, respectively. Gene deletion revealed that MyoE is involved in sporozoite traversal, MyoF and MyoK are likely essential in the asexual blood stages, and MyoJ and MyoB are not essential. Both MyoB and its essential light chain (MCL‐B) are localised at the apical end of ookinetes but expressed at completely different time points. This work provides a better understanding of the role of actomyosin motors in Apicomplexan parasites, particularly in the motile and invasive stages of Plasmodium during sexual and asexual development within the mosquito. 相似文献
7.
8.
9.
A comparison of cDNA, oligonucleotide, and Affymetrix GeneChip gene expression microarray platforms.
Yong Woo Jason Affourtit Sandra Daigle Agnes Viale Kevin Johnson Jurgen Naggert Gary Churchill 《Journal of biomolecular techniques》2004,15(4):276-284
We have conducted a study to compare the variability in measured gene expression levels associated with three types of microarray platforms. Total RNA samples were obtained from liver tissue of four male mice, two each from inbred strains A/J and C57BL/6J. The same four samples were assayed on Affymetrix Mouse Genome Expression Set 430 GeneChips (MOE430A and MOE430B), spotted cDNA microarrays, and spotted oligonucleotide microarrays using eight arrays of each type. Variances associated with measurement error were observed to be comparable across all microarray platforms. The MOE430A GeneChips and cDNA arrays had higher precision across technical replicates than the MOE430B GeneChips and oligonucleotide arrays. The Affymetrix platform showed the greatest range in the magnitude of expression levels followed by the oligonucleotide arrays. We observed good concordance in both estimated expression level and statistical significance of common genes between the Affymetrix MOE430A GeneChip and the oligonucleotide arrays. Despite their apparently high precision, cDNA arrays showed poor concordance with other platforms. 相似文献
10.
Cortez DA Tonon AP Colepicolo P Vêncio RZ 《Genetics and molecular research : GMR》2011,10(4):3586-3595
HTself is a web-based bioinformatics tool designed to deal with the classification of differential gene expression in low replication microarray studies. It is based on a statistical test that uses self-self experiments to derive intensity-dependent cutoffs. We developed an extension of HTself, originally released in 2005, by calculating P values instead of using a fixed acceptance level α. As before, the statistic used to compute single-spot P values is obtained from the Gaussian kernel density estimator method applied to self-self data. Different spots corresponding to the same biological gene (replicas) give rise to a set of independent P values that can be combined by well-known statistical methods. The combined P value can be used to decide whether a gene can be considered differentially expressed or not. HTself2 is a new version of HTself that uses P values combination. It is implemented as a user-friendly desktop application to help laboratories without a bioinformatics infrastructure. 相似文献
11.
Joshua D. Campbell Gang Liu Lingqi Luo Ji Xiao Joseph Gerrein Brenda Juan-Guardela John Tedrow Yuriy O. Alekseyev Ivana V. Yang Mick Correll Mark Geraci John Quackenbush Frank Sciurba David A. Schwartz Naftali Kaminski W. Evan Johnson Stefano Monti Avrum Spira Jennifer Beane Marc E. Lenburg 《RNA (New York, N.Y.)》2015,21(2):164-171
12.
Background
To identify differentially expressed genes (DEGs) from microarray data, users of the Affymetrix GeneChip system need to select both a preprocessing algorithm to obtain expression-level measurements and a way of ranking genes to obtain the most plausible candidates. We recently recommended suitable combinations of a preprocessing algorithm and gene ranking method that can be used to identify DEGs with a higher level of sensitivity and specificity. However, in addition to these recommendations, researchers also want to know which combinations enhance reproducibility. 相似文献13.
Square-plate culture method allows detection of differential gene expression and screening of novel, region-specific genes in Aspergillus oryzae 总被引:1,自引:0,他引:1
Masai K Maruyama J Sakamoto K Nakajima H Akita O Kitamoto K 《Applied microbiology and biotechnology》2006,71(6):881-891
When grown on solid agar medium, the mycelium of a filamentous fungus, Aspergillus oryzae, forms three morphologically distinct regions: the tip (T), white (W), and basal (B) regions. In this study, we developed the square-plate culture method, a novel culture method that enabled the extraction of mRNA samples from the three regions and analyzed the differential gene expression of the A. oryzae mycelium in concert with the microarray technique. Expression of genes involved in protein synthesis was predominant in the T region; relative expression was, at most, six times higher in the T region compared to the other regions. Genes encoding hypothetical proteins were expressed at high levels in the W and B regions. In addition, genes coding transporters/permeases were predominantly transcribed in the B region. By analyzing the expression patterns of genes in the three regions, we demonstrated the dynamic changes in the regulation of gene expression that occur along the mycelium of filamentous fungi. Consequently, our study established a method to analyze and screen for region-specific genes whose function may be essential for morphogenesis and differentiation in filamentous fungi and whose traits may be beneficial to the biotechnology industry.Electronic Supplementary Materials Supplementary material is available for this article at 相似文献
14.
15.
16.
《Animal : an international journal of animal bioscience》2012,6(9):1377-1388
Eventing competitions in Great Britain (GB) comprise three disciplines, each split into four grades, yielding 12 discipline-grade traits. As there is a demand for tools to estimate (co)variance matrices with a large number of traits, the aim of this work was to investigate different methods to produce large (co)variance matrices using GB eventing data. Data from 1999 to 2008 were used and penalty points were converted to normal scores. A sire model was utilised to estimate fixed effects of gender, age and class, and random effects of sire, horse and rider. Three methods were used to estimate (co)variance matrices. Method 1 used a method based on Gibbs sampling and data augmentation and imputation. Methods 2a and 2b combined sub-matrices from bivariate analyses; one took samples from a multivariate Normal distribution defined by the covariance matrix from each bivariate analysis, then analysed these data in a 12-trait multivariate analysis; the other replaced negative eigenvalues in the matrix with positive values to obtain a positive definite (co)variance matrix. A formal comparison of models could not be conducted; however, estimates from all methods, particularly Methods 2a/2b, were in reasonable agreement. The computational requirements of Method 1 were much less compared with Methods 2a or 2b. Method 2a heritability estimates were as follows: for dressage 7.2% to 9.0%, for show jumping 8.9% to 16.2% and for cross-country 1.3% to 1.4%. Method 1 heritability estimates were higher for the advanced grades, particularly for dressage (17.1%) and show jumping (22.6%). Irrespective of the model, genetic correlations between grades, for dressage and show jumping, were positive, high and significant, ranging from 0.59 to 0.99 for Method 2a and 0.78 to 0.95 for Method 1. For cross-country, using Method 2a, genetic correlations were only significant between novice and pre-novice (0.75); however, using Method 1 estimates were all significant and low to moderate (0.36 to 0.70). Between-discipline correlations were all low and of mixed sign. All methods produced positive definite 12 × 12 (co)variance matrices, suitable for the prediction of breeding values. Method 1 benefits from much reduced computational requirements, and by performing a true multivariate analysis. 相似文献
17.
18.
19.
20.
Sébastien Lemieux 《BMC bioinformatics》2006,7(1):391-9