共查询到20条相似文献,搜索用时 0 毫秒
1.
A method for cross-species gene expression analysis with high-density oligonucleotide arrays 总被引:7,自引:0,他引:7
DNA microarrays have been widely used in gene expression analysis of biological processes. Due to a lack of sequence information, the applications have been largely restricted to humans and a few model organisms. Presented within this study are results of the cross-species hybridization with Affymetrix human high-density oligonucleotide arrays or GeneChip® using distantly related mammalian species; cattle, pig and dog. Based on the unique feature of the Affymetrix GeneChip® where every gene is represented by multiple probes, we hypothesized that sequence conservation within mammals is high enough to generate sufficient signals from some of the probes for expression analysis. We demonstrated that while overall hybridization signals are low for cross-species hybridization, a few probes of most genes still generated signals equivalent to the same-species hybridization. By masking the poorly hybridized probes electronically, the remaining probes provided reliable data for gene expression analysis. We developed an algorithm to select the reliable probes for analysis utilizing the match/mismatch feature of GeneChip®. When comparing gene expression between two tissues using the selected probes, we found a linear correlation between the cross-species and same-species hybridization. In addition, we validated cross-species hybridization results by quantitative PCR using randomly selected genes. The method shown herein could be applied to both plant and animal research. 相似文献
2.
Lahti L Elo LL Aittokallio T Kaski S 《IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM》2011,8(1):217-225
Probe defects are a major source of noise in gene expression studies. While existing approaches detect noisy probes based on external information such as genomic alignments, we introduce and validate a targeted probabilistic method for analyzing probe reliability directly from expression data and independently of the noise source. This provides insights into the various sources of probe-level noise and gives tools to guide probe design. 相似文献
3.
To date, most studies of multigenic expression patterns by long DNA array have used DNA fragments as probes. These probes are usually obtained as PCR products, and this represents a time-consuming and error-prone approach, requiring strict quality control. The present study examines the use of 40- and 70-mer synthetic oligonucleotides as probes for DNA array analysis with radioactive labeled targets. Design, spotting onto nylon filters, and hybridization conditions were determined and optimized. In this approach, the sensitivity and the specificity of the hybridization appear comparable to the conventional long DNA probes assay, permitting the analysis of small samples of approximately 1 microg total RNA. The long oligonucleotide array thus provides a very convenient method for the analysis of gene expression patterns in biological specimens and in clinical research. 相似文献
4.
5.
MOTIVATION: Oligonucleotide expression arrays exhibit systematic and reproducible variation produced by the multiple distinct probes used to represent a gene. Recently, a gene expression index has been proposed that explicitly models probe effects, and provides improved fits of hybridization intensity for arrays containing perfect match (PM) and mismatch (MM) probe pairs. RESULTS: Here we use a combination of analytical arguments and empirical data to show directly that the estimates provided by model-based expression indexes are superior to those provided by commercial software. The improvement is greatest for genes in which probe effects vary substantially, and modeling the PM and MM intensities separately is superior to using the PM-MM differences. To empirically compare expression indexes, we designed a mixing experiment involving three groups of human fibroblast cells (serum starved, serum stimulated, and a 50:50 mixture of starved/stimulated), with six replicate HuGeneFL arrays in each group. Careful spiking of control genes provides evidence that 88-98% of the genes on the array are detectably transcribed, and that the model-based estimates can accurately detect the presence versus absence of a gene. The use of extensive replication from single RNA sources enables exploration of the technical variability of the array. 相似文献
6.
Background
One of the important challenges in microarray analysis is to take full advantage of previously accumulated data, both from one's own laboratory and from public repositories. Through a comparative analysis on a variety of datasets, a more comprehensive view of the underlying mechanism or structure can be obtained. However, as we discover in this work, continual changes in genomic sequence annotations and probe design criteria make it difficult to compare gene expression data even from different generations of the same microarray platform. 相似文献7.
8.
9.
We present an analysis of physical chemical constraints on the accuracy of DNA micro-arrays under equilibrium and nonequilibrium conditions. At the beginning of the article we describe an algorithm for choosing a probe set with high specificity for targeted genes under equilibrium conditions. The algorithm as well as existing methods is used to select probes from the full Saccharomyces cerevisiae genome, and these probe sets, along with a randomly selected set, are used to simulate array experiments and identify sources of error. Inasmuch as specificity and sensitivity are maximum at thermodynamic equilibrium, we are particularly interested in the factors that affect the approach to equilibrium. These are analyzed later in the article, where we develop and apply a rapidly executable method to simulate the kinetics of hybridization on a solid phase support. Although the difference between solution phase and solid phase hybridization is of little consequence for specificity and sensitivity when equilibrium is achieved, the kinetics of hybridization has a pronounced effect on both. We first use the model to estimate the effects of diffusion, crosshybridization, relaxation time, and target concentration on the hybridization kinetics, and then investigate the effects of the most important kinetic parameters on specificity. We find even when using probe sets that have high specificity at equilibrium that substantial crosshybridization is present under nonequilibrium conditions. Although those complexes that differ from perfect complementarity by more than a single base do not contribute to sources of error at equilibrium, they slow the approach to equilibrium dramatically and confound interpretation of the data when they dissociate on a time scale comparable to the time of the experiment. For the best probe set, our simulation shows that steady-state behavior is obtained in a relaxation time of approximately 12-15 h for experimental target concentrations approximately (10(-13) - 10(-14))M, but the time is greater for lower target concentrations in the range (10(-15)-10(-16))M. The result points to an asymmetry in the accuracy with which up- and downregulated genes are identified. 相似文献
10.
MOTIVATION: The need for normalization in microarray experiments has been well documented in the literature. Currently, many analysis methods treat normalization and analysis as a series of steps, with summarized data carried forward to the next step. RESULTS: We present a unified algorithm which incorporates normalization and class comparison in one analysis using probe level perfect match and mismatch data. The algorithm is based on calibration models common to most biological assays, and the resulting chip-specific parameters have a natural interpretation. We show that the algorithm fits into the statistical generalized linear models framework, describe a practical fitting strategy and present results of the algorithm applied to an example dataset as well as based on metrics used in affycomp. The algorithm ranks amongst the top third of the affycomp competitors, performing best in measures of bias. AVAILABILITY: R functions are available on request from the authors. 相似文献
11.
A model for measurement error for gene expression arrays. 总被引:20,自引:0,他引:20
We introduce a model for measurement error in gene expression arrays as a function of the expression level. This model, together with analysis methods, data transformations, and weighting, allows much more precise comparisons of gene expression, and provides guidance for analysis of background, determination of confidence intervals, and preprocessing data for multivariate analysis. 相似文献
12.
Background
Affymetrix GeneChips™ are an important tool in many facets of biological research. Recently, notable design changes to the chips have been made. In this study, we use publicly available data from Affymetrix to gauge the performance of three human gene expression arrays: Human Genome U133 Plus 2.0 (U133), Human Exon 1.0 ST (HuEx) and Human Gene 1.0 ST (HuGene). 相似文献13.
A rapid method for the construction of oligonucleotide arrays 总被引:2,自引:0,他引:2
A simple method has been devised to construct oligonucleotide array on a variety of surfaces using commonly available reagents and chemistry with good efficiency and accuracy. The method involves the generation of hydroxyl functionalities on glass, polypropylene, polyethylene, and commonly used surfaces for construction of oligonucleotide arrays followed by their activation with trifluoroethanesulfonyl chloride (tresyl chloride). The activated surface in the subsequent reaction is used to covalently immobilize oligonucleotides in regioselective fashion to create an oligonucleotide array. The surface bound tresyl sulfonate esters allow the immobilization of oligonucleotides specifically via their 3'- or 5'-end having mercaptohexyl- or aminohexyl functionalities. The constructed oligonucleotide arrays were successfully used to analyze oligonucleotides by hybridization technique. 相似文献
14.
15.
Microarray experiments are being increasingly used in molecular biology. A common task is to detect genes with differential
expression across two experimental conditions, such as two different tissues or the same tissue at two time points of biological
development. To take proper account of statistical variability, some statistical approaches based on the t-statistic have been proposed. In constructing the t-statistic, one needs to estimate the variance of gene expression levels. With a small number of replicated array experiments,
the variance estimation can be challenging. For instance, although the sample variance is unbiased, it may have large variability,
leading to a large mean squared error. For duplicated array experiments, a new approach based on simple averaging has recently
been proposed in the literature. Here we consider two more general approaches based on nonparametric smoothing. Our goal is
to assess the performance of each method empirically. The three methods are applied to a colon cancer data set containing
2,000 genes. Using two arrays, we compare the variance estimates obtained from the three methods. We also consider their impact
on the t-statistics. Our results indicate that the three methods give variance estimates close to each other. Due to its simplicity
and generality, we recommend the use of the smoothed sample variance for data with a small number of replicates.
Electronic Publication 相似文献
16.
17.
18.
Goldstein DR 《Bioinformatics (Oxford, England)》2006,22(19):2364-2372
MOTIVATION: Studies of gene expression using high-density short oligonucleotide arrays have become a standard in a variety of biological contexts. Of the expression measures that have been proposed to quantify expression in these arrays, multi-chip-based measures have been shown to perform well. As gene expression studies increase in size, however, utilizing multi-chip expression measures is more challenging in terms of computing memory requirements and time. RESULTS: A strategic alternative to exact multi-chip quantification on a full large chip set is to approximate expression values based on subsets of chips. This paper introduces an extrapolation method, Extrapolation Averaging (EA), and a resampling method, Partition Resampling (PR), to approximate expression in large studies. An examination of properties indicates that subset-based methods can perform well compared with exact expression quantification. The focus is on short oligonucleotide chips, but the same ideas apply equally well to any array type for which expression is quantified using an entire set of arrays, rather than for only a single array at a time. AVAILABILITY: Software implementing Partition Resampling and Extrapolation Averaging is under development as an R package for the BioConductor project. 相似文献
19.