首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 406 毫秒
1.

Background

High-density oligonucleotide arrays have become a valuable tool for high-throughput gene expression profiling. Increasing the array information density and improving the analysis algorithms are two important computational research topics.

Results

A new algorithm, Match-Only Integral Distribution (MOID), was developed to analyze high-density oligonucleotide arrays. Using known data from both spiking experiments and no-change experiments performed with Affymetrix GeneChip® arrays, MOID and the Affymetrix algorithm implemented in Microarray Suite 4.0 (MAS4) were compared. While MOID gave similar performance to MAS4 in the spiking experiments, better performance was observed in the no-change experiments. MOID also provides a set of alternative statistical analysis tools to MAS4. There are two main features that distinguish MOID from MAS4. First, MOID uses continuous P values for the likelihood of gene presence, while MAS4 resorts to discrete absolute calls. Secondly, MOID uses heuristic confidence intervals for both gene expression levels and fold change values, while MAS4 categorizes the significance of gene expression level changes into discrete fold change calls.

Conclusions

The results show that by using MOID, Affymetrix GeneChip® arrays may need as little as ten probes per gene without compromising analysis accuracy.  相似文献   

2.
Model-based cluster analysis of microarray gene-expression data   总被引:3,自引:0,他引:3  
Pan W  Lin J  Le CT 《Genome biology》2002,3(2):research0009.1-research00098

Background

Microarray technologies are emerging as a promising tool for genomic studies. The challenge now is how to analyze the resulting large amounts of data. Clustering techniques have been widely applied in analyzing microarray gene-expression data. However, normal mixture model-based cluster analysis has not been widely used for such data, although it has a solid probabilistic foundation. Here, we introduce and illustrate its use in detecting differentially expressed genes. In particular, we do not cluster gene-expression patterns but a summary statistic, the t-statistic.

Results

The method is applied to a data set containing expression levels of 1,176 genes of rats with and without pneumococcal middle-ear infection. Three clusters were found, two of which contain more than 95% genes with almost no altered gene-expression levels, whereas the third one has 30 genes with more or less differential gene-expression levels.

Conclusions

Our results indicate that model-based clustering of t-statistics (and possibly other summary statistics) can be a useful statistical tool to exploit differential gene expression for microarray data.  相似文献   

3.
Effects of filtering by Present call on analysis of microarray experiments   总被引:1,自引:0,他引:1  

Background

Affymetrix GeneChips® are widely used for expression profiling of tens of thousands of genes. The large number of comparisons can lead to false positives. Various methods have been used to reduce false positives, but they have rarely been compared or quantitatively evaluated. Here we describe and evaluate a simple method that uses the detection (Present/Absent) call generated by the Affymetrix microarray suite version 5 software (MAS5) to remove data that is not reliably detected before further analysis, and compare this with filtering by expression level. We explore the effects of various thresholds for removing data in experiments of different size (from 3 to 10 arrays per treatment), as well as their relative power to detect significant differences in expression.

Results

Our approach sets a threshold for the fraction of arrays called Present in at least one treatment group. This method removes a large percentage of probe sets called Absent before carrying out the comparisons, while retaining most of the probe sets called Present. It preferentially retains the more significant probe sets (p ≤ 0.001) and those probe sets that are turned on or off, and improves the false discovery rate. Permutations to estimate false positives indicate that probe sets removed by the filter contribute a disproportionate number of false positives. Filtering by fraction Present is effective when applied to data generated either by the MAS5 algorithm or by other probe-level algorithms, for example RMA (robust multichip average). Experiment size greatly affects the ability to reproducibly detect significant differences, and also impacts the effect of filtering; smaller experiments (3–5 samples per treatment group) benefit from more restrictive filtering (≥50% Present).

Conclusion

Use of a threshold fraction of Present detection calls (derived by MAS5) provided a simple method that effectively eliminated from analysis probe sets that are unlikely to be reliable while preserving the most significant probe sets and those turned on or off; it thereby increased the ratio of true positives to false positives.  相似文献   

4.

Background

Although expression microarrays have become a standard tool used by biologists, analysis of data produced by microarray experiments may still present challenges. Comparison of data from different platforms, organisms, and labs may involve complicated data processing, and inferring relationships between genes remains difficult.

Results

S TAR N ET 2 is a new web-based tool that allows post hoc visual analysis of correlations that are derived from expression microarray data. S TAR N ET 2 facilitates user discovery of putative gene regulatory networks in a variety of species (human, rat, mouse, chicken, zebrafish, Drosophila, C. elegans, S. cerevisiae, Arabidopsis and rice) by graphing networks of genes that are closely co-expressed across a large heterogeneous set of preselected microarray experiments. For each of the represented organisms, raw microarray data were retrieved from NCBI's Gene Expression Omnibus for a selected Affymetrix platform. All pairwise Pearson correlation coefficients were computed for expression profiles measured on each platform, respectively. These precompiled results were stored in a MySQL database, and supplemented by additional data retrieved from NCBI. A web-based tool allows user-specified queries of the database, centered at a gene of interest. The result of a query includes graphs of correlation networks, graphs of known interactions involving genes and gene products that are present in the correlation networks, and initial statistical analyses. Two analyses may be performed in parallel to compare networks, which is facilitated by the new H EAT S EEKER module.

Conclusion

S TAR N ET 2 is a useful tool for developing new hypotheses about regulatory relationships between genes and gene products, and has coverage for 10 species. Interpretation of the correlation networks is supported with a database of previously documented interactions, a test for enrichment of Gene Ontology terms, and heat maps of correlation distances that may be used to compare two networks. The list of genes in a S TAR N ET network may be useful in developing a list of candidate genes to use for the inference of causal networks. The tool is freely available at http://vanburenlab.medicine.tamhsc.edu/starnet2.html, and does not require user registration.  相似文献   

5.
6.
7.

Background

In the last decade, a large amount of microarray gene expression data has been accumulated in public repositories. Integrating and analyzing high-throughput gene expression data have become key activities for exploring gene functions, gene networks and biological pathways. Effectively utilizing these invaluable microarray data remains challenging due to a lack of powerful tools to integrate large-scale gene-expression information across diverse experiments and to search and visualize a large number of gene-expression data points.

Results

Gene Expression Browser is a microarray data integration, management and processing system with web-based search and visualization functions. An innovative method has been developed to define a treatment over a control for every microarray experiment to standardize and make microarray data from different experiments homogeneous. In the browser, data are pre-processed offline and the resulting data points are visualized online with a 2-layer dynamic web display. Users can view all treatments over control that affect the expression of a selected gene via Gene View, and view all genes that change in a selected treatment over control via treatment over control View. Users can also check the changes of expression profiles of a set of either the treatments over control or genes via Slide View. In addition, the relationships between genes and treatments over control are computed according to gene expression ratio and are shown as co-responsive genes and co-regulation treatments over control.

Conclusion

Gene Expression Browser is composed of a set of software tools, including a data extraction tool, a microarray data-management system, a data-annotation tool, a microarray data-processing pipeline, and a data search & visualization tool. The browser is deployed as a free public web service (http://www.ExpressionBrowser.com) that integrates 301 ATH1 gene microarray experiments from public data repositories (viz. the Gene Expression Omnibus repository at the National Center for Biotechnology Information and Nottingham Arabidopsis Stock Center). The set of Gene Expression Browser software tools can be easily applied to the large-scale expression data generated by other platforms and in other species.  相似文献   

8.
9.

Background

During the lifetime of a fermenter culture, the soil bacterium S. coelicolor undergoes a major metabolic switch from exponential growth to antibiotic production. We have studied gene expression patterns during this switch, using a specifically designed Affymetrix genechip and a high-resolution time-series of fermenter-grown samples.

Results

Surprisingly, we find that the metabolic switch actually consists of multiple finely orchestrated switching events. Strongly coherent clusters of genes show drastic changes in gene expression already many hours before the classically defined transition phase where the switch from primary to secondary metabolism was expected. The main switch in gene expression takes only 2 hours, and changes in antibiotic biosynthesis genes are delayed relative to the metabolic rearrangements. Furthermore, global variation in morphogenesis genes indicates an involvement of cell differentiation pathways in the decision phase leading up to the commitment to antibiotic biosynthesis.

Conclusions

Our study provides the first detailed insights into the complex sequence of early regulatory events during and preceding the major metabolic switch in S. coelicolor, which will form the starting point for future attempts at engineering antibiotic production in a biotechnological setting.  相似文献   

10.

Background

Genomic deletions and duplications are important in the pathogenesis of diseases, such as cancer and mental retardation, and have recently been shown to occur frequently in unaffected individuals as polymorphisms. Affymetrix GeneChip whole genome sampling analysis (WGSA) combined with 100 K single nucleotide polymorphism (SNP) genotyping arrays is one of several microarray-based approaches that are now being used to detect such structural genomic changes. The popularity of this technology and its associated open source data format have resulted in the development of an increasing number of software packages for the analysis of copy number changes using these SNP arrays.

Results

We evaluated four publicly available software packages for high throughput copy number analysis using synthetic and empirical 100 K SNP array data sets, the latter obtained from 107 mental retardation (MR) patients and their unaffected parents and siblings. We evaluated the software with regards to overall suitability for high-throughput 100 K SNP array data analysis, as well as effectiveness of normalization, scaling with various reference sets and feature extraction, as well as true and false positive rates of genomic copy number variant (CNV) detection.

Conclusion

We observed considerable variation among the numbers and types of candidate CNVs detected by different analysis approaches, and found that multiple programs were needed to find all real aberrations in our test set. The frequency of false positive deletions was substantial, but could be greatly reduced by using the SNP genotype information to confirm loss of heterozygosity.  相似文献   

11.

Background

Upon repeated or chronic antigen stimulation, activated T cells undergo a T cell receptor (TCR)-triggered propriocidal cell death important for governing the intensity of immune responses. This is thought to be chiefly mediated by an extrinsic signal through the Fas-FasL pathway. However, we observed that TCR restimulation still potently induced apoptosis when this interaction was blocked, or genetically impaired in T cells derived from autoimmune lymphoproliferative syndrome (ALPS) patients, prompting us to examine Fas-independent, intrinsic signals.

Results

Upon TCR restimulation, we specifically noted a marked increase in the expression of BIM, a pro-apoptotic Bcl-2 family protein known to mediate lymphocyte apoptosis induced by cytokine withdrawal. In fact, T cells from an ALPS type IV patient in which BIM expression is suppressed were more resistant to restimulation-induced death. Strikingly, knockdown of BIM expression rescued normal T cells from TCR-induced death to as great an extent as Fas disruption.

Conclusion

Our data implicates BIM as a critical mediator of apoptosis induced by restimulation as well as growth cytokine withdrawal. These findings suggest an important role for BIM in eliminating activated T cells even when IL-2 is abundant, working in conjunction with Fas to eliminate chronically stimulated T cells and maintain immune homeostasis.

Reviewers

This article was reviewed by Dr. Wendy Davidson (nominated by Dr. David Scott), Dr. Mark Williams (nominated by Dr. Neil Greenspan), and Dr. Laurence C. Eisenlohr.  相似文献   

12.

Background

After fertilization, embryo development involves differentiation, as well as development of the fetal body and extra-embryonic tissues until the moment of implantation. During this period various cellular and molecular changes take place with a genetic origin, e.g. the elongation of embryonic tissues, cell-cell contact between the mother and the embryo and placentation. To identify genetic profiles and search for new candidate molecules involved during this period, embryonic gene expression was analyzed with a custom designed utero-placental complementary DNA (cDNA) microarray.

Methods

Bovine embryos on days 7, 14 and 21, extra-embryonic membranes on day 28 and fetuses on days 28 were collected to represent early embryo, elongating embryo, pre-implantation embryo, post-implantation extra-embryonic membrane and fetus, respectively. Gene expression at these different time points was analyzed using our cDNA microarray. Two clustering algorithms such as k-means and hierarchical clustering methods identified the expression patterns of differentially expressed genes across pre-implantation period. Novel candidate genes were confirmed by real-time RT-PCR.

Results

In total, 1,773 individual genes were analyzed by complete k-means clustering. Comparison of day 7 and day 14 revealed most genes increased during this period, and a small number of genes exhibiting altered expression decreased as gestation progressed. Clustering analysis demonstrated that trophoblast-cell-specific molecules such as placental lactogens (PLs), prolactin-related proteins (PRPs), interferon-tau, and adhesion molecules apparently all play pivotal roles in the preparation needed for implantation, since their expression was remarkably enhanced during the pre-implantation period. The hierarchical clustering analysis and RT-PCR data revealed new functional roles for certain known genes (dickkopf-1, NPM, etc) as well as novel candidate genes (AW464053, AW465434, AW462349, AW485575) related to already established trophoblast-specific genes such as PLs and PRPs.

Conclusions

A large number of genes in extra-embryonic membrane increased up to implantation and these profiles provide information fundamental to an understanding of extra-embryonic membrane differentiation and development. Genes in significant expression suggest novel molecules in trophoblast differentiation.  相似文献   

13.
14.
15.

Background

Polyethyleneimine (PEI), a cationic polymer, is one of the successful and widely used vectors for non-viral gene transfection in vitro. However, its in vivo application was greatly limited due to its high cytotoxicity and short duration of gene expression. To improve its biocompatibility and transfection efficiency, PEI has been modified with PEG, folic acid, and chloroquine in order to improve biocompatibility and enhance targeting.

Results

Poly(ε-caprolactone)-Pluronic-Poly(ε-caprolactone) (PCFC) was synthesized by ring-opening polymerization, and PCFC-g-PEI was obtained by Michael addition reaction with GMA-PCFC-GMA and polyethyleneimine (PEI, 25 kD). The prepared PCFC-g-PEI was characterized by 1H-NMR, SEC-MALLS. Meanwhile, DNA condensation, DNase I protection, the particle size and zeta potential of PCFC-g-PEI/DNA complexes were also determined. According to the results of flow cytometry and MTT assay, the synthesized PCFC-g-PEI, with considerable transfection efficiency, had obviously lower cytotoxicity against 293 T and A549 cell lines compared with that of PEI 25 kD.

Conclusion

The cytotoxicity and in vitro transfection study indicated that PCFC-g-PEI copolymer prepared in this paper was a novel gene delivery system with lower cytotoxicity and considerable transfection efficiency compared with commercial PEI (25 kD).  相似文献   

16.

Background

With the growing abundance of microarray data, statistical methods are increasingly needed to integrate results across studies. Two common approaches for meta-analysis of microarrays include either combining gene expression measures across studies or combining summaries such as p-values, probabilities or ranks. Here, we compare two Bayesian meta-analysis models that are analogous to these methods.

Results

Two Bayesian meta-analysis models for microarray data have recently been introduced. The first model combines standardized gene expression measures across studies into an overall mean, accounting for inter-study variability, while the second combines probabilities of differential expression without combining expression values. Both models produce the gene-specific posterior probability of differential expression, which is the basis for inference. Since the standardized expression integration model includes inter-study variability, it may improve accuracy of results versus the probability integration model. However, due to the small number of studies typical in microarray meta-analyses, the variability between studies is challenging to estimate. The probability integration model eliminates the need to model variability between studies, and thus its implementation is more straightforward. We found in simulations of two and five studies that combining probabilities outperformed combining standardized gene expression measures for three comparison values: the percent of true discovered genes in meta-analysis versus individual studies; the percent of true genes omitted in meta-analysis versus separate studies, and the number of true discovered genes for fixed levels of Bayesian false discovery. We identified similar results when pooling two independent studies of Bacillus subtilis. We assumed that each study was produced from the same microarray platform with only two conditions: a treatment and control, and that the data sets were pre-scaled.

Conclusion

The Bayesian meta-analysis model that combines probabilities across studies does not aggregate gene expression measures, thus an inter-study variability parameter is not included in the model. This results in a simpler modeling approach than aggregating expression measures, which accounts for variability across studies. The probability integration model identified more true discovered genes and fewer true omitted genes than combining expression measures, for our data sets.  相似文献   

17.

Background

Genomic instability in cancer leads to abnormal genome copy number alterations (CNA) as a mechanism underlying tumorigenesis. Using microarrays and other technologies, tumor CNA are detected by comparing tumor sample CN to normal reference sample CN. While advances in microarray technology have improved detection of copy number alterations, the increase in the number of measured signals, noise from array probes, variations in signal-to-noise ratio across batches and disparity across laboratories leads to significant limitations for the accurate identification of CNA regions when comparing tumor and normal samples.

Methods

To address these limitations, we designed a novel "Virtual Normal" algorithm (VN), which allowed for construction of an unbiased reference signal directly from test samples within an experiment using any publicly available normal reference set as a baseline thus eliminating the need for an in-lab normal reference set.

Results

The algorithm was tested using an optimal, paired tumor/normal data set as well as previously uncharacterized pediatric malignant gliomas for which a normal reference set was not available. Using Affymetrix 250K Sty microarrays, we demonstrated improved signal-to-noise ratio and detected significant copy number alterations using the VN algorithm that were validated by independent PCR analysis of the target CNA regions.

Conclusions

We developed and validated an algorithm to provide a virtual normal reference signal directly from tumor samples and minimize noise in the derivation of the raw CN signal. The algorithm reduces the variability of assays performed across different reagent and array batches, methods of sample preservation, multiple personnel, and among different laboratories. This approach may be valuable when matched normal samples are unavailable or the paired normal specimens have been subjected to variations in methods of preservation.  相似文献   

18.

Background

Foam cell formation in diabetic patients often occurs in the presence of high insulin and glucose levels. To test whether hyperinsulinemic hyperglycemic conditions affect foam cell differentiation, we examined gene expression, cytokine production, and Akt phosphorylation in human monocyte-derived macrophages incubated with two types of oxidized low density lipoprotein (LDL), minimally modified LDL (mmLDL) and extensively oxidized LDL (OxLDL).

Methods and results

Using Affymetrix GeneChip® arrays, we found that several genes directly related to insulin signaling were changed. The insulin receptor and glucose-6-phosphate dehydrogenase were upregulated by mmLDL and OxLDL, whereas insulin-induced gene 1 was significantly down-regulated. In hyperinsulinemic hyperglycemic conditions, modified LDL upregulated Akt phosphorylation and expression of the insulin-regulated aminopeptidase. The level of proinflammatory cytokines, IL-lβ, IL-12, and IL-6, and of a 5-lipoxygenase eicosanoid, 5-hydroxyeicosatetraenoic acid (5-HETE), was also increased.

Conclusion

These results suggest that the exposure of macrophages to modified low density lipoproteins in hyperglycemic hyperinsulinemic conditions affects insulin signaling and promotes the release of proinflammatory stimuli, such as cytokines and eicosanoids. These in turn may contribute to the development of insulin resistance.  相似文献   

19.

Background

The soybean cyst nematode Heterodera glycines is the most important parasite in soybean production worldwide. A comprehensive analysis of large-scale gene expression changes throughout the development of plant-parasitic nematodes has been lacking to date.

Results

We report an extensive genomic analysis of H. glycines, beginning with the generation of 20,100 expressed sequence tags (ESTs). In-depth analysis of these ESTs plus approximately 1,900 previously published sequences predicted 6,860 unique H. glycines genes and allowed a classification by function using InterProScan. Expression profiling of all 6,860 genes throughout the H. glycines life cycle was undertaken using the Affymetrix Soybean Genome Array GeneChip. Our data sets and results represent a comprehensive resource for molecular studies of H. glycines. Demonstrating the power of this resource, we were able to address whether arrested development in the Caenorhabditis elegans dauer larva and the H. glycines infective second-stage juvenile (J2) exhibits shared gene expression profiles. We determined that the gene expression profiles associated with the C. elegans dauer pathway are not uniformly conserved in H. glycines and that the expression profiles of genes for metabolic enzymes of C. elegans dauer larvae and H. glycines infective J2 are dissimilar.

Conclusion

Our results indicate that hallmark gene expression patterns and metabolism features are not shared in the developmentally arrested life stages of C. elegans and H. glycines, suggesting that developmental arrest in these two nematode species has undergone more divergent evolution than previously thought and pointing to the need for detailed genomic analyses of individual parasite species.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号