首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 171 毫秒
1.

Background  

Recent advances in automation technologies have enabled the use of flow cytometry for high throughput screening, generating large complex data sets often in clinical trials or drug discovery settings. However, data management and data analysis methods have not advanced sufficiently far from the initial small-scale studies to support modeling in the presence of multiple covariates.  相似文献   

2.

Background  

The development of algorithms to infer the structure of gene regulatory networks based on expression data is an important subject in bioinformatics research. Validation of these algorithms requires benchmark data sets for which the underlying network is known. Since experimental data sets of the appropriate size and design are usually not available, there is a clear need to generate well-characterized synthetic data sets that allow thorough testing of learning algorithms in a fast and reproducible manner.  相似文献   

3.

Background

The rapid progress currently being made in genomic science has created interest in potential clinical applications; however, formal translational research has been limited thus far. Studies of population genetics have demonstrated substantial variation in allele frequencies and haplotype structure at loci of medical relevance and the genetic background of patient cohorts may often be complex.

Methods and Findings

To describe the heterogeneity in an unselected clinical sample we used the Affymetrix 6.0 gene array chip to genotype self-identified European Americans (N = 326), African Americans (N = 324) and Hispanics (N = 327) from the medical practice of Mount Sinai Medical Center in Manhattan, NY. Additional data from US minority groups and Brazil were used for external comparison. Substantial variation in ancestral origin was observed for both African Americans and Hispanics; data from the latter group overlapped with both Mexican Americans and Brazilians in the external data sets. A pooled analysis of the African Americans and Hispanics from NY demonstrated a broad continuum of ancestral origin making classification by race/ethnicity uninformative. Selected loci harboring variants associated with medical traits and drug response confirmed substantial within- and between-group heterogeneity.

Conclusion

As a consequence of these complementary levels of heterogeneity group labels offered no guidance at the individual level. These findings demonstrate the complexity involved in clinical translation of the results from genome-wide association studies and suggest that in the genomic era conventional racial/ethnic labels are of little value.  相似文献   

4.

Background  

Phylogenetic studies using expressed sequence tags (EST) are becoming a standard approach to answer evolutionary questions. Such studies are usually based on large sets of newly generated, unannotated, and error-prone EST sequences from different species. A first crucial step in EST-based phylogeny reconstruction is to identify groups of orthologous sequences. From these data sets, appropriate target genes are selected, and redundant sequences are eliminated to obtain suitable sequence sets as input data for tree-reconstruction software. Generating such data sets manually can be very time consuming. Thus, software tools are needed that carry out these steps automatically.  相似文献   

5.

Background  

Detection of Loss of Heterozygosity (LOH) is one of the most common molecular applications in the study of human diseases, in particular cancer. The technique is commonly used to examine whether a known tumour suppressor gene is inactivated or to map unknown tumour suppressor gene(s). However, with the increasing number of samples analysed using different software, no tool is currently available to integrate and facilitate the extensive and efficient data retrieval and analyses, such as correlation of LOH data with various clinical data sets.  相似文献   

6.

Background  

New technologies are enabling the measurement of many types of genomic and epigenomic information at scales ranging from the atomic to nuclear. Much of this new data is increasingly structural in nature, and is often difficult to coordinate with other data sets. There is a legitimate need for integrating and visualizing these disparate data sets to reveal structural relationships not apparent when looking at these data in isolation.  相似文献   

7.

Background  

Survival prediction from high-dimensional genomic data is an active field in today's medical research. Most of the proposed prediction methods make use of genomic data alone without considering established clinical covariates that often are available and known to have predictive value. Recent studies suggest that combining clinical and genomic information may improve predictions, but there is a lack of systematic studies on the topic. Also, for the widely used Cox regression model, it is not obvious how to handle such combined models.  相似文献   

8.

Background  

There is an increasing demand to assemble and align large-scale biological sequence data sets. The commonly used multiple sequence alignment programs are still limited in their ability to handle very large amounts of sequences because the system lacks a scalable high-performance computing (HPC) environment with a greatly extended data storage capacity.  相似文献   

9.

Background  

Gene set enrichment analysis (GSEA) is a microarray data analysis method that uses predefined gene sets and ranks of genes to identify significant biological changes in microarray data sets. GSEA is especially useful when gene expression changes in a given microarray data set is minimal or moderate.  相似文献   

10.
11.
12.
13.

Background  

Large biological data sets, such as expression profiles, benefit from reduction of random noise. Principal component (PC) analysis has been used for this purpose, but it tends to remove small features as well as random noise.  相似文献   

14.

Background  

The superfamily Pterioidea is a morphologically and ecologically diverse lineage of epifaunal marine bivalves distributed throughout the tropical and subtropical continental shelf regions. This group includes commercially important pearl culture species and model organisms used for medical studies of biomineralization. Recent morphological treatment of selected pterioideans and molecular phylogenetic analyses of higher-level relationships in Bivalvia have challenged the traditional view that pterioidean families are monophyletic. This issue is examined here in light of molecular data sets composed of DNA sequences for nuclear and mitochondrial loci, and a published character data set of anatomical and shell morphological characters.  相似文献   

15.

Background

Recent reports indicate that in vitro drug screens combined with gene expression profiles (GEP) of cancer cell lines may generate informative signatures predicting the clinical outcome of chemotherapy. In multiple myeloma (MM) a range of new drugs have been introduced and now challenge conventional therapy including high dose melphalan. Consequently, the generation of predictive signatures for response to melphalan may have a clinical impact. The hypothesis is that melphalan screens and GEPs of B-cell cancer cell lines combined with multivariate statistics may provide predictive clinical information.

Materials and Methods

Microarray based GEPs and a melphalan growth inhibition screen of 59 cancer cell lines were downloaded from the National Cancer Institute database. Equivalent data were generated for 18 B-cell cancer cell lines. Linear discriminant analyses (LDA), sparse partial least squares (SPLS) and pairwise comparisons of cell line data were used to build resistance signatures from both cell line panels. A melphalan resistance index was defined and estimated for each MM patient in a publicly available clinical data set and evaluated retrospectively by Cox proportional hazards and Kaplan-Meier survival analysis.

Principal Findings

Both cell line panels performed well with respect to internal validation of the SPLS approach but only the B-cell panel was able to predict a significantly higher risk of relapse and death with increasing resistance index in the clinical data sets. The most sensitive and resistant cell lines, MOLP-2 and RPMI-8226 LR5, respectively, had high leverage, which suggests their differentially expressed genes to possess important predictive value.

Conclusion

The present study presents a melphalan resistance index generated by analysis of a B-cell panel of cancer cell lines. However, the resistance index needs to be functionally validated and correlated to known MM biomarkers in independent data sets in order to better understand the mechanism underlying the preparedness to melphalan resistance.  相似文献   

16.
17.

Background  

Multiple sequence alignments are a fundamental tool for the comparative analysis of proteins and nucleic acids. However, large data sets are no longer manageable for visualization and investigation using the traditional stacked sequence alignment representation.  相似文献   

18.

Background  

The Golden Spike data set has been used to validate a number of methods for summarizing Affymetrix data sets, sometimes with seemingly contradictory results. Much less use has been made of this data set to evaluate differential expression methods. It has been suggested that this data set should not be used for method comparison due to a number of inherent flaws.  相似文献   

19.

Background  

Ultrasound scanning uses the medical imaging format, DICOM, for electronically storing the images and data associated with a particular scan. Large health care facilities typically use a picture archiving and communication system (PACS) for storing and retrieving such images. However, these systems are usually not suitable for managing large collections of anonymized ultrasound images gathered during a clinical screening trial.  相似文献   

20.

Background  

Gene expression measurements from breast cancer (BrCa) tumors are established clinical predictive tools to identify tumor subtypes, identify patients showing poor/good prognosis, and identify patients likely to have disease recurrence. However, diverse breast cancer datasets in conjunction with diagnostic clinical arrays show little overlap in the sets of genes identified. One approach to identify a set of consistently dysregulated candidate genes in these tumors is to employ meta-analysis of multiple independent microarray datasets. This allows one to compare expression data from a diverse collection of breast tumor array datasets generated on either cDNA or oligonucleotide arrays.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号