首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The progression and clonal development of tumors often involve amplifications and deletions of genomic DNA. Estimation of allele-specific copy number, which quantifies the number of copies of each allele at each variant loci rather than the total number of chromosome copies, is an important step in the characterization of tumor genomes and the inference of their clonal history. We describe a new method, falcon, for finding somatic allele-specific copy number changes by next generation sequencing of tumors with matched normals. falcon is based on a change-point model on a bivariate mixed Binomial process, which explicitly models the copy numbers of the two chromosome haplotypes and corrects for local allele-specific coverage biases. By using the Binomial distribution rather than a normal approximation, falcon more effectively pools evidence from sites with low coverage. A modified Bayesian information criterion is used to guide model selection for determining the number of copy number events. Falcon is evaluated on in silico spike-in data and applied to the analysis of a pre-malignant colon tumor sample and late-stage colorectal adenocarcinoma from the same individual. The allele-specific copy number estimates obtained by falcon allows us to draw detailed conclusions regarding the clonal history of the individual''s colon cancer.  相似文献   

2.
3.

Background

Copy number variation (CNV) is important and widespread in the genome, and is a major cause of disease and phenotypic diversity. Herein, we performed a genome-wide CNV analysis in 12 diversified chicken genomes based on whole genome sequencing.

Results

A total of 8,840 CNV regions (CNVRs) covering 98.2 Mb and representing 9.4% of the chicken genome were identified, ranging in size from 1.1 to 268.8 kb with an average of 11.1 kb. Sequencing-based predictions were confirmed at a high validation rate by two independent approaches, including array comparative genomic hybridization (aCGH) and quantitative PCR (qPCR). The Pearson’s correlation coefficients between sequencing and aCGH results ranged from 0.435 to 0.755, and qPCR experiments revealed a positive validation rate of 91.71% and a false negative rate of 22.43%. In total, 2,214 (25.0%) predicted CNVRs span 2,216 (36.4%) RefSeq genes associated with specific biological functions. Besides two previously reported copy number variable genes EDN3 and PRLR, we also found some promising genes with potential in phenotypic variation. Two genes, FZD6 and LIMS1, related to disease susceptibility/resistance are covered by CNVRs. The highly duplicated SOCS2 may lead to higher bone mineral density. Entire or partial duplication of some genes like POPDC3 may have great economic importance in poultry breeding.

Conclusions

Our results based on extensive genetic diversity provide a more refined chicken CNV map and genome-wide gene copy number estimates, and warrant future CNV association studies for important traits in chickens.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-962) contains supplementary material, which is available to authorized users.  相似文献   

4.

Background

Retrospective studies of archived human specimens, with known clinical follow-up, are used to identify predictive and prognostic molecular markers of disease. Due to biochemical differences, however, formalin-fixed paraffin-embedded (FFPE) DNA and RNA have generally been extracted separately from either different tissue sections or from the same section by dividing the digested tissue. The former limits accurate correlation whilst the latter is impractical when utilizing rare or limited archived specimens.

Principal Findings

For effective recovery of genomic DNA and total RNA from a single FFPE specimen, without splitting the proteinase-K digested tissue solution, we optimized a co-extraction method by using TRIzol and purifying DNA from the lower aqueous and RNA from the upper organic phases. Using a series of seven different archived specimens, we evaluated the total amounts of genomic DNA and total RNA recovered by our TRIzol-based co-extraction method and compared our results with those from two commercial kits, the Qiagen AllPrep DNA/RNA FFPE kit, for co-extraction, and the Ambion RecoverAll™ Total Nucleic Acid Isolation kit, for separate extraction of FFPE-DNA and -RNA. Then, to accurately assess the quality of DNA and RNA co-extracted from a single FFPE specimen, we used qRT-PCR, gene expression profiling and methylation assays to analyze microRNAs, mRNAs, and genomic DNA recovered from matched fresh and FFPE MCF10A cells. These experiments show that the TRIzol-based co-extraction method provides larger amounts of FFPE-DNA and –RNA than the two other methods, and particularly provides higher quality microRNAs and genomic DNA for subsequent molecular analyses.

Significance

We determined that co-extraction of genomic DNA and total RNA from a single FFPE specimen is an effective recovery approach to obtain high-quality material for parallel molecular and high-throughput analyses. Our optimized approach provides the option of collecting DNA, which would otherwise be discarded or degraded, for additional or subsequent studies.  相似文献   

5.
拷贝数变异是指基因组中发生大片段的DNA序列的拷贝数增加或者减少。根据现有的研究可知,拷贝数变异是多种人类疾病的成因,与其发生与发展机制密切相关。高通量测序技术的出现为拷贝数变异检测提供了技术支持,在人类疾病研究、临床诊疗等领域,高通量测序技术已经成为主流的拷贝数变异检测技术。虽然不断有新的基于高通量测序技术的算法和软件被人们开发出来,但是准确率仍然不理想。本文全面地综述基于高通量测序数据的拷贝数变异检测方法,包括基于reads深度的方法、基于双末端映射的方法、基于拆分read的方法、基于从头拼接的方法以及基于上述4种方法的组合方法,深入探讨了每类不同方法的原理,代表性的软件工具以及每类方法适用的数据以及优缺点等,并展望未来的发展方向。  相似文献   

6.
Biswas  Bipasa  Lai  Yinglei 《BMC genomics》2019,20(2):35-47
Background

The next generation sequencing technology allows us to obtain a large amount of short DNA sequence (DNA-seq) reads at a genome-wide level. DNA-seq data have been increasingly collected during the recent years. Count-type data analysis is a widely used approach for DNA-seq data. However, the related data pre-processing is based on the moving window method, in which a window size need to be defined in order to obtain count-type data. Furthermore, useful information can be reduced after data pre-processing for count-type data.

Results

In this study, we propose to analyze DNA-seq data based on the related distance-type measure. Distances are measured in base pairs (bps) between two adjacent alignments of short reads mapped to a reference genome. Our experimental data based simulation study confirms the advantages of distance-type measure approach in both detection power and detection accuracy. Furthermore, we propose artificial censoring for the distance data so that distances larger than a given value are considered potential outliers. Our purpose is to simplify the pre-processing of DNA-seq data. Statistically, we consider a mixture of right censored geometric distributions to model the distance data. Additionally, to reduce the GC-content bias, we extend the mixture model to a mixture of generalized linear models (GLMs). The estimation of model can be achieved by the Newton-Raphson algorithm as well as the Expectation-Maximization (E-M) algorithm. We have conducted simulations to evaluate the performance of our approach. Based on the rank based inverse normal transformation of distance data, we can obtain the related z-values for a follow-up analysis. For an illustration, an application to the DNA-seq data from a pair of normal and tumor cell lines is presented with a change-point analysis of z-values to detect DNA copy number alterations.

Conclusion

Our distance-type measure approach is novel. It does not require either a fixed or a sliding window procedure for generating count-type data. Its advantages have been demonstrated by our simulation studies and its practical usefulness has been illustrated by an experimental data application.

  相似文献   

7.
Targeted proteomics research, based on the enrichment of disease-relevant proteins from isolated cell populations selected from high-quality tissue specimens, offers great potential for the identification of diagnostic, prognostic, and predictive biological markers for use in the clinical setting and during preclinical testing and clinical trials, as well as for the discovery and validation of new protein drug targets. Formalin-fixed and paraffin-embedded (FFPE) tissue collections, with attached clinical and outcome information, are invaluable resources for conducting retrospective protein biomarker investigations and performing translational studies of cancer and other diseases. Combined capillary isoelectric focusing/nano-reversed-phase liquid chromatography separations equipped with nano-electrospray ionization-tandem mass spectrometry are employed for the studies of proteins extracted from microdissected FFPE glioblastoma tissues using a heat-induced antigen retrieval (AR) technique. A total of 14,478 distinct peptides are identified, leading to the identification of 2733 non-redundant SwissProt protein entries. Eighty-three percent of identified FFPE tissue proteins overlap with those obtained from the pellet fraction of fresh-frozen tissue of the same patient. This large degree of protein overlapping is attributed to the application of detergent-based protein extraction in both the cell pellet preparation protocol and the AR technique.  相似文献   

8.
Accurate and efficient genome-wide detection of copy number variants (CNVs) is essential for understanding human genomic variation, genome-wide CNV association type studies, cytogenetics research and diagnostics, and independent validation of CNVs identified from sequencing based technologies. Numerous, array-based platforms for CNV detection exist utilizing array Comparative Genome Hybridization (aCGH), Single Nucleotide Polymorphism (SNP) genotyping or both. We have quantitatively assessed the abilities of twelve leading genome-wide CNV detection platforms to accurately detect Gold Standard sets of CNVs in the genome of HapMap CEU sample NA12878, and found significant differences in performance. The technologies analyzed were the NimbleGen 4.2 M, 2.1 M and 3×720 K Whole Genome and CNV focused arrays, the Agilent 1×1 M CGH and High Resolution and 2×400 K CNV and SNP+CGH arrays, the Illumina Human Omni1Quad array and the Affymetrix SNP 6.0 array. The Gold Standards used were a 1000 Genomes Project sequencing-based set of 3997 validated CNVs and an ultra high-resolution aCGH-based set of 756 validated CNVs. We found that sensitivity, total number, size range and breakpoint resolution of CNV calls were highest for CNV focused arrays. Our results are important for cost effective CNV detection and validation for both basic and clinical applications.  相似文献   

9.
《Genomics》2019,111(6):1745-1751
The copy number variation (CNV) is an important genetic marker in cancer and other diseases. To detect CNVs of specific genetic loci, the multiplex ligation-dependent probe amplification (MLPA) is an appropriate approach, but the experimental optimization and probe synthesis are still great challenges. The multiplex competitive PCR is an alternative method for CNV detection. However, the construction of internal competitive template and establishment of a stable multiplex PCR system are the main limiting factors for this method. Here, we introduce a novel multiplex fluorescent competitive PCR (NMFC-PCR) for detecting CNVs. In this method, the blunt hairpin primers are used to rapidly establish a stable multiplex PCR system due to the reduction of non-specific amplification, and limited cycles' amplification is used to obtain the internal competitive template instead of artificial synthesis. With this method, we tested 21 clinical samples with potential LIM homeobox 1 (LHX1) or T-box 6 (TBX6) deletion. Every three segments located on the LHX1 and TBX6 were selected as the target regions, while two segments located on X-chromosome and five segments located on autosome were selected as the reference regions for detecting CNVs. The results showed that the gender information of 21 samples can be accurately inferred by the copy number ratio (CNR) of X-chromosomal reference region to autosomal reference region (X/A), and 2 samples had one copy of LHX1 and 9 samples had one copy of TBX6. To evaluate the accuracy of NMFC-PCR, 5 random samples with CNV were also detected by array-based comparative genomic hybridization (aCGH), and the results of aCGH were consistent with the NMFC-PCR results. To further assess the performance of NMFC-PCR, 60 normal samples were simultaneously tested. The results showed that the gender results were exactly the same as known information, and CNVs of LHX1 or TBX6 were not found. In conclusion, the method is a cheap, efficient, accurate, and convenient competitive PCR method for CNV detection.  相似文献   

10.
Construction of DNA fragment libraries for next-generation sequencing can prove challenging, especially for samples with low DNA yield. Protocols devised to circumvent the problems associated with low starting quantities of DNA can result in amplification biases that skew the distribution of genomes in metagenomic data. Moreover, sample throughput can be slow, as current library construction techniques are time-consuming. This study evaluated Nextera, a new transposon-based method that is designed for quick production of DNA fragment libraries from a small quantity of DNA. The sequence read distribution across nine phage genomes in a mock viral assemblage met predictions for six of the least-abundant phages; however, the rank order of the most abundant phages differed slightly from predictions. De novo genome assemblies from Nextera libraries provided long contigs spanning over half of the phage genome; in four cases where full-length genome sequences were available for comparison, consensus sequences were found to match over 99% of the genome with near-perfect identity. Analysis of areas of low and high sequence coverage within phage genomes indicated that GC content may influence coverage of sequences from Nextera libraries. Comparisons of phage genomes prepared using both Nextera and a standard 454 FLX Titanium library preparation protocol suggested that the coverage biases according to GC content observed within the Nextera libraries were largely attributable to bias in the Nextera protocol rather than to the 454 sequencing technology. Nevertheless, given suitable sequence coverage, the Nextera protocol produced high-quality data for genomic studies. For metagenomics analyses, effects of GC amplification bias would need to be considered; however, the library preparation standardization that Nextera provides should benefit comparative metagenomic analyses.  相似文献   

11.
Breast cancer is the most common malignancy among females in the world. In Iran, age and family history are the major risk factors for the development of this disease. Mutations of BRCA1 and BRCA2 genes are associated with a greatly increased risk for familial breast cancer. The frequency of BRCA mutations was identified in familial breast cancers (FBCs) and nonfamilial breast cancers (NFBCs) by molecular genetics, and morphological and immunohistochemical methods. Thirty-four formalin-fixed, paraffin-embedded breast tissue tumors were analyzed from 16 patients with FBCs and 18 patients with NFBCs. Three 5382insC mutations were detected by multiplex PCR in 16 FBCs. The immunohistochemical method was used to detect estrogen receptors (ER), progesterona receptors (PR), and TP53. Comparison of ER, PR, and TP53 exhibited a high difference (P < 0.0001) in FBCs and NFBCs. Our results demonstrated that 5382insC mutation, ER, PR, TP53, mitotic activity, polymorphism, necrosis and tubules can serve as the major risk factors for FBC. The text was submitted by the authors in English.  相似文献   

12.
13.
Breast cancer is the most common malignancy among females in the world. Age and familial history are the major risk factors for the development of this disease in Iran. Mutations of BRCA1 and BRCA2 genes are associated with a greatly increased risk for development of familial breast cancer. Frequency of BRCA mutations was identified in familial breast cancers (FBC) and non-familial breast cancers (NFBC) by molecular genetics, morphological and Immunohistochemical methods. Thirty forth formalin-fixed, paraffin-embedded breast tissue tumors were analyzed from 16 patients with FBC and 18 patients with NFBC. Three 5382insC mutations detected by multiplex PCR in 16 familial breast cancers. Immunohistochemical method was used to detect estrogen receptor (ER) and progesterona receptor (PR) and TP53. Comparison of ER, PR and TP53 exhibited high difference (P < 0.0001) in familial breast cancers and non-familial breast cancers. Our results demonstrated that 5382insC mutation, ER, PR, TP53, mitotic activity, polymorphism, necrosis and tubules can serve as the major risk factors for the development of FBC.  相似文献   

14.
15.

Background

mRNAs are highly versatile, non-toxic molecules that are easy to produce and store, which can allow transient protein expression in all cell types. The safety aspects of mRNA-based treatments in gene therapy make this molecule one of the most promising active components of therapeutic or prophylactic methods. The use of mRNA as strategy for the stimulation of the immune system has been used mainly in current strategies for the cancer treatment but until now no one tested this molecule as vaccine for infectious disease.

Results

We produce messenger RNA of Hsp65 protein from Mycobacterium leprae and show that vaccination of mice with a single dose of 10 μg of naked mRNA-Hsp65 through intranasal route was able to induce protection against subsequent challenge with virulent strain of Mycobacterium tuberculosis. Moreover it was shown that this immunization was associated with specific production of IL-10 and TNF-alpha in spleen. In order to determine if antigen presenting cells (APCs) present in the lung are capable of capture the mRNA, labeled mRNA-Hsp65 was administered by intranasal route and lung APCs were analyzed by flow cytometry. These experiments showed that after 30 minutes until 8 hours the populations of CD11c+, CD11b+ and CD19+ cells were able to capture the mRNA. We also demonstrated in vitro that mRNA-Hsp65 leads nitric oxide (NO) production through Toll-like receptor 7 (TLR7).

Conclusions

Taken together, our results showed a novel and efficient strategy to control experimental tuberculosis, besides opening novel perspectives for the use of mRNA in vaccines against infectious diseases and clarifying the mechanisms involved in the disease protection we noticed as well.  相似文献   

16.
17.

Background

Deviations in the amount of genomic content that arise during tumorigenesis, called copy number alterations, are structural rearrangements that can critically affect gene expression patterns. Additionally, copy number alteration profiles allow insight into cancer discrimination, progression and complexity. On data obtained from high-throughput sequencing, improving quality through GC bias correction and keeping false positives to a minimum help build reliable copy number alteration profiles.

Results

We introduce seqCNA, a parallelized R package for an integral copy number analysis of high-throughput sequencing cancer data. The package includes novel methodology on (i) filtering, reducing false positives, and (ii) GC content correction, improving copy number profile quality, especially under great read coverage and high correlation between GC content and copy number. Adequate analysis steps are automatically chosen based on availability of paired-end mapping, matched normal samples and genome annotation.

Conclusions

seqCNA, available through Bioconductor, provides accurate copy number predictions in tumoural data, thanks to the extensive filtering and better GC bias correction, while providing an integrated and parallelized workflow.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-178) contains supplementary material, which is available to authorized users.  相似文献   

18.
19.
Assay for nanogram quantities of DNA in cellular homogenates.   总被引:76,自引:0,他引:76  
The DNA concentration of a crude cellular homogenate can be measured accurately in the nanogram range using the fluorescence enhancement of 4′,6-diamidino-2-phenylindole (DAPI) or bisbenzimidazole (Hoechst H 33258) complexed with DNA. A simple assay has been devised including an internal standard, which allows reliable measurement and compensates for any quenching due to cellular components or buffer. The fluorescence enhancement is highly specific for DNA; no other cell component produces significant fluorescence. The response is linear over a broad dynamic range making the measurement of unknown DNA concentrations convenient.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号