共查询到20条相似文献,搜索用时 0 毫秒
1.
2.
3.
Lucas A. Nell 《Molecular ecology resources》2020,20(4):1132-1140
High‐throughput sequencing (HTS) is central to the study of population genomics and has an increasingly important role in constructing phylogenies. Choices in research design for sequencing projects can include a wide range of factors, such as sequencing platform, depth of coverage and bioinformatic tools. Simulating HTS data better informs these decisions, as users can validate software by comparing output to the known simulation parameters. However, current standalone HTS simulators cannot generate variant haplotypes under even somewhat complex evolutionary scenarios, such as recombination or demographic change. This greatly reduces their usefulness for fields such as population genomics and phylogenomics. Here I present the R package jackalope that simply and efficiently simulates (i) sets of variant haplotypes from a reference genome and (ii) reads from both Illumina and Pacific Biosciences platforms. Haplotypes can be simulated using phylogenies, gene trees, coalescent‐simulation output, population‐genomic summary statistics, and Variant Call Format (VCF) files. jackalope can simulate single, paired‐end or mate‐pair Illumina reads, as well as reads from Pacific Biosciences. These simulations include sequencing errors, mapping qualities, multiplexing and optical/PCR duplicates. It can read reference genomes from fasta files and can simulate new ones, and all outputs can be written to standard file formats. jackalope is available for Mac, Windows and Linux systems. 相似文献
4.
The ‘TranSeq’ 3′‐end sequencing method for high‐throughput transcriptomics and gene space refinement in plant genomes 下载免费PDF全文
Oren Tzfadia Samuel Bocobza Jonas Defoort Efrat Almekias‐Siegl Sayantan Panda Matan Levy Veronique Storme Stephane Rombauts Diego Adhemar Jaitin Hadas Keren‐Shaul Yves Van de Peer Asaph Aharoni 《The Plant journal : for cell and molecular biology》2018,96(1):223-232
5.
High‐throughput sequencing (HTS) technologies generate millions of sequence reads from DNA/RNA molecules rapidly and cost‐effectively, enabling single investigator laboratories to address a variety of ‘omics’ questions in nonmodel organisms, fundamentally changing the way genomic approaches are used to advance biological research. One major challenge posed by HTS is the complexity and difficulty of data quality control (QC). While QC issues associated with sample isolation, library preparation and sequencing are well known and protocols for their handling are widely available, the QC of the actual sequence reads generated by HTS is often overlooked. HTS‐generated sequence reads can contain various errors, biases and artefacts whose identification and amelioration can greatly impact subsequent data analysis. However, a systematic survey on QC procedures for HTS data is still lacking. In this review, we begin by presenting standard ‘health check‐up’ QC procedures recommended for HTS data sets and establishing what ‘healthy’ HTS data look like. We next proceed by classifying errors, biases and artefacts present in HTS data into three major types of ‘pathologies’, discussing their causes and symptoms and illustrating with examples their diagnosis and impact on downstream analyses. We conclude this review by offering examples of successful ‘treatment’ protocols and recommendations on standard practices and treatment options. Notwithstanding the speed with which HTS technologies – and consequently their pathologies – change, we argue that careful QC of HTS data is an important – yet often neglected – aspect of their application in molecular ecology, and lay the groundwork for developing a HTS data QC ‘best practices’ guide. 相似文献
6.
F. Graeme Frost Praveen F. Cherukuri Samuel Milanovich Cornelius F. Boerkoel 《Journal of cellular and molecular medicine》2020,24(1):418-430
Numerous genetic and epigenetic alterations cause functional changes in cell biology underlying cancer. These hallmark functional changes constitute potentially tissue‐independent anticancer therapeutic targets. We hypothesized that RNA‐Seq identifies gene expression changes that underly those hallmarks, and thereby defines relevant therapeutic targets. To test this hypothesis, we analysed the publicly available TCGA‐TARGET‐GTEx gene expression data set from the University of California Santa CruzToil recompute project using WGCNA to delineate co‐correlated ‘modules’ from tumour gene expression profiles and functional enrichment of these modules to hierarchically cluster tumours. This stratified tumours according to T cell activation, NK‐cell activation, complement cascade, ATM, Rb, angiogenic, MAPK, ECM receptor and histone modification signalling. These correspond to the cancer hallmarks of avoiding immune destruction, tumour‐promoting inflammation, evading growth suppressors, inducing angiogenesis, sustained proliferative signalling, activating invasion and metastasis, and genome instability and mutation. This approach did not detect pathways corresponding to the cancer enabling replicative immortality, resisting cell death or deregulating cellular energetics hallmarks. We conclude that RNA‐Seq stratifies tumours along some, but not all, hallmarks of cancer and, therefore, could be used in conjunction with other analyses collectively to inform precision therapy. 相似文献
7.
8.
9.
10.
Estimating differences in gene expression among alleles is of high interest for many areas in biology and medicine. Here, we present a user‐friendly software tool, Allim, to estimate allele‐specific gene expression. Because mapping bias is a major problem for reliable estimates of allele‐specific gene expression using RNA‐seq, Allim combines two different strategies to account for the mapping biases. In order to reduce the mapping bias, Allim first generates a polymorphism‐aware reference genome that accounts for the sequence variation between the alleles. Then, a sequence‐specific simulation tool estimates the residual mapping bias. Statistical tests for allelic imbalance are provided that can be used with the bias corrected RNA‐seq data. 相似文献
11.
Stephany Foster Yee Voan Teo Nicola Neretti Nathalie Oulhen Gary M. Wessel 《Molecular reproduction and development》2019,86(8):931-934
Sea urchin embryos are excellent for in vivo functional studies because of their transparency and tractability in manipulation. They are also favorites for pharmacological approaches since they develop in an aquatic environment and addition of test substances is straightforward. A concern in many pharmacological tests though is the potential for pleiotropic effects that confound the conclusions drawn from the results. Precise cellular interpretations are often not feasible because the impact of the perturbant is not known. Here we use single‐cell mRNA (messenger RNA) sequencing as a metric of cell types in the embryo and to determine the selectivity of two commonly used inhibitors, one each for the Wnt and the Delta‐Notch pathways, on these nascent cell types. We identified 11 distinct cell types based on mRNA profiling, and that the cell lineages affected by Wnt and Delta/Notch inhibition were distinct from each other. These data support specificity and distinct effects of these signaling pathways in the embryo and illuminate how these conserved pathways selectively regulate cell lineages at a single cell level. Overall, we conclude that single cell RNA‐seq analysis in this embryo is revealing of the cell types present during development, of the changes in the gene regulatory network resulting from inhibition of various signaling pathways, and of the selectivity of these pathways in influencing developmental trajectories. 相似文献
12.
13.
Pierre‐François Perroud Fabian B. Haas Manuel Hiss Kristian K. Ullrich Alessandro Alboresi Mojgan Amirebrahimi Kerrie Barry Roberto Bassi Sandrine Bonhomme Haodong Chen Juliet C. Coates Tomomichi Fujita Anouchka Guyon‐Debast Daniel Lang Junyan Lin Anna Lipzen Fabien Nogué Melvin J. Oliver Inés Ponce de León Ralph S. Quatrano Catherine Rameau Bernd Reiss Ralf Reski Mariana Ricca Younousse Saidi Ning Sun Péter Szövényi Avinash Sreedasyam Jane Grimwood Gary Stacey Jeremy Schmutz Stefan A. Rensing 《The Plant journal : for cell and molecular biology》2018,95(1):168-182
14.
15.
16.
17.
RNA‐sequencing profiles hippocampal gene expression in a validated model of cancer‐induced depression 下载免费PDF全文
To investigate the pathophysiology of cancer‐induced depression (CID), we have recently developed a validated CID mouse model. Given that the efficacy of antidepressants in cancer patients is controversial, it remains unclear whether CID is a biologically distinct form of depression. We used RNA‐sequencing (RNA‐seq) to investigate differentially expressed genes (DEGs) in hippocampi of animals from our CID model relative a positive control model of depressive‐like behavior induced with chronic corticosterone (CORT). To validate RNA‐seq results, we performed quantitative real‐time RT‐PCR (qRT‐PCR) on a subset of DEGs. Enrichment analysis using DAVID was performed on DEGs to identify enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways and biological process gene ontologies (GO:BP). qRT‐PCR results significantly predicted RNA‐seq results. RNA‐seq revealed that most DEGs identified in the CORT model overlapped with the CID model. Enrichment analyses identified KEGG pathways and GO:BP terms associated with ion homeostasis and neuronal communication for both the CORT and CID model. In addition, CID DEGs were enriched in pathways and terms relating to neuronal development, intracellular signaling, learning and memory. This study is the first to investigate CID at the mRNA level. We have shown that most hippocampal mRNA changes that are associated with a depressive‐like state are also associated with cancer. Several other changes occur at the mRNA level in cancer, suggesting that the CID model may represent a biologically distinct form of a depressive‐like state. 相似文献
18.
19.
Alexander Sullivan Priyank K. Purohit Nowlan H. Freese Asher Pasha Eddi Esteban Jamie Waese Alison Wu Michelle Chen Chih Y. Chin Richard Song Sneha R. Watharkar Agnes P. Chan Vivek Krishnakumar Matthew W. Vaughn Chris Town Ann E. Loraine Nicholas J. Provart 《The Plant journal : for cell and molecular biology》2019,100(3):641-654
20.
PipeCraft: Flexible open‐source toolkit for bioinformatics analysis of custom high‐throughput amplicon sequencing data 下载免费PDF全文
Sten Anslan Mohammad Bahram Indrek Hiiesalu Leho Tedersoo 《Molecular ecology resources》2017,17(6):e234-e240
High‐throughput sequencing methods have become a routine analysis tool in environmental sciences as well as in public and private sector. These methods provide vast amount of data, which need to be analysed in several steps. Although the bioinformatics may be applied using several public tools, many analytical pipelines allow too few options for the optimal analysis for more complicated or customized designs. Here, we introduce PipeCraft, a flexible and handy bioinformatics pipeline with a user‐friendly graphical interface that links several public tools for analysing amplicon sequencing data. Users are able to customize the pipeline by selecting the most suitable tools and options to process raw sequences from Illumina, Pacific Biosciences, Ion Torrent and Roche 454 sequencing platforms. We described the design and options of PipeCraft and evaluated its performance by analysing the data sets from three different sequencing platforms. We demonstrated that PipeCraft is able to process large data sets within 24 hr. The graphical user interface and the automated links between various bioinformatics tools enable easy customization of the workflow. All analytical steps and options are recorded in log files and are easily traceable. 相似文献