首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Gene expression analysis is becoming increasingly utilized in neuro-immunology research, and there is a growing need for non-programming scientists to be able to analyze their own genomic data. MGEnrichment is a web application developed both to disseminate to the community our curated database of microglia-relevant gene lists, and to allow non-programming scientists to easily conduct statistical enrichment analysis on their gene expression data. Users can upload their own gene IDs to assess the relevance of their expression data against gene lists from other studies. We include example datasets of differentially expressed genes (DEGs) from human postmortem brain samples from Autism Spectrum Disorder (ASD) and matched controls. We demonstrate how MGEnrichment can be used to expand the interpretations of these DEG lists in terms of regulation of microglial gene expression and provide novel insights into how ASD DEGs may be implicated specifically in microglial development, microbiome responses and relationships to other neuropsychiatric disorders. This tool will be particularly useful for those working in microglia, autism spectrum disorders, and neuro-immune activation research. MGEnrichment is available at https://ciernialab.shinyapps.io/MGEnrichmentApp/ and further online documentation and datasets can be found at https://github.com/ciernialab/MGEnrichmentApp. The app is released under the GNU GPLv3 open source license.  相似文献   

2.
3.
4.
Genetic prediction of complex traits has great promise for disease prevention, monitoring, and treatment. The development of accurate risk prediction models is hindered by the wide diversity of genetic architecture across different traits, limited access to individual level data for training and parameter tuning, and the demand for computational resources. To overcome the limitations of the most existing methods that make explicit assumptions on the underlying genetic architecture and need a separate validation data set for parameter tuning, we develop a summary statistics-based nonparametric method that does not rely on validation datasets to tune parameters. In our implementation, we refine the commonly used likelihood assumption to deal with the discrepancy between summary statistics and external reference panel. We also leverage the block structure of the reference linkage disequilibrium matrix for implementation of a parallel algorithm. Through simulations and applications to twelve traits, we show that our method is adaptive to different genetic architectures, statistically robust, and computationally efficient. Our method is available at https://github.com/eldronzhou/SDPR.  相似文献   

5.
We present a systematic assessment of polygenic risk score (PRS) prediction across more than 1,500 traits using genetic and phenotype data in the UK Biobank. We report 813 sparse PRS models with significant (p < 2.5 x 10−5) incremental predictive performance when compared against the covariate-only model that considers age, sex, types of genotyping arrays, and the principal component loadings of genotypes. We report a significant correlation between the number of genetic variants selected in the sparse PRS model and the incremental predictive performance (Spearman’s ⍴ = 0.61, p = 2.2 x 10−59 for quantitative traits, ⍴ = 0.21, p = 9.6 x 10−4 for binary traits). The sparse PRS model trained on European individuals showed limited transferability when evaluated on non-European individuals in the UK Biobank. We provide the PRS model weights on the Global Biobank Engine (https://biobankengine.stanford.edu/prs).  相似文献   

6.
7.
The explosive outbreaks of COVID-19 seen in congregate settings such as prisons and nursing homes, has highlighted a critical need for effective outbreak prevention and mitigation strategies for these settings. Here we consider how different types of control interventions impact the expected number of symptomatic infections due to outbreaks. Introduction of disease into the resident population from the community is modeled as a stochastic point process coupled to a branching process, while spread between residents is modeled via a deterministic compartmental model that accounts for depletion of susceptible individuals. Control is modeled as a proportional decrease in the number of susceptible residents, the reproduction number, and/or the proportion of symptomatic infections. This permits a range of assumptions about the density dependence of transmission and modes of protection by vaccination, depopulation and other types of control. We find that vaccination or depopulation can have a greater than linear effect on the expected number of cases. For example, assuming a reproduction number of 3.0 with density-dependent transmission, we find that preemptively reducing the size of the susceptible population by 20% reduced overall disease burden by 47%. In some circumstances, it may be possible to reduce the risk and burden of disease outbreaks by optimizing the way a group of residents are apportioned into distinct residential units. The optimal apportionment may be different depending on whether the goal is to reduce the probability of an outbreak occurring, or the expected number of cases from outbreak dynamics. In other circumstances there may be an opportunity to implement reactive disease control measures in which the number of susceptible individuals is rapidly reduced once an outbreak has been detected to occur. Reactive control is most effective when the reproduction number is not too high, and there is minimal delay in implementing control. We highlight the California state prison system as an example for how these findings provide a quantitative framework for understanding disease transmission in congregate settings. Our approach and accompanying interactive website (https://phoebelu.shinyapps.io/DepopulationModels/) provides a quantitative framework to evaluate the potential impact of policy decisions governing infection control in outbreak settings.  相似文献   

8.
Lots of cell death initiator and effector molecules, signalling pathways and subcellular sites have been identified as key mediators in both cell death processes in cancer. The XDeathDB visualization platform provides a comprehensive cell death and their crosstalk resource for deciphering the signaling network organization of interactions among different cell death modes associated with 1461 cancer types and COVID-19, with an aim to understand the molecular mechanisms of physiological cell death in disease and facilitate systems-oriented novel drug discovery in inducing cell deaths properly. Apoptosis, autosis, efferocytosis, ferroptosis, immunogenic cell death, intrinsic apoptosis, lysosomal cell death, mitotic cell death, mitochondrial permeability transition, necroptosis, parthanatos, and pyroptosis related to 12 cell deaths and their crosstalk can be observed systematically by the platform. Big data for cell death gene-disease associations, gene-cell death pathway associations, pathway-cell death mode associations, and cell death-cell death associations is collected by literature review articles and public database from iRefIndex, STRING, BioGRID, Reactom, Pathway’s commons, DisGeNET, DrugBank, and Therapeutic Target Database (TTD). An interactive webtool, XDeathDB, is built by web applications with R-Shiny, JavaScript (JS) and Shiny Server Iso. With this platform, users can search specific interactions from vast interdependent networks that occur in the realm of cell death. A multilayer spectral graph clustering method that performs convex layer aggregation to identify crosstalk function among cell death modes for a specific cancer. 147 hallmark genes of cell death could be observed in detail in these networks. These potential druggable targets are displayed systematically and tailoring networks to visualize specified relations is available to fulfil user-specific needs. Users can access XDeathDB for free at https://pcm2019.shinyapps.io/XDeathDB/.Subject terms: Cell division, Cancer  相似文献   

9.
10.
Existing methods for identifying structural variants (SVs) from short read datasets are inaccurate. This complicates disease-gene identification and efforts to understand the consequences of genetic variation. In response, we have created Wham (Whole-genome Alignment Metrics) to provide a single, integrated framework for both structural variant calling and association testing, thereby bypassing many of the difficulties that currently frustrate attempts to employ SVs in association testing. Here we describe Wham, benchmark it against three other widely used SV identification tools–Lumpy, Delly and SoftSearch–and demonstrate Wham’s ability to identify and associate SVs with phenotypes using data from humans, domestic pigeons, and vaccinia virus. Wham and all associated software are covered under the MIT License and can be freely downloaded from github (https://github.com/zeeev/wham), with documentation on a wiki (http://zeeev.github.io/wham/). For community support please post questions to https://www.biostars.org/.
This is PLOS Computational Biology software paper.
  相似文献   

11.

Background

A growing trend in the biomedical community is the use of Next Generation Sequencing (NGS) technologies in genomics research. The complexity of downstream differential expression (DE) analysis is however still challenging, as it requires sufficient computer programing and command-line knowledge. Furthermore, researchers often need to evaluate and visualize interactively the effect of using differential statistical and error models, assess the impact of selecting different parameters and cutoffs, and finally explore the overlapping consensus of cross-validated results obtained with different methods. This represents a bottleneck that slows down or impedes the adoption of NGS technologies in many labs.

Results

We developed DEApp, an interactive and dynamic web application for differential expression analysis of count based NGS data. This application enables models selection, parameter tuning, cross validation and visualization of results in a user-friendly interface.

Conclusions

DEApp enables labs with no access to full time bioinformaticians to exploit the advantages of NGS applications in biomedical research. This application is freely available at https://yanli.shinyapps.io/DEAppand https://gallery.shinyapps.io/DEApp.
  相似文献   

12.
13.
Drosophila melanogaster has been widely used as a model of human Mendelian disease, but its value in modeling complex disease has received little attention. Fly models of complex disease would enable high-resolution mapping of disease-modifying loci and the identification of novel targets for therapeutic intervention. Here, we describe a fly model of permanent neonatal diabetes mellitus and explore the complexity of this model. The approach involves the transgenic expression of a misfolded mutant of human preproinsulin, hINSC96Y, which is a cause of permanent neonatal diabetes. When expressed in fly imaginal discs, hINSC96Y causes a reduction of adult structures, including the eye, wing, and notum. Eye imaginal discs exhibit defects in both the structure and the arrangement of ommatidia. In the wing, expression of hINSC96Y leads to ectopic expression of veins and mechano-sensory organs, indicating disruption of wild-type signaling processes regulating cell fates. These readily measurable “disease” phenotypes are sensitive to temperature, gene dose, and sex. Mutant (but not wild-type) proinsulin expression in the eye imaginal disc induces IRE1-mediated XBP1 alternative splicing, a signal for endoplasmic reticulum stress response activation, and produces global change in gene expression. Mutant hINS transgene tester strains, when crossed to stocks from the Drosophila Genetic Reference Panel, produce F1 adults with a continuous range of disease phenotypes and large broad-sense heritability. Surprisingly, the severity of mutant hINS-induced disease in the eye is not correlated with that in the notum in these crosses, nor with eye reduction phenotypes caused by the expression of two dominant eye mutants acting in two different eye development pathways, Drop (Dr) or Lobe (L), when crossed into the same genetic backgrounds. The tissue specificity of genetic variability for mutant hINS-induced disease has, therefore, its own distinct signature. The genetic dominance of disease-specific phenotypic variability in our model of misfolded human proinsulin makes this approach amenable to genome-wide association study in a simple F1 screen of natural variation.  相似文献   

14.
The genetic basis of traits shapes and constrains how adaptation proceeds in nature; rapid adaptation can proceed using stores of polygenic standing genetic variation or hard selective sweeps, and increasing polygenicity fuels genetic redundancy, reducing gene re-use (genetic convergence). Guppy life history traits evolve rapidly and convergently among natural high- and low-predation environments in northern Trinidad. This system has been studied extensively at the phenotypic level, but little is known about the underlying genetic architecture. Here, we use four independent F2 QTL crosses to examine the genetic basis of seven (five female, two male) guppy life history phenotypes and discuss how these genetic architectures may facilitate or constrain rapid adaptation and convergence. We use RAD-sequencing data (16,539 SNPs) from 370 male and 267 female F2 individuals. We perform linkage mapping, estimates of genome-wide and per-chromosome heritability (multi-locus associations), and QTL mapping (single-locus associations). Our results are consistent with architectures of many loci of small-effect for male age and size at maturity and female interbrood period. Male trait associations are clustered on specific chromosomes, but female interbrood period exhibits a weak genome-wide signal suggesting a potentially highly polygenic component. Offspring weight and female size at maturity are also associated with a single significant QTL each. These results suggest rapid, repeatable phenotypic evolution of guppies may be facilitated by polygenic trait architectures, but subsequent genetic redundancy may limit gene re-use across populations, in agreement with an absence of strong signatures of genetic convergence from recent analyses of wild guppies.Subject terms: Evolutionary genetics, Quantitative trait  相似文献   

15.
Expression quantitative trait loci (eQTL) studies are used to understand the regulatory function of non-coding genome-wide association study (GWAS) risk loci, but colocalization alone does not demonstrate a causal relationship of gene expression affecting a trait. Evidence for mediation, that perturbation of gene expression in a given tissue or developmental context will induce a change in the downstream GWAS trait, can be provided by two-sample Mendelian Randomization (MR). Here, we introduce a new statistical method, MRLocus, for Bayesian estimation of the gene-to-trait effect from eQTL and GWAS summary data for loci with evidence of allelic heterogeneity, that is, containing multiple causal variants. MRLocus makes use of a colocalization step applied to each nearly-LD-independent eQTL, followed by an MR analysis step across eQTLs. Additionally, our method involves estimation of the extent of allelic heterogeneity through a dispersion parameter, indicating variable mediation effects from each individual eQTL on the downstream trait. Our method is evaluated against other state-of-the-art methods for estimation of the gene-to-trait mediation effect, using an existing simulation framework. In simulation, MRLocus often has the highest accuracy among competing methods, and in each case provides more accurate estimation of uncertainty as assessed through interval coverage. MRLocus is then applied to five candidate causal genes for mediation of particular GWAS traits, where gene-to-trait effects are concordant with those previously reported. We find that MRLocus’s estimation of the causal effect across eQTLs within a locus provides useful information for determining how perturbation of gene expression or individual regulatory elements will affect downstream traits. The MRLocus method is implemented as an R package available at https://mikelove.github.io/mrlocus.  相似文献   

16.
Male sexual characters are often among the first traits to diverge between closely related species and identifying the genetic basis of such changes can contribute to our understanding of their evolutionary history. However, little is known about the genetic architecture or the specific genes underlying the evolution of male genitalia. The morphology of the claspers, posterior lobes, and anal plates exhibit striking differences between Drosophila mauritiana and D. simulans. Using QTL and introgression-based high-resolution mapping, we identified several small regions on chromosome arms 3L and 3R that contribute to differences in these traits. However, we found that the loci underlying the evolution of clasper differences between these two species are independent from those that contribute to posterior lobe and anal plate divergence. Furthermore, while most of the loci affect each trait in the same direction and act additively, we also found evidence for epistasis between loci for clasper bristle number. In addition, we conducted an RNAi screen in D. melanogaster to investigate if positional and expression candidate genes located on chromosome 3L, are also involved in genital development. We found that six of these genes, including components of Wnt signaling and male-specific lethal 3 (msl3), regulate the development of genital traits consistent with the effects of the introgressed regions where they are located and that thus represent promising candidate genes for the evolution these traits.  相似文献   

17.
There is substantial interest in uncovering the genetic basis of the traits underlying adaptive responses in tree species, as this information will ultimately aid conservation and industrial endeavors across populations, generations, and environments. Fundamentally, the characterization of such genetic bases is within the context of a genetic architecture, which describes the mutlidimensional relationship between genotype and phenotype through the identification of causative variants, their relative location within a genome, expression, pleiotropic effect, environmental influence, and degree of dominance, epistasis, and additivity. Here, we review theory related to polygenic local adaptation and contextualize these expectations with methods often used to uncover the genetic basis of traits important to tree conservation and industry. A broad literature survey suggests that most tree traits generally exhibit considerable heritability, that underlying quantitative genetic variation (QST) is structured more so across populations than neutral expectations (FST) in 69% of comparisons across the literature, and that single-locus associations often exhibit small estimated per-locus effects. Together, these results suggest differential selection across populations often acts on tree phenotypes underlain by polygenic architectures consisting of numerous small to moderate effect loci. Using this synthesis, we highlight the limits of using solely single-locus approaches to describe underlying genetic architectures and close by addressing hurdles and promising alternatives towards such goals, remark upon the current state of tree genomics, and identify future directions for this field. Importantly, we argue, the success of future endeavors should not be predicated on the shortcomings of past studies and will instead be dependent upon the application of theory to empiricism, standardized reporting, centralized open-access databases, and continual input and review of the community’s research.  相似文献   

18.
Since its identification in 1983, HIV-1 has been the focus of a research effort unprecedented in scope and difficulty, whose ultimate goals — a cure and a vaccine – remain elusive. One of the fundamental challenges in accomplishing these goals is the tremendous genetic variability of the virus, with some genes differing at as many as 40% of nucleotide positions among circulating strains. Because of this, the genetic bases of many viral phenotypes, most notably the susceptibility to neutralization by a particular antibody, are difficult to identify computationally. Drawing upon open-source general-purpose machine learning algorithms and libraries, we have developed a software package IDEPI (IDentify EPItopes) for learning genotype-to-phenotype predictive models from sequences with known phenotypes. IDEPI can apply learned models to classify sequences of unknown phenotypes, and also identify specific sequence features which contribute to a particular phenotype. We demonstrate that IDEPI achieves performance similar to or better than that of previously published approaches on four well-studied problems: finding the epitopes of broadly neutralizing antibodies (bNab), determining coreceptor tropism of the virus, identifying compartment-specific genetic signatures of the virus, and deducing drug-resistance associated mutations. The cross-platform Python source code (released under the GPL 3.0 license), documentation, issue tracking, and a pre-configured virtual machine for IDEPI can be found at https://github.com/veg/idepi.
This is a PLOS Computational Biology Software Article
  相似文献   

19.
Estimating effects of parental and sibling genotypes (indirect genetic effects) can provide insight into how the family environment influences phenotypic variation. There is growing molecular genetic evidence for effects of parental phenotypes on their offspring (e.g. parental educational attainment), but the extent to which siblings affect each other is currently unclear. Here we used data from samples of unrelated individuals, without (singletons) and with biological full-siblings (non-singletons), to investigate and estimate sibling effects. Indirect genetic effects of siblings increase (or decrease) the covariance between genetic variation and a phenotype. It follows that differences in genetic association estimates between singletons and non-singletons could indicate indirect genetic effects of siblings if there is no heterogeneity in other sources of genetic association between singletons and non-singletons. We used UK Biobank data to estimate polygenic score (PGS) associations for height, BMI and educational attainment in self-reported singletons (N = 50,143) and non-singletons (N = 328,549). The educational attainment PGS association estimate was 12% larger (95% C.I. 3%, 21%) in the non-singleton sample than in the singleton sample, but the height and BMI PGS associations were consistent. Birth order data suggested that the difference in educational attainment PGS associations was driven by individuals with older siblings rather than firstborns. The relationship between number of siblings and educational attainment PGS associations was non-linear; PGS associations were 24% smaller in individuals with 6 or more siblings compared to the rest of the sample (95% C.I. 11%, 38%). We estimate that a 1 SD increase in sibling educational attainment PGS corresponds to a 0.025 year increase in the index individual’s years in schooling (95% C.I. 0.013, 0.036). Our results suggest that older siblings may influence the educational attainment of younger siblings, adding to the growing evidence that effects of the environment on phenotypic variation partially reflect social effects of germline genetic variation in relatives.  相似文献   

20.
Scaffolding, i.e. ordering and orienting contigs is an important step in genome assembly. We present a method for scaffolding using second generation sequencing reads based on likelihoods of genome assemblies. A generative model for sequencing is used to obtain maximum likelihood estimates of gaps between contigs and to estimate whether linking contigs into scaffolds would lead to an increase in the likelihood of the assembly. We then link contigs if they can be unambiguously joined or if the corresponding increase in likelihood is substantially greater than that of other possible joins of those contigs. The method is implemented in a tool called Swalo with approximations to make it efficient and applicable to large datasets. Analysis on real and simulated datasets reveals that it consistently makes more or similar number of correct joins as other scaffolders while linking very few contigs incorrectly, thus outperforming other scaffolders and demonstrating that substantial improvement in genome assembly may be achieved through the use of statistical models. Swalo is freely available for download at https://atifrahman.github.io/SWALO/.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号