首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Recurrent neural networks with memory and attention mechanisms are widely used in natural language processing because they can capture short and long term sequential information for diverse tasks. We propose an integrated deep learning model for microbial DNA sequence data, which exploits convolutional neural networks, recurrent neural networks, and attention mechanisms to predict taxonomic classifications and sample-associated attributes, such as the relationship between the microbiome and host phenotype, on the read/sequence level. In this paper, we develop this novel deep learning approach and evaluate its application to amplicon sequences. We apply our approach to short DNA reads and full sequences of 16S ribosomal RNA (rRNA) marker genes, which identify the heterogeneity of a microbial community sample. We demonstrate that our implementation of a novel attention-based deep network architecture, Read2Pheno, achieves read-level phenotypic prediction. Training Read2Pheno models will encode sequences (reads) into dense, meaningful representations: learned embedded vectors output from the intermediate layer of the network model, which can provide biological insight when visualized. The attention layer of Read2Pheno models can also automatically identify nucleotide regions in reads/sequences which are particularly informative for classification. As such, this novel approach can avoid pre/post-processing and manual interpretation required with conventional approaches to microbiome sequence classification. We further show, as proof-of-concept, that aggregating read-level information can robustly predict microbial community properties, host phenotype, and taxonomic classification, with performance at least comparable to conventional approaches. An implementation of the attention-based deep learning network is available at https://github.com/EESI/sequence_attention (a python package) and https://github.com/EESI/seq2att (a command line tool).  相似文献   

2.
wings apart (wap) is a recessive, semilethal gene located on the X chromosome in Drosophila melanogaster, which is required for normal wing-vein patterning. We show that the wap mutation also results in loss of the adult jump muscle. We use complementation mapping and gene-specific RNA interference to localize the wap locus to the proximal X chromosome. We identify the annotated gene CG14614 as the gene affected by the wap mutation, since one wap allele contains a non-sense mutation in CG14614, and a genomic fragment containing only CG14614 rescues the jump-muscle phenotypes of two wap mutant alleles. The wap gene lies centromere-proximal to touch-insensitive larva B and centromere-distal to CG14619, which is tentatively assigned as the gene affected in introverted mutants. In mutant wap animals, founder cell precursors for the jump muscle are specified early in development, but are later lost. Through tissue-specific knockdowns, we demonstrate that wap function is required in both the musculature and the nervous system for normal jump-muscle formation. wap/CG14614 is homologous to vertebrate wdr68, DDB1 and CUL4 associated factor 7, which also are expressed in neuromuscular tissues. Thus, our findings provide insight into mechanisms of neuromuscular development in higher animals and facilitate the understanding of neuromuscular diseases that may result from mis-expression of muscle-specific or neuron-specific genes.  相似文献   

3.
Over the past 35 years, developmental geneticists have made impressive progress toward an understanding of how genes specify morphology and function, particularly as they relate to the specification of each physical component of an organism. In the last 20 years, male courtship behavior in Drosophila melanogaster has emerged as a robust model system for the study of genetic specification of behavior. Courtship behavior is both complex and innate, and a single gene, fruitless (fru), is both necessary and sufficient for all aspects of the courtship ritual. Typically, loss of male-specific Fruitless protein function results in male flies that perform the courtship ritual incorrectly, slowly, or not at all. Here we describe a novel requirement for fru: we have identified a group of cells in which male Fru proteins are required to reduce the speed of courtship initiation. In addition, we have identified a gene, Trapped in endoderm 1 (Tre1), which is required in these cells for normal courtship and mating behavior. Tre1 encodes a G-protein-coupled receptor required for establishment of cell polarity and cell migration and has previously not been shown to be involved in courtship behavior. We describe the results of feminization of the Tre1-expressing neurons, as well as the effects on courtship behavior of mutation of Tre1. In addition, we show that Tre1 is expressed in a sexually dimorphic pattern in the central and peripheral nervous systems and investigate the role of the Tre1 cells in mate identification.  相似文献   

4.
Members of the RecQ family of helicases are known for their roles in DNA repair, replication, and recombination. Mutations in the human RecQ helicases, WRN and BLM, cause Werner and Bloom syndromes, which are diseases characterized by genome instability and an increased risk of cancer. While WRN contains both a helicase and an exonuclease domain, the Drosophila melanogaster homolog, WRNexo, contains only the exonuclease domain. Therefore the Drosophila model system provides a unique opportunity to study the exonuclease functions of WRN separate from the helicase. We created a null allele of WRNexo via imprecise P-element excision. The null WRNexo mutants are not sensitive to double-strand break-inducing reagents, suggesting that the exonuclease does not play a key role in homologous recombination-mediated repair of DSBs. However, WRNexo mutant embryos have a reduced hatching frequency and larvae are sensitive to the replication fork-stalling reagent, hydroxyurea (HU), suggesting that WRNexo is important in responding to replication stress. The role of WRNexo in the HU-induced stress response is independent of Rad51. Interestingly, the hatching defect and HU sensitivity of WRNexo mutants do not occur in flies containing an exonuclease-dead copy of WRNexo, suggesting that the role of WRNexo in replication is independent of exonuclease activity. Additionally, WRNexo and Blm mutants exhibit similar sensitivity to HU and synthetic lethality in combination with mutations in structure-selective endonucleases. We propose that WRNexo and BLM interact to promote fork reversal following replication fork stalling and in their absence regressed forks are restarted through a Rad51-mediated process.  相似文献   

5.
Normalization is an essential step in the analysis of high-throughput data. Multi-sample global normalization methods, such as quantile normalization, have been successfully used to remove technical variation. However, these methods rely on the assumption that observed global changes across samples are due to unwanted technical variability. Applying global normalization methods has the potential to remove biologically driven variation. Currently, it is up to the subject matter experts to determine if the stated assumptions are appropriate. Here, we propose a data-driven alternative. We demonstrate the utility of our method (quantro) through examples and simulations. A software implementation is available from http://www.bioconductor.org/packages/release/bioc/html/quantro.html.

Electronic supplementary material

The online version of this article (doi:10.1186/s13059-015-0679-0) contains supplementary material, which is available to authorized users.  相似文献   

6.
Genetic screens in Drosophila melanogaster and other organisms have been pursued to filter the genome for genetic functions important for memory formation. Such screens have employed primarily chemical or transposon-mediated mutagenesis and have identified numerous mutants including classical memory mutants, dunce and rutabaga. Here, we report the results of a large screen using panneuronal RNAi expression to identify additional genes critical for memory formation. We identified >500 genes that compromise memory when inhibited (low hits), either by disrupting the development and normal function of the adult animal or by participating in the neurophysiological mechanisms underlying memory formation. We also identified >40 genes that enhance memory when inhibited (high hits). The dunce gene was identified as one of the low hits and further experiments were performed to map the effects of the dunce RNAi to the α/β and γ mushroom body neurons. Additional behavioral experiments suggest that dunce knockdown in the mushroom body neurons impairs memory without significantly affecting acquisition. We also characterized one high hit, sickie, to show that RNAi knockdown of this gene enhances memory through effects in dopaminergic neurons without apparent effects on acquisition. These studies further our understanding of two genes involved in memory formation, provide a valuable list of genes that impair memory that may be important for understanding the neurophysiology of memory or neurodevelopmental disorders, and offer a new resource of memory suppressor genes that will aid in understanding restraint mechanisms employed by the brain to optimize resources.  相似文献   

7.

Background

The Immunoglobulins (IG) and the T cell receptors (TR) play the key role in antigen recognition during the adaptive immune response. Recent progress in next-generation sequencing technologies has provided an opportunity for the deep T cell receptor repertoire profiling. However, a specialised software is required for the rational analysis of massive data generated by next-generation sequencing.

Results

Here we introduce tcR, a new R package, representing a platform for the advanced analysis of T cell receptor repertoires, which includes diversity measures, shared T cell receptor sequences identification, gene usage statistics computation and other widely used methods. The tool has proven its utility in recent research studies.

Conclusions

tcR is an R package for the advanced analysis of T cell receptor repertoires after primary TR sequences extraction from raw sequencing reads. The stable version can be directly installed from The Comprehensive R Archive Network (http://cran.r-project.org/mirrors.html). The source code and development version are available at tcR GitHub (http://imminfo.github.io/tcr/) along with the full documentation and typical usage examples.  相似文献   

8.
9.
Although evolutionary changes must take place in neural connectivity and synaptic architecture as nervous systems become more complex, we lack understanding of the general principles and specific mechanisms by which these changes occur. Previously, we found that morphology of the larval neuromuscular junction (NMJ) varies extensively among different species of Drosophila but is relatively conserved within a species. To identify specific genes as candidates that might underlie phenotypic differences in NMJ morphology among Drosophila species, we performed a genetic analysis on one of two phenotypic variants we found among 20 natural isolates of Drosophila melanogaster. We discovered genetic polymorphisms for both positive and negative regulators of NMJ growth segregating within the variant line. Focusing on one subline, that displayed NMJ overgrowth, we mapped the phenotype to Mob2 [Monopolar spindle (Mps) one binding protein 2)], a gene encoding a Nuclear Dbf2 (Dumbbell formation 2)-Related (NDR) kinase activator. We confirmed this identification by transformation rescue experiments and showed that presynaptic expression of Mob2 is necessary and sufficient to regulate NMJ growth. Mob2 interacts in a dominant, dose-dependent manner with tricornered but not with warts, to cause NMJ overgrowth, suggesting that Mob2 specifically functions in combination with the former NDR kinase to regulate NMJ development. These results demonstrate the feasibility and utility of identifying genetic variants affecting NMJ morphology in natural populations of Drosophila. These variants can lead to discovery of new genes and molecular mechanisms that regulate NMJ development while also providing new information that can advance our understanding of mechanisms that underlie nervous system evolution.  相似文献   

10.
11.

Background

High-throughput RNA interference (RNAi) screening has become a widely used approach to elucidating gene functions. However, analysis and annotation of large data sets generated from these screens has been a challenge for researchers without a programming background. Over the years, numerous data analysis methods were produced for plate quality control and hit selection and implemented by a few open-access software packages. Recently, strictly standardized mean difference (SSMD) has become a widely used method for RNAi screening analysis mainly due to its better control of false negative and false positive rates and its ability to quantify RNAi effects with a statistical basis. We have developed GUItars to enable researchers without a programming background to use SSMD as both a plate quality and a hit selection metric to analyze large data sets.

Results

The software is accompanied by an intuitive graphical user interface for easy and rapid analysis workflow. SSMD analysis methods have been provided to the users along with traditionally-used z-score, normalized percent activity, and t-test methods for hit selection. GUItars is capable of analyzing large-scale data sets from screens with or without replicates. The software is designed to automatically generate and save numerous graphical outputs known to be among the most informative high-throughput data visualization tools capturing plate-wise and screen-wise performances. Graphical outputs are also written in HTML format for easy access, and a comprehensive summary of screening results is written into tab-delimited output files.

Conclusion

With GUItars, we demonstrated robust SSMD-based analysis workflow on a 3840-gene small interfering RNA (siRNA) library and identified 200 siRNAs that increased and 150 siRNAs that decreased the assay activities with moderate to stronger effects. GUItars enables rapid analysis and illustration of data from large- or small-scale RNAi screens using SSMD and other traditional analysis methods. The software is freely available at http://sourceforge.net/projects/guitars/.  相似文献   

12.
Since the discovery of microRNAs (miRNAs) only two decades ago, they have emerged as an essential component of the gene regulatory machinery. miRNAs have seemingly paradoxical features: a single miRNA is able to simultaneously target hundreds of genes, while its presence is mostly dispensable for animal viability under normal conditions. It is known that miRNAs act as stress response factors; however, it remains challenging to determine their relevant targets and the conditions under which they function. To address this challenge, we propose a new workflow for miRNA function analysis, by which we found that the evolutionarily young miRNA family, the mir-310s (mir-310/mir-311/mir-312/mir-313), are important regulators of Drosophila metabolic status. mir-310s-deficient animals have an abnormal diet-dependent expression profile for numerous diet-sensitive components, accumulate fats, and show various physiological defects. We found that the mir-310s simultaneously repress the production of several regulatory factors (Rab23, DHR96, and Ttk) of the evolutionarily conserved Hedgehog (Hh) pathway to sharpen dietary response. As the mir-310s expression is highly dynamic and nutrition sensitive, this signal relay model helps to explain the molecular mechanism governing quick and robust Hh signaling responses to nutritional changes. Additionally, we discovered a new component of the Hh signaling pathway in Drosophila, Rab23, which cell autonomously regulates Hh ligand trafficking in the germline stem cell niche. How organisms adjust to dietary fluctuations to sustain healthy homeostasis is an intriguing research topic. These data are the first to report that miRNAs can act as executives that transduce nutritional signals to an essential signaling pathway. This suggests miRNAs as plausible therapeutic agents that can be used in combination with low calorie and cholesterol diets to manage quick and precise tissue-specific responses to nutritional changes.  相似文献   

13.
14.
Members of the M13 class of metalloproteases have been implicated in diseases and in reproductive fitness. Nevertheless, their physiological role remains poorly understood. To obtain a tractable model with which to analyze this protein family’s function, we characterized the gene family in Drosophila melanogaster and focused on reproductive phenotypes. The D. melanogaster genome contains 24 M13 class protease homologs, some of which are orthologs of human proteases, including neprilysin. Many are expressed in the reproductive tracts of either sex. Using RNAi we individually targeted the five Nep genes most closely related to vertebrate neprilysin, Nep1-5, to investigate their roles in reproduction. A reduction in Nep1, Nep2, or Nep4 expression in females reduced egg laying. Nep1 and Nep2 are required in the CNS and the spermathecae for wild-type fecundity. Females that are null for Nep2 also show defects as hosts of sperm competition as well as an increased rate of depletion for stored sperm. Furthermore, eggs laid by Nep2 mutant females are fertilized normally, but arrest early in embryonic development. In the male, only Nep1 was required to induce normal patterns of female egg laying. Reduction in the expression of Nep2-5 in the male did not cause any dramatic effects on reproductive fitness, which suggests that these genes are either nonessential for male fertility or perform redundant functions. Our results suggest that, consistent with the functions of neprilysins in mammals, these proteins are also required for reproduction in Drosophila, opening up this model system for further functional analysis of this protein class and their substrates.  相似文献   

15.
16.
We highlight a case on a normal left testicle with a fibrovascular cord with three nodules consistent with splenic tissue. The torsed splenule demonstrated hemorrhage with neutrophilic infiltrate and thrombus consistent with chronic infarction and torsion. Splenogonadal fusion (SGF) is a rather rare entity, with approximately 184 cases reported in the literature. The most comprehensive review was that of 123 cases completed by Carragher in 1990. Since then, an additional 61 cases have been reported in the scientific literature. We have studied these 61 cases in detail and have included a summary of that information here.Key words: Splenogonadal fusion, Acute scrotumA 10-year-old boy presented with worsening left-sided scrotal pain of 12 hours’ duration. The patient reported similar previous episodes occurring intermittently over the past several months. His past medical history was significant for left hip dysplasia, requiring multiple hip surgeries. On examination, he was found to have an edematous left hemiscrotum with a left testicle that was rigid, tender, and noted to be in a transverse lie. The ultrasound revealed possible polyorchism, with two testicles on the left and one on the right (Figure 1), and left epididymitis. One of the left testicles demonstrated a loss of blood flow consistent with testicular torsion (Figure 2).Open in a separate windowFigure 1Ultrasound of the left hemiscrotum reveals two spherical structures; the one on the left is heterogeneous and hyperdense in comparison to the right.Open in a separate windowFigure 2Doppler ultrasound of left hemiscrotum. No evidence of blood flow to left spherical structure.The patient was taken to the operating room for immediate scrotal exploration. A normalappearing left testicle with a normal epididymis was noted. However, two accessory structures were noted, one of which was torsed 720°; (Figure 3). An inguinal incision was then made and a third accessory structure was noted. All three structures were connected with fibrous tissue, giving a “rosary bead” appearance. The left accessory structures were removed, a left testicular biopsy was taken, and bilateral scrotal orchipexies were performed.Open in a separate windowFigure 3Torsed accessory spleen with splenogonadal fusion.Pathology revealed a normal left testicle with a fibrovascular cord with three nodules consistent with splenic tissue. The torsed splenule demonstrated hemorrhage with neutrophillic infiltrate and thrombus consistent with chronic infarction and torsion (Figure 4).Open in a separate windowFigure 4Splenogonadal fusion, continuous type with three accessory structures.  相似文献   

17.
Genome-scale metabolic models have been recognised as useful tools for better understanding living organisms’ metabolism. merlin (https://www.merlin-sysbio.org/) is an open-source and user-friendly resource that hastens the models’ reconstruction process, conjugating manual and automatic procedures, while leveraging the user''s expertise with a curation-oriented graphical interface. An updated and redesigned version of merlin is herein presented. Since 2015, several features have been implemented in merlin, along with deep changes in the software architecture, operational flow, and graphical interface. The current version (4.0) includes the implementation of novel algorithms and third-party tools for genome functional annotation, draft assembly, model refinement, and curation. Such updates increased the user base, resulting in multiple published works, including genome metabolic (re-)annotations and model reconstructions of multiple (lower and higher) eukaryotes and prokaryotes. merlin version 4.0 is the only tool able to perform template based and de novo draft reconstructions, while achieving competitive performance compared to state-of-the art tools both for well and less-studied organisms.  相似文献   

18.
The Bloom syndrome helicase, BLM, has numerous functions that prevent mitotic crossovers. We used unique features of Drosophila melanogaster to investigate origins and properties of mitotic crossovers that occur when BLM is absent. Induction of lesions that block replication forks increased crossover frequencies, consistent with functions for BLM in responding to fork blockage. In contrast, treatment with hydroxyurea, which stalls forks, did not elevate crossovers, even though mutants lacking BLM are sensitive to killing by this agent. To learn about sources of spontaneous recombination, we mapped mitotic crossovers in mutants lacking BLM. In the male germline, irradiation-induced crossovers were distributed randomly across the euchromatin, but spontaneous crossovers were nonrandom. We suggest that regions of the genome with a high frequency of mitotic crossovers may be analogous to common fragile sites in the human genome. Interestingly, in the male germline there is a paucity of crossovers in the interval that spans the pericentric heterochromatin, but in the female germline this interval is more prone to crossing over. Finally, our system allowed us to recover pairs of reciprocal crossover chromosomes. Sequencing of these revealed the existence of gene conversion tracts and did not provide any evidence for mutations associated with crossovers. These findings provide important new insights into sources and structures of mitotic crossovers and functions of BLM helicase.  相似文献   

19.
Learning processes in Drosophila have been studied through the use of Pavlovian associative memory tests, and these paradigms have been extremely useful in identifying both genetic factors and neuroanatomical structures that are essential to memory formation. Whether these same genes and brain compartments also contribute to memory formed from nonassociative experiences is not well understood. Exposures to environmental stressors such as predators are known to induce innate behavioral responses and can lead to new memory formation that allows a predator response to persist for days after the predator threat has been removed. Here, we utilize a unique form of nonassociative behavior in Drosophila where female flies detect the presence of endoparasitoid predatory wasps and alter their oviposition behavior to lay eggs in food containing high levels of alcohol. The predator-induced change in fly oviposition preference is maintained for days after wasps are removed, and this persistence in behavior requires a minimum continuous exposure time of 14 hr. Maintenance of this behavior is dependent on multiple long-term memory genes, including orb2, dunce, rutabaga, amnesiac, and Fmr1. Maintenance of the behavior also requires intact synaptic transmission of the mushroom body. Surprisingly, synaptic output from the mushroom body (MB) or the functions of any of these learning and memory genes are not required for the change in behavior when female flies are in constant contact with wasps. This suggests that perception of this predator that leads to an acute change in oviposition behavior is not dependent on the MB or dependent on learning and memory gene functions. Because wasp-induced oviposition behavior can last for days and its maintenance requires a functional MB and the wild-type products of several known learning and memory genes, we suggest that this constitutes a paradigm for a bona fide form of nonassociative long-term memory that is not dependent on associated experiences.  相似文献   

20.
The identification of subnetworks of interest—or active modules—by integrating biological networks with molecular profiles is a key resource to inform on the processes perturbed in different cellular conditions. We here propose MOGAMUN, a Multi-Objective Genetic Algorithm to identify active modules in MUltiplex biological Networks. MOGAMUN optimizes both the density of interactions and the scores of the nodes (e.g., their differential expression). We compare MOGAMUN with state-of-the-art methods, representative of different algorithms dedicated to the identification of active modules in single networks. MOGAMUN identifies dense and high-scoring modules that are also easier to interpret. In addition, to our knowledge, MOGAMUN is the first method able to use multiplex networks. Multiplex networks are composed of different layers of physical and functional relationships between genes and proteins. Each layer is associated to its own meaning, topology, and biases; the multiplex framework allows exploiting this diversity of biological networks. We applied MOGAMUN to identify cellular processes perturbed in Facio-Scapulo-Humeral muscular Dystrophy, by integrating RNA-seq expression data with a multiplex biological network. We identified different active modules of interest, thereby providing new angles for investigating the pathomechanisms of this disease.Availability: MOGAMUN is available at https://github.com/elvanov/MOGAMUN and as a Bioconductor package at https://bioconductor.org/packages/release/bioc/html/MOGAMUN.html. Contact: rf.uma-vinu@toduab.siana  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号