首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Advances in high-throughput sequencing(HTS)have fostered rapid developments in the field of microbiome research,and massive microbiome datasets are now being generated.However,the diversity of software tools and the complexity of analysis pipelines make it difficult to access this field.Here,we systematically summarize the advantages and limitations of micro-biome methods.Then,we recommend specific pipelines for amplicon and metagenomic analyses,and describe commonly-used software and databases,to help researchers select the appropriate tools.Furthermore,we introduce statistical and visualization methods suit-able for microbiome analysis,including alpha-and beta-diversity,taxonomic composition,difference compar-isons,correlation,networks,machine learning,evolu-tion,source tracing,and common visualization styles to help researchers make informed choices.Finally,a step-by-step reproducible analysis guide is introduced.We hope this review will allow researchers to carry out data analysis more effectively and to quickly select the appropriate tools in order to efficiently mine the bio-logical significance behind the data.  相似文献   

2.
3.
The analysis of cytosine methylation provides a new way to assess and describe epigenetic regulation at a whole-genome level in many eukaryotes. DNA methylation has a demonstrated role in the genome stability and protection, regulation of gene expression and many other aspects of genome function and maintenance. BS-seq is a relatively unbiased method for profiling the DNA methylation, with a resolution capable of measuring methylation at individual cytosines. Here we describe, as an example, a workflow to handle DNA methylation analysis, from BS-seq library preparation to the data visualization. We describe some applications for the analysis and interpretation of these data. Our laboratory provides public access to plant DNA methylation data via visualization tools available at our “Next-Gen Sequence” websites (http://mpss.udel.edu), along with small RNA, RNA-seq and other data types.  相似文献   

4.
SUMMARY: We present a Cytoscape plugin for the inference and visualization of networks from high-resolution mass spectrometry metabolomic data. The software also provides access to basic topological analysis. This open source, multi-platform software has been successfully used to interpret metabolomic experiments and will enable others using filtered, high mass accuracy mass spectrometric data sets to build and analyse networks. AVAILABILITY: http://compbio.dcs.gla.ac.uk/fabien/abinitio/abinitio.html  相似文献   

5.
6.
Legacy biodiversity data from natural history and survey collections are rapidly becoming available in a common format over the Internet. Over 110 million records are already being served from the Global Biodiversity Information Facility (GBIF). However, our ability to use this information effectively for ecological research, management and conservation lags behind. A solution is a web-based Geographic Information System for enabling visualization and analysis of this rapidly expanding data resource. In this paper we detail a case study system, GBIF Mapping and Analysis Portal Application (MAPA), developed for deployment at distributed database portals. Building such a system requires overcoming a series of technical and research challenges. These challenges include: assuring fast speed of access to the vast amounts of data available through these distributed biodiversity databases; developing open standards based access to suitable environmental data layers for analyzing biodiversity distribution; building suitably flexible and intuitive map interfaces for refining the scope and criteria of an analysis; and building appropriate web-services based analysis tools that are of primary importance to the ecological community and make manifest the value of online biodiversity GBIF data. After discussing how we overcome these challenges, we provide case studies showing two examples of the use of GBIF-MAPA analysis tools.  相似文献   

7.
Proper visualization of scientific data is important for understanding spatial relationships. Particularly in the field of structural biology, where researchers seek to gain an understanding of the structure and function of biological macromolecules, it is important to have access to visualization programs which are fast, flexible, and customizable. We present KiNG, a Java program for visualizing scientific data, with a focus on macromolecular visualization. KiNG uses the kinemage graphics format, which is tuned for macromolecular structures, but is also ideal for many other kinds of spatially embedded information. KiNG is written in cross‐platform, open‐source Java code, and can be extended by end users through simple or elaborate “plug‐in” modules. Here, we present three such applications of KiNG to problems in structural biology (protein backbone rebuilding), bioinformatics of high‐dimensional data (e.g., protein sidechain chi angles), and classroom education (molecular illustration). KiNG is a mature platform for rapidly creating and capitalizing on scientific visualizations. As a research tool, it is invaluable as a test bed for new methods of visualizing scientific data and information. It is also a powerful presentation tool, whether for structure browsing, teaching, direct 3D display on the web, or as a method for creating pictures and videos for publications. KiNG is freely available for download at http://kinemage.biochem.duke.edu .  相似文献   

8.
Marla S  Singh VK 《In silico biology》2007,7(4-5):543-545
Recent sequencing of genomes of several microorganisms provides an opportunity to have access to huge volumes of data stored in various databases. This has resulted in the development of various computational and visualization tools to aid in retrieval and analysis of data. Development of user friendly genome data mapping and visualization tools facilitates researchers to closely examine various features of genes and make inferences from the displayed data efficiently. PGV - Prokaryotic Genome Viewer is a Java based web application tool capable of generating high quality interactive circular chromosome maps. With simple mouse roll over tasks on the interested region on the displayed map, the user is provided with features such as feature labeling, multi-fold zooming, image rotation and hyperlinking to different information resources. The tool is capable of instantaneously generating maps using user-supplied sequence data.  相似文献   

9.
Genomic studies are now being undertaken on thousands of samples requiring new computational tools that can rapidly analyze data to identify clinically important features. Inferring structural variations in cancer genomes from mate-paired reads is a combinatorially difficult problem. We introduce Fastbreak, a fast and scalable toolkit that enables the analysis and visualization of large amounts of data from projects such as The Cancer Genome Atlas.  相似文献   

10.
11.
We report a new approach to probing DNA-protein interactions by combining optical tweezers with a high-throughput DNA curtains technique. Here we determine the forces required to remove the individual lipid-anchored DNA molecules from the bilayer. We demonstrate that DNA anchored to the bilayer through a single biotin-streptavidin linkage withstands ∼20 pN before being pulled free from the bilayer, whereas molecules anchored to the bilayer through multiple attachment points can withstand ?65 pN; access to this higher force regime is sufficient to probe the responses of protein-DNA interactions to force changes. As a proof-of-principle, we concurrently visualized DNA-bound fluorescently-tagged RNA polymerase while simultaneously stretching the DNA molecules. This work presents a step towards a powerful experimental platform that will enable concurrent visualization of DNA curtains while applying defined forces through optical tweezers.  相似文献   

12.
We describe Vivaldi (VIsualization and VALidation DIsplay; http://pdbe.org/vivaldi ), a web‐based service for the analysis, visualization, and validation of NMR structures in the Protein Data Bank (PDB). Vivaldi provides access to model coordinates and several types of experimental NMR data using interactive visualization tools, augmented with structural annotations and model‐validation information. The service presents information about the modeled NMR ensemble, validation of experimental chemical shifts, residual dipolar couplings, distance and dihedral angle constraints, as well as validation scores based on empirical knowledge and databases. Vivaldi was designed for both expert NMR spectroscopists and casual non‐expert users who wish to obtain a better grasp of the information content and quality of NMR structures in the public archive. © Proteins 2013. © 2012 Wiley Periodicals, Inc.  相似文献   

13.
As next-generation sequencing (NGS) technology has become widely used to identify genetic causal variants for various diseases and traits,a number of packages for checking NGS data quality have sprung up in public domains. In addition to the quality of sequencing data,sample quality issues,such as gender mismatch,abnormal inbreeding coefficient,cryptic relatedness,and population outliers,can also have fundamental impact on downstream analysis. However,there is a lack of tools specialized in identifying problematic samples from NGS data,often due to the limitation of sample size and variant counts. We developed SeqSQC,a Bioconductor package,to automate and accelerate sample cleaning in NGS data of any scale. SeqSQC is designed for efficient data storage and access,and equipped with interactive plots for intuitive data visualization to expedite the identification of problematic samples. SeqSQC is available at http://bioconductor. org/packages/SeqSQC.  相似文献   

14.
Physcomitrella patens is a bryophyte model plant that is often used to study plant evolution and development. Its resources are of great importance for comparative genomics and evo‐devo approaches. However, expression data from Physcomitrella patens were so far generated using different gene annotation versions and three different platforms: CombiMatrix and NimbleGen expression microarrays and RNA sequencing. The currently available P. patens expression data are distributed across three tools with different visualization methods to access the data. Here, we introduce an interactive expression atlas, Physcomitrella Expression Atlas Tool (PEATmoss), that unifies publicly available expression data for P. patens and provides multiple visualization methods to query the data in a single web‐based tool. Moreover, PEATmoss includes 35 expression experiments not previously available in any other expression atlas. To facilitate gene expression queries across different gene annotation versions, and to access P. patens annotations and related resources, a lookup database and web tool linked to PEATmoss was implemented. PEATmoss can be accessed at https://peatmoss.online.uni-marburg.de  相似文献   

15.
MOTIVATION: Protein sequence and family data is accumulating at such a rapid rate that state-of-the-art databases and interface tools are required to aid curators with their classifications. We have designed such a system, MetaFam, to facilitate the comparison and integration of public protein sequence and family data. This paper presents the global schema, integration issues, and query capabilities of MetaFam. RESULTS: MetaFam is an integrated data warehouse of information about protein families and their sequences. This data has been collected into a consistent global schema, and stored in an Oracle relational database. The warehouse implementation allows for quick removal of outdated data sets. In addition to the relational implementation of the primary schema, we have developed several derived tables that enable efficient access from data visualization and exploration tools. Through a series of straightforward SQL queries, we demonstrate the usefulness of this data warehouse for comparing protein family classifications and for functional assignment of new sequences.  相似文献   

16.
17.
We introduce a new and unified, compressed volumetric representation for macromolecular structures at varying feature resolutions, as well as for many computed associated properties. Important caveats of this compressed representation are fast random data access and decompression operations. Many computational tasks for manipulating large structures, including those requiring interactivity such as real-time visualization, are greatly enhanced by utilizing this compact representation. The compression scheme is obtained by using a custom designed hierarchical wavelet basis construction. Due to the continuity offered by these wavelets, we retain very good accuracy of molecular surfaces, at very high compression ratios, for macromolecular structures at multiple resolutions.  相似文献   

18.
Brain atlases and associated databases have great potential as gateways for navigating, accessing, and visualizing a wide range of neuroscientific data. Recent progress towards realizing this potential includes the establishment of probabilistic atlases, surface-based atlases and associated databases, combined with improvements in visualization capabilities and internet access.  相似文献   

19.
With several plant genomes sequenced, the power of comparative genome analysis can now be applied. However, genome-scale cross-species analyses are limited by the effort for data integration. To develop an integrated cross-species plant genome resource, we maintain comprehensive databases for model plant genomes, including Arabidopsis (Arabidopsis thaliana), maize (Zea mays), Medicago truncatula, and rice (Oryza sativa). Integration of data and resources is emphasized, both in house as well as with external partners and databases. Manual curation and state-of-the-art bioinformatic analysis are combined to achieve quality data. Easy access to the data is provided through Web interfaces and visualization tools, bulk downloads, and Web services for application-level access. This allows a consistent view of the model plant genomes for comparative and evolutionary studies, the transfer of knowledge between species, and the integration with functional genomics data.  相似文献   

20.
Major advances in high-throughput, high-resolution, 3D microscopy techniques have enabled the acquisition of large volumes of neuroanatomical data at submicrometer resolution. One of the first such instruments producing whole-brain-scale data is the Knife-Edge Scanning Microscope (KESM)7, 5, 9, developed and hosted in the authors'' lab. KESM has been used to section and image whole mouse brains at submicrometer resolution, revealing the intricate details of the neuronal networks (Golgi)1, 4, 8, vascular networks (India ink)1, 4, and cell body distribution (Nissl)3. The use of KESM is not restricted to the mouse nor the brain. We have successfully imaged the octopus brain6, mouse lung, and rat brain. We are currently working on whole zebra fish embryos. Data like these can greatly contribute to connectomics research10; to microcirculation and hemodynamic research; and to stereology research by providing an exact ground-truth. In this article, we will describe the pipeline, including specimen preparation (fixing, staining, and embedding), KESM configuration and setup, sectioning and imaging with the KESM, image processing, data preparation, and data visualization and analysis. The emphasis will be on specimen preparation and visualization/analysis of obtained KESM data. We expect the detailed protocol presented in this article to help broaden the access to KESM and increase its utilization.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号