期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

MitoLSDB: A Comprehensive Resource to Study Genotype to Phenotype Correlations in Human Mitochondrial DNA Variations

Shamnamole K Saakshi Jalali Vinod Scaria Anshu Bhardwaj 《PloS one》2013,8(4)

Human mitochondrial DNA (mtDNA) encodes a set of 37 genes which are essential structural and functional components of the electron transport chain. Variations in these genes have been implicated in a broad spectrum of diseases and are extensively reported in literature and various databases. In this study, we describe MitoLSDB, an integrated platform to catalogue disease association studies on mtDNA (http://mitolsdb.igib.res.in). The main goal of MitoLSDB is to provide a central platform for direct submissions of novel variants that can be curated by the Mitochondrial Research Community. MitoLSDB provides access to standardized and annotated data from literature and databases encompassing information from 5231 individuals, 675 populations and 27 phenotypes. This platform is developed using the Leiden Open (source) Variation Database (LOVD) software. MitoLSDB houses information on all 37 genes in each population amounting to 132397 variants, 5147 unique variants. For each variant its genomic location as per the Revised Cambridge Reference Sequence, codon and amino acid change for variations in protein-coding regions, frequency, disease/phenotype, population, reference and remarks are also listed. MitoLSDB curators have also reported errors documented in literature which includes 94 phantom mutations, 10 NUMTs, six documentation errors and one artefactual recombination. MitoLSDB is the largest repository of mtDNA variants systematically standardized and presented using the LOVD platform. We believe that this is a good starting resource to curate mtDNA variants and will facilitate direct submissions enhancing data coverage, annotation in context of pathogenesis and quality control by ensuring non-redundancy in reporting novel disease associated variants. 相似文献

2.

CTDB: An Integrated Chickpea Transcriptome Database for Functional and Applied Genomics

Mohit Verma Vinay Kumar Ravi K. Patel Rohini Garg Mukesh Jain 《PloS one》2015,10(8)

相似文献

3.

LncRBase: An Enriched Resource for lncRNA Information

Sohini Chakraborty Aritra Deb Ranjan Kumar Maji Sudipto Saha Zhumur Ghosh 《PloS one》2014,9(9)

相似文献

4.

ALDB: A Domestic-Animal Long Noncoding RNA Database

Aimin Li Junying Zhang Zhongyin Zhou Lei Wang Yujuan Liu Yajun Liu 《PloS one》2015,10(4)

Background

Long noncoding RNAs (lncRNAs) have attracted significant attention in recent years due to their important roles in many biological processes. Domestic animals constitute a unique resource for understanding the genetic basis of phenotypic variation and are ideal models relevant to diverse areas of biomedical research. With improving sequencing technologies, numerous domestic-animal lncRNAs are now available. Thus, there is an immediate need for a database resource that can assist researchers to store, organize, analyze and visualize domestic-animal lncRNAs.

Results

The domestic-animal lncRNA database, named ALDB, is the first comprehensive database with a focus on the domestic-animal lncRNAs. It currently archives 12,103 pig intergenic lncRNAs (lincRNAs), 8,923 chicken lincRNAs and 8,250 cow lincRNAs. In addition to the annotations of lincRNAs, it offers related data that is not available yet in existing lncRNA databases (lncRNAdb and NONCODE), such as genome-wide expression profiles and animal quantitative trait loci (QTLs) of domestic animals. Moreover, a collection of interfaces and applications, such as the Basic Local Alignment Search Tool (BLAST), the Generic Genome Browser (GBrowse) and flexible search functionalities, are available to help users effectively explore, analyze and download data related to domestic-animal lncRNAs.

Conclusions

ALDB enables the exploration and comparative analysis of lncRNAs in domestic animals. A user-friendly web interface, integrated information and tools make it valuable to researchers in their studies. ALDB is freely available from http://res.xaut.edu.cn/aldb/index.jsp. 相似文献

5.

An Approach to Function Annotation for Proteins of Unknown Function (PUFs) in the Transcriptome of Indian Mulberry

K. H. Dhanyalakshmi Mahantesha B. N. Naika R. S. Sajeevan Oommen K. Mathew K. Mohamed Shafi Ramanathan Sowdhamini Karaba N. Nataraja 《PloS one》2016,11(3)

相似文献

6.

FmMDb: A Versatile Database of Foxtail Millet Markers for Millets and Bioenergy Grasses Research

Venkata Suresh B Mehanathan Muthamilarasan Gopal Misra Manoj Prasad 《PloS one》2013,8(8)

The prominent attributes of foxtail millet (Setaria italica L.) including its small genome size, short life cycle, inbreeding nature, and phylogenetic proximity to various biofuel crops have made this crop an excellent model system to investigate various aspects of architectural, evolutionary and physiological significances in Panicoid bioenergy grasses. After release of its whole genome sequence, large-scale genomic resources in terms of molecular markers were generated for the improvement of both foxtail millet and its related species. Hence it is now essential to congregate, curate and make available these genomic resources for the benefit of researchers and breeders working towards crop improvement. In view of this, we have constructed the Foxtail millet Marker Database (FmMDb; http://www.nipgr.res.in/foxtail.html), a comprehensive online database for information retrieval, visualization and management of large-scale marker datasets with unrestricted public access. FmMDb is the first database which provides complete marker information to the plant science community attempting to produce elite cultivars of millet and bioenergy grass species, thus addressing global food insecurity. 相似文献

7.

ORFcor: Identifying and Accommodating ORF Prediction Inconsistencies for Phylogenetic Analysis

Jonathan L. Klassen Cameron R. Currie 《PloS one》2013,8(3)

相似文献

8.

Genome-wide discovery and characterization of maize long non-coding RNAs

Lin Li Steven R Eichten Rena Shimizu Katherine Petsch Cheng-Ting Yeh Wei Wu Antony M Chettoor Scott A Givan Rex A Cole John E Fowler Matthew M S Evans Michael J Scanlon Jianming Yu Patrick S Schnable Marja C P Timmermans Nathan M Springer Gary J Muehlbauer 《Genome biology》2014,15(2):R40

相似文献

9.

PROmiRNA: a new miRNA promoter recognition method uncovers the complex regulation of intronic miRNAs

Annalisa Marsico Matthew R Huska Julia Lasserre Haiyang Hu Dubravka Vucicevic Anne Musahl Ulf Andersson Orom Martin Vingron 《Genome biology》2013,14(8):R84

The regulation of intragenic miRNAs by their own intronic promoters is one of the open problems of miRNA biogenesis. Here, we describe PROmiRNA, a new approach for miRNA promoter annotation based on a semi-supervised statistical model trained on deepCAGE data and sequence features. We validate our results with existing annotation, PolII occupancy data and read coverage from RNA-seq data. Compared to previous methods PROmiRNA increases the detection rate of intronic promoters by 30%, allowing us to perform a large-scale analysis of their genomic features, as well as elucidate their contribution to tissue-specific regulation. PROmiRNA can be downloaded from http://promirna.molgen.mpg.de. 相似文献

10.

A new approach for annotation of transposable elements using small RNA mapping

Moaine El?Baidouri Kyung Do Kim Brian Abernathy Siwaret Arikit Florian Maumus Olivier Panaud Blake C. Meyers Scott A. Jackson 《Nucleic acids research》2015,43(13):e84

Transposable elements (TEs) are mobile genomic DNA sequences found in most organisms. They so densely populate the genomes of many eukaryotic species that they are often the major constituents. With the rapid generation of many plant genome sequencing projects over the past few decades, there is an urgent need for improved TE annotation as a prerequisite for genome-wide studies. Analogous to the use of RNA-seq for gene annotation, we propose a new method for de novo TE annotation that uses as a guide 24 nt-siRNAs that are a part of TE silencing pathways. We use this new approach, called TASR (for Transposon Annotation using Small RNAs), for de novo annotation of TEs in Arabidopsis, rice and soybean and demonstrate that this strategy can be successfully applied for de novo TE annotation in plants.Executable PERL is available for download from: http://tasr-pipeline.sourceforge.net/ 相似文献

11.

ChIPseek,a web-based analysis tool for ChIP data

Ting-Wen Chen Hsin-Pai Li Chi-Ching Lee Ruei-Chi Gan Po-Jung Huang Timothy H Wu Cheng-Yang Lee Yi-Feng Chang Petrus Tang 《BMC genomics》2014,15(1)

相似文献

12.

Structured RNAs and synteny regions in the pig genome

Christian Anthon Hakim Tafer Jakob H Havgaard Bo Thomsen Jakob Hedegaard Stefan E Seemann Sachin Pundhir Stephanie Kehr Sebastian Bartschat Mathilde Nielsen Rasmus O Nielsen Merete Fredholm Peter F Stadler Jan Gorodkin 《BMC genomics》2014,15(1)

Background

Annotating mammalian genomes for noncoding RNAs (ncRNAs) is nontrivial since far from all ncRNAs are known and the computational models are resource demanding. Currently, the human genome holds the best mammalian ncRNA annotation, a result of numerous efforts by several groups. However, a more direct strategy is desired for the increasing number of sequenced mammalian genomes of which some, such as the pig, are relevant as disease models and production animals.

Results

We present a comprehensive annotation of structured RNAs in the pig genome. Combining sequence and structure similarity search as well as class specific methods, we obtained a conservative set with a total of 3,391 structured RNA loci of which 1,011 and 2,314, respectively, hold strong sequence and structure similarity to structured RNAs in existing databases. The RNA loci cover 139 cis-regulatory element loci, 58 lncRNA loci, 11 conflicts of annotation, and 3,183 ncRNA genes. The ncRNA genes comprise 359 miRNAs, 8 ribozymes, 185 rRNAs, 638 snoRNAs, 1,030 snRNAs, 810 tRNAs and 153 ncRNA genes not belonging to the here fore mentioned classes. When running the pipeline on a local shuffled version of the genome, we obtained no matches at the highest confidence level. Additional analysis of RNA-seq data from a pooled library from 10 different pig tissues added another 165 miRNA loci, yielding an overall annotation of 3,556 structured RNA loci. This annotation represents our best effort at making an automated annotation. To further enhance the reliability, 571 of the 3,556 structured RNAs were manually curated by methods depending on the RNA class while 1,581 were declared as pseudogenes. We further created a multiple alignment of pig against 20 representative vertebrates, from which RNAz predicted 83,859 de novo RNA loci with conserved RNA structures. 528 of the RNAz predictions overlapped with the homology based annotation or novel miRNAs. We further present a substantial synteny analysis which includes 1,004 lineage specific de novo RNA loci and 4 ncRNA loci in the known annotation specific for Laurasiatheria (pig, cow, dolphin, horse, cat, dog, hedgehog).

Conclusions

We have obtained one of the most comprehensive annotations for structured ncRNAs of a mammalian genome, which is likely to play central roles in both health modelling and production. The core annotation is available in Ensembl 70 and the complete annotation is available at http://rth.dk/resources/rnannotator/susscr102/version1.02.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-459) contains supplementary material, which is available to authorized users. 相似文献

13.

SNPAAMapperT2K: A genome-wide SNP downstream analysis and annotation pipeline for species annotated with NCBI.tbl data files

Yongsheng Bai 《Bioinformation》2014,10(11):711-715

相似文献

14.

PLAZA: A Comparative Genomics Resource to Study Gene and Genome Evolution in Plants

Sebastian Proost Michiel Van Bel Lieven Sterck Kenny Billiau Thomas Van Parys Yves Van de Peer Klaas Vandepoele 《The Plant cell》2009,21(12):3718-3731

相似文献

15.

CNVannotator: A Comprehensive Annotation Server for Copy Number Variation in the Human Genome

Min Zhao Zhongming Zhao 《PloS one》2013,8(11)

Copy number variation (CNV) is one of the most prevalent genetic variations in the genome, leading to an abnormal number of copies of moderate to large genomic regions. High-throughput technologies such as next-generation sequencing often identify thousands of CNVs involved in biological or pathological processes. Despite the growing demand to filter and classify CNVs by factors such as frequency in population, biological features, and function, surprisingly, no online web server for CNV annotations has been made available to the research community. Here, we present CNVannotator, a web server that accepts an input set of human genomic positions in a user-friendly tabular format. CNVannotator can perform genomic overlaps of the input coordinates using various functional features, including a list of the reported 356,817 common CNVs, 181,261 disease CNVs, as well as, 140,342 SNPs from genome-wide association studies. In addition, CNVannotator incorporates 2,211,468 genomic features, including ENCODE regulatory elements, cytoband, segmental duplication, genome fragile site, pseudogene, promoter, enhancer, CpG island, and methylation site. For cancer research community users, CNVannotator can apply various filters to retrieve a subgroup of CNVs pinpointed in hundreds of tumor suppressor genes and oncogenes. In total, 5,277,234 unique genomic coordinates with functional features are available to generate an output in a plain text format that is free to download. In summary, we provide a comprehensive web resource for human CNVs. The annotated results along with the server can be accessed at http://bioinfo.mc.vanderbilt.edu/CNVannotator/. 相似文献

16.

SeqBuster,a bioinformatic tool for the processing and analysis of small RNAs datasets,reveals ubiquitous miRNA modifications in human embryonic cells

Lorena Pantano Xavier Estivill Eulàlia Martí 《Nucleic acids research》2010,38(5):e34

相似文献

17.

PLEK: a tool for predicting long non-coding RNAs and messenger RNAs based on an improved k-mer scheme

Aimin Li Junying Zhang Zhongyin Zhou 《BMC bioinformatics》2014,15(1)

相似文献

18.

SEED Servers: High-Performance Access to the SEED Genomes,Annotations, and Metabolic Models

Ramy K. Aziz Scott Devoid Terrence Disz Robert A. Edwards Christopher S. Henry Gary J. Olsen Robert Olson Ross Overbeek Bruce Parrello Gordon D. Pusch Rick L. Stevens Veronika Vonstein Fangfang Xia 《PloS one》2012,7(10)

The remarkable advance in sequencing technology and the rising interest in medical and environmental microbiology, biotechnology, and synthetic biology resulted in a deluge of published microbial genomes. Yet, genome annotation, comparison, and modeling remain a major bottleneck to the translation of sequence information into biological knowledge, hence computational analysis tools are continuously being developed for rapid genome annotation and interpretation. Among the earliest, most comprehensive resources for prokaryotic genome analysis, the SEED project, initiated in 2003 as an integration of genomic data and analysis tools, now contains >5,000 complete genomes, a constantly updated set of curated annotations embodied in a large and growing collection of encoded subsystems, a derived set of protein families, and hundreds of genome-scale metabolic models. Until recently, however, maintaining current copies of the SEED code and data at remote locations has been a pressing issue. To allow high-performance remote access to the SEED database, we developed the SEED Servers (http://www.theseed.org/servers): four network-based servers intended to expose the data in the underlying relational database, support basic annotation services, offer programmatic access to the capabilities of the RAST annotation server, and provide access to a growing collection of metabolic models that support flux balance analysis. The SEED servers offer open access to regularly updated data, the ability to annotate prokaryotic genomes, the ability to create metabolic reconstructions and detailed models of metabolism, and access to hundreds of existing metabolic models. This work offers and supports a framework upon which other groups can build independent research efforts. Large integrations of genomic data represent one of the major intellectual resources driving research in biology, and programmatic access to the SEED data will provide significant utility to a broad collection of potential users. 相似文献

19.

CAGEr: precise TSS data retrieval and high-resolution promoterome mining for integrative analyses

Vanja Haberle Alistair R.R. Forrest Yoshihide Hayashizaki Piero Carninci Boris Lenhard 《Nucleic acids research》2015,43(8):e51

相似文献

20.

Draft genome sequence of marine alphaproteobacterial strain HIMB11, the first cultivated representative of a unique lineage within the Roseobacter clade possessing an unusually small genome

Bryndan P. Durham Jana Grote Kerry A. Whittaker Sara J. Bender Haiwei Luo Sharon L. Grim Julia M. Brown John R. Casey Antony Dron Lennin Florez-Leiva Andreas Krupke Catherine M. Luria Aric H. Mine Olivia D. Nigro Santhiska Pather Agathe Talarmin Emma K. Wear Thomas S. Weber Jesse M. Wilson Matthew J. Church Edward F. DeLong David M. Karl Grieg F. Steward John M. Eppley Nikos C. Kyrpides Stephan Schuster Michael S. Rappé 《Standards in genomic sciences》2014,9(3):632-645

Strain HIMB11 is a planktonic marine bacterium isolated from coastal seawater in Kaneohe Bay, Oahu, Hawaii belonging to the ubiquitous and versatile Roseobacter clade of the alphaproteobacterial family Rhodobacteraceae. Here we describe the preliminary characteristics of strain HIMB11, including annotation of the draft genome sequence and comparative genomic analysis with other members of the Roseobacter lineage. The 3,098,747 bp draft genome is arranged in 34 contigs and contains 3,183 protein-coding genes and 54 RNA genes. Phylogenomic and 16S rRNA gene analyses indicate that HIMB11 represents a unique sublineage within the Roseobacter clade. Comparison with other publicly available genome sequences from members of the Roseobacter lineage reveals that strain HIMB11 has the genomic potential to utilize a wide variety of energy sources (e.g. organic matter, reduced inorganic sulfur, light, carbon monoxide), while possessing a reduced number of substrate transporters. 相似文献