期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

CodingQuarry: highly accurate hidden Markov model gene prediction in fungal genomes using RNA-seq transcripts

Alison C Testa James K Hane Simon R Ellwood Richard P Oliver 《BMC genomics》2015,16(1)

相似文献

2.

IC4R-2.0:Rice Genome Reannotation Using Massive RNA-seq Data

《基因组蛋白质组与生物信息学报(英文版)》2020,18(2):161-172

Genome reannotation aims for complete and accurate characterization of gene models and thus is of critical significance for in-depth exploration of gene function. Although the availability of massive RNA-seq data provides great opportunities for gene model refinement, few efforts have been made to adopt these precious data in rice genome reannotation. Here we reannotate the rice (Oryza sativa L. ssp. japonica) genome based on integration of large-scale RNA-seq data and release a new annotation system IC4R-2.0. In general, IC4R-2.0 significantly improves the completeness of gene structure, identifies a number of novel genes, and integrates a variety of functional annotations. Furthermore, long non-coding RNAs (lncRNAs) and circular RNAs (circRNAs) are systematically characterized in the rice genome. Performance evaluation shows that compared to previous annotation systems, IC4R-2.0 achieves higher integrity and quality, primarily attributable to massive RNA-seq data applied in genome annotation. Consequently, we incorporate the improved annotations into the Information Commons for Rice (IC4R), a database integrating multiple omics data of rice, and accordingly update IC4R by providing more user-friendly web interfaces and implementing a series of practical online tools. Together, the updated IC4R, which is equipped with the improved annotations, bears great promise for comparative and functional genomic studies in rice and other monocotyledonous species. The IC4R-2.0 annotation system and related resources are freely accessible at http://ic4r.org/. 相似文献

3.

Improving the Annotation of Arabidopsis lyrata Using RNA-Seq Data

Vimal Rawat Ahmed Abdelsamad Bj?rn Pietzenuk Danelle K. Seymour Daniel Koenig Detlef Weigel Ales Pecinka Korbinian Schneeberger 《PloS one》2015,10(9)

相似文献

4.

Transcriptome analysis of the model protozoan, Tetrahymena thermophila, using Deep RNA sequencing

Xiong J Lu X Zhou Z Chang Y Yuan D Tian M Zhou Z Wang L Fu C Orias E Miao W 《PloS one》2012,7(2):e30630

相似文献

5.

Tiling Assembly: a new tool for reference annotation-independent transcript assembly and novel gene identification by RNA-sequencing

Kenneth A. Watanabe Arielle Homayouni Tara Tufano Jennifer Lopez Patricia Ringler Paul Rushton Qingxi J. Shen 《DNA research》2015,22(5):319-329

相似文献

6.

Predicting the functional repertoire of an organism from unassembled RNA–seq data

Manuel Landesfeind Peter Meinicke 《BMC genomics》2014,15(1)

相似文献

7.

A Transcriptome Map of Actinobacillus pleuropneumoniae at Single-Nucleotide Resolution Using Deep RNA-Seq

Zhipeng Su Jiawen Zhu Zhuofei Xu Ran Xiao Rui Zhou Lu Li Huanchun Chen 《PloS one》2016,11(3)

相似文献

8.

A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae

Intawat Nookaew Marta Papini Natapol Pornputtapong Gionata Scalcinati Linn Fagerberg Matthias Uhl��n Jens Nielsen 《Nucleic acids research》2012,40(20):10084-10097

相似文献

9.

Multifaceted biological insights from a draft genome sequence of the tobacco hornworm moth,Manduca sexta

《Insect biochemistry and molecular biology》2016

Manduca sexta, known as the tobacco hornworm or Carolina sphinx moth, is a lepidopteran insect that is used extensively as a model system for research in insect biochemistry, physiology, neurobiology, development, and immunity. One important benefit of this species as an experimental model is its extremely large size, reaching more than 10 g in the larval stage. M. sexta larvae feed on solanaceous plants and thus must tolerate a substantial challenge from plant allelochemicals, including nicotine. We report the sequence and annotation of the M. sexta genome, and a survey of gene expression in various tissues and developmental stages. The Msex_1.0 genome assembly resulted in a total genome size of 419.4 Mbp. Repetitive sequences accounted for 25.8% of the assembled genome. The official gene set is comprised of 15,451 protein-coding genes, of which 2498 were manually curated. Extensive RNA-seq data from many tissues and developmental stages were used to improve gene models and for insights into gene expression patterns. Genome wide synteny analysis indicated a high level of macrosynteny in the Lepidoptera. Annotation and analyses were carried out for gene families involved in a wide spectrum of biological processes, including apoptosis, vacuole sorting, growth and development, structures of exoskeleton, egg shells, and muscle, vision, chemosensation, ion channels, signal transduction, neuropeptide signaling, neurotransmitter synthesis and transport, nicotine tolerance, lipid metabolism, and immunity. This genome sequence, annotation, and analysis provide an important new resource from a well-studied model insect species and will facilitate further biochemical and mechanistic experimental studies of many biological systems in insects. 相似文献

10.

Re-annotation of the woodland strawberry (Fragaria vesca) genome

Omar Darwish Rachel Shahan Zhongchi Liu Janet P Slovin Nadim W Alkharouf 《BMC genomics》2015,16(1)

相似文献

11.

Using online tools at the Bovine Genome Database to manually annotate genes in the new reference genome

D. A. Triant J. J. Le Tourneau C. M. Diesh D. R. Unni M. Shamimuzzaman A. T. Walsh J. Gardiner A. K. Goldkamp Y. Li H. N. Nguyen C. Roberts Z. Zhao L. J. Alexander J. E. Decker R. D. Schnabel S. G. Schroeder T. S. Sonstegard J. F. Taylor R. M. Rivera D. E. Hagen C. G. Elsik 《Animal genetics》2020,51(5):675-682

With the availability of a new highly contiguous Bos taurus reference genome assembly (ARS-UCD1.2), it is the opportune time to upgrade the bovine gene set by seeking input from researchers. Furthermore, advances in graphical genome annotation tools now make it possible for researchers to leverage sequence data generated with the latest technologies to collaboratively curate genes. For many years the Bovine Genome Database (BGD) has provided tools such as the Apollo genome annotation editor to support manual bovine gene curation. The goal of this paper is to explain the reasoning behind the decisions made in the manual gene curation process while providing examples using the existing BGD tools. We will describe the sources of gene annotation evidence provided at the BGD, including RNA-seq and Iso-Seq data. We will also explain how to interpret various data visualizations when curating gene models, and will demonstrate the value of manual gene annotation. The process described here can be applied to manual gene curation for other species with similar tools. With a better understanding of manual gene annotation, researchers will be encouraged to edit gene models and contribute to the enhancement of livestock gene sets. 相似文献

12.

Integration of mapped RNA-Seq reads into automatic training of eukaryotic gene finding algorithm

Alexandre Lomsadze Paul D. Burns Mark Borodovsky 《Nucleic acids research》2014,42(15):e119

相似文献

13.

The Solanum commersonii Genome Sequence Provides Insights into Adaptation to Stress Conditions and Genome Evolution of Wild Potato Relatives

Riccardo Aversano Felice Contaldi Maria Raffaella Ercolano Valentina Grosso Massimo Iorizzo Filippo Tatino Luciano Xumerle Alessandra Dal Molin Carla Avanzato Alberto Ferrarini Massimo Delledonne Walter Sanseverino Riccardo Aiese Cigliano Salvador Capella-Gutierrez Toni Gabaldón Luigi Frusciante James M. Bradeen Domenico Carputo 《The Plant cell》2015,27(4):954-968

Here, we report the draft genome sequence of Solanum commersonii, which consists of ∼830 megabases with an N50 of 44,303 bp anchored to 12 chromosomes, using the potato (Solanum tuberosum) genome sequence as a reference. Compared with potato, S. commersonii shows a striking reduction in heterozygosity (1.5% versus 53 to 59%), and differences in genome sizes were mainly due to variations in intergenic sequence length. Gene annotation by ab initio prediction supported by RNA-seq data produced a catalog of 1703 predicted microRNAs, 18,882 long noncoding RNAs of which 20% are shown to target cold-responsive genes, and 39,290 protein-coding genes with a significant repertoire of nonredundant nucleotide binding site-encoding genes and 126 cold-related genes that are lacking in S. tuberosum. Phylogenetic analyses indicate that domesticated potato and S. commersonii lineages diverged ∼2.3 million years ago. Three duplication periods corresponding to genome enrichment for particular gene families related to response to salt stress, water transport, growth, and defense response were discovered. The draft genome sequence of S. commersonii substantially increases our understanding of the domesticated germplasm, facilitating translation of acquired knowledge into advances in crop stability in light of global climate and environmental changes. 相似文献

14.

The ‘TranSeq’ 3′‐end sequencing method for high‐throughput transcriptomics and gene space refinement in plant genomes

下载免费PDF全文

Oren Tzfadia Samuel Bocobza Jonas Defoort Efrat Almekias‐Siegl Sayantan Panda Matan Levy Veronique Storme Stephane Rombauts Diego Adhemar Jaitin Hadas Keren‐Shaul Yves Van de Peer Asaph Aharoni 《The Plant journal : for cell and molecular biology》2018,96(1):223-232

相似文献

15.

Towards precise classification of cancers based on robust gene functional expression profiles

Zheng Guo Tianwen Zhang Xia Li Qi Wang Jianzhen Xu Hui Yu Jing Zhu Haiyun Wang Chenguang Wang Eric J Topol Qing Wang Shaoqi Rao 《BMC bioinformatics》2005,6(1):1-12

Background

Despite the continuous production of genome sequence for a number of organisms, reliable, comprehensive, and cost effective gene prediction remains problematic. This is particularly true for genomes for which there is not a large collection of known gene sequences, such as the recently published chicken genome. We used the chicken sequence to test comparative and homology-based gene-finding methods followed by experimental validation as an effective genome annotation method.

Results

We performed experimental evaluation by RT-PCR of three different computational gene finders, Ensembl, SGP2 and TWINSCAN, applied to the chicken genome. A Venn diagram was computed and each component of it was evaluated. The results showed that de novo comparative methods can identify up to about 700 chicken genes with no previous evidence of expression, and can correctly extend about 40% of homology-based predictions at the 5' end.

Conclusions

De novo comparative gene prediction followed by experimental verification is effective at enhancing the annotation of the newly sequenced genomes provided by standard homology-based methods. 相似文献

16.

Citrus sinensis Annotation Project (CAP): A Comprehensive Database for Sweet Orange Genome

Jia Wang Dijun Chen Yang Lei Ji-Wei Chang Bao-Hai Hao Feng Xing Sen Li Qiang Xu Xiu-Xin Deng Ling-Ling Chen 《PloS one》2014,9(1)

相似文献

17.

Improved Annotation of 3′ Untranslated Regions and Complex Loci by Combination of Strand-Specific Direct RNA Sequencing,RNA-Seq and ESTs

Nicholas J. Schurch Christian Cole Alexander Sherstnev Junfang Song Céline Duc Kate G. Storey W. H. Irwin McLean Sara J. Brown Gordon G. Simpson Geoffrey J. Barton 《PloS one》2014,9(4)

The reference annotations made for a genome sequence provide the framework for all subsequent analyses of the genome. Correct and complete annotation in addition to the underlying genomic sequence is particularly important when interpreting the results of RNA-seq experiments where short sequence reads are mapped against the genome and assigned to genes according to the annotation. Inconsistencies in annotations between the reference and the experimental system can lead to incorrect interpretation of the effect on RNA expression of an experimental treatment or mutation in the system under study. Until recently, the genome-wide annotation of 3′ untranslated regions received less attention than coding regions and the delineation of intron/exon boundaries. In this paper, data produced for samples in Human, Chicken and A. thaliana by the novel single-molecule, strand-specific, Direct RNA Sequencing technology from Helicos Biosciences which locates 3′ polyadenylation sites to within +/− 2 nt, were combined with archival EST and RNA-Seq data. Nine examples are illustrated where this combination of data allowed: (1) gene and 3′ UTR re-annotation (including extension of one 3′ UTR by 5.9 kb); (2) disentangling of gene expression in complex regions; (3) clearer interpretation of small RNA expression and (4) identification of novel genes. While the specific examples displayed here may become obsolete as genome sequences and their annotations are refined, the principles laid out in this paper will be of general use both to those annotating genomes and those seeking to interpret existing publically available annotations in the context of their own experimental data. 相似文献

18.

ShortStack: Comprehensive annotation and quantification of small RNA genes

Michael J. Axtell 《RNA (New York, N.Y.)》2013,19(6):740-751

相似文献

19.

Improving mRNA 5′ coding sequence determination in the mouse genome

Allison Piovesan Maria Caracausi Maria Chiara Pelleri Lorenza Vitale Silvia Martini Chiara Bassani Annalisa Gurioli Raffaella Casadei Giulia Soldà Pierluigi Strippoli 《Mammalian genome》2014,25(3-4):149-159

相似文献

20.

Characterizing and annotating the genome using RNA-seq data 总被引：2，自引：0，他引：2

Geng Chen Tieliu Shi Leming Shi

《中国科学：生命科学英文版》

相似文献