首页 | 本学科首页   官方微博 | 高级检索  
     


Sugarcane genome sequencing by methylation filtration provides tools for genomic research in the genus Saccharum
Authors:Clícia Grativol  Michael Regulski  Marcelo Bertalan  W. Richard McCombie  Felipe Rodrigues da Silva  Adhemar Zerlotini Neto  Renato Vicentini  Laurent Farinelli  Adriana Silva Hemerly  Robert A. Martienssen  Paulo Cavalcanti Gomes Ferreira
Affiliation:1. Laboratório de Biologia Molecular de Plantas, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, , Rio de Janeiro, 21941‐599 RJ, Brazil;2. Cold Spring Harbor Laboratory, , Cold Spring Harbor, NY, 11724 USA;3. Institute of Biological Psychiatry Mental Health Center, , Roskilde, DK‐4000 Denmark;4. Embrapa Informática Agropecuária, , Campinas, SP, Brazil;5. Centro de Biologia Molecular e Engenharia Genética (CBMEG), Universidade Estadual de Campinas, , Campinas, SP, Brazil;6. Fasteris SA, , Plan‐les‐Ouates, 1228 Switzerland;7. Cold Spring Harbor Laboratory, Howard Hughes Medical Institute and Gordon and Betty Moore Foundation, , Cold Spring Harbor, NY, 11724 USA
Abstract:Many economically important crops have large and complex genomes that hamper their sequencing by standard methods such as whole genome shotgun (WGS). Large tracts of methylated repeats occur in plant genomes that are interspersed by hypomethylated gene‐rich regions. Gene‐enrichment strategies based on methylation profiles offer an alternative to sequencing repetitive genomes. Here, we have applied methyl filtration with McrBC endonuclease digestion to enrich for euchromatic regions in the sugarcane genome. To verify the efficiency of methylation filtration and the assembly quality of sequences submitted to gene‐enrichment strategy, we have compared assemblies using methyl‐filtered (MF) and unfiltered (UF) libraries. The use of methy filtration allowed a better assembly by filtering out 35% of the sugarcane genome and by producing 1.5× more scaffolds and 1.7× more assembled Mb in length compared with unfiltered dataset. The coverage of sorghum coding sequences (CDS) by MF scaffolds was at least 36% higher than by the use of UF scaffolds. Using MF technology, we increased by 134× the coverage of gene regions of the monoploid sugarcane genome. The MF reads assembled into scaffolds that covered all genes of the sugarcane bacterial artificial chromosomes (BACs), 97.2% of sugarcane expressed sequence tags (ESTs), 92.7% of sugarcane RNA‐seq reads and 98.4% of sorghum protein sequences. Analysis of MF scaffolds from encoded enzymes of the sucrose/starch pathway discovered 291 single‐nucleotide polymorphisms (SNPs) in the wild sugarcane species, S. spontaneum and S. officinarum. A large number of microRNA genes was also identified in the MF scaffolds. The information achieved by the MF dataset provides a valuable tool for genomic research in the genus Saccharum and for improvement of sugarcane as a biofuel crop.
Keywords:methylation filtration  gene‐rich regions  de novo assembly  sugarcane  Saccharum spp
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号