首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Augmenting Chinese hamster genome assembly by identifying regions of high confidence
Authors:Nandita Vishwanathan  Arpan A Bandyopadhyay  Hsu‐Yuan Fu  Mohit Sharma  Kathryn C Johnson  Joann Mudge  Thiruvarangan Ramaraj  Getiria Onsongo  Kevin A T Silverstein  Nitya M Jacob  Huong Le  George Karypis  Wei‐Shou Hu
Institution:1. Department of Chemical Engineering and Materials Science, University of Minnesota, Minneapolis, MN, USA;2. Department of Computer Science & Engineering, University of Minnesota, Minneapolis, MN, USA;3. National Center for Genome Resources (NCGR), Santa Fe, New Mexico, USA;4. Minnesota Supercomputing Institute (MSI), University of Minnesota, Minneapolis, MN, USA
Abstract:Chinese hamster Ovary (CHO) cell lines are the dominant industrial workhorses for therapeutic recombinant protein production. The availability of genome sequence of Chinese hamster and CHO cells will spur further genome and RNA sequencing of producing cell lines. However, the mammalian genomes assembled using shot‐gun sequencing data still contain regions of uncertain quality due to assembly errors. Identifying high confidence regions in the assembled genome will facilitate its use for cell engineering and genome engineering. We assembled two independent drafts of Chinese hamster genome by de novo assembly from shotgun sequencing reads and by re‐scaffolding and gap‐filling the draft genome from NCBI for improved scaffold lengths and gap fractions. We then used the two independent assemblies to identify high confidence regions using two different approaches. First, the two independent assemblies were compared at the sequence level to identify their consensus regions as ”high confidence regions“ which accounts for at least 78 % of the assembled genome. Further, a genome wide comparison of the Chinese hamster scaffolds with mouse chromosomes revealed scaffolds with large blocks of collinearity, which were also compiled as high‐quality scaffolds. Genome scale collinearity was complemented with EST based synteny which also revealed conserved gene order compared to mouse. As cell line sequencing becomes more commonly practiced, the approaches reported here are useful for assessing the quality of assembly and potentially facilitate the engineering of cell lines.
Keywords:Annotation  CHO cells  NextGen sequencing  Scaffolds  Synteny
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号