首页 | 本学科首页   官方微博 | 高级检索  
     


Mapping multiple co-sequenced T-DNA integration sites within the Arabidopsis genome
Authors:Presting Gernot G
Affiliation:Torrey Mesa Research Institute, 3115 Merryfield Row, San Diego, CA 92121, USA. gernot@genome.clemson.edu
Abstract:MOTIVATION: Insertion mutagenesis, using transgenes or endogenous transposons, is a popular method for generating null mutations (knockouts) in model organisms. Insertions are mapped to specific genes by amplifying (via TAIL-PCR) and sequencing genomic regions flanking the inserted DNA. The presence of multiple TAIL-PCR templates in one sequencing reaction results in chimeric sequence of intermittently low quality. Standard processing of this sequence by applying Phred quality requirements results in loss of informative sequence, whereas not trimming low-quality sequence causes inclusion of low-complexity homopolymers from the ends of sequence runs. Accurate mapping of the flanking sequences is complicated by the presence of gene families. RESULTS: Methods for extracting informative regions from sequence traces obtained by sequencing multiple TAIL-PCR fragments in a single reaction are described. The completely sequenced Arabidopsis genome was used to identify informative TAIL-PCR sequence regions. Methods were devised to define and select high quality matches and precisely map each insert to the correct genome location. These methods were used to analyze sequence of TAIL-PCR-amplified flanking regions of the inserts from individual plants in a T-DNA-mutagenized population of Arabidopsis thaliana, and are applicable to similar situations where a reference genome can be used to extract information from poor-quality sequence.
Keywords:
本文献已被 PubMed Oxford 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号