Discovery of transgene insertion sites by high throughput sequencing of mate pair libraries |
| |
Authors: | Anuj Srivastava Vivek M Philip Ian Greenstein Lucy B Rowe Mary Barter Cathleen Lutz Laura G Reinholdt |
| |
Affiliation: | .Computational Sciences, The Jackson Laboratory, Bar Harbor, ME USA ;.Genetic Resource Sciences, The Jackson Laboratory, Bar Harbor, ME USA ;.Genome Technologies, The Jackson Laboratory, Bar Harbor, ME USA |
| |
Abstract: | BackgroundTransgenesis by random integration of a transgene into the genome of a zygote has become a reliable and powerful method for the creation of new mouse strains that express exogenous genes, including human disease genes, tissue specific reporter genes or genes that allow for tissue specific recombination. Nearly 6,500 transgenic alleles have been created by random integration in embryos over the last 30 years, but for the vast majority of these strains, the transgene insertion sites remain uncharacterized.ResultsTo obtain a complete understanding of how insertion sites might contribute to phenotypic outcomes, to more cost effectively manage transgenic strains, and to fully understand mechanisms of instability in transgene expression, we’ve developed methodology and a scoring scheme for transgene insertion site discovery using high throughput sequencing data.ConclusionsSimilar to other molecular approaches to transgene insertion site discovery, high-throughput sequencing of standard paired-end libraries is hindered by low signal to noise ratios. This problem is exacerbated when the transgene consists of sequences that are also present in the host genome. We’ve found that high throughput sequencing data from mate-pair libraries are more informative when compared to data from standard paired end libraries. We also show examples of the genomic regions that harbor transgenes, which have in common a preponderance of repetitive sequences.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2164-15-367) contains supplementary material, which is available to authorized users. |
| |
Keywords: | High-throughput sequencing Mate pair library Transgenic Transgene insertion sites |
|
|