首页 | 本学科首页   官方微博 | 高级检索  
     


Understanding the Differences between Genome Sequences of Escherichia coli B Strains REL606 and BL21(DE3) and Comparison of the E. coli B and K-12 Genomes
Authors:F. William Studier  Patrick Daegelen  Richard E. Lenski  Jihyun F. Kim
Affiliation:1 Biology Department, Brookhaven National Laboratory, PO Box 5000, Upton, NY 11973-5000, USA
2 CNRS UMR 8030, Genoscope (CEA), 2 rue Gaston Crémieux, CP 5706, 91000 Evry Cedex, France
3 Inserm, 101 rue de Tolbiac, 75013 Paris, France
4 Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI 48824, USA
5 Department of Condensed Matter Physics and Materials Science, Brookhaven National Laboratory, Upton, NY 11973, USA
6 Industrial Biotechnology and Bioenergy Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), 111 Gwahangno, Yuseong, Daejeon 305-806, Korea
7 Functional Genomics Program, University of Science and Technology, Yuseong, Daejeon 305-333, Korea
Abstract:Each difference between the genome sequences of Escherichia coli B strains REL606 and BL21(DE3) can be interpreted in light of known laboratory manipulations plus a gene conversion between ribosomal RNA operons. Two treatments with 1-methyl-3-nitro-1-nitrosoguanidine in the REL606 lineage produced at least 93 single-base-pair mutations (∼ 90% GC-to-AT transitions) and 3 single-base-pair GC deletions. Two UV treatments in the BL21(DE3) lineage produced only 4 single-base-pair mutations but 16 large deletions. P1 transductions from K-12 into the two B lineages produced 317 single-base-pair differences and 9 insertions or deletions, reflecting differences between B DNA in BL21(DE3) and integrated restriction fragments of K-12 DNA inherited by REL606. Two sites showed selective enrichment of spontaneous mutations. No unselected spontaneous single-base-pair mutations were evident. The genome sequences revealed that a progenitor of REL606 had been misidentified, explaining initially perplexing differences. Limited sequencing of other B strains defined characteristic properties of B and allowed assembly of the inferred genome of the ancestral B of Delbrück and Luria. Comparison of the B and K-12 genomes shows that more than half of the 3793 proteins of their basic genomes are predicted to be identical, although ∼ 310 appear to be functional in either B or K-12 but not in both. The ancestral basic genome appears to have had ∼ 4039 coding sequences occupying ∼ 4.0 Mbp. Repeated horizontal transfer from diverged Escherichia coli genomes and homologous recombination may explain the observed variable distribution of single-base-pair differences. Fifteen sites are occupied by phage-related elements, but only six by comparable elements at the same site. More than 50 sites are occupied by IS elements in both B and K, 16 in common, and likely founding IS elements are identified. A signature of widespread cryptic phage P4-type mobile elements was identified. Complex deletions (dense clusters of small deletions and substitutions) apparently removed nonessential genes from ∼ 30 sites in the basic genomes.
Keywords:SNP, single-base-pair difference   MNNG, 1-methyl-3-nitro-1-nitrosoguanidine   LPS, lipopolysaccharide   indel, insertion or deletion   orf, open reading frame (protein-coding sequence)
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号