期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

共查询到20条相似文献，搜索用时 93 毫秒

Cloning,Sequence Analysis,and Characterization of the Genes Involved in Isoprimeverose Metabolism in Lactobacillus pentosus

Stéphane Chaillou B. Christien Lokman Rob J. Leer Clara Posthuma Pieter W. Postma Peter H. Pouwels 《Journal of bacteriology》1998,180(9):2312-2320

相似文献

Homo-d-Lactic Acid Fermentation from Arabinose by Redirection of the Phosphoketolase Pathway to the Pentose Phosphate Pathway in l-Lactate Dehydrogenase Gene-Deficient Lactobacillus plantarum

Kenji Okano Shogo Yoshida Tsutomu Tanaka Chiaki Ogino Hideki Fukuda Akihiko Kondo 《Applied and environmental microbiology》2009,75(15):5175-5178

相似文献

DdlN from Vancomycin-Producing Amycolatopsis orientalis C329.2 Is a VanA Homologue with d-Alanyl-d-Lactate Ligase Activity

下载免费PDF全文

C. Gary Marshall Gerard D. Wright 《Journal of bacteriology》1998,180(21):5792-5795

Vancomycin-resistant enterococci acquire high-level resistance to glycopeptide antibiotics through the synthesis of peptidoglycan terminating in d-alanyl-d-lactate. A key enzyme in this process is a d-alanyl-d-alanine ligase homologue, VanA or VanB, which preferentially catalyzes the synthesis of the depsipeptide d-alanyl-d-lactate. We report the overexpression, purification, and enzymatic characterization of DdlN, a VanA and VanB homologue encoded by a gene of the vancomycin-producing organism Amycolatopsis orientalis C329.2. Evaluation of kinetic parameters for the synthesis of peptides and depsipeptides revealed a close relationship between VanA and DdlN in that depsipeptide formation was kinetically preferred at physiologic pH; however, the DdlN enzyme demonstrated a narrower substrate specificity and commensurately increased affinity for d-lactate in the C-terminal position over VanA. The results of these functional experiments also reinforce the results of previous studies that demonstrated that glycopeptide resistance enzymes from glycopeptide-producing bacteria are potential sources of resistance enzymes in clinically relevant bacteria.The origin of antibiotic resistance determinants is of significant interest for several reasons, including the prediction of the emergence and spread of resistance patterns, the design of new antimicrobial agents, and the identification of potential reservoirs for resistance elements. Antibiotic resistance can occur either through spontaneous mutation in the target or by the acquisition of external genetic elements such as plasmids or transposons which carry resistance genes (7). The origins of these acquired genes are varied, but it has long been recognized that potential reservoirs are antibiotic-producing organisms which naturally harbor antibiotic resistance genes to protect themselves from the actions of toxic compounds ().High-level resistance to glycopeptide antibiotics such as vancomycin and teicoplanin in vancomycin-resistant enterococci (VRE) is conferred by the presence of three genes, vanH, vanA (or vanB), and vanX, which, along with auxiliary genes necessary for inducible gene expression, are found on transposons integrated into plasmids or the bacterial genome (, ). These three genes are essential to resistance and serve to change the C-terminal peptide portion of the peptidoglycan layer from d-alanyl-d-alanine (d-Ala-d-Ala) to d-alanyl-d-lactate (d-Ala-d-Lac). This change results in the loss of a critical hydrogen bond between vancomycin and the d-Ala-d-Ala terminus and in a 1,000-fold decrease in binding affinity between the antibiotic and the peptidoglycan layer, which is the basis for the bactericidal action of this class of compounds (). The vanH gene encodes a d-lactate dehydrogenase which provides the requisite d-Lac (, ), while the vanX gene encodes a highly specific dd-peptidase which cleaves only d-Ala-d-Ala produced endogenously while leaving d-Ala-d-Lac intact (, ). The final gene, vanA or vanB, encodes an ATP-dependent d-Ala-d-Lac ligase (, , ). This enzyme has sequence homology with the chromosomal d-Ala-d-Ala ligases, which are essential for peptidoglycan synthesis but which generally lack the ability to synthesize d-Ala-d-Lac ().We have recently cloned vanH, vanA, and vanX homologues from two glycopeptide antibiotic-synthesizing organisms: Amycolatopsis orientalis C329.2, which produces vancomycin, and Streptomyces toyocaensis NRRL 15009, which produces A47934 (). In addition, the vanH-vanA-vanX gene cluster was identified in several other glycopeptide producers. We have also demonstrated that the VanA homologue from S. toyocaensis NRRL 15009 can synthesize d-Ala-d-Lac in vitro and in the glycopeptide-sensitive host Streptomyces lividans (, ). We now report the expression of the A. orientalis C329.2 VanA homologue DdlN in Escherichia coli, its purification, and its enzymatic characterization. These data reinforce the striking similarity between vancomycin resistance elements in VRE and glycopeptide-producing organisms and support the possibility of a common origin for these enzymes.

Expression, purification, and specificity of DdlN.

DdlN was overexpressed in E. coli under the control of the bacteriophage T7 promoter. The construct gave good yields of highly purified enzyme following a four-step purification procedure (Table (Table1;1; Fig. Fig.1).1). Like other dd-ligases, DdlN behaved like a dimer in solution (not shown).

TABLE 1

Purification of DdlN from E. coli BL21 (DE3)/pETDdlN

Sample	Protein (mg)	Activity (nmol/min)	Sp act (nmol/ min/mg)	Recovery (%)	Purification (fold)
Lysate	124	843	6.82	100
Ammonium sulfate (20–50% saturation)	67.6	780	11.5	92	1.7
Sephacryl S200	11.6	825	71.4	98	11
Q Sepharose	2.8	742	265	88	39
Phenyl Superose	0.4	299	748	35	110

Open in a separate window Open in a separate window FIG. 1Purification of DdlN from E. coli BL21 (DE3)/pETDdlN. Proteins were separated on an SDS–11% polyacrylamide gel and stained with Coomassie blue. Lane 1, molecular mass markers (masses are noted at the left in kilodaltons); lane 2, whole-cell lysate; lane 3, ammonium sulfate fraction (20 to 50% saturation); lane 4, Sephacryl S200; lane 5, Q Sepharose; lane 6, phenyl Superose.The amino acid substrate specificity of DdlN was assessed by incubation of ¹⁴C-d-Ala with all 20 common amino acids in the d configuration. Purified DdlN catalyzed the synthesis of d-Ala-d-Ala in addition to that of several other mixed dipeptides, including d-Ala-d-Met and d-Ala-d-Phe (Fig. (Fig.2).2). Thus, DdlN exhibits a substrate specificity which is similar to that of VanA (), with the capacity to synthesize not only d-Ala-d-Ala but also mixed dipeptides with bulky side chains in the C-terminal position.Open in a separate window FIG. 2Substrate specificity of DdlN. Autoradiogram from thin-layer chromatography analysis of DdlN substrate specificity. All reaction mixtures contained 2.5 mM d-Ala and 1 mM ATP, and the radiolabel was ¹⁴C-d-Ala, except where noted. Lane 1, d-Ala; lane 2, d-Lac with ¹⁴C-d-Lac label; lane 3, d,l-methionine; lane 4, dl-phenylalanine; lane 5, d-Hbut; lane 6, d-hydroxyvalerate. Letters indicate the following: A, d-Ala-d-Lac; B, d-Lac; C, d-Ala-d-Met; D, d-Ala-d-Phe; E, d-Ala-d-Hbut; F, d-Ala-d-hydroxyvalerate.Importantly, DdlN is a depsipeptide synthase with the ability to synthesize d-Ala-d-Lac, d-Ala-d-hydroxybutyrate (Hbut), and d-Ala-d-hydroxyvalerate (Fig. (Fig.2).2). However, unlike VanA (), d-hydroxycaproate and d-phenyllactate are not substrates (not shown). Thus, DdlN is a broad-spectrum d-Ala-d-X ligase with depsipeptide synthase activity.

Characterization of d-Ala-d-X ligase activity.

Following the initial assessment of the specificity of the enzyme, several substrates were selected for quantitative analysis by evaluation of their steady-state kinetic parameters (Table (Table2).2). DdlN has two amino acid (or hydroxy acid) K_m values. Steady-state kinetic plots indicated that, like other dd-ligases, the N-terminal K_m (K_m1) was significantly lower (higher specificity) than the C-terminal K_m (K_m2). Since the former value is expected to be independent of the C-terminal substrate, only K_m2 values were determined and are reported here.

TABLE 2

Characterization of steady-state parameters of DdlN and VanA

Ligase	Substrate	K_m2 (mM)	k_cat (min⁻¹)	k_cat/K_m2 (M⁻¹ s⁻¹)
DdlN	d-Ala	21 ± 2	229 ± 7	1.8 × 10²
	d-Lac	0.4 ± 0.05	55 ± 1	2.3 × 10³
	d-Hbut	2.5 ± 0.3	32 ± 2	2.1 × 10²
	ATP^a	1.2 ± 0.2	71 ± 5	0.98 × 10²
DdlM^b	d-Ala	166 ± 27
	d-Lac	1.08 ± 0.10
VanA^c	d-Ala	38	295	1.3 × 10²
	d-Lac	7.1	94	2.2 × 10²
	d-Hbut	0.60	108	3.0 × 10³

Open in a separate window^a Determined in the presence of 10 mM d-Lac. ^b Data from reference . ^c Data from reference . DdlN showed good d-Ala-d-Ala ligase activity but with a very high and physiologically questionable K_m2 (21 mM). On the other hand, d-Ala-d-Lac synthesis was excellent, with a 4-fold decrease in k_cat, compared to d-Ala-d-Ala synthesis, which was offset by a 52-fold drop in K_m that resulted in a >12-fold increase in specificity (k_cat/K_m2). d-Hbut was also a good substrate, with a k_cat/K_m2 comparable to that of d-Ala.Steady-state kinetic parameters for d-Ala-d-X formation showed trends similar to those found with both VanA and DdlN. For example, the k_cat values between VanA and DdlN were virtually the same for most substrates. There were significant differences, however. For instance, while the K_m2 values for d-Ala were very high for all three enzymes, DdlN does have greater affinity for d-Ala, with a 1.8- and 7.9-fold lower K_m2 than those of VanA and DdlM, respectively. Additionally, the K_m2 for d-Lac was 17.8- and 2.7-fold lower than those for VanA and DdlM. Thus, DdlN has a more restrictive specificity for the C-terminal residue than VanA, which is compensated for by a higher affinity for the critical substrate d-Lac.

pH dependence of peptide versus that of depsipeptide synthesis activity.

The partitioning of the syntheses of d-Ala-d-Ala and d-Ala-d-Hbut in VanA and other depsipeptide-competent dd-ligases has been shown to be pH dependent (). Determination of the pH dependence of DdlN in synthesizing peptide versus depsipeptide (Fig. (Fig.3)3) directly paralleled the results obtained with VanA in similar experiments. At lower pHs (<7), d-Ala-d-Hbut synthesis predominates and is exclusive at a pH of <6 (Fig. (Fig.3).3). At pH 7.5, levels of synthesis of d-Ala-d-Hbut and d-Ala-d-Ala are relatively equal, while at a pH greater than 8, the capacity to synthesize peptide overtakes the capacity to synthesize depsipeptide, although the latter is never abolished. Open in a separate window FIG. 3pH dependence of partitioning of the syntheses of peptide and depsipeptide by DdlN. (A) Autoradiogram of a thin-layer chromatography separation of the products of reaction mixtures containing ¹⁴C-D-Ala, unlabeled D-Ala, and d-Hbut. (B) Quantification of reaction products following phosphorimage analysis. Filled circles, D-Ala-d-Hbut; open circles, D-Ala-D-Ala.The partitioning of the formation of peptide versus depsipeptide as a function of pH by DdlM is comparable to that by VanA and depsipeptide-competent mutants of DdlB (), which show essentially exclusively depsipeptide formation at lower pHs and increasing peptide formation as the pH increases. This implies a potential role for the protonated ammonium group of d-Ala2 in second-substrate recognition and suggests a mechanism for the discrimination between d-Ala and d-Lac at physiologic pH. The structural basis for this distinction remains obscure for DdlB and VanA or DdlN.

Concluding remarks.

Resistance to vancomycin and other glycopeptides is mediated through the synthesis of a peptidoglycan which does not terminate with the canonical d-Ala-d-Ala dipeptide. Thus, enterococci which exhibit the VanC phenotype, which consists of low-level, noninducible resistance to vancomycin only, have peptidoglycan terminating in d-Ala-d-Ser (). On the other hand, bacteria which are constitutively resistant to high concentrations of glycopeptides, such as lactic acid bacteria and VRE exhibiting the VanA or VanB phenotype (high-level inducible resistance to vancomycin), incorporate the depsipeptide d-Ala-d-Lac into their cell walls (, , ). The enzymes responsible for the intracellular synthesis of d-Ala-d-Lac not surprisingly have significant amino acid sequence similarity with d-Ala-d-Ala ligases, which are responsible for d-Ala-d-Ala synthesis in all bacteria with a cell wall ().The d-Ala-d-Lac synthases can be subdivided into two groups based on sequence homology: those found in the constitutively resistant lactic acid bacteria and those found in glycopeptide-producing organisms and VanA or VanB VRE (, ). The former have more similarity with exclusive d-Ala-d-Ala ligases. Indeed, single point mutations in d-Ala-d-Ala ligases which yield sequences more similar to those of lactic acid bacterium d-Ala-d-Lac ligases are sufficient to induce significant depsipeptide synthase activity in these enzymes (). Similarly, mutational studies of the d-Ala-d-Lac ligase from Leuconostoc mesenteroides have demonstrated that the converse also holds (). On the other hand, the molecular basis for depsipeptide synthesis by the VanA or VanB ligases is unknown, in large part due to the lack of protein structural information on which to base mutational studies, unlike the situation with d-Ala-d-Ala ligases, where the E. coli DdlB structure serves as a template for mechanistic research ().Significantly, a major difference in the VanA or VanB ligases and other dd-ligases lies in the amino acid sequence of the ω-loop region, which closes off the active site of DdlB () and has been shown to contribute amino acid residues with the capacity to control the syntheses of d-Ala-d-Ala and d-Ala-d-Lac, notably, Tyr216 (, ). Until recently, the VanA and VanB ligases were exceptional in amino acid structure and had no known homologues. The sequencing of resistance genes from glycopeptide-producing bacteria has uncovered enzymes with >60% homology to VanA or VanB and which are virtually superimposable in the critical ω-loop region (, ). One of these, DdlM from S. toyocaensis NRRL 15009, has been shown to have d-Ala-d-Lac ligase ability (, ), although no rigorous analysis of this activity has been performed. The results presented here demonstrate that DdlN from the vancomycin producer A. orientalis C329.2 not only is a d-Ala-d-Lac ligase but also has significant functional homology with VanA. It is not known at present if, like S. toyocaensis NRRL 15009 (), A. orientalis C329.2 also possess a d-Ala-d-Ala-exclusive ligase, though the presence of a vanX gene () suggests that it may.These studies demonstrate that DdlN cloned from a vancomycin-producing bacterium is a d-Ala-d-Lac ligase which has not only amino acid sequence homology with the dd-ligases from VRE but also functional homology. Thus, VanA, VanB, DdlN, and DdlM have likely evolved from similar origins. The fact that a vanH-vanA-vanX gene cluster can be found in other glycopeptide producers as well () suggests that the genes now found in VRE may have originated in glycopeptide-producing bacteria. Our finding that overexpressed, purified, DdlN shows many enzymatic characteristics similar (though not identical) to those of VanA suggests that the genes from glycopeptide-producing bacteria can be important in elucidating biochemical and protein structural aspects of the VRE proteins. 相似文献

Interaction Between Eye Pigment Genes and Tau-Induced Neurodegeneration in Drosophila melanogaster

Surendra S. Ambegaokar George R. Jackson 《Genetics》2010,186(1):435-442

相似文献

Improved Production of Homo-d-Lactic Acid via Xylose Fermentation by Introduction of Xylose Assimilation Genes and Redirection of the Phosphoketolase Pathway to the Pentose Phosphate Pathway in l-Lactate Dehydrogenase Gene-Deficient Lactobacillus plantarum

Kenji Okano Shogo Yoshida Ryosuke Yamada Tsutomu Tanaka Chiaki Ogino Hideki Fukuda Akihiko Kondo 《Applied and environmental microbiology》2009,75(24):7858-7861

相似文献

The Bacillus subtilis AraE Protein Displays a Broad Substrate Specificity for Several Different Sugars

下载免费PDF全文

Oliver Krispin Rudolf Allmansberger 《Journal of bacteriology》1998,180(12):3250-3252

相似文献

On the Classification of Epistatic Interactions

Hong Gao Julie M. Granka Marcus W. Feldman 《Genetics》2010,184(3):827-837

Modern genomewide association studies are characterized by the problem of “missing heritability.” Epistasis, or genetic interaction, has been suggested as a possible explanation for the relatively small contribution of single significant associations to the fraction of variance explained. Of particular concern to investigators of genetic interactions is how to best represent and define epistasis. Previous studies have found that the use of different quantitative definitions for genetic interaction can lead to different conclusions when constructing genetic interaction networks and when addressing evolutionary questions. We suggest that instead, multiple representations of epistasis, or epistatic “subtypes,” may be valid within a given system. Selecting among these epistatic subtypes may provide additional insight into the biological and functional relationships among pairs of genes. In this study, we propose maximum-likelihood and model selection methods in a hypothesis-testing framework to choose epistatic subtypes that best represent functional relationships for pairs of genes on the basis of fitness data from both single and double mutants in haploid systems. We gauge the performance of our method with extensive simulations under various interaction scenarios. Our approach performs reasonably well in detecting the most likely epistatic subtype for pairs of genes, as well as in reducing bias when estimating the epistatic parameter (ɛ). We apply our approach to two available data sets from yeast (Saccharomyces cerevisiae) and demonstrate through overlap of our identified epistatic pairs with experimentally verified interactions and functional links that our results are likely of biological significance in understanding interaction mechanisms. We anticipate that our method will improve detection of epistatic interactions and will help to unravel the mysteries of complex biological systems.UNDERSTANDING the nature of genetic interactions is crucial to obtaining a more complete picture of complex biological systems and their evolution. The discovery of genetic interactions has been the goal of many researchers studying a number of model systems, including but not limited to Saccharomyces cerevisiae, Caenorhabditis elegans, and Escherichia coli (; ; ; ; ; ; ; ; ; ; ; ). Recently, high-throughput experimental approaches, such as epistatic mini-array profiles (E-MAPs) and genetic interaction analysis technology for E. coli (GIANT-coli), have enabled the study of epistasis on a large scale (, ; , ; ). However, it remains unclear whether the computational and statistical methods currently in use to identify these interactions are indeed the most appropriate.The study of genetic interaction, or “epistasis,” has had a long and somewhat convoluted history. Bateson (1909) first used the term epistasis to describe the ability of a gene at one locus to “mask” the mutational influence of a gene at another locus (). The term “epistacy” was later coined by Fisher (1918) to denote the statistical deviation of multilocus genotype values from an additive linear model for the value of a phenotype (, ).These origins are the basis for the two main current interpretations of epistasis. The first, as introduced by Bateson (1909), is the “biological,” “physiological,” or “compositional” form of epistasis, concerned with the influence of an individual''s genetic background on an allele''s effect on phenotype (; , ; ; ). The second interpretation, attributed to Fisher, is “statistical” epistasis, which in its linear regression framework places the phenomenon of epistasis in the context of a population (; Wade et al. 2001; ; ; ). Each of these approaches is equally valid in studying genetic interactions; however, confusion still exists about how to best reconcile the methods and results of the two (, ; ; ; ; ).Aside from the distinction between the statistical and the physiological definitions of epistasis, inconsistencies exist when studying solely physiological epistasis. For categorical traits, physiological epistasis is clear as a “masking” effect. When noncategorical or numerical traits are measured, epistasis is defined as the deviation of the phenotype of the multiple mutant from that expected under independence of the underlying genes.The “expectation” of the phenotype under independence, that is, in the absence of epistasis, is not defined consistently between studies. For clarity, consider epistasis between pairs of genes and, without loss of generality, consider fitness as the phenotype. The first commonly used definition of independence, originating from additivity, defines the effect of two independent mutations to be equal to the sum of the individual mutational effects. A second, motivated by the use of fitness as a phenotype, defines the effect of the two mutations as the product of the individual effects (; ; ). A third definition of independence has been referred to as “minimum,” where alleles at two loci are independent if the double mutant has the same fitness as the less-fit single mutant. claim that this has been used when identifying pairwise epistasis by searching for synthetic lethal double mutants (, ; , ; ). A fourth is the “Log” definition presented by and . The less-frequently used “scaled ɛ” () measure of epistasis takes the multiplicative definition of independence with a scaling factor.These different definitions of independence are partly due to distinct measurement “scales.” For some traits, a multiplicative definition of independence may be necessary to identify epistasis between two genes, whereas for other traits, additivity may be appropriate (Falconer and Mackay 1995; Wade et al. 2001; ; ). An interaction found under one independence definition may not necessarily be found under another, leading to different biological conclusions (). suggest that there may be an “ideal” definition of independence for all gene pairs for identifying functional relationships. However, it is plausible that different representations of independence for two genes may reflect different biological properties of the relationship (; ). “Two categories of general interest [the additive and multiplicative definitions, respectively] are those in which etiologic factors act interchangeably in the same step in a multistep process, or alternatively act at different steps in the process” (, p. 468). In some cases, the discovery of epistasis may merely be an artifact of using an incorrect null model (). It may be necessary to represent “independence” differently, resulting in different statistical measures of interactions, for different pairs of genes depending on their functions.Previous studies have suggested that different pairs of loci may have different modes of interaction and have attempted to subclassify genetic interactions into regulatory hierarchies and mutually exclusive “interaction subtypes” to elucidate underlying biological properties (; ; ). We suggest that epistatic relationships can be divided into several subtypes, or forms, corresponding to the aforementioned definitions of independence. As a particular gene pair may deviate from independence according to several criteria, we do not claim that these subtypes are necessarily mutually exclusive. We attempt to select the most likely epistatic subtype that is the best statistical representation of the relationship between two genes. To further subclassify interactions, epistasis among deleterious mutations can take one of two commonly used forms: positive (equivalently alleviating, antagonistic, or buffering) epistasis, where the phenotype of the double mutant is less severe than expected under independence, and negative (equivalently aggravating, synergistic, or synthetic), where the phenotype is more severe than expected (; ; ; ).Another objective of such distinctions is to reduce the bias of the estimator of the epistatic parameter (ɛ), which measures the extent and direction of epistasis for a given gene pair. , assuming that the overall distribution of ɛ should be centered around 0, find that inaccurately choosing a definition of independence can result in increased bias when estimating ɛ. For example, using the minimum definition results in the most severe bias when single mutants have moderate fitness effects, and the additive definition results in the largest positive bias when at least one gene has an extreme fitness defect (). Therefore, it is important to select an optimal estimator for ɛ for each pair of genes from among the subtypes of epistatic interactions.Epistasis may be important to consider in genomic association studies, as a gene with a weak main effect may be identified only through its interaction with another gene or other genes (; ; ; ; ). Epistasis has also been studied extensively in the context of the evolution of sex and recombination. The mutational deterministic hypothesis proposes that the evolution of sex and recombination would be favored by negative epistatic interactions (; ); many other studies have also studied the importance of the form of epistasis (; ; ; ; ; ). Indeed, according to , p. 3466), “the choice of definition [of epistasis] alters conclusions relevant to the adaptive value of sex and recombination.”Given fitness data from single and double mutants in haploid organisms, we implement a likelihood method to determine the subtype that is the best statistical representation of the epistatic interaction for pairs of genes. We use maximum-likelihood estimation and the Bayesian information criteria (BIC) (Schwarz 1978) with a likelihood-ratio test to select the most appropriate null or epistatic model for each putative interaction. We conduct extensive simulations to gauge the performance of our method and demonstrate that it performs reasonably well under various interaction scenarios. We apply our method to two data sets with fitness measurements obtained from yeast (; ), whose authors assume only multiplicative epistasis for all interactions. By examining functional links and experimentally validated interactions among epistatic pairs, we demonstrate that our results are biologically meaningful. Studying a random selection of genes, we find that minimum epistasis is more prevalent than both additive and multiplicative epistasis and that the overall distribution of ɛ is not significantly different from zero (as suggest). For genes in a particular pathway, we advise selecting among fewer epistatic subtypes. We believe that our method of epistatic subtype classification will aid in understanding genetic interactions and their properties.

St. Onge et al. (2007) data set:

examined 26 nonessential genes known to confer resistance to MMS, constructed double-deletion strains for 323 double-mutant strains (all but two of the total possible pairs), and assumed the multiplicative form of epistasis for all interactions (see Methods: Analysis of experimental data). Following these authors, we focus on single- and double-mutant fitnesses measured in the presence of MMS. (For results in the absence of MMS, see File S1 and File S1_2.)Using the resampling method described in Analysis of experimental data and File S1, 222 gene pairs pass the cutoff of having epistasis inferred in at least 900 of 1000 replicates. This does not include 5 synthetic lethal gene pairs. Hypothesis testing and a multiple-testing procedure (for 222 simultaneous hypotheses) are necessary to determine the final epistatic pairs.To select one among the three multiple-testing procedures, we follow and examine gene pairs that share specific functional links (see Analysis of experimental data). The Bonferroni method is likely too conservative, yielding only 25 significantly epistatic pairs with only one functional link among them; alternatively, the pFDR procedure appears to be too lenient in rejecting independence for all 222 pairs. Therefore, we use the FDR procedure (although the number of functional links is not significant) and detect 193 epistatic pairs, of which 5 (2.6%) are synthetic lethals, 19 (9.8%) have additive epistasis, 33 (17.1%) have multiplicative epistasis, and 136 (70.5%) have minimum epistasis (File S1_1). We find 29 gene pairs with positive (alleviating) epistasis and 159 gene pairs with negative (aggravating) epistasis.

TABLE 2

Summary of gene pairs with the indicated epistatic subtypes, inferred using the FDR procedure with the BIC method that considers all three epistatic subtypes and their corresponding null models

Epistatic subtype	Study S	Study J
All	193 (100%)	352 (100%)
	= −0.060	= −0.001
	= −0.096	= −0.059
Additive	19 (9.8%)	35 (9.9%)
	= 0.115*	= 0.193***
	= 0.131	= 0.188
Multiplicative	33 (17.1%)	63 (17.9%)
	= 0.048	= 0.017
	= −0.166	= −0.115
Minimum	136 (70.5%)	254 (72.2%)
	= −0.111***	= −0.032**
	= −0.091	= −0.065

Open in a separate windowNumbers are the counts of each type, and percentages are given of the total number of epistatic pairs. The mean () and median () of the epistatic parameter (ɛ) are given for each subtype, with “*” indicating that the mean of ɛ is significantly different from 0 (*, P-value ≤0.05; **, P-value ≤0.01; ***, P-value ≤0.001). Study S refers to the data set, and study J refers to the data set. (For study S, five of the epistatic pairs are synthetic lethals and are not shown; as a result, percentages do not sum to 100%.)To further validate the use of our method and the FDR procedure, we assess by Fisher''s exact test the significance of an enrichment of both Biological Process and all GO Slim term links among epistatic pairs, neither of which are significant (; www.yeastgenome.org; Stark et al. 2006); Table S4]. Although some of the previously unidentified interactions that we identify could be false positives, many are likely to be new discoveries.

TABLE 3

Comparison of validation measures for each data set for different variations of the FDR and BIC procedures, considering only a subset of epistatic subtypes with their corresponding null models: all epistatic subtypes (A, P, and M); only the additive and multiplicative subtypes (A and P); and only the additive (A), only the multiplicative (P), or only the minimum (M) subtype (see text for details)

	Subtypes considered in BIC procedure
	A, P, M	A, P	A	P	M
Study J
No. found (636)	352	273	263	231	329
Functional links (25)	19 (0.0255)*	13 (0.2320)	11 (0.4689)	10 (0.4227)	15 (0.2619)
GO Slim terms (Biological Process) (115)	69 (0.1573)	50 (0.4874)	55 (0.0736)	44 (0.3534)	68 (0.04902)*
GO Slim terms (all) (369)	224 (0.0009)*	172 (0.01654)*	160 (0.1297)	146 (0.0273)*	213 (0.0003)*
Experimentally identified (3)	3	2	1	2	3
Study S
No. found (323)	193	192	247	171	243
Functional links (36)	21 (0.6450)	29 (0.0041)*	34 (0.0031)*	29 (0.0003)*	24 (0.9256)
GO Slim terms (Biological Process) (283)	174 (0.0657)	174 (0.03656)*	223 (0.0010)*	153 (0.1825)	213 (0.5534)
GO Slim terms (all) (307)	185 (0.2866)	182 (0.6926)	237 (0.1472)	162 (0.6997)	231 (0.5908)
Experimentally identified (29)	17	22	24	23	21

Open in a separate windowNumbers in parentheses indicate P-values by Fisher''s exact test. “*” indicates significance. Study J refers to the data set, and study S refers to the data set measured in the presence of MMS. Numbers in parentheses indicate the total number of tested pairs and the total number of each type of link found in each complete data set.The epistatic subtypes we consider are not necessarily mutually exclusive. To more fully assess the assumptions of our method, we also consider several of the possible subsets of the epistatic subtypes (and their corresponding null models) in our procedure. As the minimum epistatic subtype was the most frequently selected in this data set, we first do not include the minimum null model or the minimum epistatic model in our procedure (i.e., we select from among four rather than six models for a pair; Table S4). However, there are a significant number of epistatic pairs with functional links only when the minimum epistatic subtype is not included (also see Table S4 and Table S5). It is not immediately clear which epistatic subtypes are the most appropriate for these data, although including the minimum subtype may not be appropriate () (see discussion).Although it may be best to consider fewer epistatic subtypes for this specific data set, we report our results including all three epistatic subtypes and their corresponding null models (St. Onge et al. (2007), although we identify 105 epistatic pairs not identified by the original authors (Figure S4, Table S4). find that epistatic pairs with a functional link have a positively shifted distribution of epistasis. We find no such shift in epistasis values (Figure S5). We also demonstrate [described in application to simulated data: Bias and variance of the epistatic parameter (ɛ)] that our method seems to reduce bias of the epistatic parameter (ɛ) (Table S3).] When considering only a subset of the epistatic subtypes, however, we find to be positive and significantly different from zero (results not shown). See File S1, Figure S6, and Figure S7 for additional discussion of the epistatic pairs we identify.

Jasnos and Korona (2007) data set:

The data set included 758 yeast gene deletions known to cause growth defects and reports fitnesses of only a sparse subset of all possible gene pairs [≈0.2% of the possible pairwise genotypes, or 639 pairs of ]. Because the authors do not identify epistatic pairs in a hypothesis-testing framework, we cannot explicitly compare our conclusions with theirs.To validate our method, we examine gene pairs that have specific functional links (see methods: Analysis of experimental data). When defining a functional link using GO terms () with <30 genes associated with them, only 1 of 639 tested gene pairs has a functional link. Raising the threshold of associated genes to 50 and 100, the number of tested pairs with functional links rises only to 3 and 9, respectively. Because of the large number of random genes and the sparse number of gene pairs in this data set, we follow and select GO terms that have associated with them ≤200 genes. Twenty-five of 639 tested pairs then have a functional link.Only the FDR multiple-testing procedure results in a significant enrichment of functional links among epistatic pairs (File S1). With the FDR procedure we find 352 significant epistatic pairs, of which 35 (9.9%) have additive epistasis, 63 (17.9%) have multiplicative epistasis, and 254 (72.2%) have minimum epistasis (File S1_3). These proportions of inferred subtypes suggest that the authors'' original restriction to multiplicative epistasis may be inappropriate. We find 141 gene pairs with positive epistasis and 211 gene pairs with negative epistasis.We do not find a significant number of epistatic pairs with shared GO Slim Biological Process terms (see Analysis of experimental data), but do when considering all shared GO Slim terms (St. Onge et al. (2007) data set, we also consider some of the possible subsets of the three epistatic subtypes (and their corresponding null models) in our model selection procedure (Table S5). In contrast to the data set, using all three epistatic subtypes results in a significant number of epistatic pairs with functional links; this measure is not significant when using any of the other subsets of the subtypes. This suggests that our proposed method with three epistatic subtypes may indeed be the most appropriate for data sets with randomly selected genes.We examined the distribution of the estimated values of the epistatic parameter (ɛ) for all pairs with significant epistasis. , in assuming only multiplicative epistasis, conclude that epistasis is predominantly positive. However, we find that the estimated mean of epistasis is not significantly different from zero (two-sided t-test, P-value = 0.9578; Figure 1 and File S1.Open in a separate window Figure 1.—Distribution of the epistasis values (ɛ) for significant epistatic pairs in the data set, determined using the FDR procedure and the BIC method including all three epistatic subtypes and their corresponding null models. Mean of ɛ is −0.0009, with a standard deviation of 0.3177; median value is −0.0587. A similar plot is shown in Figure 3 of . 相似文献

Peptidoglycan Fine Structure of the Radiotolerant Bacterium Deinococcus radiodurans Sark

José Carlos Quintela Francisco García-del Portillo Ernst Pittenauer Günter Allmaier Miguel A. de Pedro 《Journal of bacteriology》1999,181(1):334-337

Peptidoglycan from Deinococcus radiodurans was analyzed by high-performance liquid chromatography and mass spectrometry. The monomeric subunit was: N-acetylglucosamine–N-acetylmuramic acid–l-Ala–d-Glu-(γ)–l-Orn-[(δ)Gly-Gly]–d-Ala–d-Ala. Cross-linkage was mediated by (Gly)₂ bridges, and glycan strands were terminated in (1→6)anhydro-muramic acid residues. Structural relations with the phylogenetically close Thermus thermophilus are discussed.The gram-positive bacterium Deinococcus radiodurans is remarkable because of its extreme resistance to ionizing radiation (14). Phylogenetically the closest relatives of Deinococcus are the extreme thermophiles of the genus Thermus (4, 11). In 16S rRNA phylogenetic trees, the genera Thermus and Deinococcus group together as one of the older branches in bacterial evolution (11). Both microorganisms have complex cell envelopes with outer membranes, S-layers, and ornithine-Gly-containing mureins (, , , , , ). However, Deinococcus and Thermus differ in their response to the Gram reaction, having positive and negative reactions, respectively (4, 14). The murein structure for Thermus thermophilus HB8 has been recently elucidated (). Here we report the murein structure of Deinococcus radiodurans with similar detail.D. radiodurans Sark () was used in the present study. Cultures were grown in Luria-Bertani medium () at 30°C with aeration. Murein was purified and subjected to amino acid and high-performance liquid chromatography (HPLC) analyses as previously described (, , , ). For further analysis muropeptides were purified, lyophilized, and desalted as reported elsewhere (, ). Purified muropeptides were subjected to plasma desorption linear time-of-flight mass spectrometry (PDMS) as described previously (1, , , ). Positive and negative ion mass spectra were obtained on a short linear ²⁵²californium time-of-flight instrument (BioIon AB, Uppsala, Sweden). The acceleration voltage was between 17 and 19 kV, and spectra were accumulated for 1 to 10 million fission events. Calibration of the mass spectra was done in the positive ion mode with H⁺ and Na⁺ ions and in the negative ion mode with H⁻ and CN⁻ ions. Calculated m/z values are based on average masses.Amino acid analysis of muramidase (Cellosyl; Hoechst, Frankfurt am Main, Germany)-digested sacculi (50 μg) revealed Glu, Orn, Ala, and Gly as the only amino acids in the muramidase-solubilized material. Less than 3% of the total Orn remained in the muramidase-insoluble fraction, indicating an essentially complete solubilization of murein.Muramidase-digested murein samples (200 μg) were analyzed by HPLC as described in reference . The muropeptide pattern (Fig. (Fig.1)1) was relatively simple, with five dominating components (DR5 and DR10 to DR13 [Fig. 1]). The muropeptides resolved by HPLC were collected, desalted, and subjected to PDMS. The results are presented in Table Table11 compared with the m/z values calculated for best-matching muropeptides made up of N-acetylglucosamine (GlucNAc), N-acetylmuramic acid (MurNAc), and the amino acids detected in the murein. The more likely structures are shown in Fig. Fig.1.1. According to the m/z values, muropeptides DR1 to DR7 and DR9 were monomers; DR8, DR10, and DR11 were dimers; and DR12 and DR13 were trimers. The best-fitting structures for DR3 to DR8, DR11, and DR13 coincided with muropeptides previously characterized in T. thermophilus HB8 () and had identical retention times in comparative HPLC runs. The minor muropeptide DR7 (Fig. (Fig.1)1) was the only one detected with a d-Ala–d-Ala dipeptide and most likely represents the basic monomeric subunit. The composition of the major cross-linked species DR11 and DR13 confirmed that cross-linking is mediated by (Gly)₂ bridges, as proposed previously (). Open in a separate window FIG. 1HPLC muropeptide elution patterns of murein purified from D. radiodurans. Muramidase-digested murein samples were subjected to HPLC analysis, and the A₂₀₄ of the eluate was recorded. The most likely structures for each muroeptide as deduced by PDMS are shown. The position of residues in brackets is the most likely one as deduced from the structures of other muropeptides but could not be formally demonstrated. R = GlucNac–MurNac–l-Ala–d-Glu-(γ)→.

TABLE 1

Calculated and measured m/z values for the molecular ions of the major muropeptides from D. radiodurans

Muropeptide^a	Ion	m/z		Δm^b	Error (%)^c	Muropeptide composition						Muropeptide abundance (mol%)
Muropeptide^a	Ion	Calculated	Measured	Δm^b	Error (%)^c	NAG^d	NAM^e	Glu	Orn	Ala	Gly	Muropeptide abundance (mol%)
DR1	[M+H]⁺	699.69	700.1	0.41	0.06	1	1	1	0	1	0	12.0
DR2	[M+H]⁺	927.94	928.3	0.36	0.04	1	1	1	1	1	2	5.7
DR3	[M+Na]⁺	1,006.97	1,007.5	0.53	0.05	1	1	1	1	1	3	3.0
DR4	[M+Na]⁺	963.95	964.6	0.65	0.07	1	1	1	1	2	1	2.5
DR5	[M+H]⁺	999.02	999.8	0.78	0.08	1	1	1	1	2	2	27.7
	[M−H]⁻	997.00	997.3	0.30	0.03
DR6	[M+Na]⁺	1,078.5	1,078.8	0.75	0.07	1	1	1	1	2	3	2.4
DR7	[M+H]⁺	1,070.09	1,071.0	0.90	0.08	1	1	1	1	3	2	2.2
DR8	[M+Na]⁺	1,520.53	1,521.6	1.08	0.07	1	1	2	2	4	4	2.2
DR9	[M+Na]⁺	701.64	702.1	0.46	0.03	1	1^f	1	0	1	0	5.0
DR10	[M+H]⁺	1,907.94	1,907.8	0.14	0.01	2	2	2	2	3	4	10.1
	[M−H]⁻	1,905.92	1,906.6	0.68	0.04
DR11	[M+H]⁺	1,979.01	1,979.1	0.09	0.01	2	2	2	2	4	4	19.1
	[M−H]⁻	1,977.00	1,977.3	0.30	0.02
DR12	[M+H]⁺	2,887.93	2,886.5	−1.43	−0.05	3	3	3	3	5	6	4.4
	[M−H]⁻	2,885.91	2,885.8	−0.11	−0.01
DR13	[M+H]⁺	2,959.00	2,957.8	−1.20	−0.04	3	3	3	3	6	6	3.6
	[M−H]⁻	2,956.99	2,955.9	−1.09	−0.04

Open in a separate window^aDR5 and DR10 to DR13 were analyzed in both the positive and negative ion modes. Muropeptides DR1 to DR4 and DR6 to DR9 were analyzed in the positive mode only due to the small amounts of sample available. ^bMass difference between measured and calculated quasimolecular ion values. ^c[(Measured mass−calculated mass)/calculated mass] × 100. ^dN-Acetylglucosamine. ^eN-Acetylmuramitol. ^f(1→6)Anhydro-N-acetylmuramic acid. Structural assignments of muropeptides DR1, DR2, DR8 to DR10, and DR12 deserve special comments. The low m/z value measured for DR1 (700.1) fitted very well with the value calculated for GlucNAc–MurNAc–l-Ala–d-Glu (699.69). Even smaller was the mass deduced for DR9 from the m/z value of the molecular ion of the sodium adduct (702.1) (Fig. (Fig.2).2). The mass difference between DR1 and DR9 (19.9 mass units) was very close indeed to the calculated difference between N-acetylmuramitol and the (1→6)anhydro form of MurNAc (20.04 mass units). Therefore, DR9 was identified as GlucNAc–(1→6)anhydro-MurNAc–l-Ala–d-Glu (Fig. (Fig.1).1). Muropeptides with (1→6)anhydro muramic acid have been identified in mureins from diverse origins (, , , ), indicating that it might be a common feature among peptidoglycan-containing microorganisms. Open in a separate window FIG. 2Positive-ion linear PDMS of muropeptide DR9. Muropeptide DR9 was purified, desalted by HPLC, and subjected to PDMS to determine the molecular mass. The masses for the dominant molecular ions are indicated.The measured m/z value for the [M+Na]⁺ ion of DR8 was 1,521.6, very close to the mass calculated for a cross-linked dimer without one disaccharide moiety (1,520.53) (Fig. (Fig.1;1; Table Table1).1). Such muropeptides, also identified in T. thermophilus HB8 and other bacteria (, ), are most likely generated by the enzymatic clevage of MurNAc–l-Ala amide bonds in murein by an N-acetylmuramyl–l-alanine amidase (21). In particular, DR8 could derive from DR11. The difference between measured m/z values for DR8 and DR11 was 478.7, which fits with the mass contribution of a disaccharide moiety (480.5) within the mass accuracy of the instrument.The m/z values for muropeptides DR2, DR10, and DR12 supported the argument for structures in which the two d-Ala residues from the d-Ala–d-Ala C-terminal dipeptide were lost, leaving Orn as the C-terminal amino acid.The position of one Gly residue in muropeptides DR2, DR8, and DR10 to DR13 could not be formally demonstrated. One of the Gly residues could be at either the N- or the C-terminal positions. However, the N-terminal position seems more likely. The structure of the basic muropeptide (DR7), with a (Gly)₂ acylating the δ-NH₂ group of Orn, suggests that major muropeptides should present a (Gly)₂ dipeptide. The scarcity of DR3 and DR6, which unambiguously have Gly as the C-terminal amino acid (Fig. (Fig.1),1), supports our assumption.Molar proportions for each muropeptide were calculated as proposed by Glauner et al. () and are shown in Table Table1.1. For calculations the structures of DR10 to DR13 were assumed to be those shown in Fig. Fig.1.1. The degree of cross-linkage calculated was 47.2%. Trimeric muropeptides were rather abundant (8 mol%) and made a substantial contribution to total cross-linkage. However, higher-order oligomers were not detected, in contrast with other gram-positive bacteria, such as Staphylococcus aureus, which is rich in such oligomers (8). The proportion of muropeptides with (1→6)anhydro-muramic acid (5 mol%) corresponded to a mean glycan strand length of 20 disaccharide units, which is in the range of values published for other bacteria (, ).The results of our study indicate that mureins from D. radiodurans and T. thermophilus HB8 () are certainly related in their basic structures but have distinct muropeptide compositions. In accordance with the phylogenetic proximity of Thermus and Deinococcus (11), both mureins are built up from the same basic monomeric subunit (DR7 in Fig. Fig.1),1), are cross-linked by (Gly)₂ bridges, and have (1→6)anhydro-muramic acid at the termini of glycan strands. Most interestingly, Deinococcus and Thermus are the only microorganisms identified at present with the murein chemotype A3β as defined by Schleifer and Kandler (). Nevertheless, the differences in muropeptide composition were substantial. Murein from D. radiodurans was poor in d-Ala–d-Ala- and d-Ala–Gly-terminated muropeptides (2.2 and 2.4 mol%, respectively) but abundant in Orn-terminated muropeptides (23.8 mol%) and in muropeptides with a peptide chain reduced to the dipeptide l-Ala–d-Glu (18 mol%). In contrast, neither Orn- nor Glu-terminated muropeptides have been detected in T. thermophilus HB8 murein, which is highly enriched in muropeptides with d-Ala–d-Ala and d-Ala–Gly (). Furthermore, no traces of phenyl acetate-containing muropeptides, a landmark for T. thermophilus HB8 murein (), were found in D. radiodurans. Cross-linkage was definitely higher in D. radiodurans than in T. thermophilus HB8 (47.4 and 27%, respectively), largely due to the higher proportion of trimers in the former.The similarity in murein basic structure suggests that the difference between D. radiodurans and T. thermophilus HB8 with respect to the Gram reaction may simply be a consequence of the difference in the thickness of cell walls (, 3, ). Interestingly, D. radiodurans murein turned out to be relatively simple for a gram-positive organism, possibly reflecting the primitive nature of this genus as deduced from phylogenetic trees (11). Our results illustrate the phylogenetic proximity between Deinococcus and Thermus at the cell wall level but also point out the structural divergences originated by the evolutionary history of each genus. 相似文献

Molecular and Biochemical Characterization of the Protein Template Controlling Biosynthesis of the Lipopeptide Lichenysin

Dirk Konz Sascha Doekel Mohamed A. Marahiel 《Journal of bacteriology》1999,181(1):133-140

Lichenysins are surface-active lipopeptides with antibiotic properties produced nonribosomally by several strains of Bacillus licheniformis. Here, we report the cloning and sequencing of an entire 26.6-kb lichenysin biosynthesis operon from B. licheniformis ATCC 10716. Three large open reading frames coding for peptide synthetases, designated licA, licB (three modules each), and licC (one module), could be detected, followed by a gene, licTE, coding for a thioesterase-like protein. The domain structure of the seven identified modules, which resembles that of the surfactin synthetases SrfA-A to -C, showed two epimerization domains attached to the third and sixth modules. The substrate specificity of the first, fifth, and seventh recombinant adenylation domains of LicA to -C (cloned and expressed in Escherichia coli) was determined to be Gln, Asp, and Ile (with minor Val and Leu substitutions), respectively. Therefore, we suppose that the identified biosynthesis operon is responsible for the production of a lichenysin variant with the primary amino acid sequence l-Gln–l-Leu–d-Leu–l-Val–l-Asp–d-Leu–l-Ile, with minor Leu and Val substitutions at the seventh position.Many strains of Bacillus are known to produce lipopeptides with remarkable surface-active properties (). The most prominent of these powerful lipopeptides is surfactin from Bacillus subtilis (). Surfactin is an acylated cyclic heptapeptide that reduces the surface tension of water from 72 to 27 mN m⁻¹ even in a concentration below 0.05% and shows some antibacterial and antifungal activities (). Some B. subtilis strains are also known to produce other, structurally related lipoheptapeptides (Table (Table1),1), like iturin (, ) and bacillomycin (, , ), or the lipodecapeptides fengycin () and plipastatin ().

TABLE 1

Lipoheptapeptide antibiotics of Bacillus spp.

Lipopeptide	Organism	Structure	Reference
Lichenysin A	B. licheniformis	`FA`^a`-L-Glu-L-Leu-D-Leu-L-Val-L-Asn-D-Leu-L-Ile`	51, 52
Lichenysin B		`FA`^a`-L-Glu-L-Leu-D-Leu-L-Val-L-Asp-D-Leu-L-Leu`	23, 26
Lichenysin C		`FA`^a`-L-Glu-L-Leu-D-Leu-L-Val-L-Asp-D-Leu-L-Ile`	17
Lichenysin D		`FA`^a`-L-Gln-L-Leu-D-Leu-L-Val-L-Asp-D-Leu-L-Ile`	This work
Surfactant 86	B. licheniformis	`FA`^a`-L-Glx`^d`-L-Leu-D-Leu-L-Val-L-Asx`^d`-D-Leu-L-Ile`^e	14, 15
		`L-Val`
Surfactin	B. subtilis	`FA`^a`-L-Glu-L-Leu-D-Leu-L-Val-L-Asp-D-Leu-L-Leu`	1, 7, 49
Esperin	B. subtilis	`FA`^b`-L-Glu-L-Leu-D-Leu-L-Val-L-Asp-D-Leu-L-Leu`^e	45
		`L-Val`
Iturin A	B. subtilis	`FA`^c`-L-Asn-D-Tyr-D-Asn-L-Gln-L-Pro-D-Asn-L-Ser`	32
Iturin C		`FA`^c`-L-Asn-D-Tyr-D-Asn-L-Gln-L-Pro-D-Asn`^e`-L-Asn`^e	34
		`D-Ser-L-Thr`
Bacillomycin L	B. subtilis	`FA`^c`-L-Asp-D-Tyr-D-Asn-L-Ser-L-Gln-D-Pro`^e`-L-Thr`	3
		`D-Ser-`
Bacillomycin D		`FA`^c`-L-Asp-D-Tyr-D-Asn-L-Pro-L-Glu-D-Ser-L-Thr`	30, 31
Bacillomycin F		`FA`^c`-L-Asn-D-Tyr-D-Asn-L-Gln-L-Pro-D-Asn-L-Thr`	27

Open in a separate window^aFA, β-hydroxy fatty acid. The β-hydroxy group forms an ester bond with the carboxy group of the C-terminal amino acid. ^bFA, β-hydroxy fatty acid. The β-hydroxy group forms an ester bond with the carboxy group of Asp5. ^cFA, β-amino fatty acid. The β-amino group forms a peptide bond with the carboxy group of the C-terminal amino acid. ^dOnly the following combinations of amino acid 1 and 5 are allowed: Gln-Asp or Glu-Asn. ^eWhere an alternative amino acid may be present in a structure, the alternative is also presented. In addition to B. subtilis, several strains of Bacillus licheniformis have been described as producing the lipopeptide lichenysin (14, , , , ). Lichenysins can be grouped under the general sequence l-Glx–l-Leu–d-Leu–l-Val–l-Asx–d-Leu–l-Ile/Leu/Val (Table (Table1).1). The first amino acid is connected to a β-hydroxyl fatty acid, and the carboxy-terminal amino acid forms a lactone ring to the β-OH group of the lipophilic part of the molecule. In contrast to the lipopeptide surfactin, lichenysins seem to be synthesized during growth under aerobic and anaerobic conditions (, ). The isolation of lichenysins from cells growing on liquid mineral salt medium on glucose or sucrose basic has been studied intensively. Antimicrobial properties and the ability to reduce the surface tension of water have also been described (14, , , ). The structural elucidation of the compounds revealed slight differences, depending on the producer strain. Various distributions of branched and linear fatty acid moieties of diverse lengths and amino acid variations in three defined positions have been identified (Table (Table11).In contrast to the well-defined methods for isolation and structural characterization of lichenysins, little is known about the biosynthetic mechanisms of lichenysin production. The structural similarity of lichenysins and surfactin suggests that the peptide moiety is produced nonribosomally by multifunctional peptide synthetases (, , , , ). Peptide synthetases from bacterial and fungal sources describe an alternative route in peptide bond formation in addition to the ubiquitous ribosomal pathway. Here, large multienzyme complexes affect the ordered recognition, activation, and linking of amino acids by utilizing the thiotemplate mechanism (, , ). According to this model, peptide synthetases activate their substrate amino acids as aminoacyl adenylates by ATP hydrolysis. These unstable intermediates are subsequently transferred to a covalently enzyme-bound 4′-phosphopantetheinyl cofactor as thioesters. The thioesterified amino acids are then integrated into the peptide product through a stepwise elongation by a series of transpeptidations directed from the amino terminals to the carboxy terminals. Peptide synthetases have not only awakened interest because of their mechanistic features; many of the nonribosomally processed peptide products also possess important biological and medical properties.In this report we describe the identification and characterization of a putative lichenysin biosynthesis operon from B. licheniformis ATCC 10716. Cloning and sequencing of the entire lic operon (26.6 kb) revealed three genes, licA, licB, and licC, with structural patterns common to peptide synthetases and a gene designated licTE, which codes for a putative thioesterase. The modular organization of the sequenced genes resembles the requirements for the biosynthesis of the heptapeptide lichenysin. Based on the arrangement of the seven identified modules and the tested substrate specificities, we propose that the identified genes are involved in the nonribosomal synthesis of the portion of the lichenysin peptide with the primary sequence l-Gln–l-Leu–d-Leu–l-Val–l-Asp–d-Leu–l-Ile (with minor Val and Leu substitutions). 相似文献

10.

Experimentally Increased Codon Bias in the Drosophila Adh Gene Leads to an Increase in Larval,But Not Adult,Alcohol Dehydrogenase Activity

Winfried Hense Nathan Anderson Stephan Hutter Wolfgang Stephan John Parsch David B. Carlini 《Genetics》2010,184(2):547-555

相似文献

11.

Adaptive Divergence in Experimental Populations of Pseudomonas fluorescens. IV. Genetic Constraints Guide Evolutionary Trajectories in a Parallel Adaptive Radiation

下载免费PDF全文

Michael J. McDonald Stefanie M. Gehrig Peter L. Meintjes Xue-Xian Zhang Paul B. Rainey 《Genetics》2009,183(3):1041-1053

The capacity for phenotypic evolution is dependent upon complex webs of functional interactions that connect genotype and phenotype. Wrinkly spreader (WS) genotypes arise repeatedly during the course of a model Pseudomonas adaptive radiation. Previous work showed that the evolution of WS variation was explained in part by spontaneous mutations in wspF, a component of the Wsp-signaling module, but also drew attention to the existence of unknown mutational causes. Here, we identify two new mutational pathways (Aws and Mws) that allow realization of the WS phenotype: in common with the Wsp module these pathways contain a di-guanylate cyclase-encoding gene subject to negative regulation. Together, mutations in the Wsp, Aws, and Mws regulatory modules account for the spectrum of WS phenotype-generating mutations found among a collection of 26 spontaneously arising WS genotypes obtained from independent adaptive radiations. Despite a large number of potential mutational pathways, the repeated discovery of mutations in a small number of loci (parallel evolution) prompted the construction of an ancestral genotype devoid of known (Wsp, Aws, and Mws) regulatory modules to see whether the types derived from this genotype could converge upon the WS phenotype via a novel route. Such types—with equivalent fitness effects—did emerge, although they took significantly longer to do so. Together our data provide an explanation for why WS evolution follows a limited number of mutational pathways and show how genetic architecture can bias the molecular variation presented to selection.UNDERSTANDING—and importantly, predicting—phenotypic evolution requires knowledge of the factors that affect the translation of mutation into phenotypic variation—the raw material of adaptive evolution. While much is known about mutation rate (e.g., ; ), knowledge of the processes affecting the translation of DNA sequence variation into phenotypic variation is minimal.Advances in knowledge on at least two fronts suggest that progress in understanding the rules governing the generation of phenotypic variation is possible (). The first stems from increased awareness of the genetic architecture underlying specific adaptive phenotypes and recognition of the fact that the capacity for evolutionary change is likely to be constrained by this architecture (Schlichting and Murren 2004; Hansen 2006). The second is the growing number of reports of parallel evolution (e.g., Pigeon et al. 1997; ; ; ; ; ; ; ; ; ; ; ; )—that is, the independent evolution of similar or identical features in two or more lineages—which suggests the possibility that evolution may follow a limited number of pathways (Schluter 1996). Indeed, giving substance to this idea are studies that show that mutations underlying parallel phenotypic evolution are nonrandomly distributed and typically clustered in homologous genes ().While the nonrandom distribution of mutations during parallel genetic evolution may reflect constraints due to genetic architecture, some have argued that the primary cause is strong selection (e.g., ; ). A means of disentangling the roles of population processes (selection) from genetic architecture is necessary for progress (Maynard Smith et al. 1985; ); also necessary is insight into precisely how genetic architecture might bias the production of mutations presented to selection.Despite their relative simplicity, microbial populations offer opportunities to advance knowledge. The wrinkly spreader (WS) morphotype is one of many different niche specialist genotypes that emerge when experimental populations of Pseudomonas fluorescens are propagated in spatially structured microcosms (). Previous studies defined, via gene inactivation, the essential phenotypic and genetic traits that define a single WS genotype known as LSWS (, ) (Figure 1). LSWS differs from the ancestral SM genotype by a single nonsynonymous nucleotide change in wspF. Functionally (see Figure 2), WspF is a methyl esterase and negative regulator of the WspR di-guanylate cyclase (DGC) () that is responsible for the biosynthesis of c-di-GMP (), the allosteric activator of cellulose synthesis enzymes (). The net effect of the wspF mutation is to promote physiological changes that lead to the formation of a microbial mat at the air–liquid interface of static broth microcosms ().Open in a separate window Figure 1.—Outline of experimental strategy for elucidation of WS-generating mutations and their subsequent identity and distribution among a collection of independently evolved, spontaneously arising WS genotypes. The strategy involves, first, the genetic analysis of a specific WS genotype (e.g., LSWS) to identify the causal mutation, and second, a survey of DNA sequence variation at specific loci known to harbor causal mutations among a collection of spontaneously arising WS genotypes. For example, suppressor analysis of LSWS using a transposon to inactivate genes necessary for expression of the wrinkly morphology delivered a large number of candidate genes (top left) (). Genetic and functional analysis of these candidate genes (e.g., ) led eventually to the identity of the spontaneous mutation (in wspF) responsible for the evolution of LSWS from the ancestral SM genotype (). Subsequent analysis of the wspF sequence among 26 independent WS genotypes (bottom) showed that 50% harbored spontaneous mutations (of different kinds; see Open in a separate window Figure 2.—Network diagram of DGC-encoding pathways underpinning the evolution of the WS phenotype and their regulation. Overproduction of c-di-GMP results in overproduction of cellulose and other adhesive factors that determine the WS phenotype. The ancestral SBW25 genome contains 39 putative DGCs, each in principle capable of synthesizing the production of c-di-GMP, and yet WS genotypes arise most commonly as a consequence of mutations in just three DGC-containing pathways: Wsp, Aws, and Mws. In each instance, the causal mutations are most commonly in the negative regulatory component: wspF, awsX, and the phosphodiesterase domain of mwsR (see text).To determine whether spontaneous mutations in wspF are a common cause of the WS phenotype, the nucleotide sequence of this gene was obtained from a collection of 26 spontaneously arising WS genotypes (WS_A-Z) taken from 26 independent adaptive radiations, each founded by the same ancestral SM genotype (Figure 1): 13 contained mutations in wspF (Bantinaki et al. 2007). The existence of additional mutational pathways to WS provided the initial motivation for this study.

TABLE 1

Mutational causes of WS

WS genotype	Gene	Nucleotide change	Amino acid change	Source/reference
LSWS	wspF	A901C	S301R	Bantinaki et al. (2007)
AWS	awsX	Δ100-138	ΔPDPADLADQRAQA	This study
MWS	mwsR	G3247A	E1083K	This study
WS_A	wspF	T14G	I5S	Bantinaki et al. (2007)
WS_B	wspF	Δ620-674	P206Δ (8)^{^a}	Bantinaki et al. (2007)
WS_C	wspF	G823T	G275C	Bantinaki et al. (2007)
WS_D	wspE	A1916G	D638G	This study
WS_E	wspF	G658T	V220L	Bantinaki et al. (2007)
WS_F	wspF	C821T	T274I	Bantinaki et al. (2007)
WS_G	wspF	C556T	H186Y	Bantinaki et al. (2007)
WS_H	wspE	A2202C	K734N	This study
WS_I	wspE	G1915T	D638Y	This study
WS_J	wspF	Δ865-868	R288Δ (3)^{^a}	Bantinaki et al. (2007)
WS_K	awsO	G125T	G41V	This study
WS_L	wspF	G482A	G161D	Bantinaki et al. (2007)
WS_M	awsR	C164T	S54F	This study
WS_N	wspF	A901C	S301R	Bantinaki et al. (2007)
WS_O	wspF	Δ235-249	V79Δ (6)^{^a}	Bantinaki et al. (2007)
WS_P	awsR	222insGCCACCGAA	74insATE	This study
WS_Q	mwsR	3270insGACGTG	1089insDV	This study
WS_R	mwsR	T2183C	V272A	This study
WS_S	awsX	C472T	Q158STOP	This study
WS_T	awsX	Δ229-261	ΔYTDDLIKGTTQ	This study
WS_U	wspF	Δ823-824	T274Δ (13)^{^a}	Bantinaki et al. (2007)
WS_V	awsX	T74G	L24R	This study
WS_W	wspF	Δ149	L49Δ (1)^{^a}	Bantinaki et al. (2007)
WS_X^{^b}	?	?	?	This study
WS_Y	wspF	Δ166-180	Δ(L51-I55)	Bantinaki et al. (2007)
WS_Z	mwsR	G3055A	A1018T	This study

Open in a separate window^aP206Δ(8) indicates a frameshift; the number of new residues before a stop codon is reached is in parentheses.^bSuppressor analysis implicates the wsp locus (17 transposon insertions were found in this locus). However, repeated sequencing failed to identify a mutation.Here we define and characterize two new mutational routes (Aws and Mws) that together with the Wsp pathway account for the evolution of 26 spontaneously arising WS genotypes. Each pathway offers approximately equal opportunity for WS evolution; nonetheless, additional, less readily realized genetic routes producing WS genotypes with equivalent fitness effects exist. Together our data show that regulatory pathways with specific functionalities and interactions bias the molecular variation presented to selection. 相似文献

12.

Allelic Variation in Cell Wall Candidate Genes Affecting Solid Wood Properties in Natural Populations and Land Races of Pinus radiata

S. K. Dillon M. Nolan W. Li C. Bell H. X. Wu S. G. Southerton 《Genetics》2010,185(4):1477-1487

相似文献

13.

Retrograde Intraflagellar Transport Mutants Identify Complex A Proteins With Multiple Genetic Interactions in Chlamydomonas reinhardtii

下载免费PDF全文

Carlo Iomini Linya Li Jessica M. Esparza Susan K. Dutcher 《Genetics》2009,183(3):885-896

The intraflagellar transport machinery is required for the assembly of cilia. It has been investigated by biochemical, genetic, and computational methods that have identified at least 21 proteins that assemble into two subcomplexes. It has been hypothesized that complex A is required for retrograde transport. Temperature-sensitive mutations in FLA15 and FLA17 show defects in retrograde intraflagellar transport (IFT) in Chlamydomonas. We show that IFT144 and IFT139, two complex A proteins, are encoded by FLA15 and FLA17, respectively. The fla15 allele is a missense mutation in a conserved cysteine and the fla17 allele is an in-frame deletion of three exons. The flagellar assembly defect of each mutant is rescued by the respective transgenes. In fla15 and fla17 mutants, bulges form in the distal one-third of the flagella at the permissive temperature and this phenotype is also rescued by the transgenes. These bulges contain the complex B component IFT74/72, but not α-tubulin or p28, a component of an inner dynein arm, which suggests specificity with respect to the proteins that accumulate in these bulges. IFT144 and IFT139 are likely to interact with each other and other proteins on the basis of three distinct genetic tests: (1) Double mutants display synthetic flagellar assembly defects at the permissive temperature, (2) heterozygous diploid strains exhibit second-site noncomplemention, and (3) transgenes confer two-copy suppression. Since these tests show different levels of phenotypic sensitivity, we propose they illustrate different gradations of gene interaction between complex A proteins themselves and with a complex B protein (IFT172).CILIA and flagella are microtubule-based organelles that are found on most mammalian cells. They provide motility to cells and participate in many sensory processes. Defects in or loss of cilia/flagella cause a variety of human diseases that include polycystic kidney disease, retinal degeneration, infertility, obesity, respiratory defects, left–right axis determination, and polydactyly (). Mouse mutants demonstrate that cilia are essential for viability, neural tube closure, and bone development (; ). Cilia and flagella are also present in protists, algae, moss, and some fungi.The assembly and maintenance of cilia and flagella require intraflagellar transport (IFT) (). IFT involves the movement of 100- to 200-nm-long protein particles from the basal body located in the cell body to the tip of the flagella using the heterotrimeric kinesin-2 (anterograde movement) () and movement back to the cell body (retrograde movement) using the cytoplasmic dynein complex (; ). IFT particles change their direction of movement as well as their size, speed, and frequency at the ends of the flagella as they switch from anterograde to retrograde movement (). Biochemical isolation of IFT particles reveals that they are composed of at least 16 proteins and that these particles can be dissociated into two complexes in vitro by changing the salt concentration (; ). Recent genetic and bioinformatics analysis adds at least 7 more proteins to the IFT particle () (Eggenschwiler and Anderson 2007).

TABLE 1

Proteins and gene names for the intraflagellar transport particles in Chlamydomonas, C. elegans, and mouse

Protein	Motif	Chlamydomonas gene	C. elegans gene	Mouse gene	References to worm and mouse genes
Complex A
IFT144	WD	FLA15
IFT140	WD	—	che-11		Qin et al. (2001)
IFT139	TRP	FLA17	dyf-2	THM1	Efimenko et al. (2006); Tran et al. (2008)
IFT122	WD	—	IFTA-1		Blacque et al. (2006)
IFT121	WD	—	daf-10		Bell et al. (2006)
IFT43		—
Complex B
IFT172	WD	FLA11	osm-1	Wimple	Huangfu et al. (2003); Pedersen et al. (2005); Bell et al. (2006)
IFT88	TRP	IFT88	osm-5	Tg737/Polaris	Pazour et al. (2000); Qin et al. (2001)
IFT81	Coil	—	ift-81	CDV1	Kobayashi et al. (2007)
IFT80	WD	—	che-2	Wdr56	Fujiwara et al. (1999)
IFT74/72	Coil	—	ift-74	Cmg1	Kobayashi et al. (2007)
IFT57/55	Coil	—	che-13	Hippi	Haycraft et al. (2003)
IFT54	Microtubule binding domain MIP-T3	—	dyf-11	Traf3IP1	Kunitomo and Iino (2008); Li et al. (2008); Omori et al. (2008); Follit et al. (2009)
IFT52	ABC type	BLD1	osm-6	Ngd2	Brazelton et al. (2001); Bell et al. (2006)
IFT46		IFT46	dyf-6		Bell et al. (2006); Hou et al. (2007)
IFT27	G protein	—	Not present	Rabl4
IFT25	Hsp20	—	Not present	HSP16.1	Follit et al. (2009)
IFT22	G protein	—	IFTA-2	Rabl5	Schafer et al. (2006)
IFT20	Coil	—			Follit et al. (2006)
FAP22	Cluamp related protein	—	dyf-3	Cluamp1	Murayama et al. (2005); Follit et al. (2009)
DYF13		—	dyf-13	Ttc26	Blacque et al. (2005)

Open in a separate window—, no mutant found to date in Chlamydomonas.A collection of temperature-sensitive mutant strains that fail to assemble flagella at the restrictive temperature of 32° was isolated in Chlamydomonas (; ; ; ). Analysis of the flagella at 21° permits the measurement of the velocity and frequency of IFT particles in the mutant strains. This analysis suggested that assembly has four phases: recruitment to the basal body, anterograde movement (phases I and II), retrograde movement, and return to the cytoplasm (phases III and IV) (). Different mutants were classified as defective in these four phases. However, because different alleles of FLA8 were classified as defective in different phases (; ), we combined mutants with IFT defects into just two classes. The first group (phases I and II) includes mutant strains that show decreased anterograde velocities, a decreased ratio of anterograde to retrograde particles, and an accumulation of complex A proteins at the basal body. This group includes mutations in the FLA8 and FLA10 genes, which encode the two motor subunits of kinesin-2 (; ), as well as mutations in three unknown genes (FLA18, FLA27, and FLA28). The second group includes mutant strains that show the reciprocal phenotype (phases III and IV); these phenotypes include decreased retrograde velocities, an increased ratio of anterograde to retrograde particles, and an accumulation of complex B proteins in the flagella. With the exception of the FLA11 gene, which encodes IFT172, a component of complex B (), the gene products in this class are unknown (FLA2, FLA15, FLA16, FLA17, and FLA24). One might predict that mutations in this group would map to genes that encode complex A or retrograde motor subunits. Interestingly, IFT particles isolated from fla11, fla15, fla16, and fla17-1 flagella show depletion of complex A polypeptides (; ). The inclusion of IFT172 in this class is explained by the observations that IFT172 plays a role in remodeling the IFT particles at the flagellar tip to transition from anterograde to retrograde movement (). The remaining mutant strains do not show obvious defects in velocities, ratios, or accumulation at 21° and may reflect a less severe phenotype at the permissive temperature or a non-IFT role for these genes.Direct interactions occur between components of complex B. IFT81 and IFT74/72 interact to form a scaffold required for IFT complex B assembly (). IFT57 and IFT20 also interact with each other and kinesin-2 (). While physical interactions are being used to define IFT particle architecture, genetic interactions among loci encoding IFT components should be instructive regarding their function as well. To probe retrograde movement and its function, we have identified the gene products encoded by two retrograde defective mutant strains. They are FLA15 and FLA17 and encode IFT144 and IFT139, respectively. The genetic interactions of these loci provide interesting clues about the assembly of the IFT particles and possible physical interactions in the IFT particles. 相似文献

14.

A Problem With the Correlation Coefficient as a Measure of Gene Expression Divergence

Vini Pereira David Waxman Adam Eyre-Walker 《Genetics》2009,183(4):1597-1600

The correlation coefficient is commonly used as a measure of the divergence of gene expression profiles between different species. Here we point out a potential problem with this statistic: if measurement error is large relative to the differences in expression, the correlation coefficient will tend to show high divergence for genes that have relatively uniform levels of expression across tissues or time points. We show that genes with a conserved uniform pattern of expression have significantly higher levels of expression divergence, when measured using the correlation coefficient, than other genes, in a data set from mouse, rat, and human. We also show that the Euclidean distance yields low estimates of expression divergence for genes with a conserved uniform pattern of expression.IT is now possible to measure the expression levels of thousands of genes in multiple tissues at multiple times. This has led to investigations into the evolution of gene expression and how the pattern of expression changes on a genomic scale. In some analyses, the evolution of expression is considered only within one tissue, but in many studies the evolution across multiple tissues is investigated. In this latter case, the evolution of an expression profile—a vector of expression levels of a gene across several tissues—is considered.Several different statistics have been proposed to measure the divergence between gene expression profiles. The two most popular measures are the Euclidean distance (; ; ; ) and Pearson''s correlation coefficient (; ; ; ; ,; ; ). The correlation coefficient is often subtracted from one, so that the statistic varies from zero, when there has been no expression divergence, to a maximum of two; we refer to this statistic as the Pearson distance. Here we describe a significant shortcoming of the Pearson distance that is not shared by the Euclidean distance.To investigate properties of these two measures of expression divergence, we compiled a data set of 2859 orthologous genes from human, mouse, and rat for which we had microarray expression data from nine homologous tissues: bone marrow, heart, kidney, large intestine, pituitary, skeletal muscle, small intestine, spleen, and thymus). The expression data for rat came from , the mouse data from , and the human data from . Each tissue experiment had two replicates in mouse, a varying number of replicates in rat, and one in humans; some genes were also matched by multiple probe sets. To obtain an average across experiments and probe sets we processed the data as follows:

Raw CEL files of gene expression levels were obtained from the NCBI Gene Expression Omnibus database (http://www.ncbi.nlm.nih.gov/projects/geo/).
The results from the mouse, rat, and human arrays were normalized separately using both the MAS5 (Affymetrix 2001) and the RMA algorithms (Irizarry et al. 2003) as implemented in Bioconductor (Gentleman et al. 2004). The results are qualitatively similar for the two normalization procedures, although recent analyses suggest that MAS5 normalization is generally better (Ploner et al. 2005; Lim et al. 2007).
The expression of each gene within a tissue was averaged across experiments and probe sets.

We computed expression distances (ED) between orthologous gene expression profiles, for each of the three species comparisons, rat–mouse, rat–human, and mouse–human, according to the two different distance metrics, the Euclidean distance and the Pearson distance:(1)Here x_ij is the expression level of the gene under consideration in species i in tissue j, and is the average expression level of the gene in species i across tissues. Expression levels are known in a total of k tissues.Because expression levels are measured on different microarray platforms in the three species, we compute relative abundance (RA) values, before calculating the Euclidean distance (). The RA is the expression of a gene in a particular tissue divided by the sum of the expression values of that gene across all tissues. We calculated RA values to remove “probe” effects (the tendency for a gene to bind its probe set on one platform more efficiently than on another platform). Because of probe effects it is not easy to distinguish absolute changes in expression and differences in binding efficiency. Calculating RA values removes this problem from the Euclidean distance. Pearson''s distance does not change under such a rescaling and so this is unnecessary.In some analyses the logarithm of the expression or RA values are used (e.g., ; ; ), and in others the expression values are used without this transformation (e.g., ; ; ; ,; ; ). We calculated both the Pearson and the Euclidean distances on log-transformed and untransformed expression values. The results are qualitatively similar so here we present only the results obtained using the logarithm of the expression or RA values.It is natural to expect the two measures of expression divergence to be positively correlated with one another; however, the Euclidean and Pearson distances are almost completely uncorrelated (MAS5 normalization, mouse–rat correlation coefficient = 0.06, human–rat r = 0.13, human–mouse r = 0.10; RMA normalization, mouse–rat correlation coefficient = −0.12, human–rat r = −0.00, human–mouse r = −0.08; Figure 1). This could, plausibly, be because the two statistics measure different aspects of divergence. However, irrespective of this, there is a potential problem associated with the Pearson distance. Imagine that we have a gene that is expressed at identical levels in all tissues in two species (i.e., expression levels are uniform between tissues and also between species). We quite reasonably assume that measured expression levels contain noise. Thus each measured expression level (x_ij) is the sum of the (assumed) uniform expression level and an independent random number representing noise. In this case there is no real divergence in the expression profile between the species. However, the two measures of divergence may differ greatly in this case. The Euclidean distance reflects only the noise present in the data and hence will be small if the noise is small. By contrast, the Pearson distance will have a value close to 1 since the second term in PeaD in Equation 1 will be close to zero, reflecting the fact that the noise components of different expression levels are independent. Thus the Pearson distance will give the impression that expression divergence is great, but all this apparent divergence is noise. This will be a problem with Pearson''s distance whenever measurement error is of the same magnitude as the differences in expression between tissues. This will therefore tend to be a problem for lowly expressed genes, where measurement error can be large relative to the true value.Open in a separate window Figure 1.—The correlation between the Euclidean and Pearson distances for (a) mouse–rat, (b) human–rat, and (c) human–mouse. Only the results from MAS5 normalization are shown; qualitatively similar results were obtained with RMA.The above example is unrealistic because real gene expression profiles are rarely perfectly uniform. To investigate whether this shortcoming of the Pearson distance is a problem in real data sets, we determined genes with a relatively uniform pattern of expression in all three species considered above. To do this we computed the entropy of a gene''s expression, which is a measure of uniformity in expression across tissues (): the higher the value of the entropy, the more uniform is the expression. We calculated the entropy for each gene in each of the three species, averaged these across species, and then took those genes in the upper quartile of mean entropy values as a data set of genes with a relatively conserved pattern of uniform expression.It is natural to expect those genes with a conserved uniform pattern of expression to have relatively low expression divergence; however, on average these genes have significantly higher Pearson distances than other genes (Figure 2; supporting information, Figure S1 and Figure S2). By contrast, the Euclidean distance shows the pattern one would anticipate; all of the conserved uniform genes have low expression divergence. It therefore seems likely that the Pearson distance is sensitive to measurement error and hence may not be a good measure of expression divergence.Open in a separate window Figure 2.—The distribution of expression divergence values for those genes with a uniform pattern of expression that is conserved across species vs. the distribution for all genes for (a) Pearson and (b) Euclidean distances for mouse–rat. We present similar values for human–mouse and human–rat in Figure S1 and Figure S2. Only the results from MAS5 normalization are shown; qualitatively similar results were obtained with RMA.

TABLE 1

The median expression divergence for genes that have a conserved uniform pattern of expression (upper quartile of mean entropy values) vs. all other genes

Data set	Statistic	Conserved uniform genes	Other genes	Wilcoxon test P-value
MAS5 normalization
Mouse–rat	Euclidean	1.66	2.79	<10⁻¹⁵
	Pearson	0.70	0.47	<10⁻¹⁵
Human–mouse	Euclidean	1.67	3.13	<10⁻¹⁵
	Pearson	0.78	0.58	<10⁻¹⁵
Human–rat	Euclidean	1.83	3.21	<10⁻¹⁵
	Pearson	0.78	0.58	<10⁻¹⁵
RMA normalization
Mouse–rat	Euclidean	0.59	1.40	<10⁻¹⁵
	Pearson	0.82	0.38	<10⁻¹⁵
Human–mouse	Euclidean	0.59	1.58	<10⁻¹⁵
	Pearson	0.81	0.48	<10⁻¹⁵
Human–rat	Euclidean	0.58	1.55	<10⁻¹⁵
	Pearson	0.73	0.50	<10⁻¹⁵

Open in a separate windowWe note that there are two additional advantages of the Euclidean distance. First, it can take into account differences in the absolute level of expression if those data are available, either because the method of assay allows this, for example, if ESTs, SAGE, sequencing, or RNA-Seq data are used, or because expression in the two species has been assessed on the same platform using probes that are conserved between the two species. Second, the square of the Euclidean distance is expected to increase linearly with time. have previously shown that the squared difference in log expression level increases linearly with time under a Brownian motion model of gene expression evolution. It is therefore expected that the squared Euclidean distance will increase with time since the squared Euclidean distance is the sum of the squared differences across tissues. We prove this in File S1; we also show that this linearity holds, approximately, when relative abundance values are used (see also ). 相似文献

15.

Surprising Fitness Consequences of GC-Biased Gene Conversion: I. Mutation Load and Inbreeding Depression

Sylvain Glémin 《Genetics》2010,185(3):939-959

GC-biased gene conversion (gBGC) is a recombination-associated process mimicking selection in favor of G and C alleles. It is increasingly recognized as a widespread force in shaping the genomic nucleotide landscape. In recombination hotspots, gBGC can lead to bursts of fixation of GC nucleotides and to accelerated nucleotide substitution rates. It was recently shown that these episodes of strong gBGC could give spurious signatures of adaptation and/or relaxed selection. There is also evidence that gBGC could drive the fixation of deleterious amino acid mutations in some primate genes. This raises the question of the potential fitness effects of gBGC. While gBGC has been metaphorically termed the “Achilles'' heel” of our genome, we do not know whether interference between gBGC and selection merely has practical consequences for the analysis of sequence data or whether it has broader fundamental implications for individuals and populations. I developed a population genetics model to predict the consequences of gBGC on the mutation load and inbreeding depression. I also used estimates available for humans to quantitatively evaluate the fitness impact of gBGC. Surprising features emerged from this model: (i) Contrary to classical mutation load models, gBGC generates a fixation load independent of population size and could contribute to a significant part of the load; (ii) gBGC can maintain recessive deleterious mutations for a long time at intermediate frequency, in a similar way to overdominance, and these mutations generate high inbreeding depression, even if they are slightly deleterious; (iii) since mating systems affect both the selection efficacy and gBGC intensity, gBGC challenges classical predictions concerning the interaction between mating systems and deleterious mutations, and gBGC could constitute an additional cost of outcrossing; and (iv) if mutations are biased toward A and T alleles, very low gBGC levels can reduce the load. A robust prediction is that the gBGC level minimizing the load depends only on the mutational bias and population size. These surprising results suggest that gBGC may have nonnegligible fitness consequences and could play a significant role in the evolution of genetic systems. They also shed light on the evolution of gBGC itself.GC-BIASED gene conversion (gBGC) is increasingly recognized as a widespread force in shaping genome evolution. In different species, gene conversion occurring during double-strand break recombination repair is thought to be biased toward G and C alleles. In heterozygotes, GC alleles undergo a kind of molecular meiotic drive that mimics selection (reviewed in ). This process can rapidly increase the GC content, especially around recombination hotspots (), and, more broadly, can affect genome-wide nucleotide landscapes (). For instance, it is thought to play a role in shaping isochore structure evolution in mammals (; ; ) and birds (). Direct experimental evidence of gBGC mainly comes from studies in yeast (; ; but see ) and humans (). However, associations between recombination and the nucleotide landscape and frequency spectra biased toward GC alleles provide indirect evidence in very diverse organisms (

Pedigree	Pedigree class	No. of nonoverlapping sibships in the pedigree	No. of retained sibships	No. of meioses
1	Multigenerational	17	17	464
2	Multigenerational	27	20	728
3	Multigenerational	23	19	602
4	Multigenerational	14	9	254
5	Multigenerational	11	9	242
6	Multigenerational	5	3	68
7	Multigenerational	4	3	100
8	Multigenerational	2	1	16
9	Sibship^{^a}	2	1	20
32–85	Sibship		51	1146
Total		180	132	3640

Gene designation		Gene product	% Identity of protein^b
Previous	New (acces- sion no.)	Gene product	% Identity of protein^b
gntV	idnK (P39208)	d-Gluconate kinase	45 (GntK^c)
yjgV	idnD (P39346)	l-Idonate 5-dehydrogenase	30.6 (sheep DHSO^d)
yjgU	idnO (P39345)	5-Keto-d-gluconate 5-reductase	56 (GNO^e)
gntW	idnT (P39344)	l-Idonate transporter	61 (GntT^f)
yjgS	idnR (P39343)	l-Idonate regulator	46 (GntR^g)

Expression, purification, and specificity of DdlN.

TABLE 1

Characterization of d-Ala-d-X ligase activity.

TABLE 2

pH dependence of peptide versus that of depsipeptide synthesis activity.

Concluding remarks.

St. Onge et al. (2007) data set:

TABLE 2

TABLE 3

Jasnos and Korona (2007) data set:

TABLE 1

TABLE 1

TABLE 1

TABLE 1

TABLE 1

Impacts of gBGC on inbreeding depression:

gBGC and the genetic basis of inbreeding depression in panmictic populations:

Joint effect of gBGC and mating system on the load and inbreeding depression:

Minimum load and the evolution of gBGC and recombination landscapes:

gBGC and the evolution of mating systems:

Inbreeding depression and the shift in mating systems:

gBGC and the long-term evolution of mating systems:

Conclusion:

TABLE 1

TABLE 1