首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Patterns of molecular evolution in birds have long been considered anomalous. Compared with other vertebrates, birds have reduced levels of genetic divergence between groups of similar taxonomic ranks for a variety of nuclear and mitochondrial markers. This observation led to the avian constraint hypothesis, which identifies increased functional constraint on avian proteins as the cause for the reduction in genetic divergence. Subsequent investigations provided additional support for the avian constraint hypothesis when rates of molecular evolution were found to be slower in birds than in mammals in a variety of independent calibrations. It is possible to test the avian constraint hypothesis as an explanation for this avian slowdown by comparing DNA sequence data from protein-coding regions in birds and homologous regions in mammals. The increased selective constraints should lead to a reduction in the proportion of amino acid replacement substitutions. To test for such a decrease, we calculated the numbers of amino acid replacement substitutions per replacement site (dN) and silent substitutions per silent site (dS) for the complete mitochondrial cytochrome b gene using 38 avian and 43 mammalian comparisons that were phylogenetically independent. We find that dN/dS is significantly smaller in birds than in mammals. This difference cannot be explained by differences in codon bias affecting dS values. We suggest that the avian slowdown can be explained, at least in part, by a decreased tolerance for amino acid substitutions in avian species relative to mammalian species.  相似文献   

2.
The PH75 strain of filamentous bacteriophage (Inovirus) grows in the thermophilic bacterium Thermus thermophilus at 70 degrees C. We have characterized the viral DNA and determined the amino acid sequence of the major coat protein, p8. The p8 protein is synthesized without a leader sequence, like that of bacteriophage Pf3 but unlike that of bacteriophage Pf1, both of which grow in the mesophile Pseudomonas aeruginosa. X-ray diffraction patterns from ordered fibres of the PH75 virion are similar to those from bacteriophages Pf1 and Pf3, indicating that the protein capsid of the PH75 virion has the same helix symmetry and subunit shape, even though the primary structures of the major coat proteins are quite different and the virions assemble at very different temperatures. We have used this information to build a molecular model of the PH75 protein capsid based on that of Pf1, and refined the model by simulated annealing, using fibre diffraction data extending to 2.4 A resolution in the meridional direction and to 3.1 A resolution in the equatorial direction. The common design may reflect a fundamental motif of alpha-helix packing, although differences exist in the DNA packaging and in the means of insertion of the major coat protein of these filamentous bacteriophages into the membrane of the host bacterial cell. These may reflect differences in the assembly mechanisms of the virions.  相似文献   

3.
Two-dimensional Fourier transform methods for homonuclear proton NMR spectroscopy have been introduced by Wüthrich and Ernst as a means of extending assignments in spectra of proteins. Multinuclear two-dimensional approaches also appear promising. We are applying current one- and two-dimensional NMR methods to protein family members that differ from one another by one or more amino acid substitutions. The overall goal is to elucidate relationships among the sequences, structures, and functions of these proteins: for example, to delineate primary structural requirements for changes in observable properties such as conformation, amino acid side chain dynamics, hydrogen exchange dynamics, pK'a values, and oxidation-reduction potentials. The ovomucoids from a variety of species of birds, which include a single set of 12 pairs of third-domain proteins (Mr = 6062 for turkey third domain, similar for others) that differ by single amino acid substitutions, provide a favorable system for the study of the structural and dynamic effects of single amino acid replacements. X-ray crystallographic structures of two ovomucoid third domains are available. Other series of proteins being studied by these methods include the photosynthetic electron transport proteins ferredoxin and plastocyanin.  相似文献   

4.
A set of aligned homologous protein sequences is divided into two groups consisting of m and n sequences. Each group contains sequences from the most related organisms. Value of the position dissimilarity of proteins from different groups of m and n sequences is defined as a number of mismatches in comparison of all possible m X n pairs of amino acid residues in the position (each from different group) divided by m X n. Ten position average of dissimilarity values is plotted vs. the first position number. Area of the figure between the profile of dissimilarity values and its mean value line characterizes the overall irregularity of amino acid substitutions along the protein sequences. If the area is greater than the average area for 1000 random profiles by more than two standard deviation units, the profile extrema containing the "surplus" of area are cut off. The cut-off stretches are likely to be variable and constant regions. If necessary, each of stretches may be separately tested and statistically estimated using a standard size sample of artificial protein families. Intergroup comparison of protein sequences reveals high overall irregularity of amino acid substitutions and identifies variable and conservative regions for all considered families of proteins: phospholipases A2, aspartate aminotransferases, alpha-subunits of Na+, K(+)-ATPase, L- and M-subunits of photosynthetic bacteria photoreaction centre, human rhodopsins.  相似文献   

5.
An algorithm is presented for localizing variable and constant regions in homologous protein sequences. A set of aligned protein sequences is divided into two groups consisting of m and n sequences. Each group contains sequences of most related species. Value of the position dissimilarity of proteins from different groups of m and n sequences is defined as a number of failures to coincide in comparison with all possible mXn pairs of amino acid residues in the position (each from different group) divided by mXn. The position dissimilarity value of m protein sequences within a group is defined as the number of failures to coincide in comparison with all possible mX X(m-1)/2 pairs of amino acid residues divided by mX(m-1)/2. Ten position average of dissimilarity values is plotted vs. the first position number. Area of the figure included between the profile of dissimilarity values and its mean value line characterizes the overall irregularity of amino acid substitutions along the protein sequences. If the area value is greater than the average area for 1000 random profile by more than two standard deviation units, the profile extrema containing the "surplus" of area are cut off. The cut off stretches are likely to be variable and constant regions. In case of "between groups" comparisons it is found that the overall irregularity of amino acid substitutions is very high for all considered families of proteins; phospholipases A2, aspartate aminotransferases, alpha-subunits of Na+,K(+)-ATPase, L- and M-subunits of photosynthetic bacteria photoreaction centre, human rhodopsins.  相似文献   

6.
Here we investigate the correlations between coding sequence substitutions as a function of their separation along the protein sequence. We consider both substitutions between the reference genomes of several Drosophilids as well as polymorphisms in a population sample of Zimbabwean Drosophila melanogaster. We find that amino acid substitutions are "clustered" along the protein sequence, that is, the frequency of additional substitutions is strongly enhanced within ≈10 residues of a first such substitution. No such clustering is observed for synonymous substitutions, supporting a "correlation length" associated with selection on proteins as the causative mechanism. Clustering is stronger between substitutions that arose in the same lineage than it is between substitutions that arose in different lineages. We consider several possible origins of clustering, concluding that epistasis (interactions between amino acids within a protein that affect function) and positional heterogeneity in the strength of purifying selection are primarily responsible. The role of epistasis is directly supported by the tendency of nearby substitutions that arose on the same lineage to preserve the total charge of the residues within the correlation length and by the preferential cosegregation of neighboring derived alleles in our population sample. We interpret the observed length scale of clustering as a statistical reflection of the functional locality (or modularity) of proteins: amino acids that are near each other on the protein backbone are more likely to contribute to, and collaborate toward, a common subfunction.  相似文献   

7.
A putative deoxyuridine triphosphatase (dUTPase) gene from chlorella virus PBCV-1 was cloned, and the recombinant protein was expressed in Escherichia coli. The recombinant protein has dUTPase activity and requires Mg(2+) for optimal activity, while it retains some activity in the presence of other divalent cations. Kinetic studies of the enzyme revealed a K(m) of 11.7 microM, a turnover k(cat) of 6.8 s(-1), and a catalytic efficiency of k(cat)/K(m) = 5.8 x 10(5) M(-1) s(-1). dUTPase genes were cloned and expressed from two other chlorella viruses IL-3A and SH-6A. The two dUTPases have similar properties to PBCV-1 dUTPase except that IL-3A dUTPase has a lower temperature optimum (37 degrees C) than PBCV-1 dUTPase (50 degrees C). The IL-3A dUTPase differs from the PBCV-1 enzyme by nine amino acids, including two amino acid substitutions, Glu81-->Ser81 and Thr84-->Arg84, in the highly conserved motif III of the proteins. To investigate the difference in temperature optima between the two enzymes, homology modeling and docking simulations were conducted. The results of the simulation and comparisons of amino acid sequence suggest that adjacent amino acids are important in the temperature optima. To confirm this suggestion, three site-directed amino acid substitutions were made in the IL-3A enzyme: Thr84-->Arg84, Glu81-->Ser81, and Glu81-->Ser81 plus Thr84-->Arg84. The single substitutions affected the optimal temperature for enzyme activity. The temperature optimum increased from 37 to 55 degrees C for the enzyme containing the two amino acid substitutions. We postulate that the change in temperature optimum is due to reduction in charge and balkiness in the active cavity that allows more movement of the ligand and protein before the enzyme and substrate complex is formed.  相似文献   

8.
Substitutions of individual amino acids in proteins may be under very different evolutionary restraints depending on their structural and functional roles. The Environment Specific Substitution Table (ESST) describes the pattern of substitutions in terms of amino acid location within elements of secondary structure, solvent accessibility, and the existence of hydrogen bonds between side chains and neighbouring amino acid residues. Clearly amino acids that have very different local environments in their functional state compared to those in the protein analysed will give rise to inconsistencies in the calculation of amino acid substitution tables. Here, we describe how the calculation of ESSTs can be improved by discarding the functional residues from the calculation of substitution tables. Four categories of functions are examined in this study: protein–protein interactions, protein–nucleic acid interactions, protein–ligand interactions, and catalytic activity of enzymes. Their contributions to residue conservation are measured and investigated. We test our new ESSTs using the program CRESCENDO, designed to predict functional residues by exploiting knowledge of amino acid substitutions, and compare the benchmark results with proteins whose functions have been defined experimentally. The new methodology increases the Z-score by 98% at the active site residues and finds 16% more active sites compared with the old ESST. We also find that discarding amino acids responsible for protein–protein interactions helps in the prediction of those residues although they are not as conserved as the residues of active sites. Our methodology can make the substitution tables better reflect and describe the substitution patterns of amino acids that are under structural restraints only.  相似文献   

9.
Proteins evolve under a myriad of biophysical selection pressures that collectively control the patterns of amino acid substitutions. These evolutionary pressures are sufficiently consistent over time and across protein families to produce substitution patterns, summarized in global amino acid substitution matrices such as BLOSUM, JTT, WAG, and LG, which can be used to successfully detect homologs, infer phylogenies, and reconstruct ancestral sequences. Although the factors that govern the variation of amino acid substitution rates have received much attention, the influence of thermodynamic stability constraints remains unresolved. Here we develop a simple model to calculate amino acid substitution matrices from evolutionary dynamics controlled by a fitness function that reports on the thermodynamic effects of amino acid mutations in protein structures. This hybrid biophysical and evolutionary model accounts for nucleotide transition/transversion rate bias, multi‐nucleotide codon changes, the number of codons per amino acid, and thermodynamic protein stability. We find that our theoretical model accurately recapitulates the complex yet universal pattern observed in common global amino acid substitution matrices used in phylogenetics. These results suggest that selection for thermodynamically stable proteins, coupled with nucleotide mutation bias filtered by the structure of the genetic code, is the primary driver behind the global amino acid substitution patterns observed in proteins throughout the tree of life.  相似文献   

10.
We have cloned and sequenced the gene for DNA ligase from Thermus thermophilus. A comparison of this sequence and those of other ligases reveals significant homology only with that of Escherichia coli. The overall amino acid composition of the thermophilic ligase and the pattern of amino acid substitutions between the two proteins are consistent with compositional biases in other thermophilic enzymes. We have engineered the expression of the T. thermophilus gene in Escherichia coli, and we show that E. coli proteins may be substantially removed from the thermostable ligase by a simple heat precipitation step.  相似文献   

11.
We study to what degree patterns of amino acid substitution vary between genes using two models of protein-coding gene evolution. The first divides the amino acids into groups, with one substitution rate for pairs of residues in the same group and a second for those in differing groups. Unlike previous applications of this model, the groups themselves are estimated from data by simulated annealing. The second model makes substitution rates a function of the physical and chemical similarity between two residues. Because we model the evolution of coding DNA sequences as opposed to protein sequences, artifacts arising from the differing numbers of nucleotide substitutions required to bring about various amino acid substitutions are avoided. Using 10 alignments of related sequences (five of orthologous genes and five gene families), we do find differences in substitution patterns. We also find that, although patterns of amino acid substitution vary temporally within the history of a gene, variation is not greater in paralogous than in orthologous genes. Improved understanding of such gene-specific variation in substitution patterns may have implications for applications such as sequence alignment and phylogenetic inference.  相似文献   

12.
During protein evolution the amino acid substitutions accumulate with time. However, the effect of accumulation of the amino acid substitutions to structural changes has not been estimated well. We will propose that the discordance of amino acid substitution on the HA protein of influenza A virus is useful for the assessment of structural changes during evolution. Discordance value can be obtained from the experimental data of tolerance or intolerance by introducing site directed mutagenesis at the homologous positions of two HA proteins holding the same amino acid residues. The value of discordance correlated to the number of amino acid differences among proteins. In the H3HA discordance rate was calculated to be 0.45% per one amino acid change. Furthermore, discordance of amino acid substitutions suggests that tolerable amino acid substitutions in different order have a probability of promoting irreversible divergence of the HA protein to different subtypes.  相似文献   

13.
The potential for engineering stable proteins with multiple amino acid substitutions was explored. Eleven lysine, five methionine, two tryptophan, one glycine, and three threonine substitutions were simultaneously made in barley chymotrypsin inhibitor-2 (CI-2) to substantially improve the essential amino acid content of the protein. These substitutions were chosen based on the three-dimensional structure of CI-2 and an alignment of homologous sequences. The initial engineered protein folded into a wild-type-like structure, but had a free energy of unfolding of only 2.2 kcal/mol, considerably less than the wild-type value of 7.5 kcal/mol. Restoration of the lysine mutation at position 67 to the wild-type arginine increased the free energy of unfolding to 3.1 kcal/mol. Subsequent cysteine substitutions at positions 22 and 82 resulted in disulfide bond formation and a protein with nearly wild-type thermodynamic stability (7.0 kcal/mol). None of the engineered proteins retained inhibitory activity against chymotrypsin or elastase, and all had substantially reduced inhibitory activity against subtilisin. The proteolytic stabilities of the proteins correlated with their thermodynamic stabilities. Reduction of the disulfide bond resulted in substantial loss of both thermodynamic and proteolytic stabilities, confirming that the disulfide bond, and not merely the cysteine substitutions, was responsible for the increased stability. We conclude that it is possible to replace over a third of the residues in CI-2 with minimal disruption of stability and structural integrity.  相似文献   

14.
To examine further the dependence of immunological cross-reactivity on sequence resemblance among proteins, we carried out micro-complement fixation studies with rabbit antisera to bacterial azurins of known amino acid sequence. There is a strong correlation (r = 0.9) between number of amino acid substitutions and degree of antigenic difference (immunological distance) among these azurins. The antigenic effects of amino acid substitutions are thus approximately equal and approximately additive. Similar observations and inferences were made before with a series of bird lysozymes. Indeed, the same approximate relationship between immunological distance (y) and percent difference in amino acid sequence (x) holds for both azurins and lysozymes, namely y congruent to 5x. An explanation is given for the dependence of immunological cross-reactivity on sequence resemblance among proteins. This entails reviewing evidence regarding the nature and number of antigenic sites on globular protein antigens as well as evidence for the existence of evolutionary biases against substitutions that are internal or cause large conformational changes. The explanation we give may apply only to those naturally occurring, globular, monomeric, isofunctional proteins whose sequences differ substantially from that of any rabbit protein.  相似文献   

15.
KNOX homeodomain (HD) proteins encoded by KNOTTED1-like homeobox genes (KNOX genes) are thought to work as switches for cells to change from an indeterminate to a determinate state, although their direct functions are not clear. In the process of isolating KNOX genes from rice, we found that one gene, named OSH3, has two amino acid substitutions in three of the invariant amino acid residues in the HD of KNOX proteins. These amino acid substitutions are not universal in rice: two of the cultivars from the Indica variety of rice do not carry those substitutions but two of the cultivars from Japonica variety do. We tested the effect of these amino acid substitutions on their ability to form dimers and to induce abnormal morophologies when overexpressed in transgenic plants. We found that OSH3 without those substitutions can form dimers and can induce an abnormal phenotype in overexpression studies, and that OSH3 with those amino acid substitutions is defective in both. Based on these observations, we concluded that OSH3 from two of the cultivars from the Japonica variety could have lost its original function, or could have acquired a novel function by modifying the action of HD, or both.  相似文献   

16.
Miyazawa S 《PloS one》2011,6(3):e17244

Background

Empirical substitution matrices represent the average tendencies of substitutions over various protein families by sacrificing gene-level resolution. We develop a codon-based model, in which mutational tendencies of codon, a genetic code, and the strength of selective constraints against amino acid replacements can be tailored to a given gene. First, selective constraints averaged over proteins are estimated by maximizing the likelihood of each 1-PAM matrix of empirical amino acid (JTT, WAG, and LG) and codon (KHG) substitution matrices. Then, selective constraints specific to given proteins are approximated as a linear function of those estimated from the empirical substitution matrices.

Results

Akaike information criterion (AIC) values indicate that a model allowing multiple nucleotide changes fits the empirical substitution matrices significantly better. Also, the ML estimates of transition-transversion bias obtained from these empirical matrices are not so large as previously estimated. The selective constraints are characteristic of proteins rather than species. However, their relative strengths among amino acid pairs can be approximated not to depend very much on protein families but amino acid pairs, because the present model, in which selective constraints are approximated to be a linear function of those estimated from the JTT/WAG/LG/KHG matrices, can provide a good fit to other empirical substitution matrices including cpREV for chloroplast proteins and mtREV for vertebrate mitochondrial proteins.

Conclusions/Significance

The present codon-based model with the ML estimates of selective constraints and with adjustable mutation rates of nucleotide would be useful as a simple substitution model in ML and Bayesian inferences of molecular phylogenetic trees, and enables us to obtain biologically meaningful information at both nucleotide and amino acid levels from codon and protein sequences.  相似文献   

17.
We have used analysis of variance to partition the variation in synonymous and amino acid substitution rates between three effects (gene, lineage, and a gene-by-lineage interaction) in mammalian nuclear and mitochondrial genes. We find that gene effects are stronger for amino acid substitution rates than for synonymous substitution rates and that lineage effects are stronger for synonymous substitution rates than for amino acid substitution rates. Gene-by-lineage interactions, equivalent to overdispersion corrected for lineage effects, are found in amino acid substitutions but not in synonymous substitutions. The variance in the ratio of amino acid and synonymous substitution rates is dominated by gene effects, but there is also a significant gene-by-lineage interaction.  相似文献   

18.
Olson MA 《Biophysical journal》2001,81(4):1841-1853
The problem of calculating binding affinities of protein-RNA complexes is addressed by analyzing a computational strategy of modeling electrostatic free energies based on a nonlinear Poisson-Boltzmann (NLPB) model and linear response approximation (LRA). The underlying idea is to treat binding as a two-step process. Solutions to the NLPB equation calculate free energies arising from electronic polarizability and the LRA is constructed from molecular dynamics simulations to model reorganization free energies due to conformational transitions. By implementing a consistency condition of requiring the NLPB model to reproduce the solute-solvent free-energy transitions determined by the LRA, a "macromolecule dielectric constant" (epsilon(m)) for treating reorganization is obtained. The applicability of this hybrid approach was evaluated by calculating the absolute free energy of binding and free-energy changes for amino acid substitutions in the complex between the U1A spliceosomal protein and its cognate RNA hairpin. Depending on the residue substitution, epsilon(m) varied from 3 to 18, and reflected dipolar reorientation not included in the polarization modeled by epsilon(m) = 2. Although the changes in binding affinities from substitutions modeled strictly at the implicit level by the NLPB equation with epsilon(m) = 4 reproduced the experimental values with good overall agreement, substitutions problematic to this simple treatment showed significant improvement when solved by the NLPB-LRA approach.  相似文献   

19.
Enveloped virus entry into host cells is typically initiated by an interaction between a viral envelope glycoprotein and a host cell receptor. For budded virions of the baculovirus Autographa californica multicapsid nucleopolyhedrovirus, the envelope glycoprotein GP64 is involved in host cell receptor binding, and GP64 is sufficient to mediate low-pH-triggered membrane fusion. To better define the role of GP64 in receptor binding, we generated and characterized a panel of antisera against subdomains of GP64. Eight subdomain-specific antisera were generated, and their reactivities with GP64 proteins and neutralization of virus infectivity and binding were examined. Antibodies directed against the N-terminal region of GP64 (amino acids 21 to 159) showed strong neutralization of infectivity and effectively inhibited binding of (35)S-labeled budded virions to Sf9 cells. In addition, we generated virions displaying truncated GP64 constructs. A construct displaying the N-terminal 274 amino acids (residues 21 to 294) of the ectodomain was sufficient to mediate virion binding. Additional studies of antisera directed against small subdomains revealed that an antiserum against a 40-amino-acid region (residues 121 to 160) neutralized virus infectivity. Site-directed mutagenesis was subsequently used for functional analysis of that region. Recombinant viruses expressing GP64 proteins with single amino acid substitutions within amino acids 120 to 124 and 142 to 148 replicated to high titers, suggesting that those amino acids were not critical for receptor binding or other important GP64 functions. In contrast, GP64 proteins with single amino acid substitutions of residues 153 and 156 were unable to substitute for wild-type GP64 and did not rescue a gp64 knockout virus. Further analysis showed that these substitutions substantially reduced binding of recombinant virus to Sf9 cells. Thus, the amino acid region from positions 21 to 159 was identified as a putative receptor binding domain, and amino acids 153 and 156 appear to be important for receptor binding.  相似文献   

20.
The envelope glycoprotein of human immunodeficiency virus type 1 (HIV-1) interacts with receptors on the target cell and mediates virus entry by fusing the viral and cell membranes. To maintain the viral infectivity, amino acids that interact with receptors are expected to be more conserved than the other sites on the protein surface. In contrast to the functional constraint of amino acids for the receptor binding, some amino acid changes in this protein may produce antigenic variations that enable the virus to escape from recognition of the host immune system. Therefore, both positive selection (higher fitness) and negative selection (lower fitness) against amino acid changes are taking place during evolution of surface proteins of parasites To elucidate the evolutionary mechanisms of the whole HIV-1 gp120 envelope glycoprotein at the single site level, we collected and analyzed all available sequence data for the protein. By analyzing 186 sequences of the HIV-1 gp120 (subtype B), we reevaluated amino acid variability at the single site level, and estimated the numbers of synonymous and nonsynonymous substitutions at each codon position to detect positive and negative selection. We identified 33 amino acid positions which may be under positive selection. Some of these positions may form discontinuous epitopes. We also analyzed amino acid sequences to find amino acid positions responsible for usage of the second receptor. We found that, in addition to the V3 loop, amino acid variation at residue 440 in C4 region is clearly linked with the usage of CXCR 4.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号