首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The ability of the principle of parsimony to accurately reconstruct molecular evolutionary pathways from an analysis of amino acid or nucleic acid sequences from extant organisms is tested by direct comparison with a known pathway. Topological errors occur under specified conditions. Importantly, given no errors in the topology, and error-free experimental sequences, the ancestral sequences inferred by the parsimony principle err significantly, the magnitude of the error increasing with the distance of the nodal sequence from the present. These errors are irreducible as an inherent consequence of any evolutionary process in which chance processes operate within the constraints imposed by Darwinian selection. Formulae are derived which predict the errors in the ancestral sequences from a knowledge of only the internodal distances. The parsimony solution is not a reliably good solution. It is necessary to develop a detailed understanding of the interaction between chance processes and natural selection to further advance our understanding of molecular change in proteins and nucleic acids.  相似文献   

2.
Ancestral sequence reconstruction has had recent success in decoding the origins and the determinants of complex protein functions. However, phylogenetic analyses of remote homologues must handle extreme amino acid sequence diversity resulting from extended periods of evolutionary change. We exploited the wealth of protein structures to develop an evolutionary model based on protein secondary structure. The approach follows the differences between discrete secondary structure states observed in modern proteins and those hypothesized in their immediate ancestors. We implemented maximum likelihood-based phylogenetic inference to reconstruct ancestral secondary structure. The predictive accuracy from the use of the evolutionary model surpasses that of comparative modeling and sequence-based prediction; the reconstruction extracts information not available from modern structures or the ancestral sequences alone. Based on a phylogenetic analysis of a sequence-diverse protein family, we showed that the model can highlight relationships that are evolutionarily rooted in structure and not evident in amino acid-based analysis.  相似文献   

3.
Phylogenetic analyses of three families of arthropod apyrases were used to reconstruct the evolutionary relationships of salivary-expressed apyrases, which have an anti-coagulant function in blood-feeding arthropods. Members of the 5′nucleotidase family were recruited for salivary expression in blood-feeding species at least five separate times in the history of arthropods, while members of the Cimex-type apyrase family have been recruited at least twice. In spite of these independent events of recruitment for salivary function, neither of these families showed evidence of convergent amino acid sequence evolution in salivary-expressed members. On the contrary, in the 5′-nucleotide family, salivary-expressed proteins conserved ancestral amino acid residues to a significantly greater extent than related proteins without salivary function, implying parallel evolution by conservation of ancestral characters. This unusual pattern of sequence evolution suggests the hypothesis that purifying selection favoring conservation of ancestral residues is particularly strong in salivary-expressed members of the 5′-nucleotidase family of arthropods because of constraints arising from expression within the vertebrate host.  相似文献   

4.
Proteins evolve under a myriad of biophysical selection pressures that collectively control the patterns of amino acid substitutions. These evolutionary pressures are sufficiently consistent over time and across protein families to produce substitution patterns, summarized in global amino acid substitution matrices such as BLOSUM, JTT, WAG, and LG, which can be used to successfully detect homologs, infer phylogenies, and reconstruct ancestral sequences. Although the factors that govern the variation of amino acid substitution rates have received much attention, the influence of thermodynamic stability constraints remains unresolved. Here we develop a simple model to calculate amino acid substitution matrices from evolutionary dynamics controlled by a fitness function that reports on the thermodynamic effects of amino acid mutations in protein structures. This hybrid biophysical and evolutionary model accounts for nucleotide transition/transversion rate bias, multi‐nucleotide codon changes, the number of codons per amino acid, and thermodynamic protein stability. We find that our theoretical model accurately recapitulates the complex yet universal pattern observed in common global amino acid substitution matrices used in phylogenetics. These results suggest that selection for thermodynamically stable proteins, coupled with nucleotide mutation bias filtered by the structure of the genetic code, is the primary driver behind the global amino acid substitution patterns observed in proteins throughout the tree of life.  相似文献   

5.
The role of sequence divergence in functional divergence of duplicate genes is a topic of great interest. In this study, we compare the numbers of amino acid substitutions in each sequence since two yeast duplicates diverged, using a preduplication ancestral outgroup. Using this strategy, we explored the relationship between sequence divergence and functional divergence between duplicate partners. We show that the degree of relative functional asymmetry between duplicate proteins is proportional to the relative sequence divergence between them. Furthermore, of the two duplicates, the copy closer to their ancestral sequence (fewer number of amino acid substitutions) interacts with more proteins and affects fitness more severely when deleted. Therefore, asymmetric sequence divergence between duplicates is correlated with asymmetric functional divergence and may underlie the duplicate's role in genetic robustness against mutations. Among the functional traits considered, protein abundance appears to have the strongest correlation with the nonsynonymous divergence between duplicates. Taken together with the results from whole-genome analyses, our results indicate that within-species duplicates are subject to the same evolutionary force that acts on interspecific sequence and functional divergence. In particular, we detect signs of purifying selection on the more slowly evolving duplicate.  相似文献   

6.
Several lines of evidence such as the basal location of thermophilic lineages in large-scale phylogenetic trees and the ancestral sequence reconstruction of single enzymes or large protein concatenations support the conclusion that the ancestors of the bacterial and archaeal domains were thermophilic organisms which were adapted to hot environments during the early stages of the Earth. A parsimonious reasoning would therefore suggest that the last universal common ancestor (LUCA) was also thermophilic. Various authors have used branch-wise non-homogeneous evolutionary models that better capture the variation of molecular compositions among lineages to accurately reconstruct the ancestral G + C contents of ribosomal RNAs and the ancestral amino acid composition of highly conserved proteins. They confirmed the thermophilic nature of the ancestors of Bacteria and Archaea but concluded that LUCA, their last common ancestor, was a mesophilic organism having a moderate optimal growth temperature. In this letter, we investigate the unknown nature of the phylogenetic signal that informs ancestral sequence reconstruction to support this non-parsimonious scenario. We find that rate variation across sites of molecular sequences provides information at different time scales by recording the oldest adaptation to temperature in slow-evolving regions and subsequent adaptations in fast-evolving ones.  相似文献   

7.
Adaptive evolution at the molecular level can be studied by detecting convergent and parallel evolution at the amino acid sequence level. For a set of homologous protein sequences, the ancestral amino acids at all interior nodes of the phylogenetic tree of the proteins can be statistically inferred. The amino acid sites that have experienced convergent or parallel changes on independent evolutionary lineages can then be identified by comparing the amino acids at the beginning and end of each lineage. At present, the efficiency of the methods of ancestral sequence inference in identifying convergent and parallel changes is unknown. More seriously, when we identify convergent or parallel changes, it is unclear whether these changes are attributable to random chance. For these reasons, claims of convergent and parallel evolution at the amino acid sequence level have been disputed. We have conducted computer simulations to assess the efficiencies, of the parsimony and Bayesian methods of ancestral sequence inference in identifying convergent and parallel-change sites. Our results showed that the Bayesian method performs better than the parsimony method in identifying parallel changes, and both methods are inefficient in identifying convergent changes. However, the Bayesian method is recommended for estimating the number of convergent-change sites because it gives a conservative estimate. We have developed statistical tests for examining whether the observed numbers of convergent and parallel changes are due to random chance. As an example, we reanalyzed the stomach lysozyme sequences of foregut fermenters and found that parallel evolution is statistically significant, whereas convergent evolution is not well supported.   相似文献   

8.
The phylogenetic inference of ancestral protein sequences is a powerful technique for the study of molecular evolution, but any conclusions drawn from such studies are only as good as the accuracy of the reconstruction method. Every inference method leads to errors in the ancestral protein sequence, resulting in potentially misleading estimates of the ancestral protein's properties. To assess the accuracy of ancestral protein reconstruction methods, we performed computational population evolution simulations featuring near-neutral evolution under purifying selection, speciation, and divergence using an off-lattice protein model where fitness depends on the ability to be stable in a specified target structure. We were thus able to compare the thermodynamic properties of the true ancestral sequences with the properties of “ancestral sequences” inferred by maximum parsimony, maximum likelihood, and Bayesian methods. Surprisingly, we found that methods such as maximum parsimony and maximum likelihood that reconstruct a “best guess” amino acid at each position overestimate thermostability, while a Bayesian method that sometimes chooses less-probable residues from the posterior probability distribution does not. Maximum likelihood and maximum parsimony apparently tend to eliminate variants at a position that are slightly detrimental to structural stability simply because such detrimental variants are less frequent. Other properties of ancestral proteins might be similarly overestimated. This suggests that ancestral reconstruction studies require greater care to come to credible conclusions regarding functional evolution. Inferred functional patterns that mimic reconstruction bias should be reevaluated.  相似文献   

9.
The group-specific component (Gc) is a plasma protein that binds vitamin D. Recent characterization of human Gc cDNA demonstrated homology with serum albumin and alpha-fetoprotein. This study compares the sequences of the three proteins and demonstrates a strong evolutionary relationship. Albumin, alpha-fetoprotein and Gc evolved from an ancestral gene containing an intragenic triplication. Comparison of the amino acid sequences and patterns of double disulfide bonds suggests that the Gc gene may have diverged from an ancestral gene earlier in evolution than the genes encoding albumin and alpha-fetoprotein. Analysis of the amino acid and nucleotide sequences of the three internal domains of Gc revealed 19-23% amino acid sequence identity and the localization of three homology blocks with 40-44% nucleotide sequence identity. The deduced amino sequence of Gc furnished data for comparing its molecular configuration based on the predicted secondary structure with those predicted for human albumin and alpha-fetoprotein. Utilization of Gc cDNA has also led to the identification of its genomic DNA and detection of a human DNA polymorphism.  相似文献   

10.
Pig plasma gelsolin (Mr = 81595; 739 residues) contains 704 identical residues out of a maximum 730 when compared to the cytoplasmic form of human gelsolin. The cDNA sequence also codes for a peptide of 33 residues N-terminal to the nine-residue plasma extension sequence previously reported: these 33 residues are highly homologous to the human signal peptide and plasma extension. Comparison of the gelsolin sequences with chicken brush border villin, severin from Dictyostelium discoideum and fragmin from Physarum polycephalum shows a strong evolutionary relationship between all these proteins. There are six large repeating segments in gelsolin and villin, and three similar segments in severin and fragmin. Although these multiple repeats cannot be related to any known function of these actin-severing proteins, this superfamily of proteins appears to have evolved from an ancestral sequence of 120 to 130 amino acid residues.  相似文献   

11.
Crystal structure of the ribosomal protein S6 from Thermus thermophilus.   总被引:1,自引:1,他引:0  
The amino acid sequence and crystal structure of the ribosomal protein S6 from the small ribosomal subunit of Thermus thermophilus have been determined. S6 is a small protein with 101 amino acid residues. The 3D structure, which was determined to 2.0 A resolution, consists of a four-stranded anti-parallel beta-sheet with two alpha-helices packed on one side. Similar folding patterns have been observed for other ribosomal proteins and may suggest an original RNA-interacting motif. Related topologies are also found in several other nucleic acid-interacting proteins and based on the assumption that the structure of the ribosome was established early in the molecular evolution, the possibility that an ancestral RNA-interacting motif in ribosomal proteins is the evolutionary origin for the nucleic acid-interacting domain in large classes of ribonucleic acid binding proteins should be considered.  相似文献   

12.
Immunological comparisons of higher plant plastocyanins   总被引:1,自引:0,他引:1  
Antisera were prepared in rabbits to purified plastocyanins of Spinacia oleracea and Urtica dioica. Using the method of micro-complement fixation, the immunological cross-reactivity of these antisera with plastocyanins from 37 species of plants was determined. Cross-reactivity between antisera to spinach plastocyanin and 11 plastocyanins from other plant species showed a positive correlation with distance on an ancestral amino acid sequence affinity tree constructed by the method of Dayhoff and Eck [1]. The importance of serological data as a supplement to amino acid sequence data in evolutionary studies is discussed.  相似文献   

13.
MOTIVATION: Knowledge of how proteomic amino acid composition has changed over time is important for constructing realistic models of protein evolution and increasing our understanding of molecular evolutionary history. The proteomic amino acid composition of the Last Universal Ancestor (LUA) of life is of particular interest, since that might provide insight into the early evolution of proteins and the nature of the LUA itself. RESULTS: We introduce a method to estimate ancestral amino acid composition that is based on expectation-maximization. On simulated data, the approach was found to be very effective in estimating ancestral amino acid composition, with accuracy improving as the number of residues in the dataset was increased. The method was then used to infer the amino acid composition of a set of proteins in the LUA. In general, as compared with the modern protein set, LUA proteins were found to be richer in amino acids that are believed to have been most abundant in the prebiotic environment and poorer in those believed to have been unavailable or scarce. Additionally, we found the inferred amino acid composition of this protein set in the LUA to be more similar to the observed composition of the same set in extant thermophilic species than in extant mesophilic species, supporting the idea that the LUA lived in a thermophilic environment. AVAILABILITY: The program is available at http://compbio.cs.princeton.edu/ancestralaa  相似文献   

14.
To examine further the dependence of immunological cross-reactivity on sequence resemblance among proteins, we carried out micro-complement fixation studies with rabbit antisera to bacterial azurins of known amino acid sequence. There is a strong correlation (r = 0.9) between number of amino acid substitutions and degree of antigenic difference (immunological distance) among these azurins. The antigenic effects of amino acid substitutions are thus approximately equal and approximately additive. Similar observations and inferences were made before with a series of bird lysozymes. Indeed, the same approximate relationship between immunological distance (y) and percent difference in amino acid sequence (x) holds for both azurins and lysozymes, namely y congruent to 5x. An explanation is given for the dependence of immunological cross-reactivity on sequence resemblance among proteins. This entails reviewing evidence regarding the nature and number of antigenic sites on globular protein antigens as well as evidence for the existence of evolutionary biases against substitutions that are internal or cause large conformational changes. The explanation we give may apply only to those naturally occurring, globular, monomeric, isofunctional proteins whose sequences differ substantially from that of any rabbit protein.  相似文献   

15.
16.
17.
Despite similarities in their enzymic properties, diphtheria toxin (DT) and exotoxin A (ETA) of Pseudomonas aeruginosa have major differences in structure and action: consequently, the question of possible evolutionary relatedness of these two proteins remains unanswered. Here we report the existence of significant amino acid sequence homology between the enzymic domain of DT and that of ETA. Major segments of sequence may be aligned with high percentages of identity and of conservative substitutions. The homologous stretches in ETA form much of the active-site cleft in the X-ray crystallographic structure. This evidence implies that these domains, at least, have diverged from a common ancestral protein and that active-site residues have been strongly conserved.  相似文献   

18.
We have recently developed a new method for designing thermostable proteins using phylogenetic trees of enzymes. In this study, we investigated a method for designing proteins with improved stability using 3-isopropylmalate dehydrogenase (IPMDH) from Thermus thermophilus as a model enzyme. We designed 12 mutant enzymes, each having an ancestral amino acid residue that was present in the common ancestor of Bacteria and Archaea. At least six of the 12 ancestral mutants tested showed thermal stability higher than that of the original enzyme. The results supported the hyperthermophilic universal ancestor hypothesis. The effect of ancestral residues on IPMDHs of several organisms and on the related enzyme isocitrate dehydrogenase was summarised and analysed. The effect of an ancestral residue on thermostability did not depend on the degree of conservation of the residue at the site, suggesting that the stabilisation of these mutant proteins is not related to sequence conservation but to the antiquity of the introduced residues. The results suggest also that this method could be an efficient way of designing mutant enzymes with higher thermostability based only on the primary structure and a phylogenetic tree.  相似文献   

19.
DNA-binding proteins are crucial for various cellular processes and hence have become an important target for both basic research and drug development. With the avalanche of protein sequences generated in the postgenomic age, it is highly desired to establish an automated method for rapidly and accurately identifying DNA-binding proteins based on their sequence information alone. Owing to the fact that all biological species have developed beginning from a very limited number of ancestral species, it is important to take into account the evolutionary information in developing such a high-throughput tool. In view of this, a new predictor was proposed by incorporating the evolutionary information into the general form of pseudo amino acid composition via the top-n-gram approach. It was observed by comparing the new predictor with the existing methods via both jackknife test and independent data-set test that the new predictor outperformed its counterparts. It is anticipated that the new predictor may become a useful vehicle for identifying DNA-binding proteins. It has not escaped our notice that the novel approach to extract evolutionary information into the formulation of statistical samples can be used to identify many other protein attributes as well.  相似文献   

20.
In this study, we used a computational approach to investigate the early evolutionary history of a system of proteins that, together, embed and translocate other proteins across cell membranes. Cell membranes comprise the basis for cellularity, which is an ancient, fundamental organizing principle shared by all organisms and a key innovation in the evolution of life on Earth. Two related requirements for cellularity are that organisms are able to both embed proteins into membranes and translocate proteins across membranes. One system that accomplishes these tasks is the signal recognition particle (SRP) system, in which the core protein components are the paralogs, FtsY and Ffh. Complementary to the SRP system is the Sec translocation channel, in which the primary channel-forming protein is SecY. We performed phylogenetic analyses that strongly supported prior inferences that FtsY, Ffh, and SecY were all present by the time of the last universal common ancestor of life, the LUCA, and that the ancestor of FtsY and Ffh existed before the LUCA. Further, we combined ancestral sequence reconstruction and protein structure and function prediction to show that the LUCA had an SRP system and Sec translocation channel that were similar to those of extant organisms. We also show that the ancestor of Ffh and FtsY that predated the LUCA was more similar to FtsY than Ffh but could still have comprised a rudimentary protein translocation system on its own. Duplication of the ancestor of FtsY and Ffh facilitated the specialization of FtsY as a membrane bound receptor and Ffh as a cytoplasmic protein that could bind nascent proteins with specific membrane-targeting signal sequences. Finally, we analyzed amino acid frequencies in our ancestral sequence reconstructions to infer that the ancestral Ffh/FtsY protein likely arose prior to or just after the completion of the canonical genetic code. Taken together, our results offer a window into the very early evolutionary history of cellularity.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号