首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Estimation of evolutionary distances between nucleotide sequences   总被引:11,自引:0,他引:11  
A formal mathematical analysis of the substitution process in nucleotide sequence evolution was done in terms of the Markov process. By using matrix algebra theory, the theoretical foundation of Barry and Hartigan's (Stat. Sci. 2:191–210, 1987) and Lanave et al.'s (J. Mol. Evol. 20:86–93, 1984) methods was provided. Extensive computer simulation was used to compare the accuracy and effectiveness of various methods for estimating the evolutionary distance between two nucleotide sequences. It was shown that the multiparameter methods of Lanave et al.'s (J. Mol. Evol. 20:86–93, 1984), Gojobori et al.'s (J. Mol. Evol. 18:414–422, 1982), and Barry and Hartigan's (Stat. Sci. 2:191–210, 1987) are preferable to others for the purpose of phylogenetic analysis when the sequences are long. However, when sequences are short and the evolutionary distance is large, Tajima and Nei's (Mol. Biol. Evol. 1:269–285, 1984) method is superior to others.  相似文献   

2.
A method for estimating the numbers of synonymous (Ks) and nonsynonymous (Ka) substitutions per site is proposed. The method is based on the Li's (J Mol. Evol. 36:96–99, 1993) and Pamilo and Bianchi's (Mol. Biol. Evol. 10:271–281, 1993) method, but a putative source of bias is solved. It is proposed that the number of synonymous substitutions that are actually transitions or transversions should be computed by separating the twofold degenerate sites into two types of sites, 2S-fold and 2V-fold, where only transitional and transversional substitutions are synonymous, respectively. Kimura's (J. Mol. Evol. 16:111–120, 1980) two-parameter correcting method for multiple substitutions at a site is then applied using the overall observed synonymous transversion frequency to estimate both the numbers of synonymous transversional (Bs) and transitional (As) substitutions per site. This approach, therefore, also minimizes stochastic errors. Computer simulations indicate that the method presented gives more accurate Ks and Ka estimates than the aforementioned methods. Furthermore, the obtention of confidence intervals for divergence estimates by computer simulation is proposed.  相似文献   

3.
Recent studies of the structure of Type I collagen fibrils (Piez and Trus,Biosci. Rep. 1:801–810, 1981; Fraser, MacRae, Miller and Suzuki,J. Mol. Biol. 167:497–521, 1983) suggest that the segments of the collagen molecule which comprise the gap region are more mobile than those which comprise the overlap region. We have analyzed the distribution of amino acid residues and triplet types between the two regions, and find significantly non-uniform distributions for Ala, Gln, His, Hyp, Leu, Phe, and Tyr, and for triplets containing two imino acid residues. Taken together with the lower packing density in the gap region these observations provide a basis for understanding the greater mobility of the molecular segments in the gap region. In addition, we have examined the linear distribution of residue types in the two regions and also the hydropathy profile (Kyte and Doolittle,J. Mol. Biol. 157: 105–113, 1982). These reveal a segment of the gap region comprising helical residues 165–173, 399–407, 633–641 and 867–975 which has the highest hydropathy index, is devoid of charged residues, and contains very high proportions of Ala, Hyp and Phe.  相似文献   

4.
A number of the Alu and Ll elements present within the centromeric regions of the human chromosomes have been analyzed by polymerase chain reaction amplification. The oligonucleotide primers were homologous to the 3 end consensus sequences of either Alu or Ll in conjunction with an oligonucleotide primer homologous to alphoid sequences specific to different chromosomes. This allowed one to detect an unusual number of Alu and Ll polymorphisms at different loci. It is proposed that this results from molecular rearrangements which occur within the -satellite DNA in which they are embedded (Marçais et al. J. Mol. Evol. 33:42–48, 1991) and not because the centromeric regions are targets for new insertions of such elements. The same analyses were made on cosmids and YACs originating from the centromeric region of chromosome 21 as well as on a collection of somatic hybrids containing chromosome 21 centromere as unique common human genetic material. The results were consistent with the above hypothesis. Correspondence to: G. Roizès  相似文献   

5.
Summary In at least two instances involving serine proteinase inhibitors it has been shown that functionally important sites evolve faster and exhibit more interspecific variability than functionally neutral sites. Because these phenomena are difficult to reconcile with the neutral theory of molecular evolution, it has been suggested that the accelerated rate of amino acid substitution at the reactive sites is brought about by positive Darwinian selection. We show that differences in the amino acid composition in the different regions of proteinase inhibitors can account for the differences in the rates of amino acid substitution. By using an index of protein mutability [D. Graur (1985) J Mol Evol 2253–62], we show that the amino acid composition of the reactive center in the ovomucoids andSpi-2 gene products is such that, regardless of function, they are expected to evolve more rapidly than any other polypeptide for which the rate of substitution is known. In addition, the reactive region in theSpi-2 proteins is shown to be free of compositional constraint. Positive Darwinian selection need not be invoked at the present time in these cases.  相似文献   

6.
Summary NMR spectral studies on the HCN oligomers suggest the presence of carboxamide and urea groupings. The release of CO2, H2O, HCN, CH3CN, HCONH2 and pyridine on pyrolysis is consistent with the presence of these groupings as well as carboxylic acid groups. No basic primary amine groupings could be detected with fluorescamine. Hydrazinolysis of the HCN oligomers releases 10% of the amino acids normally released by acid hydrolysis. The oligomers give a positive biuret test but this is not due to the presence of peptide bonds. There is no conclusive evidence for the presence of peptide bonds in the HCN oligomers. No diglycine was detected on partial hydrolysis of the HCN oligomers at pH 8.5 suggesting that HCN oligomers were not a source of prebiotic peptides.Chemical Evolution 38. For the previous papers see Ferris JP, Rao RV, Newton TA (1979). J Org Chem 44:4378–4381, 4381–4385; Ferris JP, Edelson EH, Mount NM, Sullivan AE (1979) J Mol Evol 13:317–330  相似文献   

7.
The genetic code is implemented by aminoacyl-tRNA synthetases (aaRS). These 20 enzymes are divided into two classes that, despite performing same functions, have nothing common in structure. The mystery of this striking partition of aaRSs might have been concealed in their sterically complementary modes of tRNA recognition that, as we have found recently, protect the tRNAs with complementary anticodons from confusion in translation. This finding implies that, in the beginning, life increased its coding repertoire by the pairs of complementary codons (rather than one-by-one) and used both complementary strands of genes as templates for translation. The class I and class II aaRSs may represent one of the most important examples of such primordial sense–antisense (SAS) coding (Rodin and Ohno, Orig Life Evol Biosph 25:565–589, 1995). In this report, we address the issue of SAS coding in a wider scope. We suggest a variety of advantages that such coding would have had in exploring a wider sequence space before translation became highly specific. In particular, we confirm that in Achlya klebsiana a single gene might have originally coded for an HSP70 chaperonin (class II aaRS homolog) and an NAD-specific GDH-like enzyme (class I aaRS homolog) via its sense and antisense strands. Thus, in contrast to the conclusions in Williams et al. (Mol Biol Evol 26:445–450, 2009), this could indeed be a “Rosetta stone” gene (Carter and Duax, Mol Cell 10:705–708, 2002) (eroded somewhat, though) for the SAS origin of the two aaRS classes.  相似文献   

8.
Despite mitochondria and chloroplasts having their own genome, 99% of mitochondrial proteins (Rehling et al., Nat Rev Mol Cell Biol 5:519–530, 2004) and more than 95% of chloroplast proteins (Soll, Curr Opin Plant Biol 5:529–535, 2002) are encoded by nuclear DNA, synthesised in the cytosol and imported post-translationally. Protein targeting to these organelles depends on cytosolic targeting factors, which bind to the precursor, and then interact with membrane receptors to deliver the precursor into a translocase. The molecular chaperones Hsp70 and Hsp90 have been widely implicated in protein targeting to mitochondria and chloroplasts, and receptors capable of recognising these chaperones have been identified at the surface of both these organelles (Schlegel et al., Mol Biol Evol 24:2763–2774, 2007). The role of these chaperone receptors is not fully understood, but they have been shown to increase the efficiency of protein targeting (Young et al., Cell 112:41–50, 2003; Qbadou et al., EMBO J 25:1836–1847, 2006). Whether these receptors contribute to the specificity of targeting is less clear. A class of chaperone receptors bearing tetratricopeptide repeat domains is able to specifically bind the highly conserved C terminus of Hsp70 and/or Hsp90. Interestingly, at least of one these chaperone receptors can be found on each organelle (Schlegel et al., Mol Biol Evol 24:2763–2774, 2007), which suggests a universal role in protein targeting for these chaperone receptors. This review will investigate the role that chaperone receptors play in targeting efficiency and specificity, as well as examining recent in silico approaches to find novel chaperone receptors.  相似文献   

9.
We have recently reported a method to identify the shortest possible phylogenetic tree for a set of protein sequences [Foulds Hendy & Penny (1979) J. Mol. Evol. 13. 127--150; Foulds, Penny & Hendy (1979) J. Mol. Evol. 13, 151--166]. The present paper discusses issues that arise during the construction of minimal phylogenetic trees from protein-sequence data. The conversion of the data from amino acid sequences into nucleotide sequences is shown to be advantageous. A new variation of a method for constructing a minimal tree is presented. Our previous methods have involved first constructing a tree and then either proving that it is minimal or transforming it into a minimal tree. The approach presented in the present paper progressively builds up a tree, taxon by taxon. We illustrate this approach by using it to construct a minimal tree for ten mammalian haemoglobin alpha-chain sequences. Finally we define a measure of the complexity of the data and illustrate a method to derive a directed phylogenetic tree from the minimal tree.  相似文献   

10.
We previously reported the sequence of a 9260-bp fragment of mitochondrial (mt) DNA of the cephalopod Loligo bleekeri [J. Sasuga et al. (1999) J. Mol. Evol. 48:692–702]. To clarify further the characteristics of Loligo mtDNA, we have sequenced an 8148-bp fragment to reveal the complete mt genome sequence. Loligo mtDNA is 17,211 bp long and possesses a standard set of metazoan mt genes. Its gene arrangement is not identical to any other metazoan mt gene arrangement reported so far. Three of the 19 noncoding regions longer than 10 bp are 515, 507, and 509 bp long, and their sequences are nearly identical, suggesting that multiplication of these noncoding regions occurred in an ancestral Loligo mt genome. Comparison of the gene arrangements of Loligo, Katharina tunicata, and Littorina saxatilis mt genomes revealed that 17 tRNA genes of the Loligo mt genome are adjacent to noncoding regions. A majority (15 tRNA genes) of their counterparts is found in two tRNA gene clusters of the Katharina mt genome. Therefore, the Loligo mt genome (17 tRNA genes) may have spread over the genome, and this may have been coupled with the multiplication of the noncoding regions. Maximum likelihood analysis of mt protein genes supports the clade Mollusca + Annelida + Brachiopoda but fails to infer the relationships among Katharina, Loligo, and three gastropod species. Received: 9 May 2001 / Accepted: 3 October 2001  相似文献   

11.
Six new species are described:Gagea anonyma, G. Staintonii, G. siphonantha, G. Grey-Wilsonii, G. chloroneura. All belong to subgen.Platyspermum (Boiss.)Miscz. Florae Iranicae praecursores63–68. — Praecursores praecurrentes: Pl. Syst. Evol.151, 281–293 (1986).  相似文献   

12.
The insertion-deletion model developed by Thorne, Kishino and Felsenstein (1991, J. Mol. Evol., 33, 114–124; the TKF91 model) provides a statistical framework of two sequences. The statistical alignment of a set of sequences related by a star tree is a generalization of this model. The known algorithm computes the probability of a set of such sequences in O(l 2k ) time, where l is the geometric mean of the sequence lengths and k is the number of sequences. An improved algorithm is presented whose running time is only O(22k l k).  相似文献   

13.
A recently identified Alu element (Leeflang et al. J. Mol. Evol. 1993, 37:559–565), referred to as the putative founder of the HS (PV) subfamily, was found to be present at orthologous loci in the human, chimpanzee, gorilla, and gibbon lineages. The evolution of this Alu suggested that it is a source gene in the evolution of Alu family repeats for one of the most recent subfamilies, HS. We have determined that this putative founder of the HS subfamily was not present at the orthologous loci in older primates, including old world and new world monkeys. Thus, this particular Alu locus has only been responsible for the establishment of a very small subfamily of Alu sequences. We have further demonstrated that this putative founder Alu was not responsible for the de novo Alu insertion into the neurofibromatosis-1 gene of an individual causing neurofibromatosis. Our data demonstrate that although the putative founder of the HS subfamily found by Leeflang et al. (1993) probably gave rise to one of the most recent subfamilies of Alu sequences, it has not been very active in retroposition. Correspondence to: T.H. Shaikh  相似文献   

14.
The kinetics of synonymous codon change and species divergence is described in a matrix formalism that is equally applicable to all levels of codon degeneracy and all levels of codon or nucleotide bias. Based on the formalism it is possible to calculate the sum of all the synonymous substitution rate constants from the observed sequence differences between two species. This sum, the relaxation rate, is equivalent to the LogDet transformation that has recently been proposed as a new measure of evolutionary distance (Lockhardt et al.Mol. Biol. Evol. 11(4): 605–612, 1994). The relationship between this measure and the average number of base changes per site (K) is discussed. The formalism is tested on some sets of simulated sequence divergence data.  相似文献   

15.
Class II genes of the human major histocompatibility complex (MHC) are polymorphic. Allelic variation of the coding region of these genes is involved in the antigen presentation and is associated with susceptibility to certain autoimmune diseases. The DR region is unique among human class II regions in that multiple DRB genes are expressed. Differential expression of the different DRB loci has been demonstrated, and we sequenced the proximal promoter region of the HLA-DRB genes, known to be involved in the regulation of nucleotide variations in their regulatory regions and we determined the relationship between the regulatory regions of HLA-DRB genes. This polymorphism found in the regulatory conserved boxes could be involved in the observed differential expression of DRB loci. In addition, we found a polymorphism between the regulatory regions of DRB1 alleles which might be involved in an allele-specific regulation and therefore could be considered as an additional factor in susceptibility to autoimmune diseases.The nucleotide sequence data reported in this paper have been submitted to the EMBL nucleotide sequence database and have been assigned the accession numbers X64436–X64442, X64544, X64546–X64549, X65558–X65569, and X65585–X65587. Correspondence to: J. F. Eliaou.  相似文献   

16.
The human dynactin 1 gene (DCTN1) is positioned on chromosome 2p13, the candidate region for various diseases including Alström syndrome, limb-girdle muscle dystrophy, and Miyoshi myopathy. Here, we report the exon–intron structure ofDCTN1along with characterization of the 5′ upstream sequence and alternative splice variants previously identified by Tokitoet al.(1996),Mol. Biol. Cell7: 1167–1180). Knowledge of the genomic structure ofDCTN1allowed us to design intronic primers necessary for analyzing mutations in families segregating for diseases linked to this gene. These primers were tested on a French Acadian kindred segregating for Alström syndrome. No mutations were observed within the coding region ofDCTN1in this family. However, the intronic primers should allow for the rapid amplification of the coding region for mutational analysis of additional Alström families and other diseases tightly linked to theDCTN1locus on chromosome 2p13.  相似文献   

17.
Schultz and Yarus (J. Mol. Biol. 235:1377–1380, 1994) have proposed that reassignment of codons in the genetic code passes through a stage in which the codons are ambiguously translated. In contrast, we state that such ambiguity would be deleterious, and that, to be reassigned, a codon, together with the tRNA that translates the codon, must first disappear from coding sequences, after which a tRNA appears with a mutated anticodon, and this enables the codon to reappear with a changed meaning. In the case of a stop codon, the relevant release factor must change so as not to recognize it.Correspondence to: T.H. Jukes  相似文献   

18.
Estimation of the ratio of the rates of transitions to transversions (TI:TV ratio) for a collection of aligned nucleotide sequences is important because it provides insight into the process of molecular evolution and because such estimates may be used to further model the evolutionary process for the sequences under consideration. In this paper, we compare several methods for estimating the TI:TV ratio, including the pairwise method [TREE 11 (1996) 158], a modification of the pairwise method due to Ina [J. Mol. Evol. 46 (1998) 521], a method based on parsimony (TREE 11 (1996) 158), a method due to Purvis and Bromham [J. Mol. Evol. 44 (1997) 112] that uses phylogenetically independent pairs of sequences, the maximum likelihood method, and a Bayesian method [Bioinformatics 17 (2001) 754]. We examine the performance of each estimator under several conditions using both simulated and real data.  相似文献   

19.
Twelve of 30 species examined in the ant genus Polyrhachis carry single nucleotide insertions at one or two positions within the mitochondrial cytochrome b (cytb) gene. Two of the sites are present in more than one species. Nucleotide substitutions in taxa carrying insertions show the strong codon position bias expected of functional protein coding genes, with substitutions concentrated in the third positions of the original reading frame. This pattern of evolution of the sequences strongly suggests that they are functional cytb sequences. This result is not the first report of +1 frameshift insertions in animal mitochondrial genes. A similar site was discovered in vertebrates, where single nucleotide frameshift insertions in many birds and a turtle were reported by Mindell et al. (Mol Biol Evol 15:1568, 1998). They hypothesized that the genes are correctly decoded by a programmed frameshift during translation. The discovery of four additional sites gives us the opportunity to look for common features that may explain how programmed frameshifts can arise. The common feature appears to be the presence of two consecutive rare codons at the insertion site. We hypothesize that the second of these codons is not efficiently translated, causing a pause in the translation process. During the stall the weak wobble pairing of the tRNA bound in the peptidyl site of the ribosome, together with an exact Watson–Crick codon–anticodon pairing in the +1 position, allows translation to continue in the +1 reading frame. The result of these events is an adequate level of translation of a full-length and fully functional protein. A model is presented for decoding of these mitochondrial genes, consistent with known features of programmed translational frameshifting in the yeast TY1 and TY3 retrotransposons.Reviewing Editor: Dr. W. Ford Doolittle  相似文献   

20.
The nucleotide (nt) sequences of the Sc3 and Sc4 genes of the filamentous fungus Schizophyllum commune, and the deduced amino acid (aa) sequences, were determined; moreover, the previously published sequence for the ScI gene [Dons et al., EMBO J. 3 (1984) 2101–2106] was corrected. All three independently isolated genes were found to have similar structures and nt sequences of their coding regions. At the aa level the homology is 43–62% (63–69% in the C-terminal parts of the proteins), the hydrophobic aa predominate and the hydrophobicity patterns are similar. All three proteins contain leader sequences and eight cysteines among about 110 aa, conserved at the same positions. Yet these genes are differentially regulated: Sc1 and Sc4 are only expressed at high levels in fruiting dikaryons, whereas Sc3 is highly expressed in both monokaryons and dikaryons, independent from fruiting.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号