期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Apparent Selection Intensity for the Cytochrome Oxidase Subunit I Gene Varies with Mode of Reproduction in Echinoderms

Foltz DW Hrincevich AW Rocha-Olivares A 《Genetica》2004,122(2):115-125

When most amino acid substitutions in protein-coding genes are slightly deleterious rather than selectively neutral, life history differences can potentially modify the effective population size or the selective regime, resulting in altered ratios of non-synonymous to synonymous substitutions among taxa. We studied substitution patterns for the mitochondrial cytochrome oxidase subunit I (COI) gene in a sea star genus (Leptasterias spp.) with an obligate brood-protecting mode of reproduction and small-scale population genetic subdivision, and compared the results to available COI sequences in nine other genera of echinoderms with pelagic larvae: three sea stars, five sea urchins and one brittle star. We predicted that this life history difference would be associated with differences in the ratio of non-synonymous (dN) to synonymous (dS) substitution rates. Leptasterias had a significantly greater dN/dS ratio (both between species and within species), a significantly smaller transition/transversion rate ratio, and a significantly lower average nucleotide diversity within species, than did the non-brooding genera. Other explanations for the results, such as altered mutation rates or selective sweeps, were not supported by the data analysis. These findings highlight the potential influence of reproductive traits and other life history factors on patterns of nucleotide substitution within and between species. 相似文献

2.

ADAPTSITE: detecting natural selection at single amino acid sites. 总被引：12，自引：0，他引：12

Y Suzuki T Gojobori M Nei 《Bioinformatics (Oxford, England)》2001,17(7):660-661

ADAPTSITE is a program package for detecting natural selection at single amino acid sites, using a multiple alignment of protein-coding sequences for a given phylogenetic tree. The program infers ancestral codons at all interior nodes, and computes the total numbers of synonymous (c(S)) and nonsynonymous (c(N)) substitutions as well as the average numbers of synonymous (s(S)) and nonsynonymous (s(N)) sites for each codon site. The probabilities of occurrence of synonymous and nonsynonymous substitutions are approximated by s(S) / (s(S) + s(N)) and s(N) / (s(S) + s(N)), respectively. The null hypothesis of selective neutrality is tested for each codon site, assuming a binomial distribution for the probability of obtaining c(S) and c(N). AVAILABILITY: ADAPTSITE is available free of charge at the World-Wide Web sites http://mep.bio.psu.edu/adaptivevol.html and http://www.cib.nig.ac.jp/dda/yossuzuk/welcome.html. The package includes the source code written in C, binary files for UNIX operating systems, manual, and example files. 相似文献

3.

Single Nucleotide polymorphisms and their relationship to codon usage bias in the Pacific oyster Crassostrea gigas 总被引：3，自引：0，他引：3

Sauvage C Bierne N Lapègue S Boudry P 《Gene》2007,406(1-2):13-22

DNA sequence polymorphism and codon usage bias were investigated in a set of 41 nuclear loci in the Pacific oyster Crassostrea gigas. Our results revealed a very high level of DNA polymorphism in oysters, in the order of magnitude of the highest levels reported in animals to date. A total of 290 single nucleotide polymorphisms (SNPs) were detected, 76 of which being localised in exons and 214 in non-coding regions. Average density of SNPs was estimated to be one SNP every 60 bp in coding regions and one every 40 bp in non-coding regions. Non-synonymous substitutions contributed substantially to the polymorphism observed in coding regions. The non-synonymous to silent diversity ratio was 0.16 on average, which is fairly higher to the ratio reported in other invertebrate species recognised to display large population sizes. Therefore, purifying selection does not appear to be as strong as it could have been expected for a species with a large effective population size. The level of non-synonymous diversity varied greatly from one gene to another, in accordance with varying selective constraints. We examined codon usage bias and its relationship with DNA polymorphism. The table of optimal codons was deduced from the analysis of an EST dataset, using EST counts as a rough assessment of gene expression. As recently observed in some other taxa, we found a strong and significant negative relationship between codon bias and non-synonymous diversity suggesting correlated selective constraints on synonymous and non-synonymous substitutions. Codon bias as measured by the frequency of optimal codons for expression might therefore provide a useful indicator of the level of constraint upon proteins in the oyster genome. 相似文献

4.

A Sliding Window-Based Method to Detect Selective Constraints in Protein-Coding Genes and Its Application to RNA Viruses 总被引：4，自引：0，他引：4

Fares MA Elena SF Ortiz J Moya A Barrio E 《Journal of molecular evolution》2002,55(5):509-521

Here we present a new sliding window-based method specially designed to detect selective constraints in specific regions of a multiple protein-coding sequence alignment. In contrast to previous window-based procedures, our method is based on a nonarbitrary statistical approach to find the appropriate codon-window size to test deviations of synonymous (dS) and nonsynonymous (dN) nucleotide substitutions from the expectation. The probabilities of dN and dS are obtained from simulated data and used to detect significant deviations of dN and dS in a specific window region of the real sequence alignment. The nonsynonymous-to-synonymous rate ratio (w = dN/dS) was used to highlight selective constraints in any window wherein dS or dN was significantly different from the expectation. In these significant windows, w and its variance [V(w)] were calculated and used to test the neutral hypothesis. Computer simulations showed that the method is accurate even for highly divergent sequences. The main advantages of the new method are that it (i) uses a statistically appropriate window size to detect different selective patterns, (ii) is computationally less intensive than maximum likelihood methods, and (iii) detects saturation of synonymous sites, which can give deviations from neutrality. Hence, it allows the analysis of highly divergent sequences and the test of different alternative hypothesis as well. The application of the method to different human immunodeficiency virus type 1 and to foot-and-mouth disease virus genes confirms the action of positive selection on previously described regions as well as on new regions. 相似文献

5.

Codon-based tests of positive selection,branch lengths,and the evolution of mammalian immune system genes 总被引：4，自引：1，他引：3

Hughes AL Friedman R 《Immunogenetics》2008,60(9):495-506

Using basic probability theory, we show that there is a substantial likelihood that even in the presence of strong purifying selection, there will be a number of codons in which the number of synonymous nucleotide substitutions per site (d (S)) exceeds the number of non-synonymous nucleotide substitutions per site (d (N)). In an empirical study, we examined the numbers of synonymous (b (S)) and non-synonymous substitutions (b (N)) along branches of the phylogenies of 69 single-copy orthologous genes from seven species of mammals. A pattern of b (N) > b (S) was most commonly seen in the shortest branches of the tree and was associated with a high coefficient of variation in both b (N) and b (S), suggesting that high stochastic error in b (N) and b (S) on short branches, rather than positive Darwinian selection, is the explanation of most cases where b (N) is greater than b (S) on a given branch. The branch-site method of Zhang et al. (Zhang, Nielsen, Yang, Mol Biol Evol, 22:2472-2479, 2005) identified 117 codons on 35 branches as "positively selected," but a majority of these codons lacked synonymous substitutions, while in the others, synonymous and non-synonymous differences per site occurred in approximately equal frequencies. Thus, it was impossible to rule out the hypothesis that chance variation in the pattern of mutation across sites, rather than positive selection, accounted for the observed pattern. Our results showed that b (N)/b (S) was consistently elevated in immune system genes, but neither the search for branches with b (N) > b (S) nor the branch-site method revealed this trend. 相似文献

6.

Evolutionary simulations to detect functional lineage-specific genes

Dupanloup I Kaessmann H 《Bioinformatics (Oxford, England)》2006,22(15):1815-1822

MOTIVATION: Supporting the functionality of recent duplicate gene copies is usually difficult, owing to high sequence similarity between duplicate counterparts and shallow phylogenies, which hamper both the statistical and experimental inference. RESULTS: We developed an integrated evolutionary approach to identify functional duplicate gene copies and other lineage-specific genes. By repeatedly simulating neutral evolution, our method estimates the probability that an ORF was selectively conserved and is therefore likely to represent a bona fide coding region. In parallel, our method tests whether the accumulation of non-synonymous substitutions reveals signatures of selective constraint. We show that our approach has high power to identify functional lineage-specific genes using simulated and real data. For example, a coding region of average length (approximately 1400 bp), restricted to hominoids, can be predicted to be functional in approximately 94-100% of cases. Notably, the method may support functionality for instances where classical selection tests based on the ratio of non-synonymous to synonymous substitutions fail to reveal signatures of selection. Our method is available as an automated tool, ReEVOLVER, which will also be useful to systematically detect functional lineage-specific genes of closely related species on a large scale. AVAILABILITY: ReEVOLVER is available at http://www.unil.ch/cig/page7858.html. 相似文献

7.

Diversity in the thrombospondin-related adhesive protein gene (TRAP) of Plasmodium vivax

Putaporntip C Jongwutiwes S Tia T Ferreira MU Kanbara H Tanabe K 《Gene》2001,268(1-2):97-104

We analyzed 22 clinical isolates of Plasmodium vivax from Thailand and 17 from Brazil to investigate the extent of sequence variation in the thrombospondin-related adhesive protein of Plasmodium vivax (PvTRAP), a homologue of P. falciparum TRAP (PfTRAP) which has been considered to be a promising vaccine candidate. In total 54 haplotypes were identified from 73 distinct gene clones. Coexistence of different PvTRAP in circulation occurred in 10 and 13 isolates from Thailand and Brazil, respectively. Forty out of 48 substituted nucleotides are non-synonymous changes. Most of the substituted residues reside in the von Willebrand factor type A-domain (region II), a sulfated glycosaminoglycan-binding domain (region III) and a proline-rich region (region IV). All nucleotide substitutions are dimorphic. Two haplotypes from Thailand contain an inserted sequence encoding aspartic acid-serine-proline in the proline-rich region. Sequence analysis has revealed that nucleotide diversity in PvTRAP is low although Brazilian isolates display a higher degree of variation than those from Thailand. Phylogenetic construction using the neighbor joining method has shown that most of the Thai and the Brazilian isolates appear to be mainly clustered into distinct groups. Significantly greater than expected values of the mean number of non-synonymous (d(n)) than synonymous (d(s)) nucleotide substitutions per site were observed in regions II and III of PvTRAP. Analysis of the published PfTRAP sequences has shown a similar finding in regions II and IV suggesting that positive selection operates on the regions. Hence, different regions in PvTRAP and PfTRAP could be under different pressures in terms of immune selection, structural and/or functional constraints. 相似文献

8.

Close correlation of streptococcal DNase B (sdaB) alleles with emm genotypes in Streptococcus pyogenes

Matsumoto M Sakae K Hashikawa S Torii K Hasegawa T Horii T Endo M Okuno R Murayama S Hirasawa K Suzuki R Isobe J Tanaka D Katsukawa C Tamaru A Tomita M Ogata K Ikebe T Watanabe H Ohta M;Working Group for Group A Streptococci in Japan 《Microbiology and immunology》2005,49(10):925-929

DNase B is a major nuclease and a possible virulence factor in Streptococcus pyogenes. The allelic diversity of streptococcal DNase B (sdaB) gene was investigated in 83 strains with 14 emm genotypes. Of the 15 alleles identified, 11 alleles carried only synonymous nucleotide substitutions. On the other hand, 4 alleles had a non-synonymous substitution other than synonymous substitutions, resulting in the substitution of a single amino acid. The distribution of each allele was generally emm genotype-specific. Only sdaB7 was found in both emm2 and emm4. The promoter region was highly conserved and DNase B protein was similarly expressed in all alleles. 相似文献

9.

藏鸡主要组织相容性复合体B-LBⅡ基因第二外显子序列遗传变异分析

徐日福李奎陈国宏强巴央宗张玉波林丽樊斌刘榜《遗传学报》2005,32(11):1136-1146

根据鸡主要组织相容性复合体B-LBⅡ基因序列设计特异性引物,在藏鸡基因组中扩增了一个包括其第二外显子和第二内含子在内长度为374 bp的片段,并通过克隆和PCR直接测序获得了该片段的核苷酸序列。发现了15个B-LBⅡ新等位基因。对18个B-LBⅡ等位基因核苷酸序列和其所编码的MHCB-LBⅡ分子β1结构域的氨基酸序列分析显示,第二外显子核苷酸序列遗传多态性异常丰富,存在着62个多态变异位点(共包括80个突变),其中41个为简约性多态位点;衡量该序列遗传多样性的π值为0.0718;反映其群体内遗传变异度的平均遗传距离为0.056±0.008,低于在5个外来品种所估算的平均遗传距离。该编码区核苷酸相对异义替换率(15.61±2.69%)显著高于其同义替换率(3.25±0.94%),进一步分析表明,基因重组和平衡选择机制可能是引起B-LBⅡ基因序列变异的主要因素。在β1结构域氨基酸序列中,存在11个同义替换和27个异义替换;在24个肽结合位点中有12个变异位点;与其他6个中国地方鸡品种和一个外来品种比较发现,有11个异义氨基酸替换仅出现在藏鸡群体中,并被认为与藏鸡的免疫特异性有关,可为鸡的抗病力研究提供分子依据。相似文献

10.

Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions 总被引：137，自引：26，他引：111

Nei M; Gojobori T 《Molecular biology and evolution》1986,3(5):418-426

Two simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions are presented. Although they give no weights to different types of codon substitutions, these methods give essentially the same results as those obtained by Miyata and Yasunaga's and by Li et al.'s methods. Computer simulation indicates that estimates of synonymous substitutions obtained by the two methods are quite accurate unless the number of nucleotide substitutions per site is very large. It is shown that all available methods tend to give an underestimate of the number of nonsynonymous substitutions when the number is large. 相似文献

11.

Whole-Genome Sequencing of Theileria parva Strains Provides Insight into Parasite Migration and Diversification in the African Continent

Kyoko Hayashida Takashi Abe William Weir Ryo Nakao Kimihito Ito Kiichi Kajino Yutaka Suzuki Frans Jongejan Dirk Geysen Chihiro Sugimoto 《DNA research》2013,20(3):209-220

The disease caused by the apicomplexan protozoan parasite Theileria parva, known as East Coast fever or Corridor disease, is one of the most serious cattle diseases in Eastern, Central, and Southern Africa. We performed whole-genome sequencing of nine T. parva strains, including one of the vaccine strains (Kiambu 5), field isolates from Zambia, Uganda, Tanzania, or Rwanda, and two buffalo-derived strains. Comparison with the reference Muguga genome sequence revealed 34 814–121 545 single nucleotide polymorphisms (SNPs) that were more abundant in buffalo-derived strains. High-resolution phylogenetic trees were constructed with selected informative SNPs that allowed the investigation of possible complex recombination events among ancestors of the extant strains. We further analysed the dN/dS ratio (non-synonymous substitutions per non-synonymous site divided by synonymous substitutions per synonymous site) for 4011 coding genes to estimate potential selective pressure. Genes under possible positive selection were identified that may, in turn, assist in the identification of immunogenic proteins or vaccine candidates. This study elucidated the phylogeny of T. parva strains based on genome-wide SNPs analysis with prediction of possible past recombination events, providing insight into the migration, diversification, and evolution of this parasite species in the African continent. 相似文献

12.

Selective pressures at a codon-level predict deleterious mutations in human disease genes

Arbiza L Duchi S Montaner D Burguet J Pantoja-Uceda D Pineda-Lucena A Dopazo J Dopazo H 《Journal of molecular biology》2006,358(5):1390-1404

Deleterious mutations affecting biological function of proteins are constantly being rejected by purifying selection from the gene pool. The non-synonymous/synonymous substitution rate ratio (omega) is a measure of selective pressure on amino acid replacement mutations for protein-coding genes. Different methods have been developed in order to predict non-synonymous changes affecting gene function. However, none has considered the estimation of selective constraints acting on protein residues. Here, we have used codon-based maximum likelihood models in order to estimate the selective pressures on the individual amino acid residues of a well-known model protein: p53. We demonstrate that the number of residues under strong purifying selection in p53 is much higher than those that are strictly conserved during the evolution of the species. In agreement with theoretical expectations, residues that have been noted to be of structural relevance, or in direct association with DNA, were among those showing the highest signals of purifying selection. Conversely, those changing according to a neutral, or nearly neutral mode of evolution, were observed to be irrelevant for protein function. Finally, using more than 40 human disease genes, we demonstrate that residues evolving under strong selective pressures (omega<0.1) are significantly associated (p<0.01) with human disease. We hypothesize that non-synonymous change on amino acids showing omega<0.1 will most likely affect protein function. The application of this evolutionary prediction at a genomic scale will provide an a priori hypothesis of the phenotypic effect of non-synonymous coding single nucleotide polymorphisms (SNPs) in the human genome. 相似文献

13.

How to calculate the non-synonymous to synonymous rate ratio of protein-coding genes under the Fisher–Wright mutation–selection framework

Mario dos Reis 《Biology letters》2015,11(4)

First principles of population genetics are used to obtain formulae relating the non-synonymous to synonymous substitution rate ratio to the selection coefficients acting at codon sites in protein-coding genes. Two theoretical cases are discussed and two examples from real data (a chloroplast gene and a virus polymerase) are given. The formulae give much insight into the dynamics of non-synonymous substitutions and may inform the development of methods to detect adaptive evolution. 相似文献

14.

Evolutionary analysis of S-RNase genes from Rosaceae species 总被引：7，自引：0，他引：7

Ma RC Oliveira MM 《Molecular genetics and genomics : MGG》2002,267(1):71-78

Eight new cDNA sequences for S-RNases were cloned and analysed from almond (Prunus dulcis) cultivars of European origin, and compared to published sequences from other Rosaceae species. Insertions/deletions of 10-20 amino acid residues were detected in the RC4 and C5 domains of S-RNases from almond and sweet cherry. The S-RNases of the Prunus species and those of the genera Malus and Pyrus formed two distinct groups on phylogenetic analysis. Nucleotide substitutions were analysed in the S-RNase genes of these species. The S-genes of almond and sweet cherry have a lower Ka/Ks value than those of apple, pear and wild apple do. The fact that there is no fixed difference between the S-RNase genes of almond and sweet cherry, or between apple and pear, suggests that nucleotide substitutions only introduce transient polymorphism into the two groups, and rarely became fixed and contribute to divergence. Through the comparative study of 17 S-RNase genes from the genus Prunus and 18 from the genera Malus and Pyrus, some fixed nucleotide differences between the two groups were identified. These differences do not appear to be the result of selection for adaptive mutations, since the number of replacement substitutions is not significantly greater than the number of synonymous substitutions. S-RNase genes of almond and sweet cherry, and of apple and pear, showed little heterogeneity in nucleotide substitution rates. However, heterogeneity was observed between the two groups of S-alleles, with the Prunus alleles exhibiting a lower rate of non-synonymous substitutions than alleles from Malus and Pyrus. The evolutionary relationships between these species are discussed. 相似文献

15.

Characterization of three VERNALIZATION INSENSITIVE3-like (VIL) homologs in wild wheat, Aegilops tauschii Coss

Koyama K Hatano H Nakamura J Takumi S 《Hereditas》2012,149(2):62-71

Control of flowering time is an adaptive trait of plants for different growth habitats. A vernalization requirement is a major genetic component determining wheat flowering time. Arabidopsis VERNALIZATION INSENSITIVE3 (VIN3) and VIN3-like 1 (VIL1) play critical roles in the vernalization pathway of flowering, and three wheat VIL homologs are upregulated by vernalization in einkorn wheat. To study the relationship between vernalization and wheat VIL homologs in Aegilops tauschii, the D-genome progenitor of common wheat, we isolated three cDNAs orthologous to the einkorn wheat VIL genes. The three Ae. tauschii VIL genes showed many single nucleotide polymorphisms including non-synonymous substitutions relative to the einkorn orthologs. In addition, high rates of non-synonymous and synonymous substitutions were revealed by intraspecific variation analysis of the AetVIL sequences, suggesting adaptive evolution at the AetVIL loci. Quantitative RT-PCR analysis was conducted to examine the time course of expression of the VIL genes during vernalization. Of the three AetVIL genes, AetVIL2 was upregulated after one week of low-temperature treatment, and its expression pattern was distinct for winter and spring habit accessions. These observations strongly suggest that AetVIL2 is associated with the vernalization-responsive pathway in Ae. tauschii. 相似文献

16.

The varying microsporidian genome: existence of long-terminal repeat retrotransposon in domesticated silkworm parasite Nosema bombycis

Xu J Pan G Fang L Li J Tian X Li T Zhou Z Xiang Z 《International journal for parasitology》2006,36(9):1049-1056

Microsporidia are a group of intracellular parasites with an extremely compact genome and there is no confirmed evidence that retroelements are parasitised in these organisms. Using the dataset of 200,000 genomic shotgun reads of the silkworm pebrine Nosema bombycis, we have identified the eight complete N. bombycis long-terminal repeat retrotransposon (Nbr) elements. All of the Nbr elements are Ty3/gypsy members and have close relationships to Saccharomycetes long-terminal repeat retrotransposons identified previously, providing further evidence of their relationship to fungi. To explore the effect of retrotransposons in microsporidian genome evolution, their distribution was characterised by comparisons between two N. bombycis contigs containing the Nbr elements with the completed genome of the human parasite Encephalitozoon cuniculi, which is closely related to N. bombycis. The Nbr elements locate between or beside syntenic blocks, which are often clustered with other transposable-like sequences, indicating that they are associated with genome size variation and syntenic discontinuities. The ratios of the number of non-synonymous substitutions per non-synonymous site to the number of synonymous substitutions per synonymous site of the open reading frames among members of each of the eight Nbr families were estimated, which reveal the purifying selection acted on the N. bombycis long-terminal repeat retrotransposons. These results strongly suggest that retrotransposons play a major role in reorganization of the microsporidian genome and they might be active. The present study presents an initial characterization of some transposable elements in the N. bombycis genome and provides some insight into the evolutionary mechanism of microsporidian genomes. 相似文献

17.

A rationale for the symmetries by base substitutions of degeneracy in the genetic code

Jean-Luc Jestin 《Bio Systems》2010

The first symmetry by base substitutions of degeneracy in the genetic code was described by Rumer (1966) and the other symmetries were identified later by Jestin (2006) and Jestin and Soulé (2007). Here, a rationale accounting for these symmetries is reported. The number of non-synonymous substitutions over the replicated coding sequence is written as a function of the substitution matrix, whose elements are the number of substitutions from any codon to any other codon. The p-adic distance used as a similarity measure and applied to this matrix is shown to be biologically relevant. The rationale indicates that symmetries by base substitutions of degeneracy in the genetic code are symmetries of the measures of the number of non-synonymous substitutions for sets of synonymous codons. 相似文献

18.

Effective population size and the rate and pattern of nucleotide substitutions

Megan Woolfit 《Biology letters》2009,5(3):417-420

Both the overall rate of nucleotide substitution and the relative proportions of synonymous and non-synonymous substitutions are predicted to vary between species that differ in effective population size (N_e). Our understanding of the genetic processes underlying these lineage-specific differences in molecular evolution is still developing. Empirical analyses indicate that variation in substitution rates and patterns caused by differences in N_e is often substantial, however, and must be accounted for in analyses of molecular evolution. 相似文献

19.

Codon Substitution in Evolution and the "Saturation" of Synonymous Changes 总被引：4，自引：1，他引：3

Takashi Gojobori 《Genetics》1983,105(4):1011-1027

A mathematical model for codon substitution is presented, taking into account unequal mutation rates among different nucleotides and purifying selection. This model is constructed by using a 61 X 61 transition probability matrix for the 61 nonterminating codons. Under this model, a computer simulation is conducted to study the numbers of silent (synonymous) and amino acid-altering (nonsynonymous) nucleotide substitutions when the underlying mutation rates among the four kinds of nucleotides are not equal. It is assumed that the substitution rates are constant over evolutionary time, the codon frequencies being in equilibrium, and, thus, the numbers of synonymous and nonsynonymous substitutions both increase linearly with evolutionary time. It is shown that, when the mutation rates are not equal, the estimate of synonymous substitutions obtained by F. Perler, A. Efstratiadis, P. Lomedico, W. Gilbert, R. Kolodner and J. Dodgson's "Percent Corrected Divergence" method increases nonlinearly, although the true number of synonymous substitutions increases linearly. It is, therefore, possible that the "saturation" of synonymous substitutions observed by Perler et al. is due to the inefficiency of their method to detect all synonymous substitutions. 相似文献

20.

Overdispersed molecular clock at the major histocompatibility complex loci.

N Takahata 《Proceedings. Biological sciences / The Royal Society》1991,243(1306):13-18

The extent of amino acid differences of major histocompatibility complex molecules within species is unusually high, consistent with the finding that some pairs of alleles have persisted for more than ten million years and the view that the polymorphism has been maintained by natural selection. The disparity between synonymous and non-synonymous substitutions in the antigen recognition site, however, suggests that some non-synonymous sites have undergone a number of substitutions whereas others have little or none. To describe statistically such an overdispersed underlying process, commonly used Poisson processes are inadequate. An alternative process leads to the surprising conclusion that each non-synonymous site has accumulated as many as 2.6 substitutions, on the average, in the two lineages leading to humans and mice. The standard deviation is also very large (6.6) and the dispersion index (the ratio of the variance to the mean) is at least 17. The substitution process thus inferred qualitatively agrees with the disposition (a boomerang pattern) of substitutions between HLA-A2 and Aw68 alleles, and quantitatively agrees well with that expected where the evolution of major histocompatibility complex molecules has long been driven mostly by balancing selection. 相似文献