首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 281 毫秒
1.
The mouse cadherin-related neuronal receptor/protocadherin (CNR/Pcdh) gene clusters are located on chromosome 18. We sequenced single-nucleotide polymorphisms (SNPs) of the CNR/Pcdh(alpha)-coding region among 12 wild-derived and four laboratory strains; these included the four major subspecies groups of Mus musculus: domesticus, musculus, castaneus, and bactrianus. We detected 883 coding SNPs (cSNPs) in the CNR/Pcdh(alpha) variable exons and three in the constant exons. Among all the cSNPs, 586 synonymous (silent) and 297 nonsynonymous (amino acid exchanged) substitutions were found; therefore, the K(a)/K(s) ratio (nonsynonymous substitutions per synonymous substitution) was 0.51. The synonymous cSNPs were relatively concentrated in the first and fifth extracellular cadherin domain-encoding regions (ECs) of CNR/Pcdh(alpha). These regions have high nucleotide homology among the CNR/Pcdh(alpha) paralogs, suggesting that gene conversion events in synonymous and homologous regions of the CNR/Pcdh(alpha) cluster are related to the generation of cSNPs. A phylogenetic analysis revealed gene conversion events in the EC1 and EC5 regions. Assuming that the common sequences between rat and mouse are ancestral, the GC content of the third codon position has increased in the EC1 and EC5 regions, although biased substitutions from GC to AT were detected in all the codon positions. In addition, nonsynonymous substitutions were extremely high (11 of 13, K(a)/K(s) ratio 5.5) in the laboratory mouse strains. The artificial environment of laboratory mice may allow positive selection for nonsynonymous amino acid variations in CNR/Pcdh(alpha) during inbreeding. In this study, we analyzed the direction of cSNP generation, and concluded that subspecies-specific nucleotide substitutions and region-restricted gene conversion events may have contributed to the generation of genetic variations in the CNR/Pcdh genes within and between species.  相似文献   

2.
In order to understand the impact of overlapping reading frames on natural selection by host CD8+ T lymphocytes (CD8(+)-TL), we analyzed the pattern of nucleotide substitution in simian immunodeficiency virus (SIV) genomes sampled from populations at time of death in 35 rhesus monkeys. Both the mean number of nonsynonymous nucleotide substitutions per nonsynonymous site (d(N)) and the mean number of synonymous nucleotide substitutions per synonymous site (d(S)) were elevated in overlap regions in comparison to non-overlap regions. Mean d(N) exceeded mean d(S) in CD8(+)-TL epitopes restricted by the host's class I major histocompatibility complex molecules. This pattern, which is indicative of positive Darwinian selection favoring amino acid changes in these epitopes, was seen in both overlap and non-overlap regions; but mean d(N) was particularly elevated in restricted CD8(+)-TL epitopes encoded in overlap regions. Amino acid changes from the inoculum were defined as parallel if the same amino acid change occurred at the same site independently in two or more monkeys, and a surprisingly high proportion (71.9%) of observed amino acid changes throughout the SIV genome occurred in parallel in different monkeys. The proportion of parallel changes in restricted epitopes encoded by overlapping reading frames was still higher (80%), supporting the hypothesis that the interaction of positive selection and overlapping reading frames enhances the probability of convergent or parallel amino acid change.  相似文献   

3.
4.
The sporozoite threonine-asparagine-rich protein (STARP) of Plasmodium falciparum is an attractive target for a pre-erythrocytic stage malaria vaccine because both naturally acquired and experimentally induced anti-STARP antibodies can block sporozoite invasion of hepatocytes. To explore the extent of sequence variation, we surveyed nucleotide polymorphism across the entire gene, encompassing 2 exons and an intron, of 124 P. falciparum-infected blood samples from Thailand and 10 from 4 other endemic areas. In total 24 haplotypes were identified despite low-level nucleotide diversity at this locus. The mean number of nonsynonymous substitutions per nonsynonymous site (d(N)) significantly exceeded that of synonymous substitutions per synonymous site (d(S)), suggesting that the STARP gene has evolved under positive selection, probably from host immune pressure. The preponderance of conservative amino acid exchanges and a strongly biased T-nucleotide toward the third position of codons in repeat arrays have reflected simultaneous constraints on this molecule, probably from its respective unknown function and nucleotide composition. Sequence conservation in the STARP locus among clinical isolates from different disease endemic areas would not compromise vaccine incorporation.  相似文献   

5.
Pattern recognition proteins play an important role in the innate immune response of invertebrates. Herein we report the evolutionary relationships among Gram-negative bacteria binding proteins (GNBPs) that were previously identified and characterized from a wide array of invertebrates. Our results, together with those obtained in previous studies, indicate that decapod lipopolysaccharide- and beta-1,3-glucan binding protein (LGBP/BGBP) has retained the crucial components for glucanase activity, and shares a common ancestor with GNBPs, as well as with the glucanase proteins of a wide range of invertebrates, rather than with GNBPs of some arthropods. However, experimental evidence of earlier studies suggested a lack of glucanase activity by these proteins, thus implying that during evolutionary time these proteins might have lost their glucan binding protein, but retained their glucan binding activity. The present results have also revealed that although a vast majority of the decapod LGBP/BGBP codons are constrained to purifying selection, certain codons are shown to have a higher rate of nonsynonymous substitutions per nonsynonymous site (dN) than synonymous substitutions per synonymous site (dS), indicating these codons have evolved adaptively (dN/dS>1). Although purifying selection (dN/dS<1) appears to be the major driving force in the evolution of a vast majority of LGBP/BGBP codons in decapods, the findings of several hotspots for nonsynonymous substitutions in this protein indicate host immune selection might play an important role in maintaining diversity among these ecologically diversified decapod species.  相似文献   

6.
7.
A number of statistical tests have been proposed to detect positive Darwinian selection affecting a few amino acid sites in a protein, exemplified by an excess of nonsynonymous nucleotide substitutions. These tests are often more powerful than pairwise sequence comparison, which averages synonymous (d(S)) and nonsynonymous (d(N)) rates over the whole gene. In a recent study, however, Hughes AL and Friedman R (2005. Variation in the pattern of synonymous and nonsynonymous difference between two fungal genomes. Mol Bio Evol. 22: 1320-1324) argue that d(S) and d(N) are expected to fluctuate along the sequence by chance and that an excess of nonsynonymous differences in individual codons is no evidence for positive selection. The authors compared codons in protein-coding genes from the genomes of 2 yeast species, Saccharomyces cerevisiae and Saccharomyces paradoxus. They calculated the proportions of synonymous and nonsynonymous differences per site (p(S) and p(N)) in every codon and discovered that p(N) is often greater than p(S) and that among some codons p(S) and p(N) are negatively correlated. The authors argued that these results invalidate previous tests of codons under positive selection. Here I discuss several errors of statistics in the analysis of Hughes and Friedman, including confusion of statistics with parameters, arbitrary data filtering, and derivation of hypotheses from data. I also apply likelihood ratio tests of positive selection to the yeast data and illustrate empirically that Hughes and Friedman's criticisms on such tests are not valid.  相似文献   

8.
A new method is proposed for estimating the number of synonymous and nonsynonymous nucleotide substitutions between homologous genes. In this method, a nucleotide site is classified as nondegenerate, twofold degenerate, or fourfold degenerate, depending on how often nucleotide substitutions will result in amino acid replacement; nucleotide changes are classified as either transitional or transversional, and changes between codons are assumed to occur with different probabilities, which are determined by their relative frequencies among more than 3,000 changes in mammalian genes. The method is applied to a large number of mammalian genes. The rate of nonsynonymous substitution is extremely variable among genes; it ranges from 0.004 X 10(-9) (histone H4) to 2.80 X 10(-9) (interferon gamma), with a mean of 0.88 X 10(-9) substitutions per nonsynonymous site per year. The rate of synonymous substitution is also variable among genes; the highest rate is three to four times higher than the lowest one, with a mean of 4.7 X 10(-9) substitutions per synonymous site per year. The rate of nucleotide substitution is lowest at nondegenerate sites (the average being 0.94 X 10(-9), intermediate at twofold degenerate sites (2.26 X 10(-9)). and highest at fourfold degenerate sites (4.2 X 10(-9)). The implication of our results for the mechanisms of DNA evolution and that of the relative likelihood of codon interchanges in parsimonious phylogenetic reconstruction are discussed.  相似文献   

9.
The pattern and extent of DNA sequence variability at the rplX locus (encoding ribosomal protein L24) has been investigated in nine strains of Bacillus subtilis. Overall, there is a very low level of nucleotide diversity, even at silent sites, which is probably due to selection among synonymous codons. By analogy with Escherichia coli, there may also be some effect of the relative proximity of rplX to the chromosomal origin of replication. The small number of nucleotide substitutions are non-randomly distributed: all of the synonymous changes are in valine codons. From the sequence differences the strains can be divided into two groups, which are not coincident with their previous classification; this observation is consistent with recombination among strains.  相似文献   

10.
Mammalian gene evolution: Nucleotide sequence divergence between mouse and rat   总被引:16,自引:0,他引:16  
As a paradigm of mammalian gene evolution, the nature and extent of DNA sequence divergence between homologous protein-coding genes from mouse and rat have been investigated. The data set examined includes 363 genes totalling 411 kilobases, making this by far the largest comparison conducted between a single pair of species. Mouse and rat genes are on average 93.4% identical in nucleotide sequence and 93.9% identical in amino acid sequence. Individual genes vary substantially in the extent of nonsynonymous nucleotide substitution, as expected from protein evolution studies; here the variation is characterized. The extent of synonymous (or silent) substitution also varies considerably among genes, though the coefficient of variation is about four times smaller than for nonsynonymous substitutions. A small number of genes mapped to the X-chromosome have a slower rate of molecular evolution than average, as predicted if molecular evolution is male-driven. Base composition at silent sites varies from 33% to 95% G + C in different genes; mouse and rat homologues differ on average by only 1.7% in silent-site G + C, but it is shown that this is not necessarily due to any selective constraint on their base composition. Synonymous substitution rates and silent site base composition appear to be related (genes at intermediate G + C have on average higher rates), but the relationship is not as strong as in our earlier analyses. Rates of synonymous and nonsynonymous substitution are correlated, apparently because of an excess of substitutions involving adjacent pairs of nucleotides. Several factors suggest that synonymous codon usage in rodent genes is not subject to selection.  相似文献   

11.
A method for detecting positive selection at single amino acid sites   总被引:23,自引:0,他引:23  
A method was developed for detecting the selective force at single amino acid sites given a multiple alignment of protein-coding sequences. The phylogenetic tree was reconstructed using the number of synonymous substitutions. Then, the neutrality was tested for each codon site using the numbers of synonymous and nonsynonymous changes throughout the phylogenetic tree. Computer simulation showed that this method accurately estimated the numbers of synonymous and nonsynonymous substitutions per site, as long as the substitution number on each branch was relatively small. The false-positive rate for detecting the selective force was generally low. On the other hand, the true-positive rate for detecting the selective force depended on the parameter values. Within the range of parameter values used in the simulation, the true-positive rate increased as the strength of the selective force and the total branch length (namely the total number of synonymous substitutions per site) in the phylogenetic tree increased. In particular, with the relative rate of nonsynonymous substitutions to synonymous substitutions being 5.0, most of the positively selected codon sites were correctly detected when the total branch length in the phylogenetic tree was > or = 2.5. When this method was applied to the human leukocyte antigen (HLA) gene, which included antigen recognition sites (ARSs), positive selection was detected mainly on ARSs. This finding confirmed the effectiveness of the present method with actual data. Moreover, two amino acid sites were newly identified as positively selected in non-ARSs. The three-dimensional structure of the HLA molecule indicated that these sites might be involved in antigen recognition. Positively selected amino acid sites were also identified in the envelope protein of human immunodeficiency virus and the influenza virus hemagglutinin protein. This method may be helpful for predicting functions of amino acid sites in proteins, especially in the present situation, in which sequence data are accumulating at an enormous speed.  相似文献   

12.
To determine the relative importance of gene conversion followed by natural selection and of natural selection for point mutation in generating variability in immunoglobulins, the numbers of synonymous and nonsynonymous substitutions in immunoglobulin sequences of various subgroups were estimated for complementarity-determining regions (CDRs) and for framework regions (FRs). Both the number of synonymous substitutions and the number of nonsynonymous substitutions in the CDR were found to exceed the corresponding numbers in the FR. Therefore, gene conversion is likely to be an important mechanism for providing variability in the CDR of immunoglobulins. The correlation coefficients between the number of synonymous substitutions and the number of nonsynonymous substitutions and between the substitution number in the CDR and that in the FR were found to be very low. Again, gene conversion is thought to be responsible for this finding.  相似文献   

13.
The two mechanisms for generating hypervariability at the reactive center of serine proteases and their inhibitors are gene conversion followed by natural selection and natural selection for point mutation. One way to clarify the effects of these two mechanisms is to calculate separately the number of nonsynonymous substitutions and that of synonymous substitutions at the variable regions and at the conserved regions. Our data analysis shows that not only the number of nonsynonymous substitutions but also the number of synonymous substitutions at the variable regions exceed the corresponding numbers at the conserved regions. Thus gene conversion has provided needed variability at the variable regions of serine proteases and their inhibitors. Natural selection has helped perpetuate such variability.  相似文献   

14.
The nearly neutral theory of molecular evolution predicts larger generation-time effects for synonymous than for nonsynonymous substitutions. This prediction is tested using the sequences of 49 single-copy genes by calculating the average and variance of synonymous and nonsynonymous substitutions in mammalian star phylogenies (rodentia, artiodactyla, and primates). The average pattern of the 49 genes supports the prediction of the nearly neutral theory, with some notable exceptions.The nearly neutral theory also predicts that the variance of the evolutionary rate is larger than the value predicted by the completely neutral theory. This prediction is tested by examining the dispersion index (ratio of the variance to the mean), which is positively correlated with the average substitution number. After weighting by the lineage effects, this correlation almost disappears for nonsynonymous substitutions, but not quite so for synonymous substitutions. After weighting, the dispersion indices of both synonymous and nonsynonymous substitutions still exceed values expected under the simple Poisson process. The results indicate that both the systematic bias in evolutionary rate among the lineages and the episodic type of rate variation are contributing to the large variance. The former is more significant to synonymous substitutions than to nonsynonymous substitutions. Isochore evolution may be similar to synonymous substitutions. The rate and pattern found here are consistent with the nearly neutral theory, such that the relative contributions of drift and selection differ between the two types of substitutions. The results are also consistent with Gillespie's episodic selection theory.  相似文献   

15.
N G Smith  L D Hurst 《Genetics》1999,153(3):1395-1402
Nonsynonymous substitutions in DNA cause amino acid substitutions while synonymous substitutions in DNA leave amino acids unchanged. The cause of the correlation between the substitution rates at nonsynonymous (K(A)) and synonymous (K(S)) sites in mammals is a contentious issue, and one that impacts on many aspects of molecular evolution. Here we use a large set of orthologous mammalian genes to investigate the causes of the K(A)-K(S) correlation in rodents. The strength of the K(A)-K(S) correlation exceeds the neutral theory expectation when substitution rates are estimated using algorithmic methods, but not when substitution rates are estimated by maximum likelihood. Irrespective of this methodological uncertainty the strength of the K(A)-K(S) correlation appears mostly due to tandem substitutions, an excess of which is generated by substitutional nonindependence. Doublet mutations cannot explain the excess of tandem synonymous-nonsynonymous substitutions, and substitution patterns indicate that selection on silent sites is the likely cause. We find no evidence for selection on codon usage. The nature of the relationship between synonymous divergence and base composition is unclear because we find a significant correlation if we use maximum-likelihood methods but not if we use algorithmic methods. Finally, we find that K(S) is reduced at the start of genes, which suggests that selection for RNA structure may affect silent sites in mammalian protein-coding genes.  相似文献   

16.
Genes that have undergone positive or diversifying selection are likely to be associated with adaptive divergence between species. One indicator of adaptive selection at the molecular level is an excess of amino acid replacement fixed differences per replacement site relative to the number of synonymous fixed differences per synonymous site (omega = K(a)/K(s)). We used an evolutionary expressed sequence tag (EST) approach to estimate the distribution of omega among 304 orthologous loci between Arabidopsis thaliana and A. lyrata to identify genes potentially involved in the adaptive divergence between these two Brassicaceae species. We find that 14 of 304 genes (approximately 5%) have an estimated omega > 1 and are candidates for genes with increased selection intensities. Molecular population genetic analyses of 6 of these rapidly evolving protein loci indicate that, despite their high levels of between-species nonsynonymous divergence, these genes do not have elevated levels of intraspecific replacement polymorphisms compared to previously studied genes. A hierarchical Bayesian analysis of protein-coding region evolution within and between species also indicates that the selection intensities of these genes are elevated compared to previously studied A. thaliana nuclear loci.  相似文献   

17.
Investigating ancient duplication events in the Arabidopsis genome   总被引:10,自引:0,他引:10  
The complete genomic analysis of Arabidopsis thaliana has shown that a major fraction of the genome consists of paralogous genes that probably originated through one or more ancient large-scale gene or genome duplication events. However, the number and timing of these duplications still remains unclear, and several different hypotheses have been put forward recently. Here, we reanalyzed duplicated blocks found in the Arabidopsis genome described previously and determined their date of divergence based on silent substitution estimations between the paralogous genes and, where possible, by phylogenetic reconstruction. We show that methods based on averaging protein distances of heterogeneous classes of duplicated genes lead to unreliable conclusions and that a large fraction of blocks duplicated much more recently than assumed previously. We found clear evidence for one large-scale gene or even complete genome duplication event somewhere between 70 to 90 million years ago. Traces pointing to a much older (probably more than 200 million years) large-scale gene duplication event could be detected. However, for now it is impossible to conclude whether these old duplicates are the result of one or more large-scale gene duplication events. abbreviations dA, fraction of amino acid substitutions; Kn, number of nonsynonymous substitutions per nonsynonymous site; Ks, number of synonymous substitutions per synonymous site; MYA, million years ago  相似文献   

18.
A hallmark of positive selection (adaptive evolution) in protein-coding regions is a d(N)/d(S) ratio >1, where d(N) is the number of nonsynonymous substitutions/nonsynonymous sites and d(S) is the number of synonymous substitutions/synonymous sites. Zonadhesin is a male reproductive protein localized on the sperm head, comprising many domains known to be involved in cell-cell interaction or cell adhesion. Previous studies have shown that VWD domains (homologous to the D domains of the von Willebrand factor) are involved directly in binding to the female zona pellucida (ZP) in a species-specific manner. In this study, we sequenced 47 coding exons in 12 primate species and, by using maximum-likelihood methods to determine sites under positive selection, we show that VWD2, membrane/A5 antigen mu receptor, and mucin-like domains in zonadhesin are rapidly evolving and, thus, may be involved in binding to the ZP in a species-specific manner in primates. In addition, polymorphism data from 48 human individuals revealed significant polymorphism-to-divergence heterogeneity and a significant departure from equilibrium-neutral expectations in the frequency spectrum, suggesting balancing selection and positive selection occurring in zonadhesin (ZAN) within human populations. Finally, we observe adaptive evolution in haplotypes segregating for a frameshift mutation that was previously thought to indicate that ZAN was a potential pseudogene.  相似文献   

19.
SRY基因在人猿超科和旧大陆猴中具有不同的进化规律   总被引:1,自引:0,他引:1  
王晓霞  吕雪梅  张亚平 《遗传学报》2000,27(10):847-852
通过PCR扩增、测序,得到了白臀叶猴和红面猴的SRY基因全序列。结合现有的灵长类其他物种序列进行分析,验证了HMG盒的保守性。通过构建系统发育树,比较旧大陆猴和人猿超科两个类群内和类群间HMG盒侧翼序列Ka/Ks的比率。有趣的是,人猿超科两物种比较呈现较高的Ka/Ks比值,但在旧大陆猴中及旧大陆猴与狨猴间的Ka/Ks比值显著低于人猿超科的,呈现很不同的格局。同时,对于HMG盒序列,Ka/Ks比值在  相似文献   

20.
An excess of nonsynonymous substitutions over synonymous ones is an important indicator of positive selection at the molecular level. A lineage that underwent Darwinian selection may have a nonsynonymous/synonymous rate ratio (dN/dS) that is different from those of other lineages or greater than one. In this paper, several codon-based likelihood models that allow for variable dN/dS ratios among lineages were developed. They were then used to construct likelihood ratio tests to examine whether the dN/dS ratio is variable among evolutionary lineages, whether the ratio for a few lineages of interest is different from the background ratio for other lineages in the phylogeny, and whether the dN/dS ratio for the lineages of interest is greater than one. The tests were applied to the lysozyme genes of 24 primate species. The dN/dS ratios were found to differ significantly among lineages, indicating that the evolution of primate lysozymes is episodic, which is incompatible with the neutral theory. Maximum- likelihood estimates of parameters suggested that about nine nonsynonymous and zero synonymous nucleotide substitutions occurred in the lineage leading to hominoids, and the dN/dS ratio for that lineage is significantly greater than one. The corresponding estimates for the lineage ancestral to colobine monkeys were nine and one, and the dN/dS ratio for the lineage is not significantly greater than one, although it is significantly higher than the background ratio. The likelihood analysis thus confirmed most, but not all, conclusions Messier and Stewart reached using reconstructed ancestral sequences to estimate synonymous and nonsynonymous rates for different lineages.   相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号