首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 687 毫秒
1.
Retracing the trajectories of past genetic events is crucial to understand the structure of the genome, both in individuals and across populations. A haplotype describes a string of polymorphic sites along a DNA segment. Haplotype diversity is due to mutations creating new variants, and to recombinations and gene conversions that mix and redistribute these variants among individual chromosomes in populations. A number of studies have revealed a relatively simple pattern of haplotype diversity in the human genome, dominated by a few common haplotypes representing founder ancestral ones. New haplotypes are usually rare and have a limited geographic distribution. We propose a method to derive a new haplotype from a set of putative ancestral haplotypes, once mutations in place, through minimal recombination and gene conversion pathways. We describe classes of pathways that represent the whole set of minimal pathways leading to a new haplotype. We show that obtaining this set of pathways can be represented as a problem of finding "secondary structures" of minimum energy. We present a polynomial algorithm solving this folding problem.  相似文献   

2.
利用核基因和叶绿体基因对台湾水韭(Isoetes taiwanensis Devol)2个居群20个样本的遗传多样性和遗传结构进行了分析,并对台湾水韭的保育提出了建议。核基因的检测结果显示,2个居群共存在18个单倍型,单倍型多样性为0.9842,核苷酸多样性为0.00215,居群间基因流Nm为4.26,居群间分化系数Gst值为0.05543;叶绿体DNA的检测结果显示,单倍型数目为6,单倍型多样性为0.6211,核苷酸多样性为0.00326。其中台北居群的单倍型多样性和核苷酸多样性均比金门居群高。核基因和叶绿体基因的AMOVA分析显示,居群内的遗传差异远高于居群间。台湾水韭的核苷酸多样性相对其他水韭属植物较低,可能与台湾水韭的染色体倍型以及狭域分布有关。居群结构的形成可能和基因流以及奠基者效应有关。宜采用原地和迁地保护的方法对居群进行保护。  相似文献   

3.
Simple but exact statistical tests for detecting a cluster of associated nucleotide changes in DNA are presented. The tests are based on the linear distribution of a set of s sites among a total of n sites, where the s sites may be the variable sites, sites of insertion/deletion, or categorized in some other way. These tests are especially useful for detecting gene conversion and intragenic recombination in a sample of DNA sequences. In this case, the sites of interest are those that correspond to particular ways of splitting the sequences into two groups (e.g., sequences A and D vs. sequences B, C, and E-J). Each such split is termed a phylogenetic partition. Application of these methods to a well-documented case of gene conversion in human gamma-globin genes shows that sites corresponding to two of the three observed partitions are significantly clustered, whereas application to hominoid mitochondrial DNA sequences--among which no recombination is expected to occur--shows no evidence of such clustering. This indicates that clustering of partition-specific sites is largely due to intragenic recombination or gene conversion. Alternative hypotheses explaining the observed clustering of sites, such as biased selection or mutation, are discussed.   相似文献   

4.
The Mhc is a highly conserved gene region especially interesting to geneticists because of the rapid evolution of gene families found within it. High levels of Mhc genetic diversity often exist within populations. The chicken Mhc is the focus of considerable interest because of the strong, reproducible infectious disease associations found with particular Mhc-B haplotypes. Sequence data for Mhc-B haplotypes have been lacking thereby hampering efforts to systematically resolve which genes within the Mhc-B region contribute to well-defined Mhc-B-associated disease responses. To better understand the genetic factors that generate and maintain genomic diversity in the Mhc-B region, we determined the complete genomic sequence for 14 Mhc-B haplotypes across a region of 59 kb that encompasses 14 gene loci ranging from BG1 to BF2. We compared the sequences using alignment, phylogenetic, and genome profiling methods. We identified gene structural changes, synonymous and non-synonymous polymorphisms, insertions and deletions, and allelic gene rearrangements or exchanges that contribute to haplotype diversity. Mhc-B haplotype diversity appears to be generated by a number of mutational events. We found evidence that some Mhc-B haplotypes are derived by whole- and partial-allelic gene conversion and homologous reciprocal recombination, in addition to nucleotide mutations. These data provide a framework for further analyses of disease associations found among these 14 haplotypes and additional haplotypes segregating and evolving in wild and domesticated populations of chickens.  相似文献   

5.
RFLP haplotypes at the alpha-globin gene complex have been examined in 190 individuals from the Niokolo Mandenka population of Senegal: haplotypes were assigned unambiguously for 210 chromosomes. The Mandenka share with other African populations a sample size-independent haplotype diversity that is much greater than that in any non-African population: the number of haplotypes observed in the Mandenka is typically twice that seen in the non-African populations sampled to date. Of these haplotypes, 17.3% had not been observed in any previous surveys, and a further 19.1% have previously been reported only in African populations. The haplotype distribution shows clear differences between African and non-African peoples, but this is on the basis of population-specific haplotypes combined with haplotypes common to all. The relationship of the newly reported haplotypes to those previously recorded suggests that several mutation processes, particularly recombination as homologous exchange or gene conversion, have been involved in their production. A computer program based on the expectation-maximization (EM) algorithm was used to obtain maximum-likelihood estimates of haplotype frequencies for the entire data set: good concordance between the unambiguous and EM-derived sets was seen for the overall haplotype frequencies. Some of the low-frequency haplotypes reported by the estimation algorithm differ greatly, in structure, from those haplotypes known to be present in human populations, and they may not represent haplotypes actually present in the sample.  相似文献   

6.
The killer cell Ig-like receptor (KIR) gene family encodes MHC class I-specific receptors, which regulate NK cell responses and are also expressed on subpopulations of T cells. KIR haplotypes vary in gene content, which, in combination with allelic polymorphism, extensively diversifies the KIR genotype both within and between human populations. Species comparison indicates that formation of new KIR genes and loss of old ones are frequent events, so that few genes are conserved even between closely related species. In this regard, the hominoids define a time frame that is particularly informative for understanding the processes of KIR evolution and its potential impact on killer cell biology. KIR cDNA were characterized from PBMC of three gorillas, and genomic DNA were characterized for six additional individuals. Eleven gorilla KIR genes were defined. With attainment of these data, a set of 75 KIR sequences representing five hominoid species was assembled, which also included rhesus monkey, cattle, and rodent KIR. Searching this data set for recombination events, and phylogenetic analysis using Bayesian methods, demonstrated that new KIR were usually the result of recombination between loci in which complete protein domains were shuffled. Further phylogenetic analysis of the KIR sequences after removal of confounding recombined segments showed that only two KIR genes, KIR2DL4 and KIR2DL5, have been preserved throughout hominoid evolution, and one of them, KIR2DL4, is also common to rhesus monkey and hominoids. Other KIR genes represent recombinant forms present in a minority of species, often only one, as exemplified by 8 of the 11 gorilla KIR genes.  相似文献   

7.
Population-genetic studies have been remarkably productive and successful in the last decade following the invention of PCR technology and the introduction of mitochondrial and microsatellite DNA markers. While mitochondrial DNA has proven powerful for genealogical and evolutionary studies of animal populations, and microsatellite sequences are the most revealing DNA markers available so far for inferring population structure and dynamics, they both have important and unavoidable limitations. To obtain a fuller picture of the history and evolutionary potential of populations, genealogical data from nuclear loci are essential, and the inclusion of other nuclear markers, i.e. single copy nuclear polymorphic (scnp) sequences, is clearly needed. Four major uncertainties for nuclear DNA analyses of populations have been facing us, i.e. the availability of scnp markers for carrying out such analysis, technical laboratory hurdles for resolving haplotypes, difficulty in data analysis because of recombination, low divergence levels and intraspecific multifurcation evolution, and the utility of scnp markers for addressing population-genetic questions. In this review, we discuss the availability of highly polymorphic single copy DNA in the nuclear genome, describe patterns and rate of evolution of nuclear sequences, summarize past empirical and theoretical efforts to recover and analyse data from scnp markers, and examine the difficulties, challenges and opportunities faced in such studies. We show that although challenges still exist, the above-mentioned obstacles are now being removed. Recent advances in technology and increases in statistical power provide the prospect of nuclear DNA analyses becoming routine practice, allowing allele-discriminating characterization of scnp loci and microsatellite loci. This certainly will increase our ability to address more complex questions, and thereby the sophistication of genetic analyses of populations.  相似文献   

8.
Nosema ceranae is currently one of the major pathogens of honeybees, related to the worldwide colony losses phenomenon. The genotyping of strains based on ribosomal DNA (rDNA) can be misleading if the repeated units are not identical. The analysis of cloned rDNA fragments containing the intergenic spacer (IGS) and part of the rDNA small-subunit (SSU) gene, from N. ceranae isolates from different European and Central Asia populations, revealed a high diversity of sequences. The variability involved single-nucleotide polymorphisms and insertion/deletions, resulting in 79 different haplotypes. Two sequences from the same isolate could be as different as any pair of sequences from different samples; in contrast, identical haplotypes were also found in very different geographical origins. Consequently, haplotypes cannot be organized in a consistent phylogenetic tree, clearly indicating that rDNA is not a reliable marker for the differentiation of N. ceranae strains. The results indicate that recombination between different sequences may produce new variants, which is quite surprising in microsporidia, usually considered to have an asexual mode of reproduction. The diversity of sequences and their geographical distribution indicate that haplotypes of different lineages may occasionally be present in a same cell and undergo homologue recombination, therefore suggesting a sexual haplo-diploid cycle.  相似文献   

9.
Previous analyses of diploid nuclear genotypes have concluded that recombination has occurred in populations of the yeast Candida albicans. To address the possibilities of clonality and recombination in an effectively haploid genome, we sequenced seven regions of mitochondrial DNA (mtDNA) in 45 strains of C. albicans from human immunodeficiency virus-positive patients in Toronto, Canada, and 3 standard reference isolates of C. albicans, CA, CAI4, and WO-1. Among a total of 2,553 nucleotides in the seven regions, 62 polymorphic nucleotide sites and seven indels defined nine distinct mtDNA haplotypes among the 48 strains. Five of these haplotypes occurred in more than one strain, indicating clonal proliferation of mtDNA. Phylogenetic analysis of mtDNA haplotypes resulted in one most-parsimonious tree. Most of the nucleotide sites undergoing parallel change in this tree were clustered in blocks that corresponded to sequenced regions. Because of the existence of these blocks, the apparent homoplasy can be attributed to infrequent, past genetic exchange and recombination between individuals and cannot be attributed to parallel mutation. Among strains sharing the same mtDNA haplotypes, multilocus nuclear genotypes were more similar than expected from a random comparison of nuclear DNA genotypes, suggesting that clonal proliferation of the mitochondrial genome was accompanied by clonal proliferation of the nuclear genome.  相似文献   

10.
The evolutionary histories and relationships among African, Eurasian, and Pacific Island populations are investigated by using observations on five polymorphic restriction sites in the beta-globin gene cluster. We present new data on 222 chromosomes from a global sample and combine these with previously published observations on 591 chromosomes. It is shown that the data are rich in rare haplotypes and that rare variants are not helpful for standard methods of population structure analysis. Consequently, a new approach is developed. We first consider the phylogeny of beta-globin haplotypes. The roles of mutation, gene conversion, and recombination in the generation of haplotype diversity are specifically focused upon. The relationships among human populations are then inferred from the phylogenetic relationships among the haplotypes, their presence or absence, and frequencies within populations. Questions regarding whether or not a phyletic process can account for relationships among the major geographical populations and whether or not an extant human population exhibits the qualities that would be expected of an ancestral group are addressed. The results of this analysis support an African origin for modern Homo sapiens and a phyletic structuring of the major geographical regions. However, it is shown that divergence times for the various populations cannot be determined from these data.  相似文献   

11.
Large amount of population-scale genetic variation data are being collected in populations. One potentially important biological problem is to infer the population genealogical history from these genetic variation data. Partly due to recombination, genealogical history of a set of DNA sequences in a population usually cannot be represented by a single tree. Instead, genealogy is better represented by a genealogical network, which is a compact representation of a set of correlated local genealogical trees, each for a short region of genome and possibly with different topology. Inference of genealogical history for a set of DNA sequences under recombination has many potential applications, including association mapping of complex diseases. In this paper, we present two new methods for reconstructing local tree topologies with the presence of recombination, which extend and improve the previous work in. We first show that the "tree scan" method can be converted to a probabilistic inference method based on a hidden Markov model. We then focus on developing a novel local tree inference method called RENT that is both accurate and scalable to larger data. Through simulation, we demonstrate the usefulness of our methods by showing that the hidden-Markov-model-based method is comparable with the original method in terms of accuracy. We also show that RENT is competitive with other methods in terms of inference accuracy, and its inference error rate is often lower and can handle large data.  相似文献   

12.
Population bottlenecks can restrict variation at functional genes, reducing the ability of populations to adapt to new and changing environments. Understanding how populations generate adaptive genetic variation following bottlenecks is therefore central to evolutionary biology. Genes of the major histocompatibility complex (MHC) are ideal models for studying adaptive genetic variation due to their central role in pathogen recognition. While de novo MHC sequence variation is generated by point mutation, gene conversion can generate new haplotypes by transferring sections of DNA within and across duplicated MHC loci. However, the extent to which gene conversion generates new MHC haplotypes in wild populations is poorly understood. We developed a 454 sequencing protocol to screen MHC class I exon 3 variation across all 13 island populations of Berthelot's pipit (Anthus berthelotii). We reveal that just 11-15 MHC haplotypes were retained when the Berthelot's pipit dispersed across its island range in the North Atlantic ca. 75,000 years ago. Since then, at least 26 new haplotypes have been generated in situ across populations. We show that most of these haplotypes were generated by gene conversion across divergent lineages, and that the rate of gene conversion exceeded that of point mutation by an order of magnitude. Gene conversion resulted in significantly more changes at nucleotide sites directly involved with pathogen recognition, indicating selection for functional variants. We suggest that the creation of new variants by gene conversion is the predominant mechanism generating MHC variation in genetically depauperate populations, thus allowing them to respond to pathogenic challenges.  相似文献   

13.
Two outstanding problems pertaining to the population dynamics and evolution of the t complex in mice concern the frequency of t haplotypes in the wild and the degree to which these haplotypes recombine with their wild-type homologs. To address these problems, the frequency and distribution of several t complex-associated restriction fragment variants in wild mice were estimated. Sixty-four versions of chromosome 17 from wild-derived Mus musculus musculus and Mus musculus domesticus were examined with DNA probes for six loci within the t complex that exhibit restriction fragment variation. All six probes detect variants that have heretofore been found exclusively associated with the t complex. Haplotype analysis of wild-derived chromosomes revealed a high frequency (45.3%) of "mosaic" haplotypes with a mixture of t-specific and wild-type variants and only one haplotype with t-specific variants at all six loci. When 12 well-characterized t haplotypes isolated from diverse geographic regions were analyzed, only three had a complete set of t-specific restriction fragments for the six loci examined. The preponderance of mosaic haplotypes in both groups of mice can be explained by any one of the following hypotheses: genetic recombination between t haplotypes and their wild-type homologs, the persistence in wild populations of haplotypes that have descended from ancestral partial t haplotypes, or that the restriction fragment variants fixed in the ancestral t haplotype were also fixed in some wild-type haplotypes. There is evidence to support all three of these hypotheses in our data. The allelic composition of some mosaic haplotypes indicates that they may have been formed by segmental recombination, either double crossing over or gene conversion, rather than by simple single crossovers. The occurrence of indistinguishable mosaic haplotypes in both M. m. musculus and M. m. domesticus suggests that these haplotypes are ancestral rather than recently derived.  相似文献   

14.
为了解中国狼不同地理种群遗传多样性及系统发育情况,从中国境内狼的主要分布区青海、新疆、内蒙古和吉林4个地区采集样品,用分子生物学技术手段成功地获得44个个体线粒体DNA控制区第一高变区(HVRⅠ)序列和40个线粒体Cyt b部分序列。线粒体控制区HVRⅠ共检测到51个变异位点,位点变异率为8.76%;线粒体Cyt b部分序列发现31个变异位点,位点变异率为5.33%,未见插入及缺失现象,变异类型全部为碱基置换。共定义了16个线粒体HVRⅠ单倍型,其中吉林与内蒙种群存在共享单倍型,估计这两地间种群亲缘关系较近。4个地理种群中新疆种群拥有较高的遗传多样性(0.94)。中国狼种群总体平均核苷酸多态性为2.27%,与世界其他国家地区相比,中国狼种群拥有相对较高的遗传多样性。通过线粒体HVRⅠ单倍型构建的系统进化树可以看出,中国狼在进化上分为2大支,其中位于青藏高原的青海种群独立为一支,推测其可能长期作为独立种群进化。基于青海种群与新疆,内蒙种群的线粒体Cyt b遗传距离,推测中国狼2个世系可能在更新世冰川时期青藏高原受地质作用急速隆起后出现分歧,分歧时间大约在1.1 MY前。  相似文献   

15.
第四纪冰期气候的反复变化对青藏高原及邻近地区植物的种群地理分布及种群遗传结构产生了巨大的影响。本研究对青藏高原东北部及其邻近地区无苞香蒲(Typha laxmannii)的15个种群148个个体的叶绿体rpl32-trnL间隔区和核基因(植物螯合肽合成酶, PS)进行测序, 共发现2个叶绿体单倍型和8个核基因单倍型。所有的单倍型被共享, 高原种群没有特有的单倍型。邻近地区种群的叶绿体遗传多样性和核基因遗传多样性分别是高原种群的4倍和2倍。高原种群的遗传分化水平明显高于邻近地区种群, 其中高原种群的遗传分化主要存在于东部种群与西部种群之间。研究结果表明, 冰期后从多个避难所回迁至高原台面和由此产生的奠基者效应造成了无苞香蒲在青藏高原东北及邻近地区目前的遗传多样性和基因谱系地理分布格局。  相似文献   

16.
An accurate estimate of the extent of recombination is important whenever phylogenetic methods are applied to potentially recombining nucleotide sequences. Here, data sets from viruses, bacteria, and mitochondria were examined for deviations from clonality using a new approach for detecting and measuring recombination. The apparent rate heterogeneity (ARH) among sites in a sequence alignment can be inflated as an artifact of recombination. However, the composition of polymorphic sites will differ in a data set with recombination-generated ARH versus a clonal data set that exhibits the equivalent degree of rate heterogeneity. This is because recombinant data sets, encompassing regions of conflicting phylogenetic history, tend to yield "starlike" trees that are superficially similar to those inferred from clonal data sets with weak phylogenetic signal throughout. Specifically, a recombinant data set will be unexpectedly rich in conflicting phylogenetic information compared with clonally generated data sets supporting the same tree shape. Its value of q-defined as the proportion of two-state parsimony-informative sites to all polymorphic sites-will be greater than that expected for nonrecombinant data. The method proposed here, the informative-sites test, compares the value of q against a null distribution of values found using Monte Carlo-simulated data evolved under the null hypothesis of clonality. A significant excess of q indicates that the assumption of clonality is not valid and hence that the ARH in the data is at least partly an artifact of recombination. Investigations of the procedure using simulated sequences indicated that it can successfully detect and measure recombination and that it is unlikely to produce "false positives." Simulations also showed that for recombinant data, na?ve use of maximum-likelihood models incorporating rate heterogeneity can lead to overestimation of the time to the most recent common ancestor. Application of the test to real data revealed for the first time that populations of viruses, like those of bacteria, can be brought close to complete linkage equilibrium by pervasive recombination. On the other hand, the test did not reject the hypothesis of clonality when applied to a data set from the coding region of human mitochondrial DNA, despite its high level of ARH and homoplasy.  相似文献   

17.
Recombination is an important evolutionary mechanism responsible for creating the patterns of haplotype variation observable in human populations. Recently, there has been extensive research on understanding the fine-scale variation in recombination across the human genome using DNA polymorphism data. Historical recombination events leave signature patterns in haplotype data. A nonparametric approach for estimating the number of historical recombination events is to compute the minimum number of recombination events in the history of a set of haplotypes. In this paper, we provide new and improved methods for computing lower bounds on the minimum number of recombination events. These methods are shown to detect a higher number of recombination events for a haplotype dataset from a region in the lipoprotein lipase gene than previous lower bounds. We apply our methods to two datasets for which recombination hotspots have been experimentally determined and demonstrate a high density of detectable recombination events in the regions annotated as recombination hotspots. The programs implementing the methods in this paper are available at www.cs.ucsd.edu/users/vibansal/RecBounds/.  相似文献   

18.
The major histocompatibility complex (MHC) is recognised as one of the most important genetic regions in relation to common human disease. Advancement in identification of MHC genes that confer susceptibility to disease requires greater knowledge of sequence variation across the complex. Highly duplicated and polymorphic regions of the human genome such as the MHC are, however, somewhat refractory to some whole-genome analysis methods. To address this issue, we are employing a bacterial artificial chromosome (BAC) cloning strategy to sequence entire MHC haplotypes from consanguineous cell lines as part of the MHC Haplotype Project. Here we present 4.25 Mb of the human haplotype QBL (HLA-A26-B18-Cw5-DR3-DQ2) and compare it with the MHC reference haplotype and with a second haplotype, COX (HLA-A1-B8-Cw7-DR3-DQ2), that shares the same HLA-DRB1, -DQA1, and -DQB1 alleles. We have defined the complete gene, splice variant, and sequence variation contents of all three haplotypes, comprising over 259 annotated loci and over 20,000 single nucleotide polymorphisms (SNPs). Certain coding sequences vary significantly between different haplotypes, making them candidates for functional and disease-association studies. Analysis of the two DR3 haplotypes allowed delineation of the shared sequence between two HLA class II-related haplotypes differing in disease associations and the identification of at least one of the sites that mediated the original recombination event. The levels of variation across the MHC were similar to those seen for other HLA-disparate haplotypes, except for a 158-kb segment that contained the HLA-DRB1, -DQA1, and -DQB1 genes and showed very limited polymorphism compatible with identity-by-descent and relatively recent common ancestry (<3,400 generations). These results indicate that the differential disease associations of these two DR3 haplotypes are due to sequence variation outside this central 158-kb segment, and that shuffling of ancestral blocks via recombination is a potential mechanism whereby certain DR-DQ allelic combinations, which presumably have favoured immunological functions, can spread across haplotypes and populations.  相似文献   

19.
Miscanthus sinensis is a dominant perennial C4 grass with the potential to being a feedstock crop in North America, Europe, and China. Variation in chloroplast DNA sequence was used to obtain information regarding the genetic diversity and structure of populations of M. sinensis in southwest China. Chloroplast DNA, trnL-F and rpl20-rps12 sequences from seventy-five individuals representing 14 populations of M. sinensis were used to study the sequence variation. Seven haplotypes and 16 polymorphic sites (2.7%) were identified. Five substitutions, 6 indels, and 5 existing substitutions and indels sites, were detected through splicing these two gene segments. The genetic diversity within the studied populations (diversity of haploids, h = 0.561, nucleotide diversity, π = 0.00504) was low, this may be affected by the relatively larger effect of genetic drift on the chloroplast DNA, reflecting smaller effective populations than nuclear DNA. Genetic variance within the populations was higher than that between the populations, suggesting that higher gene flow may exist within these populations. The results of parsimony network in seven haplotypes indicated that H1 and H2 may be ancient haplotypes, which may help guide future research on the origin of M. sinensis. Our results provide information on the genetic diversity and structure of M. sinensis and may assist future studies on the phylogeography of M. sinensis.  相似文献   

20.
beta-Globin gene haplotypes obtained in Polynesian Samoans were similar to those described in Southern Chinese. An atypical HindIII restriction fragment length polymorphism detected with pRK29, a 3' beta-globin gene probe, was present at a gene frequency of 7% in Samoans. Haplotype patterns suggest that this polymorphism may have arisen by 1 or 2 mutational events. DNA haplotypes derived from the beta-globin gene cluster confirm nuclear and mitochondrial DNA data that Polynesian precursor populations were East Asian in origin.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号