首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 140 毫秒
1.
DNA Polymorphism Detectable by Restriction Endonucleases   总被引:82,自引:15,他引:67       下载免费PDF全文
Data on DNA polymorphisms detected by restriction endonucleases are rapidly accumulating. With the aim of analyzing these data, several different measures of nucleon (DNA segment) diversity within and between populations are proposed, and statistical methods for estimating these quantities are developed. These statistical methods are applicable to both nuclear and nonnuclear DNAs. When evolutionary change of nucleons occurs mainly by mutation and genetic drift, all the measures can be expressed in terms of the product of mutation rate per nucleon and effective population size. A method for estimating nucleotide diversity from nucleon diversity is also presented under certain assumptions. It is shown that DNA divergence between two populations can be studied either by the average number of restriction site differences or by the average number of nucleotide differences. In either case, a large number of different restriction enzymes should be used for studying phylogenetic relationships among related organisms, since the effect of stochastic factors on these quantities is very large. The statistical methods developed have been applied to data of Shah and Langley on mitochondrial (mt)DNA from Drosophila melanogaster, simulans and virilis. This application has suggested that the evolutionary change of mtDNA in higher animals occurs mainly by nucleotide substitution rather than by deletion and insertion. The evolutionary distances among the three species have also been estimated.  相似文献   

2.
Evolutionary Relationship of DNA Sequences in Finite Populations   总被引:74,自引:27,他引:47       下载免费PDF全文
Fumio Tajima 《Genetics》1983,105(2):437-460
With the aim of analyzing and interpreting data on DNA polymorphism obtained by DNA sequencing or restriction enzyme technique, a mathematical theory on the expected evolutionary relationship among DNA sequences (nucleons) sampled is developed under the assumption that the evolutionary change of nucleons is determined solely by mutation and random genetic drift. The statistical property of the number of nucleotide differences between randomly chosen nucleons and that of heterozygosity or nucleon diversity is investigated using this theory. These studies indicate that the estimates of the average number of nucleotide differences and nucleon diversity have a large variance, and a large part of this variance is due to stochastic factors. Therefore, increasing sample size does not help reduce the variance significantly. The distribution of sample allele (nucleomorph) frequencies is also studied, and it is shown that a small number of samples are sufficient in order to know the distribution pattern.  相似文献   

3.
We show that the number of lineages ancestral to a sample, as a function of time back into the past, which we call the number of lineages as a function of time (NLFT), is a nearly deterministic property of large-sample gene genealogies. We obtain analytic expressions for the NLFT for both constant-sized and exponentially growing populations. The low level of stochastic variation associated with the NLFT of a large sample suggests using the NLFT to make estimates of population parameters. Based on this, we develop a new computational method of inferring the size and growth rate of a population from a large sample of DNA sequences at a single locus. We apply our method first to a sample of 1,212 mitochondrial DNA (mtDNA) sequences from China, confirming a pattern of recent population growth previously identified using other techniques, but with much smaller confidence intervals for past population sizes due to the low variation of the NLFT. We further analyze a set of 63 mtDNA sequences from blue whales (BWs), concluding that the population grew in the past. This calls for reevaluation of previous studies that were based on the assumption that the BW population was fixed.  相似文献   

4.
Although a major component of fitness, male reproductive success is generally extremely difficult to estimate. As a result, genetic methods and maximum likelihood models have been developed to estimate male parentage, but all are limited in practice by the degree of genetic variation observable. Scoring individuals phenotypically at a large number of random loci exhibiting dominance (e.g. RAPD markers) may provide a means of detecting sufficient genetic variation. Dominance, however, represents a loss of information and therefore greater variation in the estimate of paternity. A mixture model describing mating in a population is presented to quantify the trade-off between marker types when estimates of male fertility are sought. A sample size 1.5-2.0 times greater is required for dominant markers under some conditions to obtain the same confidence in fertility estimates as for codominant markers, although with large sample sizes the fertility estimates are similar for either marker type. Since the number of dominant DN A markers is not limited in the same manner as is the number of codominant protein markers, one's confidence in the estimates can be increased above that possible from proteins by surveying additional loci. However, for a fixed sample size a trade-off exists between the number of progeny assayed per female and the number of loci surveyed. In many cases more progeny per female provide better estimates of fertility than more loci.  相似文献   

5.
The interpretation of data on genetic variation with regard to the relative roles of different evolutionary factors that produce and maintain genetic variation depends critically on our assumptions concerning effective population size and the level of migration between neighboring populations. In humans, recent population growth and movements of specific ethnic groups across wide geographic areas mean that any theory based on assumptions of constant population size and absence of substructure is generally untenable. We examine the effects of population subdivision on the pattern of protein genetic variation in a total sample drawn from an artificial agglomerate of 12 tribal populations of Central and South America, analyzing the pooled sample as though it were a single population. Several striking findings emerge. (1) Mean heterozygosity is not sensitive to agglomeration, but the number of different alleles (allele count) is inflated, relative to neutral mutation/drift/equilibrium expectation. (2) The inflation is most serious for rare alleles, especially those which originally occurred as tribally restricted "private" polymorphisms. (3) The degree of inflation is an increasing function of both the number of populations encompassed by the sample and of the genetic divergence among them. (4) Treating an agglomerated population as though it were a panmictic unit of long standing can lead to serious biases in estimates of mutation rates, selection pressures, and effective population sizes. Current DNA studies indicate the presence of numerous genetic variants in human populations. The findings and conclusions of this paper are all fully applicable to the study of genetic variation at the DNA level as well.  相似文献   

6.
We have surveyed the region of the X chromosome of Drosophila melanogaster which encodes the yellow, achaete and scute genes for restriction map variation. Two natural populations, one from North Carolina, U.S.A. and the other from southern Spain were screened for variation at about 70 restriction sites and for variation due to DNA insertion or deletion events in 120 kilobases of DNA. Mean heterozygosity per nucleotide was estimated to be 0.0024 and 15 large insertions were found in the 49 chromosomes screened. Extensive disequilibrium between polymorphic sites were found across much of the region in the North Carolina population. The frequency of large insertions, which usually correspond to transposable genetic elements, is significantly lower than has been observed in autosomal regions of the genome. This is predicted for X-linked loci by certain models of transposable element evolution, where copy number is restricted by virtue of the recessive deleterious effects of the insertions. Our results appear to support such models. The deficiency of insertions may in this case be enhanced by hitch-hiking effects arising from the high level of disequilibrium.  相似文献   

7.
Population genetic studies provide insights into the evolutionary processes that influence the distribution of sequence variants within and among wild populations. FST is among the most widely used measures for genetic differentiation and plays a central role in ecological and evolutionary genetic studies. It is commonly thought that large sample sizes are required in order to precisely infer FST and that small sample sizes lead to overestimation of genetic differentiation. Until recently, studies in ecological model organisms incorporated a limited number of genetic markers, but since the emergence of next generation sequencing, the panel size of genetic markers available even in non-reference organisms has rapidly increased. In this study we examine whether a large number of genetic markers can substitute for small sample sizes when estimating FST. We tested the behavior of three different estimators that infer FST and that are commonly used in population genetic studies. By simulating populations, we assessed the effects of sample size and the number of markers on the various estimates of genetic differentiation. Furthermore, we tested the effect of ascertainment bias on these estimates. We show that the population sample size can be significantly reduced (as small as n = 4–6) when using an appropriate estimator and a large number of bi-allelic genetic markers (k>1,000). Therefore, conservation genetic studies can now obtain almost the same statistical power as studies performed on model organisms using markers developed with next-generation sequencing.  相似文献   

8.
A method is presented for the estimation of nucleotide diversity and genetic structure of populations from RAPD (random amplified polymorphic DNA) data. It involves a modification of the technique developed by Lynch and Crease (1990) for the case of restriction sites as survey data. As new elements the method incorporates (i) dominance correction, (ii) values of asexual reproduction of the populations sampled, and (iii) an analytical variance of the number of nucleotide substitutions per site. Sampling was carried out at two geographic scales for three aphid species. At a macrogeographic scale, populations of Rhopalosiphum padi did not show statistical genetic differentiation. Aphis gossypii and Myzus persicae, which were sampled at a microgeographic scale, showed a higher genetic differentiation than R. padi, it being statistically significant in M. persicae. The major sources of sampling variance within- and between-populations were found to be nucleotide (i.e., the number of alleles used as a function of the number of primers used) and population (i.e., sample size) sampling. Extremely low estimates of nucleotide diversity were obtained for the species studied here. This result is consistent with previous reports on genetic diversity for the same or other aphid species which were based on allozyme polymorphism, mitochondrial DNA variation and qualitative analyses of RAPDs.  相似文献   

9.
This study compares estimates of the census size of the spawning population with genetic estimates of effective current and long-term population size for an abundant and commercially important marine invertebrate, the brown tiger prawn (Penaeus esculentus). Our aim was to focus on the relationship between genetic effective and census size that may provide a source of information for viability analyses of naturally occurring populations. Samples were taken in 2001, 2002 and 2003 from a population on the east coast of Australia and temporal allelic variation was measured at eight polymorphic microsatellite loci. Moments-based and maximum-likelihood estimates of current genetic effective population size ranged from 797 to 1304. The mean long-term genetic effective population size was 9968. Although small for a large population, the effective population size estimates were above the threshold where genetic diversity is lost at neutral alleles through drift or inbreeding. Simulation studies correctly predicted that under these experimental conditions the genetic estimates would have non-infinite upper confidence limits and revealed they might be overestimates of the true size. We also show that estimates of mortality and variance in family size may be derived from data on average fecundity, current genetic effective and census spawning population size, assuming effective population size is equivalent to the number of breeders. This work confirms that it is feasible to obtain accurate estimates of current genetic effective population size for abundant Type III species using existing genetic marker technology.  相似文献   

10.
Estimation of allele frequencies for VNTR loci   总被引:9,自引:4,他引:5       下载免费PDF全文
VNTR loci provide valuable information for a number of fields of study involving human genetics, ranging from forensics (DNA fingerprinting and paternity testing) to linkage analysis and population genetics. Alleles of a VNTR locus are simply fragments obtained from a particular portion of the DNA molecule and are defined in terms of their length. The essential element of a VNTR fragment is the repeat, which is a short sequence of basepairs. The core of the fragment is composed of a variable number of identical repeats that are linked in tandem. A sample of fragments from a population of individuals exhibits substantial variation in length because of variation in the number of repeats. Each distinct fragment length defines an allele, but any given fragment is measured with error. Therefore the observed distribution of fragment lengths is not discrete but is continuous, and determination of distinct allele classes is not straightforward. A mixture model is the natural statistical method for estimating the allele frequencies of VNTR loci. In this article we develop nonparametric methods for obtaining the distribution of allele sizes and estimates of their frequencies. Methods for obtaining maximum-likelihood estimates are developed. In addition, we suggest an empirical Bayes method to improve the maximum-likelihood estimates of the gene frequencies; the empirical Bayes procedure effects a local smoothing. The latter method works particularly well when measurement error is large relative to the repeat size, because the estimated distribution of allele frequencies when maximum likelihood is used is unreliable because of an alternating pattern of over- and underestimation. We define alleles and estimate the allele frequencies for two VNTR loci from the human genome (D17S79 and D2S44), from data obtained from Lifecodes, Inc.  相似文献   

11.
Haney RA  Silliman BR  Rand DM 《Heredity》2007,98(5):294-302
The pelagic larval stage of most coral reef fishes might allow extensive dispersal or, alternatively, some level of local recruitment might be important. Molecular markers can be used to obtain indirect estimates of dispersal to evaluate these alternatives, yet the extent of gene flow among populations is known for only a small number of species. The use of such markers must take into account the properties of the markers and the demographic history of the population when making inferences about current gene flow. In the Caribbean bluehead wrasse, Thalassoma bifasciatum, previous studies have found both substantial levels of local recruitment, in studies interpreting otolith microchemistry and, conversely, a lack of genetic differentiation inferred from mitochondrial DNA (mtDNA) restriction-fragment length polymorphism (RFLP) data and allozymes. However, if subtle differentiation exists, larger sample sizes and highly variable markers may be required to discern it. Here we present results from mitochondrial control region sequence and microsatellite data that indicate a lack of genetic differentiation at both small and large spatial scales. However, historical processes, such as changes in population size, may have affected the current distribution of genetic variation.  相似文献   

12.
To empirically determine the effects of sample size on commonly used measures of average genetic diversity, we genotyped 200 song sparrows Melospiza melodia from two populations, one genetically depauperate (n=100) and the other genetically diverse (n=100), using eight microsatellite loci. These genotypes were used to randomly create 10,000 datasets of differing sizes (5 to 50) for each population to determine what the effects of sample size might be on several estimates of genetic diversity (number of alleles per locus, average observed heterozygosity, and unbiased average expected heterozygosity) in natural populations of conservation concern. We found that at small sample sizes of 5 to 10 individuals, estimates of unbiased heterozygosity outperformed those based on observed heterozygosity or allelic diversity for both low- and high-diversity populations. We also found that when comparing across populations in which different numbers of individuals were sampled, rarefaction provided a useful way to compare estimates of allelic diversity. We recommend that standard errors should be reported for all diversity estimators, especially when sample sizes are small. We also recommend that at least 20 to 30 individuals be sampled in microsatellite studies that assess genetic diversity when working in a population that has an unknown level of diversity. However, research on critically endangered populations (where large sample sizes are impossible or extremely difficult to obtain) should include measures of genetic diversity even if sample sizes are less than ideal. These estimates can be useful in assessing the genetic diversity of the population.  相似文献   

13.
Recent controversies surrounding models of modern human origins have focused on among-group variation, particularly the reconstruction of phylogenetic trees from mitochondrial DNA (mtDNA) and, the dating of population divergence. Problems in tree estimation have been seen as weakening the case for a replacement model and favoring a multiregional evolution model. There has been less discussion of patterns of within-group variation, although the mtDNA evidence has consistently shown the greatest diversity within African populations. Problems of interpretation abound given the numerous factors that can influence within-group variation, including the possibility of earlier divergence, differences in population size, patterns of population expansion, and variation in migration rates. We present a model of within-group phenotypic variation and apply it to a large set of craniometric data representing major Old World geographic regions (57 measurements for 1,159 cases in four regions: Europe, Sub-Saharan Africa, Australasia, and the Far East). The model predicts a linear relationship between variation within populations (the average within-group variance) and variation between populations (the genetic distance of populations to pooled phenotypic means). On a global level this relationship should hold if the long-term effective population sizes of each region are correctly specified. Other potential effects on withingroup variation are accounted for by the model. Comparison of observed and expected variances under the assumption of equal effective sizes for four regions indicates significantly greater within-group variation in Africa and significantly less within-group variation in Europe. These results suggest that the long-term effective population size was greatest in Africa. Closer examination of the model suggests that the long-term African effective size was roughly three times that of any other geographic region. Using these estimates of relative population size, we present a method for analyzing ancient population structure, which provides estimates of ancient migration. This method allows us to reconstruct migration history between geographic regions after adjustment for the effect of genetic drift on interpopulational distances. Our results show a clear isolation of Africa from other regions. We then present a method that allows direct estimation of the ancient migration matrix, thus providing us with information on the actual extent of interregional migration. These methods also provide estimates of time frames necessary to reach genetic equilibrium. The ultimate goal is extracting as much information from present-day patterns of human variation relevannt to issues of human origins. Our results are in agreement with mismatch distribution analysis of mtDNA, and they support a “weak Garden o Eden” model. In this model, modern-day variation can be explained by divergence from an initial source (perhaps Africa) into a number o small isolated populations, followed by later population expansion throughout our species. The major populationn expansions of Homo sapiens during and after the late Pleistocene have had the effect of “freezing” ancient patterns of population structure. While this is not the only possible scenario, we do note the close agreement with ecent analyses of mtDNA mismatch distibutions. © 1994 Wiley-Liss, Inc.  相似文献   

14.
The genetic variation at a compound nonrecombining haplotype system, consisting of the previously reported SB19.3 Alu insertion polymorphism and a newly identified adjacent short tandem repeat (STR), was studied in population samples from Portugal and S?o Tomé (Gulf of Guinea, West Africa). Age estimates based on the linked microsatellite variation suggest that the Alu insertion occurred about 190,000 years ago. In accordance with the global patterns of distribution of human genetic variation, the highest haplotype diversity was found in the African sample. This excess in African diversity was due to both a substantial reduction in heterozygosity at the Alu polymorphism and a lower STR variability associated with the predominant Alu insertion allele in the Portuguese sample. The high level of interpopulation differentiation observed at the Alu locus (F(ST) = 0.43) was interpreted under alternative selective and demographic scenarios. The need for compatibility between patterns of variation at the STR and Alu loci could be used to restrict the range of selection coefficients in selection-driven genetic hitchhiking frameworks and to favor demographic scenarios dominated by larger pre-expansion African population sizes. Taken together, the data show that the SB19.3 Alu-STR system is an informative marker that can be included in more extended batteries of compound haplotypes used in human evolutionary studies.  相似文献   

15.
The Norris Farms No. 36 cemetery in central Illinois has been the subject of considerable archaeological and genetic research. Both mitochondrial DNA (mtDNA) and nuclear DNA have been examined in this 700-year-old population. DNA preservation at the site was good, with about 70% of the samples producing mtDNA results and approximately 15% yielding nuclear DNA data. All four of the major Amerindian mtDNA haplogroups were found, in addition to a fifth haplogroup. Sequences of the first hypervariable region of the mtDNA control region revealed a high level of diversity in the Norris Farms population and confirmed that the fifth haplogroup associates with Mongolian sequences and hence is probably authentic. Other than a possible reduction in the number of rare mtDNA lineages in many populations, it does not appear as if European contact significantly altered patterns of Amerindian mtDNA variation, despite the large decrease in population size that occurred. For nuclear DNA analysis, a novel method for DNA-based sex identification that uses nucleotide differences between the X and Y copies of the amelogenin gene was developed and applied successfully in approximately 20 individuals. Despite the well-known problems of poor DNA preservation and the ever-present possibility of contamination with modern DNA, genetic analysis of the Norris Farms No. 36 population demonstrates that ancient DNA can be a fruitful source of new insights into prehistoric populations.  相似文献   

16.
The Finnish wolf population (Canis lupus) was sampled during three different periods (1996-1998, 1999-2001 and 2002-2004), and 118 individuals were genotyped with 10 microsatellite markers. Large genetic variation was found in the population despite a recent demographic bottleneck. No spatial population subdivision was found even though a significant negative relationship between genetic relatedness and geographic distance suggested isolation by distance. Very few individuals did not belong to the local wolf population as determined by assignment analyses, suggesting a low level of immigration in the population. We used the temporal approach and several statistical methods to estimate the variance effective size of the population. All methods gave similar estimates of effective population size, approximately 40 wolves. These estimates were slightly larger than the estimated census size of breeding individuals. A Bayesian model based on Markov chain Monte Carlo simulations indicated strong evidence for a long-term population decline. These results suggest that the contemporary wolf population size is roughly 8% of its historical size, and that the population decline dates back to late 19th century or early 20th century. Despite an increase of over 50% in the census size of the population during the whole study period, there was only weak evidence that the effective population size during the last period was higher than during the first. This may be caused by increased inbreeding, diminished dispersal within the population, and decreased immigration to the population during the last study period.  相似文献   

17.
W. Stephan  S. J. Mitchell 《Genetics》1992,132(4):1039-1045
We have estimated DNA sequence variation within and between two populations of Drosophila ananassae, using six-cutter restriction site variation at vermilion (v) and furrowed (fw). These two gene regions are located close to the centromere on the left and right X chromosome arms, respectively. In the fw region, no DNA polymorphism was detected within each population. In the v region, average heterozygosity per nucleotide was very low in both populations (pi = 0.0005 in the Burma population, and 0.0009 in the India population). These estimates are significantly lower than those from loci in more distal gene regions. The distribution of DNA polymorphisms between both populations was also striking. At fw, three fixed differences between the Burma and India populations were detected (two restriction site differences and one insertion/deletion of approximately 2 kb). At v, each DNA polymorphism in high frequency in the total sample was nearly fixed in one or the other population, although none of them reached complete fixation. The observed pattern of reduced variation within populations and fixed differences between populations appears to correlate with recombination rate. We conclude that recent hitchhiking associated with directional selection is the best explanation for this pattern. The data indicate that different selective sweeps have occurred in the two populations. The possible role of genetic hitchhiking in rapid population differentiation in gene regions of restricted recombination is discussed.  相似文献   

18.
We investigated the distribution of genetic variation and the relationship between population size and genetic variation in the rare plant Gentianella germanica using RAPD (random amplified polymorphic DNA) profiles. Plants for the analysis were grown from seeds sampled from 72 parent plants in 11 G. germanica populations of different size (40-5000 fruiting individuals). In large populations, seeds were sampled from parents in two spatially distinct subpopulations comparable in area to the total area covered by small populations. Analysis of molecular variance revealed significant genetic variation among populations (P <0.001), while genetic variation among subpopulations was marginally significant (P <0.06). Average molecular variance within subpopulations in large populations did not differ significantly from whole-population values. There was a positive correlation between genetic variation and population size (P <0.01). Genetic variation was also positively correlated with the number of seeds per plant in the field (P <0.02) and the number of flowers per planted seed in a common garden experiment (P <0.051). We conclude that gene flow among natural populations is very limited and that reduced plant fitness in small populations of G. germanica most likely has genetic causes. Management should aim to increase the size of small populations to minimize further loss of genetic variation. Because a large proportion of genetic variation is among populations, even small populations are worth preserving.  相似文献   

19.
ABSTRACT Estimating black bear (Ursus americanus) population size is a difficult but important requirement when justifying harvest quotas and managing populations. Advancements in genetic techniques provide a means to identify individual bears using DNA contained in tissue and hair samples, thereby permitting estimates of population abundance based on established mark-capture-recapture methodology. We expand on previous noninvasive population-estimation work by geographically extending sampling areas (36,848 km2) to include the entire Northern Lower Peninsula (NLP) of Michigan, USA. We selected sampling locations randomly within biologically relevant bear habitat and used barbed wire hair snares to collect hair samples. Unlike previous noninvasive studies, we used tissue samples from harvested bears as an additional sampling occasion to increase recapture probabilities. We developed subsampling protocols to account for both spatial and temporal variance in sample distribution and variation in sample quality using recently published quality control protocols using 5 microsatellite loci. We quantified genotyping errors using samples from harvested bears and estimated abundance using statistical models that accounted for genotyping error. We estimated the population of yearling and adult black bears in the NLP to be 1,882 bears (95% CI = 1,389-2,551 bears). The derived population estimate with a 15% coefficient of variation was used by wildlife managers to examine the sustainability of harvest over a large geographic area.  相似文献   

20.
Next-Generation Sequencing (NGS) technologies have dramatically revolutionised research in many fields of genetics. The ability to sequence many individuals from one or multiple populations at a genomic scale has greatly enhanced population genetics studies and made it a data-driven discipline. Recently, researchers have proposed statistical modelling to address genotyping uncertainty associated with NGS data. However, an ongoing debate is whether it is more beneficial to increase the number of sequenced individuals or the per-sample sequencing depth for estimating genetic variation. Through extensive simulations, I assessed the accuracy of estimating nucleotide diversity, detecting polymorphic sites, and predicting population structure under different experimental scenarios. Results show that the greatest accuracy for estimating population genetics parameters is achieved by employing a large sample size, despite single individuals being sequenced at low depth. Under some circumstances, the minimum sequencing depth for obtaining accurate estimates of allele frequencies and to identify polymorphic sites is , where both alleles are more likely to have been sequenced. On the other hand, inferences of population structure are more accurate at very large sample sizes, even with extremely low sequencing depth. This all points to the conclusion that under various experimental scenarios, in cost-limited population genetics studies, large sample sizes at low sequencing depth are desirable to achieve high accuracy. These findings will help researchers design their experimental set-ups and guide further investigation on the effect of protocol design for genetic research.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号