首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
Estimating recombination rates from population genetic data.   总被引:21,自引:0,他引:21  
P Fearnhead  P Donnelly 《Genetics》2001,159(3):1299-1318
We introduce a new method for estimating recombination rates from population genetic data. The method uses a computationally intensive statistical procedure (importance sampling) to calculate the likelihood under a coalescent-based model. Detailed comparisons of the new algorithm with two existing methods (the importance sampling method of Griffiths and Marjoram and the MCMC method of Kuhner and colleagues) show it to be substantially more efficient. (The improvement over the existing importance sampling scheme is typically by four orders of magnitude.) The existing approaches not infrequently led to misleading results on the problems we investigated. We also performed a simulation study to look at the properties of the maximum-likelihood estimator of the recombination rate and its robustness to misspecification of the demographic model.  相似文献   

2.
As recombination plays an important role in evolution, its estimation and the identification of hotspot positions is of considerable interest. We propose a novel approach for estimating population recombination rates based on genotyping or sequence data that involves a sequential multiscale change point estimator. Our method also permits demography to be taken into account. It uses several summary statistics within a regression model fitted on suitable scenarios. Our proposed method is accurate, computationally fast, and provides a parsimonious solution by ensuring a type I error control against too many changes in the recombination rate. An application to human genome data suggests a good congruence between our estimated and experimentally identified hotspots. Our method is implemented in the R ‐package LDJump, which is freely available at https://github.com/PhHermann/LDJump .  相似文献   

3.
4.
Obtaining an accurate measure of how recombination rates vary across the genome has implications for understanding the molecular basis of recombination, its evolutionary significance and the distribution of linkage disequilibrium in natural populations. Although measuring the recombination rate is experimentally challenging, good estimates can be obtained by applying population-genetic methods to DNA sequences taken from natural populations. Statistical methods are now providing insights into the nature and scale of variation in the recombination rate, particularly in humans. Such knowledge will become increasingly important owing to the growing use of population-genetic methods in biomedical research.  相似文献   

5.
6.
Wang J  Whitlock MC 《Genetics》2003,163(1):429-446
In the past, moment and likelihood methods have been developed to estimate the effective population size (N(e)) on the basis of the observed changes of marker allele frequencies over time, and these have been applied to a large variety of species and populations. Such methods invariably make the critical assumption of a single isolated population receiving no immigrants over the study interval. For most populations in the real world, however, migration is not negligible and can substantially bias estimates of N(e) if it is not accounted for. Here we extend previous moment and maximum-likelihood methods to allow the joint estimation of N(e) and migration rate (m) using genetic samples over space and time. It is shown that, compared to genetic drift acting alone, migration results in changes in allele frequency that are greater in the short term and smaller in the long term, leading to under- and overestimation of N(e), respectively, if it is ignored. Extensive simulations are run to evaluate the newly developed moment and likelihood methods, which yield generally satisfactory estimates of both N(e) and m for populations with widely different effective sizes and migration rates and patterns, given a reasonably large sample size and number of markers.  相似文献   

7.
Estimating hypermutation rates from clonal tree data   总被引:3,自引:0,他引:3  
To understand the mechanisms underlying the varying patterns of mutations that occur during immune and autoimmune responses, estimates of the somatic hypermutation rate are critical. However, despite its significance, precise estimates of the mutation rate do not currently exist. Microdissection studies of mutating B cell clones provide an opportunity to measure this rate more accurately than previously possible. Each microdissection provides a number of clonally related sequences that, through the analysis of shared mutations, can be genealogically related to each other. The shape of these clonal trees is influenced by many processes, including the hypermutation rate. We have developed two different methods to estimate the mutation rate based on these data. These methods are applied to two sets of experimental data, one from an autoimmune response and one from the antihapten response to (4-hydroxy-3-nitrophenyl)acetyl (NP). Comparable mutation rates are estimated for both responses, 0.7-0.9 x 10(-3) and 0.9-1.1 x 10(-3) bp(-1) division(-1) for the autoimmune and NP responses, respectively. In addition to comparing the results of the two procedures, we investigate the effect on our estimate of assumptions, such as the fraction of lethal mutations.  相似文献   

8.
Stochastic versions of exponential growth models predict that even when r or λ values calculated from mean vital statistics indicate exponential growth, most of the individual populations may become extinct. Several recent papers have considered this problem and some misunderstanding has arisen due to the difference between mathematical expectation of population size and most likely course of population growth. We replicated Boyce's (1977, 1979) simulations of population growth with age structure and a single randomly varying vital statistic, and reconciled some of these differences. Mean number can be projected using the dominant eigenvalue of the mean Leslie matrix, but the modal number may be considerably lower. We compared several measures of the rate of growth of the geometric mean or median of numbers and conclude that Tuljapurkar's α is an acceptable measure.  相似文献   

9.
10.
We consider a method of approximating Weir and Cockerham's theta, an unbiased estimator of genetic population structure, using values readily available from published studies using biased estimators (Wright's F(ST) or Nei's G(ST)). The estimation algorithm is shown to be useful for both model populations and real-world avian populations. However, the correlation between Wright's F(ST) and Weir and Cockerham's theta is strong when compared among 39 empirical avian datasets. Thus, the advantage of approximating an unbiased estimator is unclear considering the small actual effect of theta's bias-removing power on empirical datasets.  相似文献   

11.
Chan KC  Wang MC 《Biometrics》2012,68(2):521-531
A prevalent sample consists of individuals who have experienced disease incidence but not failure event at the sampling time. We discuss methods for estimating the distribution function of a random vector defined at baseline for an incident disease population when data are collected by prevalent sampling. Prevalent sampling design is often more focused and economical than incident study design for studying the survival distribution of a diseased population, but prevalent samples are biased by design. Subjects with longer survival time are more likely to be included in a prevalent cohort, and other baseline variables of interests that are correlated with survival time are also subject to sampling bias induced by the prevalent sampling scheme. Without recognition of the bias, applying empirical distribution function to estimate the population distribution of baseline variables can lead to serious bias. In this article, nonparametric and semiparametric methods are developed for distribution estimation of baseline variables using prevalent data.  相似文献   

12.
Estimating growth and mortality rates from size data   总被引:1,自引:0,他引:1  
Thomas A. Ebert 《Oecologia》1973,11(3):281-298
Summary A method is presented for estimating rates of individual growth and population mortality utilizing average individual size at two times during a year. The model assumes a constant rate of mortality, Brody-Bertalanffy growth, a stationary age distribution, and recruitment confined to one month each year. A hypothetical example is presented to show the interrelationships of the growth and mortality constants, size at recruitment, asymptotic size and average individual size. Three examples are presented using data from the literature: Flathead sole (Hippoglossoides elassodon), a sea urchin(Echinus esculentus), and the crown-of-thorns starfish(Acanthaster planci). The method appears to be a means of obtaining reasonable approximations of growth and mortality rates for a variety of organisms.  相似文献   

13.
Fidelity of meiotic gene conversion in yeast   总被引:6,自引:0,他引:6  
Summary Gene conversion was studied in a sample of 3869 unselected meiotic tetrads obtained from three diploids, respectively; heterozygous for a single ochre mutant, heteroallelic for a pair of ochre alleles, and heterozygous for an ochre specific suppressor. Although the genetic system were sufficiently sensitive to detect single base changes at the mutant codon level, none were found among 36 conversions (1+:3m) of the ochre mutants and 153 conversions (3S:1+) of the suppressor locus. These findings lead to the conclusion that the informational transfer in gene conversion occurred with complete fidelity. Gene conversion conserved and did not generate new genetic information. The error level of conversion was estimated as less than 10-2/N.P.  相似文献   

14.
Mutations affecting meiotic gene conversion in yeast   总被引:6,自引:0,他引:6  
  相似文献   

15.
16.
Measurement of temporal change in allele frequencies represents an indirect method for estimating the genetically effective size of populations. When allele frequencies are estimated for gene markers that display dominant gene expression, such as, e.g. random amplified polymorphic DNA (RAPD) and amplified fragment length polymorphism (AFLP) markers, the estimates can be seriously biased. We quantify bias for previous allele frequency estimators and present a new expression that is generally less biased and provides a more precise assessment of temporal allele frequency change. We further develop an estimator for effective population size that is appropriate when dealing with dominant gene markers. Comparison with estimates based on codominantly expressed genes, such as allozymes or microsatellites, indicates that about twice as many loci or sampled individuals are required when using dominant markers to achieve the same precision.  相似文献   

17.
Several methods have been developed to estimate the selfing rate of a population from a sample of individuals genotyped for several marker loci. These methods can be based on homozygosity excess (or inbreeding), identity disequilibrium, progeny array (PA) segregation or population assignment incorporating partial selfing. Progeny array-based method is generally the best because it is not subject to some assumptions made by other methods (such as lack of misgenotyping, absence of biparental inbreeding and presence of inbreeding equilibrium), and it can reveal other facets of a mixed-mating system such as patterns of shared paternity. However, in practice, it is often difficult to obtain PAs, especially for animal species. In this study, we propose a method to reconstruct the pedigree of a sample of individuals taken from a monoecious diploid population practicing mixed mating, using multilocus genotypic data. Selfing and outcrossing events are then detected when an individual derives from identical parents and from two distinct parents, respectively. Selfing rate is estimated by the proportion of selfed offspring in the reconstructed pedigree of a sample of individuals. The method enjoys many advantages of the PA method, but without the need of a priori family structure, although such information, if available, can be utilized to improve the inference. Furthermore, the new method accommodates genotyping errors, estimates allele frequencies jointly and is robust to the presence of biparental inbreeding and inbreeding disequilibrium. Both simulated and empirical data were analysed by the new and previous methods to compare their statistical properties and accuracies.  相似文献   

18.
Thomas SC  Hill WG 《Genetics》2000,155(4):1961-1972
Previous techniques for estimating quantitative genetic parameters, such as heritability in populations where exact relationships are unknown but are instead inferred from marker genotypes, have used data from individuals on a pairwise level only. At this level, families are weighted according to the number of pairs within which each family appears, hence by size rather than information content, and information from multiple relationships is lost. Estimates of parameters are therefore not the most efficient achievable. Here, Markov chain Monte Carlo techniques have been used to partition the population into complete sibships, including, if known, prior knowledge of the distribution of family sizes. These pedigrees have then been used with restricted maximum likelihood under an animal model to estimate quantitative genetic parameters. Simulations to compare the properties of parameter estimates with those of existing techniques indicate that the use of sibship reconstruction is superior to earlier methods, having lower mean square errors and showing nonsignificant downward bias. In addition, sibship reconstruction allows the estimation of population allele frequencies that account for the relationships within the sample, so prior knowledge of allele frequencies need not be assumed. Extensions to these techniques allow reconstruction of half sibships when some or all of the maternal genotypes are known.  相似文献   

19.
Kuhner MK  Yamato J  Felsenstein J 《Genetics》2000,156(3):1393-1401
We describe a method for co-estimating r = C/mu (where C is the per-site recombination rate and mu is the per-site neutral mutation rate) and Theta = 4N(e)mu (where N(e) is the effective population size) from a population sample of molecular data. The technique is Metropolis-Hastings sampling: we explore a large number of possible reconstructions of the recombinant genealogy, weighting according to their posterior probability with regard to the data and working values of the parameters. Different relative rates of recombination at different locations can be accommodated if they are known from external evidence, but the algorithm cannot itself estimate rate differences. The estimates of Theta are accurate and apparently unbiased for a wide range of parameter values. However, when both Theta and r are relatively low, very long sequences are needed to estimate r accurately, and the estimates tend to be biased upward. We apply this method to data from the human lipoprotein lipase locus.  相似文献   

20.
A model for the analysis of insect stage-frequency data is developed which includes stage-specific variable developmental periods and stage-specific daily survival rates. The model can predict the development of an insect population through its developmental stages and consequently may form the basis for a simulation model of the population.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号