首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Nuclear SSRs are notorious for having relatively high frequencies of null alleles, i.e. alleles that fail to amplify and are thus recessive and undetected in heterozygotes. In this paper, we compare two kinds of approaches for estimating null allele frequencies at seven nuclear microsatellite markers in three French Fagus sylvatica populations: (1) maximum likelihood methods that compare observed and expected homozygote frequencies in the population under the assumption of Hardy-Weinberg equilibrium and (2) direct null allele frequency estimates from progeny where parent genotypes are known. We show that null allele frequencies are high in F. sylvatica (7.0% on average with the population method, 5.1% with the progeny method), and that estimates are consistent between the two approaches, especially when the number of sampled maternal half-sib progeny arrays is large. With null allele frequencies ranging between 5% and 8% on average across loci, population genetic parameters such as genetic differentiation (F ST) may be mostly unbiased. However, using markers with such average prevalence of null alleles (up to 15% for some loci) can be seriously misleading in fine scale population studies and parentage analysis.  相似文献   

2.
In spite of the usefulness of codominant markers in population genetics, the existence of null alleles raises challenging estimation issues in natural populations that are characterized by positive inbreeding coefficients (F > 0). Disregarding the possibility of > 0 in a population will generally lead to overestimates of null allele frequencies. Conversely, estimates of inbreeding coefficients (F) may be strongly biased upwards (excess homozygotes), in the presence of nontrivial frequencies of null alleles. An algorithm has been presented for the estimation of null allele frequencies in inbred populations (van Oosterhout method), using external estimates of the F‐statistics. The goal of this study is to introduce a modification of this method and to provide a formal comparison with an alternative likelihood‐based method (Chybicki‐Burczyk). Using simulated data, we illustrate the strengths and limitations of these competing methods. Under most circumstances, the likelihood method is preferable, but for highly inbred organisms, a modified van Oosterhout method offers some advantages.  相似文献   

3.
Null alleles are alleles that for various reasons fail to amplify in a PCR assay. The presence of null alleles in microsatellite data is known to bias the genetic parameter estimates. Thus, efficient detection of null alleles is crucial, but the methods available for indirect null allele detection return inconsistent results. Here, our aim was to compare different methods for null allele detection, to explain their respective performance and to provide improvements. We applied several approaches to identify the ‘true’ null alleles based on the predictions made by five different methods, used either individually or in combination. First, we introduced simulated ‘true’ null alleles into 240 population data sets and applied the methods to measure their success in detecting the simulated null alleles. The single best‐performing method was ML‐NullFreq_frequency. Furthermore, we applied different noise reduction approaches to improve the results. For instance, by combining the results of several methods, we obtained more reliable results than using a single one. Rule‐based classification was applied to identify population properties linked to the false discovery rate. Rules obtained from the classifier described which population genetic estimates and loci characteristics were linked to the success of each method. We have shown that by simulating ‘true’ null alleles into a population data set, we may define a null allele frequency threshold, related to a desired true or false discovery rate. Moreover, using such simulated data sets, the expected null allele homozygote frequency may be estimated independently of the equilibrium state of the population.  相似文献   

4.
The two alleles an individual carries at a locus are identical by descent (ibd) if they have descended from a single ancestral allele in a reference population, and the probability of such identity is the inbreeding coefficient of the individual. Inbreeding coefficients can be predicted from pedigrees with founders constituting the reference population, but estimation from genetic data is not possible without data from the reference population. Most inbreeding estimators that make explicit use of sample allele frequencies as estimates of allele probabilities in the reference population are confounded by average kinships with other individuals. This means that the ranking of those estimates depends on the scope of the study sample and we show the variation in rankings for common estimators applied to different subdivisions of 1000 Genomes data. Allele-sharing estimators of within-population inbreeding relative to average kinship in a study sample, however, do have invariant rankings across all studies including those individuals. They are unbiased with a large number of SNPs. We discuss how allele sharing estimates are the relevant quantities for a range of empirical applications.Subject terms: Population genetics, Evolutionary biology, Molecular ecology  相似文献   

5.
Nonamplified (null) alleles are a common feature of microsatellite genotyping and can bias estimates of allele and genotype frequencies, thereby hindering population genetic analyses. The frequency of microsatellite null alleles in diploid populations can be estimated for populations that are in Hardy–Weinberg equilibrium. However, many microsatellite data sets are from nonequilibrium populations, often with known inbreeding coefficients (F) or fixation indices (FIS or FST). Here, we propose a novel null allele estimator that can be used to estimate the null allele frequency and adjust visible allele frequencies in populations for which independent estimates of F, FIS or FST are available. The algorithm is currently available as an Excel macro that can be downloaded at no cost from http://www.microchecker.hull.ac.uk/ and will be incorporated into the software micro ‐checker .  相似文献   

6.
Microsatellite null alleles and estimation of population differentiation   总被引:20,自引:0,他引:20  
Microsatellite null alleles are commonly encountered in population genetics studies, yet little is known about their impact on the estimation of population differentiation. Computer simulations based on the coalescent were used to investigate the evolutionary dynamics of null alleles, their impact on F(ST) and genetic distances, and the efficiency of estimators of null allele frequency. Further, we explored how the existing method for correcting genotype data for null alleles performed in estimating F(ST) and genetic distances, and we compared this method with a new method proposed here (for F(ST) only). Null alleles were likely to be encountered in populations with a large effective size, with an unusually high mutation rate in the flanking regions, and that have diverged from the population from which the cloned allele state was drawn and the primers designed. When populations were significantly differentiated, F(ST) and genetic distances were overestimated in the presence of null alleles. Frequency of null alleles was estimated precisely with the algorithm presented in Dempster et al. (1977). The conventional method for correcting genotype data for null alleles did not provide an accurate estimate of F(ST) and genetic distances. However, the use of the genetic distance of Cavalli-Sforza and Edwards (1967) corrected by the conventional method gave better estimates than those obtained without correction. F(ST) estimation from corrected genotype frequencies performed well when restricted to visible allele sizes. Both the proposed method and the traditional correction method have been implemented in a program that is available free of charge at http://www.montpellier.inra.fr/URLB/. We used 2 published microsatellite data sets based on original and redesigned pairs of primers to empirically confirm our simulation results.  相似文献   

7.
Estimation of the purging of detrimental effects through inbreeding and selection is an important issue in conservation genetics opening new perspectives for the management of small populations. In 1997 Ballou proposed the ancestral inbreeding coefficient, which is calculated recursively via pedigree inbreeding coefficients, as a tool for evaluating the purging of deleterious alleles in zoo populations. The formula of Ballou assumes independence of inbreeding and ancestral inbreeding coefficients at any stage of the recursion. This study investigates the consequences of this inaccuracy on the estimation of true ancestral inbreeding, i.e. the proportion of alleles within a genome that has undergone inbreeding in the past. As an alternative we propose the estimation of ancestral inbreeding by the method of gene dropping. The methods are compared by stochastic simulation for various models with respect to mode of inheritance (neutral, detrimental and lethal alleles) and different settings for population size and initial allele frequencies. In all scenarios the proportion of alleles within a genome that has undergone inbreeding in the past was overestimated by Ballou’s formula. The overestimation was more pronounced in smaller populations but was not affected by genetic model or initial allele frequency. In contrast, the ancestral inbreeding coefficient calculated by gene dropping provided a robust estimate of ancestral inbreeding in most models and settings. A marginal overestimation was observed only in models with lethal alleles. Therefore, we recommend applying the gene dropping approach to estimate ancestral inbreeding coefficients.  相似文献   

8.
We provide experimental evidence showing that, during the restriction-enzyme digestion of DNA samples, some of the HaeIII-digested DNA fragments are small enough to prevent their reliable sizing on a Southern gel. As a result of such nondetectability of DNA fragments, individuals who show a single-band DNA profile at a VNTR locus may not necessarily be true homozygotes. In a population database, when the presence of such nondetectable alleles is ignored, we show that a pseudodependence of alleles within as well as across loci may occur. Using a known statistical method, under the hypothesis of independence of alleles within loci, we derive an efficient estimate of null allele frequency, which may be subsequently used for testing allelic independence within and across loci. The estimates of null allele frequencies, thus derived, are shown to agree with direct experimental data on the frequencies of HaeIII-null alleles. Incorporation of null alleles into the analysis of the forensic VNTR database suggests that the assumptions of allelic independence within and between loci are appropriate. In contrast, a failure to incorporate the occurrence of null alleles would provide a wrong inference regarding the independence of alleles within and between loci.  相似文献   

9.
Argopecten nucleus is a functional hermaphroditic pectinid species that exhibits self‐fertilization, whose natural populations have usually very low densities. In the present study, the genetic diversity of a wild population from Neguanje Bay, Santa Marta (Colombia), was estimated using microsatellite markers, and the effect of the presence of null alleles on this estimation was assessed. A total of 8 microsatellite markers were developed, the first described for this species, and their amplification conditions were standardized. They were used to determine the genotype of 48 wild individuals from Naguanje Bay, and 1,010 individuals derived from the offspring of 38 directed crosses. For each locus, the frequencies of the identified alleles, including null alleles, were estimated using the statistical package Micro‐Checker, and the parental genotypes were confirmed using segregation analysis. Three to 8 alleles per locus with frequencies from 0.001 to 0.632 were detected. The frequencies of null alleles ranged from 0.10 to 0.45, with Ho from 0.0 to 0.79, and He from 0.53 to 0.80. All loci were in H‐W disequilibrium. The null allele frequencies values were high, with lower estimations using segregation analysis than estimated using Micro‐Checker. The present results show high levels of population genetic diversity and indicate that null alleles were not the only cause of deviation from H‐W equilibrium in all loci, suggesting that the wild population under study presents signs of inbreeding and Wahlund effect.  相似文献   

10.
Allelic dropout is a commonly observed source of missing data in microsatellite genotypes, in which one or both allelic copies at a locus fail to be amplified by the polymerase chain reaction. Especially for samples with poor DNA quality, this problem causes a downward bias in estimates of observed heterozygosity and an upward bias in estimates of inbreeding, owing to mistaken classifications of heterozygotes as homozygotes when one of the two copies drops out. One general approach for avoiding allelic dropout involves repeated genotyping of homozygous loci to minimize the effects of experimental error. Existing computational alternatives often require replicate genotyping as well. These approaches, however, are costly and are suitable only when enough DNA is available for repeated genotyping. In this study, we propose a maximum-likelihood approach together with an expectation-maximization algorithm to jointly estimate allelic dropout rates and allele frequencies when only one set of nonreplicated genotypes is available. Our method considers estimates of allelic dropout caused by both sample-specific factors and locus-specific factors, and it allows for deviation from Hardy–Weinberg equilibrium owing to inbreeding. Using the estimated parameters, we correct the bias in the estimation of observed heterozygosity through the use of multiple imputations of alleles in cases where dropout might have occurred. With simulated data, we show that our method can (1) effectively reproduce patterns of missing data and heterozygosity observed in real data; (2) correctly estimate model parameters, including sample-specific dropout rates, locus-specific dropout rates, and the inbreeding coefficient; and (3) successfully correct the downward bias in estimating the observed heterozygosity. We find that our method is fairly robust to violations of model assumptions caused by population structure and by genotyping errors from sources other than allelic dropout. Because the data sets imputed under our model can be investigated in additional subsequent analyses, our method will be useful for preparing data for applications in diverse contexts in population genetics and molecular ecology.  相似文献   

11.
DNA degradation, low DNA concentrations and primer‐site mutations may result in the incorrect assignment of microsatellite genotypes, potentially biasing population genetic analyses. micro ‐checker is windows ®‐based software that tests the genotyping of microsatellites from diploid populations. The program aids identification of genotyping errors due to nonamplified alleles (null alleles), short allele dominance (large allele dropout) and the scoring of stutter peaks, and also detects typographic errors. micro ‐checker estimates the frequency of null alleles and, importantly, can adjust the allele and genotype frequencies of the amplified alleles, permitting their use in further population genetic analysis. micro ‐checker can be freely downloaded from http://www.microchecker.hull.ac.uk/ .  相似文献   

12.
Rannala B  Qiu WG  Dykhuizen DE 《Genetics》2000,155(2):499-508
Recent breakthroughs in molecular technology, most significantly the polymerase chain reaction (PCR) and in situ hybridization, have allowed the detection of genetic variation in bacterial communities without prior cultivation. These methods often produce data in the form of the presence or absence of alleles or genotypes, however, rather than counts of alleles. Using relative allele frequencies from presence-absence data as estimates of population allele frequencies tends to underestimate the frequencies of common alleles and overestimate those of rare ones, potentially biasing the results of a test of neutrality in favor of balancing selection. In this study, a maximum-likelihood estimator (MLE) of bacterial allele frequencies designed for use with presence-absence data is derived using an explicit stochastic model of the host infection (or bacterial sampling) process. The performance of the MLE is evaluated using computer simulation and a method is presented for evaluating the fit of estimated allele frequencies to the neutral infinite alleles model (IAM). The methods are applied to estimate allele frequencies at two outer surface protein loci (ospA and ospC) of the Lyme disease spirochete, Borrelia burgdorferi, infecting local populations of deer ticks (Ixodes scapularis) and to test the fit to a neutral IAM.  相似文献   

13.
Restriction‐site associated DNA sequencing (RADSeq) facilitates rapid generation of thousands of genetic markers at relatively low cost; however, several sources of error specific to RADSeq methods often lead to biased estimates of allele frequencies and thereby to erroneous population genetic inference. Estimating the distribution of sample allele frequencies without calling genotypes was shown to improve population inference from whole genome sequencing data, but the ability of this approach to account for RADSeq‐specific biases remains unexplored. Here we assess in how far genotype‐free methods of allele frequency estimation affect demographic inference from empirical RADSeq data. Using the well‐studied pied flycatcher (Ficedula hypoleuca) as a study system, we compare allele frequency estimation and demographic inference from whole genome sequencing data with that from RADSeq data matched for samples using both genotype‐based and genotype free methods. The demographic history of pied flycatchers as inferred from RADSeq data was highly congruent with that inferred from whole genome resequencing (WGS) data when allele frequencies were estimated directly from the read data. In contrast, when allele frequencies were derived from called genotypes, RADSeq‐based estimates of most model parameters fell outside the 95% confidence interval of estimates derived from WGS data. Notably, more stringent filtering of the genotype calls tended to increase the discrepancy between parameter estimates from WGS and RADSeq data, respectively. The results from this study demonstrate the ability of genotype‐free methods to improve allele frequency spectrum‐ (AFS‐) based demographic inference from empirical RADSeq data and highlight the need to account for uncertainty in NGS data regardless of sequencing method.  相似文献   

14.
Genetic exchange with a neighboring village of Ye'cuana Indians had introduced two alleles, Dia and ACPa, into the Yanomama Indian Village of Borabuk. After several generations, these alleles had reached frequencies of 0.08 and 0.10, respectively. These frequencies are puzzling because they are higher in Borabuk than in the Ye'cuana village from which they were derived. Single allele estimates of ancestral proportions obtained from either of these traits are biologically unrealistic and suggest that admixture is not a good explanation for genetic variation in Borabuk. Nevertheless, multiallelic admixture models are seen to produce credible estimates of ancestral proportions and to explain a large amount of allele frequency variation in Borabuk. When these results are compared with expectations derived froma formal pedigree analysis, good agreement is seen. Comparison of single allele estimates of ancestral proportions obtained from alleles at 11 loci, with multiallelic estimates obtained from the same 11 loci and with the pedigree-derived estimates, demonstrates the superiority of the multiallelic approach.  相似文献   

15.
Besides haplo-diploid sex determination, where females develop from fertilized diploid eggs and males from unfertilized haploid eggs, some Hymenoptera have a secondary system called complementary sex determination (CSD). This depends on genotypes of a 'sex locus' with numerous sex-determining alleles. Diploid heterozygotes develop as females, but diploid homozygotes become sterile or nonviable diploid males. Thus, when females share sex-determining alleles with their mates and produce low fitness diploid males, CSD creates a genetic load. The parasitoid wasp Habrobracon hebetor has CSD and displays mating behaviours that lessen CSD load, including mating at aggregations of males and inbreeding avoidance by females. To examine the influence of population structure and the mating system on CSD load, we conducted genetic analyses of an H. hebetor population in Wisconsin. Given the frequency of diploid males, we estimated that the population harboured 10-16 sex-determining alleles. Overall, marker allele frequencies did not differ between subpopulations, but frequencies changed dramatically between years. This reduced estimates of effective size of subpopulations to only N3 approximately 20-50, which probably reflected annual fluctuations of abundance of H. hebetor. We also determined that the mating system is effectively monogamous. Models relating sex-determining allele diversity and the mating system to female productivity showed that inbreeding avoidance always decreased CSD loads, but multiple mating only reduced loads in populations with fewer than five sex-determining alleles. Populations with N3 less than 100 should have fewer sex-determining alleles than we found, but high diversity could be maintained by a combination of frequency-dependent selection and gene flow between populations.  相似文献   

16.
We describe a genetic polymorphism of human neutral alpha-glucosidase C, detected in lymphoid cells by a combination of starch gel electrophoresis and isoelectric focusing. The seven phenotypes observed appear to result from the expression of four different alleles. The distribution of the observed phenotypes fits the expected distribution predicted from calculated gene frequencies in Hardy-Weinberg equilibrium. Family studies are consistent with autosomal inheritance of the gene. The product of one of the alleles is unusual in that it is "silent," with an estimated gene frequency of .174 in an outbred white population. Approximately one-third of the population is heterozygous "null." Homozygosity for the allele has not been associated with any obvious disease state. This is the third example of a "null" allele which has a substantial gene frequency in an outbred population but does not appear to result in disease in the homozygous state.  相似文献   

17.
Microsatellite loci are widely used in population genetic studies, but the presence of null alleles may lead to biased results. Here, we assessed five methods that indirectly detect null alleles and found large inconsistencies among them. Our analysis was based on 20 microsatellite loci genotyped in a natural population of Microtus oeconomus sampled during 8 years, together with 1200 simulated populations without null alleles, but experiencing bottlenecks of varying duration and intensity, and 120 simulated populations with known null alleles. In the natural population, 29% of positive results were consistent between the methods in pairwise comparisons, and in the simulated data set, this proportion was 14%. The positive results were also inconsistent between different years in the natural population. In the null‐allele‐free simulated data set, the number of false positives increased with increased bottleneck intensity and duration. We also found a low concordance in null allele detection between the original simulated populations and their 20% random subsets. In the populations simulated to include null alleles, between 22% and 42% of true null alleles remained undetected, which highlighted that detection errors are not restricted to false positives. None of the evaluated methods clearly outperformed the others when both false‐positive and false‐negative rates were considered. Accepting only the positive results consistent between at least two methods should considerably reduce the false‐positive rate, but this approach may increase the false‐negative rate. Our study demonstrates the need for novel null allele detection methods that could be reliably applied to natural populations.  相似文献   

18.
The effective population size (N(e)) is notoriously difficult to accurately estimate in wild populations as it is influenced by a number of parameters that are difficult to delineate in natural systems. The different methods that are used to estimate N(e) are affected variously by different processes at the population level, such as the life-history characteristics of the organism, gene flow, and population substructure, as well as by the frequency patterns of genetic markers used and the sampling design. Here, we compare N(e) estimates obtained by different genetic methods and from demographic data and elucidate how the estimates are affected by various factors in an exhaustively sampled and comprehensively described natural brown trout (Salmo trutta) system. In general, the methods yielded rather congruent estimates, and we ascribe that to the adequate genotyping and exhaustive sampling. Effects of violating the assumptions of the different methods were nevertheless apparent. In accordance with theoretical studies, skewed allele frequencies would underestimate temporal allele frequency changes and thereby upwardly bias N(e) if not accounted for. Overlapping generations and iteroparity would also upwardly bias N(e) when applied to temporal samples taken over short time spans. Gene flow from a genetically not very dissimilar source population decreases temporal allele frequency changes and thereby acts to increase estimates of N(e). Our study reiterates the importance of adequate sampling, quantification of life-history parameters and gene flow, and incorporating these data into the N(e) estimation.  相似文献   

19.
Studies of genetics and ecology often require estimates of relatedness coefficients based on genetic marker data. However, with the presence of null alleles, an observed genotype can represent one of several possible true genotypes. This results in biased estimates of relatedness. As the numbers of marker loci are often limited, loci with null alleles cannot be abandoned without substantial loss of statistical power. Here, we show how loci with null alleles can be incorporated into six estimators of relatedness (two novel). We evaluate the performance of various estimators before and after correction for null alleles. If the frequency of a null allele is <0.1, some estimators can be used directly without adjustment; if it is >0.5, the potency of estimation is too low and such a locus should be excluded. We make available a software package entitled PolyRelatedness v1.6, which enables researchers to optimize these estimators to best fit a particular data set.  相似文献   

20.
Recent advances in the application of the polymerase chain reaction make it possible to score individuals at a large number of loci. The RAPD (random amplified polymorphic DNA) method is one such technique that has attracted widespread interest. The analysis of population structure with RAPD data is hampered by the lack of complete genotypic information resulting from dominance, since this enhances the sampling variance associated with single loci as well as induces bias in parameter estimation. We present estimators for several population-genetic parameters (gene and genotype frequencies, within- and between-population heterozygosities, degree of inbreeding and population subdivision, and degree of individual relatedness) along with expressions for their sampling variances. Although completely unbiased estimators do not appear to be possible with RAPDs, several steps are suggested that will insure that the bias in parameter estimates is negligible. To achieve the same degree of statistical power, on the order of 2 to 10 times more individuals need to be sampled per locus when dominant markers are relied upon, as compared to codominant (RFLP, isozyme) markers. Moreover, to avoid bias in parameter estimation, the marker alleles for most of these loci should be in relatively low frequency. Due to the need for pruning loci with low-frequency null alleles, more loci also need to be sampled with RAPDs than with more conventional markers, and some problems of bias cannot be completely eliminated.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号