首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Analyses of pedigrees and pedigree-derived parameters (e.g. relatedness and fitness) provide some of the most informative types of studies in evolutionary biology. The r package pedantics implements tools to facilitate power and sensitivity analyses of pedigree-related studies of natural populations. Functions are available to permute pedigree data in various ways with the goal of mimicking patterns of pedigree errors and missingness that occur in studies of natural populations. Another set of functions simulates genetic and phenotypic data based on arbitrary pedigrees. Finally, functions are also available with which visual and numerical representations of pedigree structure can be generated.  相似文献   

2.
Paterson T  Law A 《Animal genetics》2011,42(5):560-562
Datapoint errors in pedigree genotype data sets are difficult to identify and adversely affect downstream genetic analyses. We present GenotypeChecker, a desktop software tool for assisting data cleansing. The application identifies likely data errors in pedigree/genotype data sets by performing an inheritance-checking algorithm for each marker across the pedigree, and highlights inconsistently inherited genotypes in an exploratory user interface. By 'masking' suspect datapoints and rechecking inheritance consistency, erroneous datapoints can be confirmed and cleansed from the data set. The software, examples and documentation are freely available at http://bioinformatics.roslin.ac.uk/genotypechecker.  相似文献   

3.
Error detection for genetic data, using likelihood methods.   总被引:6,自引:3,他引:3       下载免费PDF全文
As genetic maps become denser, the effect of laboratory typing errors becomes more serious. We review a general method for detecting errors in pedigree genotyping data that is a variant of the likelihood-ratio test statistic. It pinpoints individuals and loci with relatively unlikely genotypes. Power and significance studies using Monte Carlo methods are shown by using simulated data with pedigree structures similar to the CEPH pedigrees and a larger experimental pedigree used in the study of idiopathic dilated cardiomyopathy (DCM). The studies show the index detects errors for small values of theta with high power and an acceptable false positive rate. The method was also used to check for errors in DCM laboratory pedigree data and to estimate the error rate in CEPH-chromosome 6 data. The errors flagged by our method in the DCM pedigree were confirmed by the laboratory. The results are consistent with estimated false-positive and false-negative rates obtained using simulation.  相似文献   

4.
Macgregor S  Knott SA  White I  Visscher PM 《Genetics》2005,171(3):1365-1376
There is currently considerable interest in genetic analysis of quantitative traits such as blood pressure and body mass index. Despite the fact that these traits change throughout life they are commonly analyzed only at a single time point. The genetic basis of such traits can be better understood by collecting and effectively analyzing longitudinal data. Analyses of these data are complicated by the need to incorporate information from complex pedigree structures and genetic markers. We propose conducting longitudinal quantitative trait locus (QTL) analyses on such data sets by using a flexible random regression estimation technique. The relationship between genetic effects at different ages is efficiently modeled using covariance functions (CFs). Using simulated data we show that the change in genetic effects over time can be well characterized using CFs and that including parameters to model the change in effect with age can provide substantial increases in power to detect QTL compared with repeated measure or univariate techniques. The asymptotic distributions of the methods used are investigated and methods for overcoming the practical difficulties in fitting CFs are discussed. The CF-based techniques should allow efficient multivariate analyses of many data sets in human and natural population genetics.  相似文献   

5.
Having found evidence for segregation at a major locus for a quantitative trait, a logical next step is to identify those pedigrees in which major-locus segregation is occurring. If the quantitative trait is a risk factor for an associated disease, identifying such segregating pedigrees can be important in classifying families by etiology, in risk assessment, and in suggesting treatment modalities. Identifying segregating pedigrees can also be helpful in selecting pedigrees to include in a subsequent linkage study to map the major locus. Here, we describe a strategy to identify pedigrees segregating at a major locus for a quantitative trait. We apply this pedigree selection strategy to simulated data generated under a major-locus or mixed model with a rare dominant allele and sampled according to one of several fixed-structure or sequential sampling designs. We demonstrate that for the situations considered, the pedigree selection strategy is sensitive and specific and that a linkage study based only on the pedigrees classified as segregating extracts essentially all the linkage information in the entire sample of pedigrees. Our results suggest that for large-scale linkage studies involving many genetic markers, the savings from this strategy can be substantial and that, compared with fixed-structure sampling, sequential sampling of pedigrees can greatly improve the efficiency for linkage analysis of a quantitative trait.  相似文献   

6.
E. A. Thompson  R. G. Shaw 《Genetics》1992,131(4):971-978
We have developed algorithms for the likelihood estimation of additive genetic models for quantitative traits on large pedigrees. The approach uses the expectation L-maximization (EM) algorithm, but avoids intensive computation. In this paper, we focus on extensions of previous work to the case of multivariate data. We exemplify the approach by analyses of bivariate data on a four-generation, 949-member pedigree of the snail Lymnaea elodes, and on a three-generation pedigree of the guppy Poecilia reticulata containing about 400 individuals.  相似文献   

7.
Recent advances in genomics resources and tools are facilitating quantitative trait locus mapping. We developed a crossbreed pedigree for mapping quantitative trait loci for hip dysplasia in dogs by crossing dysplastic Labrador Retrievers and normal Greyhounds. We show that one advantage to using a crossbreed pedigree is the increased marker informativeness in the backcross/F2 population relative to the founder populations. We also discuss three factors that affect the detection power in the context of this crossbreed pedigree: being able to detect and correct genotyping errors, increasing marker density for chromosomes with a sparse coverage, and adding individuals to the mapping population as soon as they become available.  相似文献   

8.
Nine microsatellite DNA markers (simple sequence repeats, SSRs) were used to estimate pairwise relationships among 597 Scots pine (Pinus sylvestris) trees as well as to generate a sibship structure for quantitative genetic parameters’ estimation comparison. The studied trees were part of an open-pollinated progeny test of 102 first-generation parents. Three methods were used to estimate variance components and heritabilities, namely, structured pedigree (half- and full-sib), marker-based pairwise relationships (four pairwise estimators), and a combined pedigree and marker-based relationship. In each of the three methods, the same animal model was used to compute variances except when marker-based relationship was used wherein we substituted the average numerator relationship matrix (i.e., pedigree-based matrix) with that computed based on markers’ pairwise relationships. Our results showed a high correlation in estimated breeding values between the pedigree (full-sib) and the combined marker-pedigree estimates. The marker-based relationship method produced high correlations when individual site data were analyzed. In contrast, the marker-based relationship method resulted in a significant decrease in both variance estimation and their standard errors which were in concordance with earlier published results; however, no estimates were produced when across-site analyses were attempted. We concluded that the combined pedigree method is the best approach as it represents the historical (pairwise) and contemporary (pedigree) relationships among the tested individuals, a situation that cannot be attained by any of the used methods individually. This method is dependent on the number and informativeness of the markers used.  相似文献   

9.
Quantitative genetic analysis is often fundamental for understanding evolutionary processes in wild populations. Avian populations provide a model system due to the relative ease of inferring relatedness among individuals through observation. However, extra‐pair paternity (EPP) creates erroneous links within the social pedigree. Previous work has suggested this causes minor underestimation of heritability if paternal misassignment is random and hence not influenced by the trait being studied. Nevertheless, much literature suggests numerous traits are associated with EPP and the accuracy of heritability estimates for such traits remains unexplored. We show analytically how nonrandom pedigree errors can influence heritability estimates. Then, combining empirical data from a large great tit (Parus major) pedigree with simulations, we assess how heritability estimates derived from social pedigrees change depending on the mode of the relationship between EPP and the focal trait. We show that the magnitude of the underestimation is typically small (<15%). Hence, our analyses suggest that quantitative genetic inference from pedigrees derived from observations of social relationships is relatively robust; our approach also provides a widely applicable method for assessing the consequences of nonrandom EPP.  相似文献   

10.
Errors in genotype calling can have perverse effects on genetic analyses, confounding association studies, and obscuring rare variants. Analyses now routinely incorporate error rates to control for spurious findings. However, reliable estimates of the error rate can be difficult to obtain because of their variance between studies. Most studies also report only a single estimate of the error rate even though genotypes can be miscalled in more than one way. Here, we report a method for estimating the rates at which different types of genotyping errors occur at biallelic loci using pedigree information. Our method identifies potential genotyping errors by exploiting instances where the haplotypic phase has not been faithfully transmitted. The expected frequency of inconsistent phase depends on the combination of genotypes in a pedigree and the probability of miscalling each genotype. We develop a model that uses the differences in these frequencies to estimate rates for different types of genotype error. Simulations show that our method accurately estimates these error rates in a variety of scenarios. We apply this method to a dataset from the whole-genome sequencing of owl monkeys (Aotus nancymaae) in three-generation pedigrees. We find significant differences between estimates for different types of genotyping error, with the most common being homozygous reference sites miscalled as heterozygous and vice versa. The approach we describe is applicable to any set of genotypes where haplotypic phase can reliably be called and should prove useful in helping to control for false discoveries.  相似文献   

11.
Variance-component methods are popular and flexible analytic tools for elucidating the genetic mechanisms of complex quantitative traits from pedigree data. However, variance-component methods typically assume that the trait of interest follows a multivariate normal distribution within a pedigree. Studies have shown that violation of this normality assumption can lead to biased parameter estimates and inflations in type-I error. This limits the application of variance-component methods to more general trait outcomes, whether continuous or categorical in nature. In this paper, we develop and apply a general variance-component framework for pedigree analysis of continuous and categorical outcomes. We develop appropriate models using generalized-linear mixed model theory and fit such models using approximate maximum-likelihood procedures. Using our proposed method, we demonstrate that one can perform variance-component pedigree analysis on outcomes that follow any exponential-family distribution. Additionally, we also show how one can modify the method to perform pedigree analysis of ordinal outcomes. We also discuss extensions of our variance-component framework to accommodate pedigrees ascertained based on trait outcome. We demonstrate the feasibility of our method using both simulated data and data from a genetic study of ovarian insufficiency.  相似文献   

12.
13.
Individual‐based estimates of the degree of inbreeding or parental relatedness from pedigrees provide a critical starting point for studies of inbreeding depression, but in practice wild pedigrees are difficult to obtain. Because inbreeding increases the proportion of genomewide loci that are identical by descent, inbreeding variation within populations has the potential to generate observable correlations between heterozygosity measured using molecular markers and a variety of fitness related traits. Termed heterozygosity‐fitness correlations (HFCs), these correlations have been observed in a wide variety of taxa. The difficulty of obtaining wild pedigree data, however, means that empirical investigations of how pedigree inbreeding influences HFCs are rare. Here, we assess evidence for inbreeding depression in three life‐history traits (hatching and fledging success and juvenile survival) in an isolated population of Stewart Island robins using both pedigree‐ and molecular‐derived measures of relatedness. We found results from the two measures were highly correlated and supported evidence for significant but weak inbreeding depression. However, standardized effect sizes for inbreeding depression based on the pedigree‐based kin coefficients (k) were greater and had smaller standard errors than those based on molecular genetic measures of relatedness (RI), particularly for hatching and fledging success. Nevertheless, the results presented here support the use of molecular‐based measures of relatedness in bottlenecked populations when information regarding inbreeding depression is desired but pedigree data on relatedness are unavailable.  相似文献   

14.
Hao K  Li C  Rosenow C  Hung Wong W 《Genomics》2004,84(4):623-630
Currently, most analytical methods assume all observed genotypes are correct; however, it is clear that errors may reduce statistical power or bias inference in genetic studies. We propose procedures for estimating error rate in genetic analysis and apply them to study the GeneChip Mapping 10K array, which is a technology that has recently become available and allows researchers to survey over 10,000 SNPs in a single assay. We employed a strategy to estimate the genotype error rate in pedigree data. First, the "dose-response" reference curve between error rate and the observable error number were derived by simulation, conditional on given pedigree structures and genotypes. Second, the error rate was estimated by calibrating the number of observed errors in real data to the reference curve. We evaluated the performance of this method by simulation study and applied it to a data set of 30 pedigrees genotyped using the GeneChip Mapping 10K array. This method performed favorably in all scenarios we surveyed. The dose-response reference curve was monotone and almost linear with a large slope. The method was able to estimate accurately the error rate under various pedigree structures and error models and under heterogeneous error rates. Using this method, we found that the average genotyping error rate of the GeneChip Mapping 10K array was about 0.1%. Our method provides a quick and unbiased solution to address the genotype error rate in pedigree data. It behaves well in a wide range of settings and can be easily applied in other genetic projects. The robust estimation of genotyping error rate allows us to estimate power and sample size and conduct unbiased genetic tests. The GeneChip Mapping 10K array has a low overall error rate, which is consistent with the results obtained from alternative genotyping assays.  相似文献   

15.
Family data for 14 biochemical genetic markers of squirrel monkeys (genusSaimiri) were derived from 73 pedigreed progeny and both parents of each, as well as from 16 additional progeny and one parent of each. The data for each marker and the phenotypic patterns were consistent with autosomal codominant inheritance. It was concluded from the genetic marker data that the pedigree records of seven progeny were incorrect. Retrospective investigations of colony records followed by typing of animals that might possibly have been a parent enabled five of the pedigree records to be corrected. Although five of the pedigree errors were cases of mistaken paternity, the other two apparently were the consequence of infant swapping between dams shortly after birth. Because squirrel monkeys exhibit a high degree of allomaternal behavior, infant swapping between dams may occur more frequently than in many other nonhuman primate species.This research was supported in part by NIH Grant P40 RR01254.  相似文献   

16.
Evolutionary biologists increasingly use pedigree‐based quantitative genetic methods to address questions about the evolutionary dynamics of traits in wild populations. In many cases, phenotypic data may have been collected only for recent parts of the study. How does this influence the performance of the models used to analyse these data? Here we explore how data depth (number of years) and completeness (number of observations) influence estimates of genetic variance and covariance within the context of an existing pedigree. Using long‐term data from the great tit Parus major and the mute swan Cygnus olor, species with different life‐histories, we examined the effect of manipulating the amount of data included on quantitative genetic parameter estimates. Manipulating data depth and completeness had little influence on estimated genetic variances, heritabilities, or genetic correlations, but (as expected) did influence confidence in these estimates. Estimated breeding values in the great tit were not influenced by data depth but were in the mute swan, probably because of differences in pedigree structure. Our analyses suggest the ‘rule of thumb’ that data from 3 years and a minimum of 100 individuals per year are needed to estimate genetic parameters with acceptable confidence, and that using pedigree data is worthwhile, even if phenotypes are only available toward the tips of the pedigree.  相似文献   

17.
Several programs are currently available for the detection of genotyping error that may or may not be Mendelianly inconsistent. However, no systematic study exists that evaluates their performance under varying pedigree structures and sizes, marker spacing, and allele frequencies. Our simulation study compares four multipoint methods: Merlin, Mendel4, SimWalk2, and Sibmed. We look at empirical thresholds, power, and false-positive rates on 7 small pedigree structures that included sibships with and without genotyped parents, and a three-generation pedigree, using 11 microsatellite markers with 3 different map spacings. Simulated data includes 5,000 replicates of each pedigree structure and marker map, with random genotyping errors in about 4% of the middle marker's genotypes. We found that the default thresholds used by these programs provide low power (47-72%). Power is improved more by adding genotyped siblings than by using more closely spaced markers. Some mistyping methods are sensitive to the frequencies of the observed alleles. Siblings of mistyped individuals have elevated false-positive rates, as do markers close to the mistyped marker. We conclude that thresholds should be decided based on the pedigree and marker data and that greater focus should be placed on modeling genotyping error when computing likelihoods, rather than on detecting and eliminating genotyping errors.  相似文献   

18.
Quantitative genetic parameters are nowadays more frequently estimated with restricted maximum likelihood using the 'animal model' than with traditional methods such as parent-offspring regressions. These methods have however rarely been evaluated using equivalent data sets. We compare heritabilities and genetic correlations from animal model and parent-offspring analyses, respectively, using data on eight morphological traits in the great reed warbler (Acrocephalus arundinaceus). Animal models were run using either mean trait values or individual repeated measurements to be able to separate between effects of including more extended pedigree information and effects of replicated sampling from the same individuals. We show that the inclusion of more pedigree information by the use of mean traits animal models had limited effect on the standard error and magnitude of heritabilities. In contrast, the use of repeated measures animal model generally had a positive effect on the sampling accuracy and resulted in lower heritabilities; the latter due to lower additive variance and higher phenotypic variance. For most trait combinations, both animal model methods gave genetic correlations that were lower than the parent-offspring estimates, whereas the standard errors were lower only for the mean traits animal model. We conclude that differences in heritabilities between the animal model and parent-offspring regressions were mostly due to the inclusion of individual replicates to the animal model rather than the inclusion of more extended pedigree information. Genetic correlations were, on the other hand, primarily affected by the inclusion of more pedigree information. This study is to our knowledge the most comprehensive empirical evaluation of the performance of the animal model in relation to parent-offspring regressions in a wild population. Our conclusions should be valuable for reconciliation of data obtained in earlier studies as well as for future meta-analyses utilizing estimates from both traditional methods and the animal model.  相似文献   

19.
Kang SJ  Finch SJ  Haynes C  Gordon D 《Human heredity》2004,58(3-4):139-144
Kang et al. [Genet Epidemiol 2004;26:132-141] addressed the question of which genotype misclassification errors are most costly, in terms of minimum percentage increase in sample size necessary (%MSSN) to maintain constant asymptotic power and significance level, when performing case/control studies of genetic association in a genetic model-free setting. They answered the question for single nucleotide polymorphisms (SNPs) using the 2 x 3 chi2 test of independence. We address the same question here for a genetic model-based framework. The genetic model parameters considered are: disease model (dominant, recessive), genotypic relative risk, SNP (marker) and disease allele frequency, and linkage disequilibrium. %MSSN coefficients of each of the six possible error rates are determined by expanding the non-centrality parameter of the asymptotic distribution of the 2 x 3 chi2 test under a specified alternative hypothesis to approximate %MSSN using a linear Taylor series in the error rates. In this work we assume errors misclassifying one homozygote as another homozygote are 0, since these errors are thought to rarely occur in practice. Our findings are that there are settings of the genetic model parameters that lead to large total %MSSN for both dominant and recessive models. As SNP minor allele approaches 0, total %MSSN increases without bound, independent of other genetic model parameters. In general, %MSSN is a complex function of the genetic model parameters. Use of SNPs with small minor allele frequency requires careful attention to frequency of genotyping errors to insure that power specifications are met. Software to perform these calculations for study design is available, and an example of its use to study a disease is given.  相似文献   

20.
Pedigree data are useful for a wealth of research purposes in human population biology and genetics. The collection of extended pedigrees represents the most powerful sampling design for quantitative genetic and linkage studies of both normal and disease-related quantitative traits. In this paper we outline an approach for collecting pedigree data in stable isolate populations. As an example, the pedigree for the Jirel population, which was obtained using the methods presented, is described. The Jirel pedigree contains 2,000 study participants and more than 62,000 pairwise relationships that are informative for genetic analysis. Once such pedigrees are genetically characterized by a genome scan for a given trait, they become an invaluable resource for future genetic studies of any quantitative trait.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号