首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
ABSTRACT: BACKGROUND: In the last years GWA studies have successfully identified common SNPs associated with complex diseases. However, most of the variants found this way account for only a small portion of the trait variance. This fact leads researchers to focus on rare-variant mapping with large scale sequencing, which can be facilitated by using linkage information. The question arises why linkage analysis often fails to identify genes when analyzing complex diseases. Using simulations we have investigated the power of parametric and nonparametric linkage statistics (KC-LOD, NPL, LOD and MOD scores), to detect the effect of genes responsible for complex diseases using different pedigree structures. RESULTS: As expected, a small number of pedigrees with less than three affected individuals has low power to map disease genes with modest effect. Interestingly, the power decreases when unaffected individuals are included in the analysis, irrespective of the true mode of inheritance. Furthermore, we found that the best performing statistic depends not only on the type of pedigrees but also on the true mode of inheritance. CONCLUSIONS: When applied in a sensible way linkage is an appropriate and robust technique to map genes for complex disease. Unlike association analysis, linkage analysis is not hampered by allelic heterogeneity. So, why does linkage analysis often fail with complex diseases? Evidently, when using an insufficient number of small pedigrees, one might miss a true genetic linkage when actually a real effect exists. Furthermore, we show that the test statistic has an important effect on the power to detect linkage as well. Therefore, a linkage analysis might fail if an inadequate test statistic is employed. We provide recommendations regarding the most favorable test statistics, in terms of power, for a given mode of inheritance and type of pedigrees under study, in order to reduce the probability to miss a true linkage.  相似文献   

2.
The power to detect linkage for likelihood and nonparametric (Haseman-Elston, affected-sib-pair, and affected-pedigree-member) methods is compared for the case of a common, dichotomous trait resulting from the segregation of two loci. Pedigree data for several two-locus epistatic and heterogeneity models have been simulated, with one of the loci linked to a marker locus. Replicate samples of 20 three-generation pedigrees (16 individuals/pedigree) were simulated and then ascertained for having at least 6 affected individuals. The power of linkage detection calculated under the correct two-locus model is only slightly higher than that under a single locus model with reduced penetrance. As expected, the nonparametric linkage methods have somewhat lower power than does the lod-score method, the difference depending on the mode of transmission of the linked locus. Thus, for many pedigree linkage studies, the lod-score method will have the best power. However, this conclusion depends on how many times the lod score will be calculated for a given marker. The Haseman-Elston method would likely be preferable to calculating lod scores under a large number of genetic models (i.e., varying both the mode of transmission and the penetrances), since such an analysis requires an increase in the critical value of the lod criterion. The power of the affected-pedigree-member method is lower than the other methods, which can be shown to be largely due to the fact that marker genotypes for unaffected individuals are not used.  相似文献   

3.

Background

Both common and rare genetic variants have been shown to contribute to the etiology of complex diseases. Recent genome-wide association studies (GWAS) have successfully investigated how common variants contribute to the genetic factors associated with common human diseases. However, understanding the impact of rare variants, which are abundant in the human population (one in every 17 bases), remains challenging. A number of statistical tests have been developed to analyze collapsed rare variants identified by association tests. Here, we propose a haplotype-based approach. This work inspired by an existing statistical framework of the pedigree disequilibrium test (PDT), which uses genetic data to assess the effects of variants in general pedigrees. We aim to compare the performance between the haplotype-based approach and the rare variant-based approach for detecting rare causal variants in pedigrees.

Results

Extensive simulations in the sequencing setting were carried out to evaluate and compare the haplotype-based approach with the rare variant methods that drew on a more conventional collapsing strategy. As assessed through a variety of scenarios, the haplotype-based pedigree tests had enhanced statistical power compared with the rare variants based pedigree tests when the disease of interest was mainly caused by rare haplotypes (with multiple rare alleles), and vice versa when disease was caused by rare variants acting independently. For most of other situations when disease was caused both by haplotypes with multiple rare alleles and by rare variants with similar effects, these two approaches provided similar power in testing for association.

Conclusions

The haplotype-based approach was designed to assess the role of rare and potentially causal haplotypes. The proposed rare variants-based pedigree tests were designed to assess the role of rare and potentially causal variants. This study clearly documented the situations under which either method performs better than the other. All tests have been implemented in a software, which was submitted to the Comprehensive R Archive Network (CRAN) for general use as a computer program named rvHPDT.  相似文献   

4.
Segregation and linkage analyses of 72 leprosy pedigrees   总被引:4,自引:0,他引:4  
Data on 72 families with multiple cases of leprosy were analyzed for a susceptibility gene linked to the HLA loci. We conducted segregation analysis with the program POINTER and identity of HLA types by descent analysis to determine the most likely mode of inheritance. We then conducted linkage analysis with the program LINKAS, first assuming linkage equilibrium and then allowing for linkage disequilibrium and etiological heterogeneity. Segregation results suggest a recessive mode of inheritance, especially for the tuberculoid forms of leprosy. The linkage results, limited to tuberculoid forms and assuming a recessive model, suggest a hypothesis of loose linkage with no unlinked locus. When an additive model is assumed, the best fit is obtained with a hypothesis of complete linkage (theta = 0.0) with heterogeneity. We currently favor the additive model as the more plausible one.  相似文献   

5.
Dense SNP maps can be highly informative for linkage studies. But when parental genotypes are missing, multipoint linkage scores can be inflated in regions with substantial marker-marker linkage disequilibrium (LD). Such regions were observed in the Affymetrix SNP genotypes for the Genetic Analysis Workshop 14 (GAW14) Collaborative Study on the Genetics of Alcoholism (COGA) dataset, providing an opportunity to test a novel simulation strategy for studying this problem. First, an inheritance vector (with or without linkage present) is simulated for each replicate, i.e., locations of recombinations and transmission of parental chromosomes are determined for each meiosis. Then, two sets of founder haplotypes are superimposed onto the inheritance vector: one set that is inferred from the actual data and which contains the pattern of LD; and one set created by randomly selecting parental alleles based on the known allele frequencies, with no correlation (LD) between markers. Applying this strategy to a map of 176 SNPs (66 Mb of chromosome 7) for 100 replicates of 116 sibling pairs, significant inflation of multipoint linkage scores was observed in regions of high LD when parental genotypes were set to missing, with no linkage present. Similar inflation was observed in analyses of the COGA data for these affected sib pairs with parental genotypes set to missing, but not after reducing the marker map until r2 between any pair of markers was 相似文献   

6.
7.
Multipoint quantitative-trait linkage analysis in general pedigrees.   总被引:37,自引:12,他引:37       下载免费PDF全文
Multipoint linkage analysis of quantitative-trait loci (QTLs) has previously been restricted to sibships and small pedigrees. In this article, we show how variance-component linkage methods can be used in pedigrees of arbitrary size and complexity, and we develop a general framework for multipoint identity-by-descent (IBD) probability calculations. We extend the sib-pair multipoint mapping approach of Fulker et al. to general relative pairs. This multipoint IBD method uses the proportion of alleles shared identical by descent at genotyped loci to estimate IBD sharing at arbitrary points along a chromosome for each relative pair. We have derived correlations in IBD sharing as a function of chromosomal distance for relative pairs in general pedigrees and provide a simple framework whereby these correlations can be easily obtained for any relative pair related by a single line of descent or by multiple independent lines of descent. Once calculated, the multipoint relative-pair IBDs can be utilized in variance-component linkage analysis, which considers the likelihood of the entire pedigree jointly. Examples are given that use simulated data, demonstrating both the accuracy of QTL localization and the increase in power provided by multipoint analysis with 5-, 10-, and 20-cM marker maps. The general pedigree variance component and IBD estimation methods have been implemented in the SOLAR (Sequential Oligogenic Linkage Analysis Routines) computer package.  相似文献   

8.
Sung YJ  Wijsman EM 《Human heredity》2007,63(2):144-152
Complex traits are generally believed to be influenced by multiple loci. Identification of loci involved in complex traits is more difficult for interacting than for additive loci. Here we describe an extension of the program lm_twoqtl in the package MORGAN to handle two quantitative trait loci (QTLs) with gene-gene interaction. We investigate whether parametric linkage analysis that accounts for such epistasis improves prospects for linkage detection and accuracy of localization of QTLs. Through use of simulated data we show that analysis that accounts for epistasis provides higher lod scores and better localization than does analysis without epistasis. In addition, we demonstrate that the difference between lod scores in the presence vs. absence of use of an interaction model in analysis is greater in extended than in nuclear pedigrees.  相似文献   

9.
We present a new method of quantitative-trait linkage analysis that combines the simplicity and robustness of regression-based methods and the generality and greater power of variance-components models. The new method is based on a regression of estimated identity-by-descent (IBD) sharing between relative pairs on the squared sums and squared differences of trait values of the relative pairs. The method is applicable to pedigrees of arbitrary structure and to pedigrees selected on the basis of trait value, provided that population parameters of the trait distribution can be correctly specified. Ambiguous IBD sharing (due to incomplete marker information) can be accommodated in the method by appropriate specification of the variance-covariance matrix of IBD sharing between relative pairs. We have implemented this regression-based method and have performed simulation studies to assess, under a range of conditions, estimation accuracy, type I error rate, and power. For normally distributed traits and in large samples, the method is found to give the correct type I error rate and an unbiased estimate of the proportion of trait variance accounted for by the additive effects of the locus-although, in cases where asymptotic theory is doubtful, significance levels should be checked by simulations. In large sibships, the new method is slightly more powerful than variance-components models. The proposed method provides a practical and powerful tool for the linkage analysis of quantitative traits.  相似文献   

10.

Background

We investigate the power of heterogeneity LOD test to detect linkage when a trait is determined by several major genes using Genetic Analysis Workshop 13 simulated data. We consider three traits, two of which are disease-causing traits: 1) the rate of change in body mass index (BMI); and 2) the maximum BMI; and 3) the disease itself (hypertension). Of interest is the power of "HLOD2", the maximum heterogeneity LOD obtained upon maximizing over the two genetic models.

Results

Using a trait phenotype Obesity Slope, we observe that the power to detect the two markers closest to the two genes (S1, S2) at the 0.05 level using HLOD2 is 13% and 10%. The power of HLOD2 for Max BMI phenotype is 12% and 9%. The corresponding values for the Hypertension phenotype are 8% and 6%.

Conclusion

The power to detect linkage to the slope genes is quite low. But the power using disease-related traits as a phenotype is greater than the power using the disease (hypertension) phenotype.
  相似文献   

11.
P K Andersen  M Vaeth 《Biometrics》1989,45(2):523-535
This paper studies two classes of hazard-rate-based models for the mortality in a group of individuals taking normal life expectancy into account. In a multiplicative hazard model, the estimate for the relative mortality generalises the standardised mortality ratio, and the adequacy of a model with constant relative mortality can be tested using a type of total time on test statistic. In an additive hazard model, continuous-time generalisations of a "corrected" survival curve and a "normal" survival curve are obtained, and the adequacy of a model with constant excess mortality can again be tested using a type of total time on test statistic. A model including both the multiplicative hazard model and the additive hazard model is briefly considered. The use of the models is illustrated on a set of data concerning survival after operation for malignant melanoma.  相似文献   

12.
Genomic imprinting is a mechanism in which only one of the two copies of a gene is expressed. Some genes that affect development and behavior in mammals are known to be imprinted. Deregulation of imprinted genes has been found in a number of human diseases. Incorporating imprinting information into linkage analysis results in a more powerful test for linkage. Here, we propose an efficient method to test for linkage and imprinting of quantitative traits in extended pedigrees. We compared the results obtained by using the extended-pedigree-analysis approach proposed in this study with other existing approaches. We found that the proposed method is more powerful and uses extended-pedigree information most efficiently.  相似文献   

13.
To assess evidence for genetic linkage from pedigrees, I developed a limited variance-components approach. In this method, variability among trait observations from individuals within pedigrees is expressed in terms of fixed effects from covariates and effects due to an unobservable trait-affecting major locus, random polygenic effects, and residual nongenetic variance. The effect attributable to a locus linked to a marker is a function of the additive and dominance components of variance of the locus, the recombination fraction, and the proportion of genes identical by descent at the marker locus for each pair of sibs. For unlinked loci, the polygenic variance component depends only on the relationship between the relative pair. Parameters can be estimated by either maximum-likelihood methods or quasi-likelihood methods. The forms of quasi-likelihood estimators are provided. Hypothesis tests derived from the maximum-likelihood approach are constructed by appeal to asymptotic theory. A simulation study showed that the size of likelihood-ratio tests was appropriate but that the monogenic component of variance was generally underestimated by the likelihood approach.  相似文献   

14.
Parametric linkage analysis is usually used to find chromosomal regions linked to a disease (phenotype) that is described with a specific genetic model. This is done by investigating the relations between the disease and genetic markers, that is, well-characterized loci of known position with a clear Mendelian mode of inheritance. Assume we have found an interesting region on a chromosome that we suspect is linked to the disease. Then we want to test the hypothesis of no linkage versus the alternative one of linkage. As a measure we use the maximal lod score Z(max). It is well known that the maximal lod score has asymptotically a (2 ln 10)(-1) x (1/2 chi2(0) + 1/2 chi2(1)) distribution under the null hypothesis of no linkage when only one point (one marker) on the chromosome is studied. In this paper, we show, both by simulations and theoretical arguments, that the null hypothesis distribution of Zmax has no simple form when more than one marker is used (multipoint analysis). In fact, the distribution of Zmax depends on the number of families, their structure, the assumed genetic model, marker denseness, and marker informativity. This means that a constant critical limit of Zmax leads to tests associated with different significance levels. Because of the above-mentioned problems, from the statistical point of view the maximal lod score should be supplemented by a p-value when results are reported.  相似文献   

15.
We present here four nonparametric statistics for linkage analysis that test whether pairs of affected relatives share marker alleles more often than expected. These statistics are based on simulating the null distribution of a given statistic conditional on the unaffecteds' marker genotypes. Each statistic uses a different measure of marker sharing: the SimAPM statistic uses the simulation-based affected-pedigree-member measure based on identity-by-state (IBS) sharing. The SimKIN (kinship) measure is 1.0 for identity-by-descent (IBD) sharing, 0.0 for no IBD status sharing, and the kinship coefficient when the IBD status is ambiguous. The simulation-based IBD (SimIBD) statistic uses a recursive algorithm to determine the probability of two affecteds sharing a specific allele IBD. The SimISO statistic is identical to SimIBD, except that it also measures marker similarity between unaffected pairs. We evaluated our statistics on data simulated under different two-locus disease models, comparing our results to those obtained with several other nonparametric statistics. Use of IBD information produces dramatic increases in power over the SimAPM method, which uses only IBS information. The power of our best statistic in most cases meets or exceeds the power of the other nonparametric statistics. Furthermore, our statistics perform comparisons between all affected relative pairs within general pedigrees and are not restricted to sib pairs or nuclear families.  相似文献   

16.
Methods based on variance components are powerful tools for linkage analysis of quantitative traits, because they allow simultaneous consideration of all pedigree members. The central idea is to identify loci making a significant contribution to the population variance of a trait, by use of allele-sharing probabilities derived from genotyped marker loci. The technique is only as powerful as the methods used to infer these probabilities, but, to date, no implementation has made full use of the inheritance information in mapping data. Here we present a new implementation that uses an exact multipoint algorithm to extract the full probability distribution of allele sharing at every point in a mapped region. At each locus in the region, the program fits a model that partitions total phenotypic variance into components due to environmental factors, a major gene at the locus, and other unlinked genes. Numerical methods are used to derive maximum-likelihood estimates of the variance components, under the assumption of multivariate normality. A likelihood-ratio test is then applied to detect any significant effect of the hypothesized major gene. Simulations show the method to have greater power than does traditional sib-pair analysis. The method is freely available in a new release of the software package GENEHUNTER.  相似文献   

17.
Different approaches to analytical calculation of the power in linkage analysis whose efficiencies have been demonstrated in small pedigrees are described. The possibility of applying them to large pedigrees has been estimated. Equivalence of the analytical estimates of power obtained using different approaches based on analysis of individual pairs of relatives has been shown. The accuracy of the resultant analytical estimation of the power has been analyzed for three large pedigrees and a wide range of quantitative trait inheritance models; the accuracy has been demonstrated to be sufficiently high and equal to the accuracy in the case of small pedigrees.  相似文献   

18.
The linkage of polymorphic DNA markers on chromosome 7 to cystic fibrosis (CF) was examined in two pedigrees and a number of smaller nuclear families. The pedigrees are multigenerational and together consist of more than 300 members including 30 affected individuals, while the nuclear families each have two generations and either two or three children with CF. Tight linkage was observed between the CF locus and the met oncogene locus theta = 0, zeta = 15.45), pJ3.11 (theta = 0, zeta = 10.07), and 7C22 (theta = 0, zeta = 6.64) in both the pedigrees and nuclear families with no evidence for recombination between CF and any of the DNA markers. Weaker linkage between the CF locus and the locus for the serum enzyme activity marker paraoxonase (PON) was detected, theta = 0.18, zeta = 0.76. The two pedigrees were sufficiently informative to detect significant linkage between CF and each of the three DNA markers previously shown to be tightly linked to the CF locus. These results establish a locus for CF in these pedigrees in the region of chromosome 7 nearest the three DNA markers met, pJ3.11, and 7C22 and are consistent with locus homogeneity for the defect causing CF in these populations and others that have been examined to date.  相似文献   

19.
We have compared the power of a large number of allele-sharing statistics for "nonparametric" linkage analysis with affected sibships. Our rationale was that there is an extensive literature comparing statistics for sibling pairs but that there has not been much guidance on how to choose statistics for studies that include sibships of various sizes. We concentrated on statistics that can be described as assigning scores to each identity-by-descent-sharing configuration that a pedigree might take on (Whittemore and Halpern 1994). We considered sibships of sizes two through five, 27 different genetic models, and varying recombination fractions between the marker and the trait locus. We tried to identify statistics whose power was robust over a wide variety of models. We found that the statistic that is probably used most often in such studies-S(all)-performs quite well, although it is not necessarily the best. We also found several other statistics (such as the R criterion, S(robdom), and the Sobel-and-Lange statistic C) that perform well in most situations, a few (such as S(-#geno) and the Feingold-and-Siegmund version of S(pairs)) that have high power only in very special situations, and a few (such as S(-#geno), the N criterion, and the Sobel-and-Lange statistic B) that seem to have low power for the majority of the trait models. For the most part, the same statistics performed well for all sibship sizes. We also used our results to give some suggestions regarding how to weight sibships of different sizes, in forming an overall statistic.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号