首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 93 毫秒
1.
Nuclear families with multiple affected sibs are often collected for genetic linkage analysis of complex diseases. Once linkage evidence is established, dense markers are often typed in the linked region for genetic association analysis based on linkage disequilibrium (LD). Detection of association in the presence of linkage localizes disease genes more accurately than the methods that rely on linkage alone. However, test of association due to LD in the linked region needs to account for dependency of the allele transmissions to different sibs within a family. In this paper, we define a joint model for genetic linkage and association and derive the corresponding joint survival function of age of onset for the sibs within a sibship. The joint survival function is a function of both the inheritance vector and the genotypes at the candidate marker locus. Based on this joint survival function, we derive score tests for genetic association. The proposed methods utilize the phenotype data of all the sibs and have the advantages of family-based designs which can avoid the potential spurious association caused by population admixture. In addition, the methods can account for variable age of onset or age at censoring and possible covariate effects, and therefore provide important tools for modelling disease heterogeneity. Simulation studies and application to the data sets from the 12th Genetic Analysis Workshop indicate that the proposed methods have correct type 1 error rates and increased power over other existing methods for testing allelic association.  相似文献   

2.
Fan R  Jung J 《Human heredity》2003,56(4):166-187
This paper proposes variance component models for high resolution joint linkage disequilibrium (LD) and linkage mapping of quantitative trait loci (QTL) based on sibship data; this can include population data if independent individuals are treated as single sibships. One application of these models is late onset complex disease gene mapping, when parental data are not available. The models simultaneously incorporate both LD and linkage information. The LD information is contained in mean coefficients of sibship data. The linkage information is contained in the variance-covariance matrices of trait values for sibships with at least two siblings. We derive formulas for calculating the probability of sharing two trait alleles identical by descent (IBD) for sibpairs in interval mapping of QTL; this is the coefficient of dominant variance of the trait covariance of sibpairs on major QTL. To investigate the performance of the formulas, we calculate the numerical values via the formulas and get satisfactory approximations. We compare the power and sample sizes for both LD and linkage mapping. By simulation and theoretical analysis, we compare the results with those of Fulker and Abecasis "AbAw" approach. It is well known that the resolution of linkage analysis can be low for complex disease gene mapping. LD mapping, on the other hand, can increase mapping precision and is useful in high resolution mapping. Linkage analysis is less sensitive to population subdivisions and admixtures. The level of LD is sensitive to population stratification which may easily lead to spurious association. Performing a joint analysis of LD and linkage mapping can help to overcome the limits of both approaches. Moreover, the advantages of the two complementary strategies can be utilized maximally. In practice, linkage analysis may be performed using pedigree data to identify suggestive linkage between markers and trait loci based on a sparse marker map. In the presence of linkage, joint LD and linkage mapping can be carried out to do fine gene mapping based on a dense genetic map using both pedigree and population data. Population and pedigree data of any type can be combined to perform a joint analysis of high resolution LD and linkage mapping of QTL by generalizing the method.  相似文献   

3.
In this paper we present a novel method for selecting optimally informative sibships of any size for quantitative trait locus (QTL) linkage analysis. The method allocates a quantitative index of potential informativeness to each sibship on the basis of observed trait scores and an assumed true QTL model. Any sample of phenotypically screened sibships can therefore be easily rank-ordered for selective genotyping. The quantitative index is the sibship's expected contribution to the non-centrality parameter. This expectation represents the weighted sum of chi(2) test statistics that would be obtained given the observed trait values over all possible sibship genotypic configurations; each configuration is weighted by the likelihood of it occurring given the assumed true genetic model. The properties of this procedure are explored in relation to the accuracy of the assumed true genetic model and sibship size. In comparison to previous methods of selecting phenotypically extreme sibships for genotyping, the proposed method is considerably more efficient and is robust with regard to the specification of the genetic model.  相似文献   

4.
Zhao Y  Yu H  Zhu Y  Ter-Minassian M  Peng Z  Shen H  Diao N  Chen F 《PloS one》2012,7(2):e31134
Family based association study (FBAS) has the advantages of controlling for population stratification and testing for linkage and association simultaneously. We propose a retrospective multilevel model (rMLM) approach to analyze sibship data by using genotypic information as the dependent variable. Simulated data sets were generated using the simulation of linkage and association (SIMLA) program. We compared rMLM to sib transmission/disequilibrium test (S-TDT), sibling disequilibrium test (SDT), conditional logistic regression (CLR) and generalized estimation equations (GEE) on the measures of power, type I error, estimation bias and standard error. The results indicated that rMLM was a valid test of association in the presence of linkage using sibship data. The advantages of rMLM became more evident when the data contained concordant sibships. Compared to GEE, rMLM had less underestimated odds ratio (OR). Our results support the application of rMLM to detect gene-disease associations using sibship data. However, the risk of increasing type I error rate should be cautioned when there is association without linkage between the disease locus and the genotyped marker.  相似文献   

5.
Familial clustering and linkage disequilibrium studies suggest that genetic factors predispose to vitiligo, although a clear transmission pattern and cosegregation of vitiligo with specific mutations have not been demonstrated. We collected pedigree data on vitiligo from a set of 56 multigeneration families belonging to the Paisa community from Antioquia, Colombia, with the goal of applying the unified model of complex segregation and linkage disequilibrium analyses to test the hypotheses of the existence of a major gene predisposing to vitiligo and that allelic or haplotype polymorphisms of microsatellite loci at 6p21.3-21.4 spanning HLA (D6S276, D6S265, D6S273, and D6S291) are associated with this predisposition. Minimum sibship sample size to discriminate dominant and recessive inheritance models was largely accomplished. Between the 15 models of complex segregation used, the one that best fitted the data was that of a major dominant gene and the existence of strong environmental effects acting on the recessive genotype. The penetrance and risk estimations discriminated two sets of vitiligo patients: those with early onset of vitiligo cosegregating with a dominant mode of inheritance without environmental effects, and those with late onset of vitiligo cosegregating with the recessive genotype and being influenced by environmental effects. After establishing the normal distribution of allelic frequencies and performing multiple comparisons correction, the linkage disequilibrium analysis suggested that a major genetic factor could be located at 6p21.3-21.4, because we detected significant case-control differences for allele 122 at D6S265 ( Pc=0.0264) and significant linkage disequilibrium between loci D6S276 and D6S273 in the cases but not in the controls. We cannot explain these results as a consequence of evolutionary forces or as genetic stratification acting differentially on cases and controls, because there was neither deviation from the Hardy-Weinberg expectations nor genetic subdivision between cases and controls, as theta; (non-biased F(ST)) was not significantly different from 0.  相似文献   

6.
Liu PY  Lu Y  Deng HW 《Genetics》2006,174(1):499-509
Sibships are commonly used in genetic dissection of complex diseases, particularly for late-onset diseases. Haplotype-based association studies have been advocated as powerful tools for fine mapping and positional cloning of complex disease genes. Existing methods for haplotype inference using data from relatives were originally developed for pedigree data. In this study, we proposed a new statistical method for haplotype inference for multiple tightly linked single-nucleotide polymorphisms (SNPs), which is tailored for extensively accumulated sibship data. This new method was implemented via an expectation-maximization (EM) algorithm without the usual assumption of linkage equilibrium among markers. Our EM algorithm does not incur extra computational burden for haplotype inference using sibship data when compared with using unrelated parental data. Furthermore, its computational efficiency is not affected by increasing sibship size. We examined the robustness and statistical performance of our new method in simulated data created from an empirical haplotype data set of human growth hormone gene 1. The utility of our method was illustrated with an application to the analyses of haplotypes of three candidate genes for osteoporosis.  相似文献   

7.
A new method for segregation and linkage analysis, with pedigree data, is described. Reversible jump Markov chain Monte Carlo methods are used to implement a sampling scheme in which the Markov chain can jump between parameter subspaces corresponding to models with different numbers of quantitative-trait loci (QTL's). Joint estimation of QTL number, position, and effects is possible, avoiding the problems that can arise from misspecification of the number of QTL's in a linkage analysis. The method is illustrated by use of a data set simulated for the 9th Genetic Analysis Workshop; this data set had several oligogenic traits, generated by use of a 1,497-member pedigree. The mixing characteristics of the method appear to be good, and the method correctly recovers the simulated model from the test data set. The approach appears to have great potential both for robust linkage analysis and for the answering of more general questions regarding the genetic control of complex traits.  相似文献   

8.
We present a method for the multivariate linkage analysis of the age of onset of a disease. The approach allows the incorporation of covariates for the study of gene by environment interactions. It is applicable to general pedigrees. The likelihood of the data is expressed as a function of the number of alleles identical by descent at a marker, the censored ages of onset and disease status, and environmental exposures. In a simulation study, we compare the power to detect linkage under different sampling schemes for either a dominant or recessive trait when approximately 10% of individuals are gene carriers. The majority of the linkage information from a sample of randomly selected sib pairs was retained when the analyses were limited to sibships with one sibling having early-onset disease (<59 years old). Incorporating parental phenotypes could improve the power to detect the gene. When the sample consists of affected sib pairs (ASPs) having variable age of onset, the likelihood ratio (LR) test had higher power than the means (t(2)) test for detecting a locus with a large genetic relative risk (R(g) = 20). However, the power of the two tests was similar when ASPs are selected so that the proband has an early onset of disease. Lastly, the LR test had more power than the t(2) test to detect linkage in the presence of gene by environment interactions.  相似文献   

9.
Won S  Elston RC  Park T 《Human heredity》2006,61(2):111-119
We propose an extension to longitudinal data of the Haseman and Elston regression method for linkage analysis. The proposed model is a mixed model having several random effects. As response variable, we investigate the sibship sample mean corrected cross-product (smHE) and the BLUP-mean corrected cross product (pmHE), comparing them with the original squared difference (oHE), the overall mean corrected cross-product (rHE), and the weighted average of the squared difference and the squared mean-corrected sum (wHE). The proposed model allows for the correlation structure of longitudinal data. Also, the model can test for gene x time interaction to discover genetic variation over time. The model was applied in an analysis of the Genetic Analysis Workshop 13 (GAW13) simulated dataset for a quantitative trait simulating systolic blood pressure. Independence models did not preserve the test sizes, while the mixed models with both family and sibpair random effects tended to preserve size well.  相似文献   

10.
A simulation module is built into the software package colony to simulate marker genotype data of individuals with a predefined parentage and sibship structure. The simulated data can then be used to compare the accuracy, robustness and computational efficiency of different methods for sibship and parentage reconstruction, to examine the impact of different parameter options in a software on its accuracy and computational efficiency and to assess the information sufficiency of a given set of markers for a sibship and parentage analysis. This computer note describes the method used for simulating genotype data with a pedigree and its possible applications. The method can quickly generate genotype data for a one‐ or two‐generation pedigree of virtually any complexity with up to 30k offspring, at up to 30k codominant or dominant loci with an arbitrary degree of linkage and a user‐defined mistyping rate. The data can be fed directly into the colony program for analysis by three sibship and parentage reconstruction methods and can also be imported into other programs such as Excel and R. With slight modification, the data can be analysed by other relationship analysis software.  相似文献   

11.
It is usually difficult to localize genes that cause diseases with late ages at onset. These diseases frequently exhibit complex modes of inheritance, and only recent generations are available to be genotyped and phenotyped. In this situation, multipoint analysis using traditional exact linkage analysis methods, with many markers and full pedigree information, is a computationally intractable problem. Fortunately, Monte Carlo Markov chain sampling provides a tool to address this issue. By treating age at onset as a right-censored quantitative trait, we expand the methods used by Heath (1997) and illustrate them using an Alzheimer disease (AD) data set. This approach estimates the number, sizes, allele frequencies, and positions of quantitative trait loci (QTLs). In this simultaneous multipoint linkage and segregation analysis method, the QTLs are assumed to be diallelic and to interact additively. In the AD data set, we were able to localize correctly, quickly, and accurately two known genes, despite the existence of substantial genetic heterogeneity, thus demonstrating the great promise of these methods for the dissection of late-onset oligogenic diseases.  相似文献   

12.
Regional association analysis is one of the most powerful tools for gene mapping because instead analysis of individual variants it simultaneously considers all variants in the region. Recent development of the models for regional association analysis involves functional data analysis approach. In the framework of this approach, genotypes of variants within region as well as their effects are described by continuous functions. Such approach allows us to use information about both linkage and linkage disequilibrium and reduce the influence of noise and/or observation errors. Here we define a functional linear mixed model to test association on independent and structured samples. We demonstrate how to test fixed and random effects of a set of genetic variants in the region on quantitative trait. Estimation of statistical properties of new methods shows that type I errors are in accordance with declared values and power is high especially for models with fixed effects of genotypes. We suppose that new functional regression linear models facilitate identification of rare genetic variants controlling complex human and animal traits. New methods are implemented in computer software FREGAT which is available for free download at http://mga.bionet.nsc.ru/soft/FREGAT/.  相似文献   

13.
Huang J  Jiang Y 《Human heredity》2001,52(2):83-98
We study the properties of a modified lod score method for testing linkage that incorporates linkage disequilibrium (LD-lod). By examination of its score statistic, we show that the LD-lod score method adaptively combines two sources of information: (a) the IBD sharing score which is informative for linkage regardless of the existence of LD and (b) the contrast between allele-specific IBD sharing scores which is informative for linkage only in the presence of LD. We also consider the connection between the LD-lod score method and the transmission-disequilibrium test (TDT) for triad data and the mean test for affected sib pair (ASP) data. We show that, for triad data, the recessive LD-lod test is asymptotically equivalent to the TDT; and for ASP data, it is an adaptive combination of the TDT and the ASP mean test. We demonstrate that the LD-lod score method has relatively good statistical efficiency in comparison with the ASP mean test and the TDT for a broad range of LD and the genetic models considered in this report. Therefore, the LD-lod score method is an interesting approach for detecting linkage when the extent of LD is unknown, such as in a genome-wide screen with a dense set of genetic markers.  相似文献   

14.
Many plants and some animal species are polyploids. Nondisomically inherited markers (e.g. microsatellites) in such species cannot be analysed directly by standard population genetics methods developed for diploid species. One solution is to transform the polyploid codominant genotypes to pseudodiploid‐dominant genotypes, which can then be analysed by standard methods for various purposes such as spatial genetic structure, individual relatedness and relationship. Although this data transformation approach has been used repeatedly in the literature, no systematic study has been conducted to investigate how efficient it is, how much marker information is lost and thus how much analysis accuracy is reduced. More specifically, it is unknown whether or not the transformed data can be used to infer parentage and sibship jointly, and how different sampling schemes (number and polymorphism of markers, number of individuals) and ploidy level affect the inference accuracy. This study analyses both simulated and empirical data to examine the effects of polyploid levels, actual pedigree structures and marker number and polymorphism on the accuracy of joint parentage and sibship assignments in polyploid species. We show that sibship, parentage and selfing rates in polyploids can be inferred accurately from a typical set of microsatellite loci. We also show that inferences can be substantially improved by allowing for a small genotyping error rate to accommodate the distortion in assumed Mendelian inheritance of the converted markers when large sibship groups are involved. The results are discussed in the context of polyploid data analysis in molecular ecology.  相似文献   

15.
We propose a general likelihood-based approach to the linkage analysis of qualitative and quantitative traits using identity by descent (IBD) data from sib-pairs. We consider the likelihood of IBD data conditional on phenotypes and test the null hypothesis of no linkage between a marker locus and a gene influencing the trait using a score test in the recombination fraction theta between the two loci. This method unifies the linkage analysis of qualitative and quantitative traits into a single inferential framework, yielding a simple and intuitive test statistic. Conditioning on phenotypes avoids unrealistic random sampling assumptions and allows sib-pairs from differing ascertainment mechanisms to be incorporated into a single likelihood analysis. In particular, it allows the selection of sib-pairs based on their trait values and the analysis of only those pairs having the most informative phenotypes. The score test is based on the full likelihood, i.e. the likelihood based on all phenotype data rather than just differences of sib-pair phenotypes. Considering only phenotype differences, as in Haseman and Elston (1972) and Kruglyak and Lander (1995), may result in important losses in power. The linkage score test is derived under general genetic models for the trait, which may include multiple unlinked genes. Population genetic assumptions, such as random mating or linkage equilibrium at the trait loci, are not required. This score test is thus particularly promising for the analysis of complex human traits. The score statistic readily extends to accommodate incomplete IBD data at the test locus, by using the hidden Markov model implemented in the programs MAPMAKER/SIBS and GENEHUNTER (Kruglyak and Lander, 1995; Kruglyak et al., 1996). Preliminary simulation studies indicate that the linkage score test generally matches or outperforms the Haseman-Elston test, the largest gains in power being for selected samples of sib-pairs with extreme phenotypes.  相似文献   

16.
We describe the use of multivariate regression for testing allelic association in the presence of linkage, using marker genotype data from sibships. The test is valid, provided that the correct mean structure is modeled but does not require the correlation structure within families to be specified. The test can be implemented using standard statistical software such as the SAS programming language. In a simulation study, we evaluated this new test in comparison with one from a standard, matched-case-control analysis. First, we noted that the genetic effect needed to be quite extreme before residual familial correlation due to linkage led to false inference using the standard, matched-pair analysis. Second, we showed that under examples of extreme residual familial correlation, the new test had the correct test size. Third, we found that the test was more powerful than the sibship disequilibrium test of Horvath and Laird. Finally, we concluded that although the standard analysis may lead to correct inference for practical purposes, the new test is valid, even under extreme residual familial correlation and with no cost in power at the causal locus.  相似文献   

17.
Suppose that many polymorphic sites have been identified and genotyped in a region showing strong linkage with a trait. A key question of interest is which site (or combination of sites) in the region influences susceptibility to the trait. We have developed a novel statistical approach to this problem, in the context of qualitative-trait mapping, in which we use linkage data to identify the polymorphic sites whose genotypes could fully explain the observed linkage to the region. The information provided by this analysis is different from that provided by tests of either linkage or association. Our approach is based on the observation that if a particular site is the only site in the region that influences the trait, then-conditional on the genotypes at that site for the affected relatives-there should be no unexplained oversharing in the region among affected individuals. We focus on the affected sib-pair study design and develop test statistics that are variations on the usual allele-sharing methods used in linkage studies. We perform hypothesis tests and derive a confidence set for the true causal polymorphic site, under the assumption that there is only one site in the region influencing the trait. Our method is appropriate under a very general model for how the site influences the trait, including epistasis with unlinked loci, correlated environmental effects within families, and gene-environment interaction. We extend our method to larger sibships and apply it to an NIDDM1 data set.  相似文献   

18.
Once genetic linkage has been identified for a complex disease, the next step is often association analysis, in which single-nucleotide polymorphisms (SNPs) within the linkage region are genotyped and tested for association with the disease. If a SNP shows evidence of association, it is useful to know whether the linkage result can be explained, in part or in full, by the candidate SNP. We propose a novel approach that quantifies the degree of linkage disequilibrium (LD) between the candidate SNP and the putative disease locus through joint modeling of linkage and association. We describe a simple likelihood of the marker data conditional on the trait data for a sample of affected sib pairs, with disease penetrances and disease-SNP haplotype frequencies as parameters. We estimate model parameters by maximum likelihood and propose two likelihood-ratio tests to characterize the relationship of the candidate SNP and the disease locus. The first test assesses whether the candidate SNP and the disease locus are in linkage equilibrium so that the SNP plays no causal role in the linkage signal. The second test assesses whether the candidate SNP and the disease locus are in complete LD so that the SNP or a marker in complete LD with it may account fully for the linkage signal. Our method also yields a genetic model that includes parameter estimates for disease-SNP haplotype frequencies and the degree of disease-SNP LD. Our method provides a new tool for detecting linkage and association and can be extended to study designs that include unaffected family members.  相似文献   

19.
Haseman and Elston (H-E) proposed a robust test to detect linkage between a quantitative trait and a genetic marker. In their method the squared sib-pair trait difference is regressed on the estimated proportion of alleles at a locus shared identical by descent by sib pairs. This method has recently been improved by changing the dependent variable from the squared difference to the mean-corrected product of the sib-pair trait values, a significantly positive regression indicating linkage. Because situations arise in which the original test is more powerful, a further improvement of the H-E method occurs when the dependent variable is changed to a weighted average of the squared sib-pair trait difference and the squared sib-pair mean-corrected trait sum. Here we propose an optimal method of performing this weighting for larger sibships, allowing for the correlation between pairs within a sibship. The optimal weights are inversely proportional to the residual variances obtained from the two different regressions based on the squared sib-pair trait differences and the squared sib-pair mean-corrected trait sums, respectively, allowing for correlations among sib pairs. The proposed method is compared with the existing extension of the H-E approach for larger sibships. Control of the type I error probabilities for sibships of any size can be improved by using a generalized estimating equation approach and the robust sandwich estimate of the variance, or a Monte-Carlo permutation test.  相似文献   

20.
We have developed a recursive-partitioning (RP) algorithm for identifying phenotype and covariate groupings that interact with the evidence for linkage. This data-mining approach for detecting gene x environment interactions uses genotype and covariate data on affected relative pairs to find evidence for linkage heterogeneity across covariate-defined subgroups. We adapted a likelihood-ratio based test of linkage parameterized with relative risks to a recursive partitioning framework, including a cross-validation based deviance measurement for choosing optimal tree size and a bootstrap sampling procedure for choosing robust tree structure. ALDX2 category 5 individuals were considered affected, categories 1 and 3 unaffected, and all others unknown. We sampled non-overlapping affected relative pairs from each family; therefore, we used 144 affected pairs in the RP model. Twenty pair-level covariates were defined from smoking status, maximum drinks, ethnicity, sex, and age at onset. Using the all-pairs score in GENEHUNTER, the nonparametric linkage tests showed no regions with suggestive linkage evidence. However, using the RP model, several suggestive regions were found on chromosomes 2, 4, 6, 14, and 20, with detection of associated covariates such as sex and age at onset.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号