首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
In studies of complex diseases, a common paradigm is to conduct association analysis at markers in regions identified by linkage analysis, to attempt to narrow the region of interest. Family-based tests for association based on parental transmissions to affected offspring are often used in fine-mapping studies. However, for diseases with late onset, parental genotypes are often missing. Without parental genotypes, family-based tests either compare allele frequencies in affected individuals with those in their unaffected siblings or use siblings to infer missing parental genotypes. An example of the latter approach is the score test implemented in the computer program TRANSMIT. The inference of missing parental genotypes in TRANSMIT assumes that transmissions from parents to affected siblings are independent, which is appropriate when there is no linkage. However, using computer simulations, we show that, when the marker and disease locus are linked and the data set consists of families with multiple affected siblings, this assumption leads to a bias in the score statistic under the null hypothesis of no association between the marker and disease alleles. This bias leads to an inflated type I error rate for the score test in regions of linkage. We present a novel test for association in the presence of linkage (APL) that correctly infers missing parental genotypes in regions of linkage by estimating identity-by-descent parameters, to adjust for correlation between parental transmissions to affected siblings. In simulated data, we demonstrate the validity of the APL test under the null hypothesis of no association and show that the test can be more powerful than the pedigree disequilibrium test and family-based association test. As an example, we compare the performance of the tests in a candidate-gene study in families with Parkinson disease.  相似文献   

2.
Parametric linkage analysis is usually used to find chromosomal regions linked to a disease (phenotype) that is described with a specific genetic model. This is done by investigating the relations between the disease and genetic markers, that is, well-characterized loci of known position with a clear Mendelian mode of inheritance. Assume we have found an interesting region on a chromosome that we suspect is linked to the disease. Then we want to test the hypothesis of no linkage versus the alternative one of linkage. As a measure we use the maximal lod score Z(max). It is well known that the maximal lod score has asymptotically a (2 ln 10)(-1) x (1/2 chi2(0) + 1/2 chi2(1)) distribution under the null hypothesis of no linkage when only one point (one marker) on the chromosome is studied. In this paper, we show, both by simulations and theoretical arguments, that the null hypothesis distribution of Zmax has no simple form when more than one marker is used (multipoint analysis). In fact, the distribution of Zmax depends on the number of families, their structure, the assumed genetic model, marker denseness, and marker informativity. This means that a constant critical limit of Zmax leads to tests associated with different significance levels. Because of the above-mentioned problems, from the statistical point of view the maximal lod score should be supplemented by a p-value when results are reported.  相似文献   

3.
Lemire M 《BMC genetics》2005,6(Z1):S159
A simple multipoint procedure to test for parent-of-origin effects in samples of affected siblings is discussed. The procedure consists of artificially changing all full sibs to half-sibs, with distinct mothers or fathers depending on the parental origin to be evaluated, then analyzing these families with commonly used statistics and software. The procedure leads to tests for linkage through mothers or fathers and also leads to a test for imprinting effects in the presence of linkage. Moreover, simulations illustrate that in regions unlinked to susceptibility genes this multipoint procedure does not have an inflated type I error if a sex-averaged genetic map is used, even when large differences exist between male-specific and female-specific maps. In regions linked with susceptibility genes, the test of imprinting is biased under the null hypothesis if differences exist between sex-specific maps, irrespective of the map used in the analysis. The procedure is applied to the Collaborative Study on the Genetics of Alcoholism dataset from the Genetic Analysis Workshop 14. Results indicate that brothers categorized as affected according to the DMS-III-R and Feighner classification show evidence of linkage through fathers to the 6q25 region (p = 0.00038) as well as modest evidence of imprinting (p = 0.018). This region harbors OPRM1, a candidate gene for substance dependence.  相似文献   

4.
Tests for linkage and association in nuclear families.   总被引:12,自引:4,他引:8       下载免费PDF全文
The transmission/disequilibrium test (TDT) originally was introduced to test for linkage between a genetic marker and a disease-susceptibility locus, in the presence of association. Recently, the TDT has been used to test for association in the presence of linkage. The motivation for this is that linkage analysis typically identifies large candidate regions, and further refinement is necessary before a search for the disease gene is begun, on the molecular level. Evidence of association and linkage may indicate which markers in the region are closest to a disease locus. As a test of linkage, transmissions from heterozygous parents to all of their affected children can be included in the TDT; however, the TDT is a valid chi2 test of association only if transmissions to unrelated affected children are used in the analysis. If the sample contains independent nuclear families with multiple affected children, then one procedure that has been used to test for association is to select randomly a single affected child from each sibship and to apply the TDT to those data. As an alternative, we propose two statistics that use data from all of the affected children. The statistics give valid chi2 tests of the null hypothesis of no association or no linkage and generally are more powerful than the TDT with a single, randomly chosen, affected child from each family.  相似文献   

5.
Genes involved in cellular mechanisms to repair oxidative damage are strong candidates as etiologic factors for Alzheimer's disease (AD). One important enzyme involved in this mechanism is superoxide dismutase 2 (SOD2). The gene for this enzyme lies within a single haplotype block at 6q25.3, a region showing evidence for linkage to AD in a genome scan. We genotyped four single nucleotide polymorphisms (SNPs) in SOD2 in families of the National Institute of Mental Health-AD Genetics Initiative (ADGI): rs2758346 in the 5' untranslated region (UTR), rs4880 in exon 2, rs2855116 in intron 3 and rs5746136 in the 3'UTR. Under a dominant model, family-based association tests showed significant evidence for association of AD with the first three loci in a candidate gene set of families with individuals having age of onset of at least 50 years and two affected and one unaffected sibling, and in a late-onset subset of families (families with all affected individuals having age of onset of at least 65 years) from the full ADGI sample. The alleles transmitted more frequently to cases than expected under the null hypothesis were T, C, G, and G. Global tests of the transmission of haplotypes indicate that the first two loci have the most consistent association with risk of AD. Because of the high linkage disequilibrium in this small (14 kb) gene, and the presence of 100 SNPs in this gene, 26 of which may have functional significance, additional genotyping and sequencing are needed to identify the functionally relevant SNP. We discuss the importance of our findings and the relevance of SOD2 to AD risk.  相似文献   

6.
E. Zouros 《Genetica》1993,89(1-3):35-46
Expressions are obtained for the expected phenotypic values of homozygous and heterozygous genotypes for a neutral marker locus linked to a locus segregating for a recessive deleterious gene. The phenotypic values are functions of the allele frequencies at the marker locus, the inbreeding coefficient and the degree of association of the deleterious gene with the marker alleles. The analysis is extended to more than two alleles at the marker locus. Either linkage disequilibrium or inbreeding alone can produce an apparent superiority of heterozygotes for the marker locus (unless specified otherwise, the terms ‘homozygote’ and ‘heterozygote’ will refer to the marker locus). The effect of linkage disequilibrium on the difference between the heterozygote and homozygote values can be positive (associative overdominance) or negative (associative underdominance), depending on the frequencies of the marker alleles and the degree of their association with the deleterious gene. Inbreeding has always a positive effect. In general, the expected value of a homozygote is a positive function of its allele frequency. When the various homozygous genotypes are combined into one class and the various heterozygous genotypes into another, the phenotypic difference of the two classes is a function of the evenness of the allelic frequency distribution. Inbreeding is a more likely explanation of associative overdominance if the frequency of the deleterious gene is low, but its effect on the character high. Conversely, linkage disequilibrium is more likely if the frequency is high and the effect low. The degrees of association between marker alleles and the deleterious gene can, in principle, be estimated from the observed phenotypic scores and used to calculate expected multi-locus genotype scores. This could provide the basis for statistical tests of the associative overdominance hypothesis as an explanation of observed correlations between multi-locus heterozygosity and phenotypic traits.  相似文献   

7.
A population association has consistently been observed between insulin-dependent diabetes mellitus (IDDM) and the "class 1" alleles of the region of tandem-repeat DNA (5'' flanking polymorphism [5''FP]) adjacent to the insulin gene on chromosome 11p. This finding suggests that the insulin gene region contains a gene or genes contributing to IDDM susceptibility. However, several studies that have sought to show linkage with IDDM by testing for cosegregation in affected sib pairs have failed to find evidence for linkage. As means for identifying genes for complex diseases, both the association and the affected-sib-pairs approaches have limitations. It is well known that population association between a disease and a genetic marker can arise as an artifact of population structure, even in the absence of linkage. On the other hand, linkage studies with modest numbers of affected sib pairs may fail to detect linkage, especially if there is linkage heterogeneity. We consider an alternative method to test for linkage with a genetic marker when population association has been found. Using data from families with at least one affected child, we evaluate the transmission of the associated marker allele from a heterozygous parent to an affected offspring. This approach has been used by several investigators, but the statistical properties of the method as a test for linkage have not been investigated. In the present paper we describe the statistical basis for this "transmission test for linkage disequilibrium" (transmission/disequilibrium test [TDT]). We then show the relationship of this test to tests of cosegregation that are based on the proportion of haplotypes or genes identical by descent in affected sibs. The TDT provides strong evidence for linkage between the 5''FP and susceptibility to IDDM. The conclusions from this analysis apply in general to the study of disease associations, where genetic markers are usually closely linked to candidate genes. When a disease is found to be associated with such a marker, the TDT may detect linkage even when haplotype-sharing tests do not.  相似文献   

8.
Genomewide association (GWA) studies assay hundreds of thousands of single nucleotide polymorphisms (SNPs) simultaneously across the entire genome and associate them with diseases, other biological or clinical traits. The association analysis usually tests each SNP as an independent entity and ignores the biological information such as linkage disequilibrium. Although the Bonferroni correction and other approaches have been proposed to address the issue of multiple comparisons as a result of testing many SNPs, there is a lack of understanding of the distribution of an association test statistic when an entire genome is considered together. In other words, there are extensive efforts in hypothesis testing, and almost no attempt in estimating the density under the null hypothesis. By estimating the true null distribution, we can apply the result directly to hypothesis testing; better assess the existing approaches of multiple comparisons; and evaluate the impact of linkage disequilibrium on the GWA studies. To this end, we estimate the empirical null distribution of an association test statistic in GWA studies using simulated population data. We further propose a convenient and accurate method based on adaptive spline to estimate the empirical value in GWA studies and validate our findings using a real data set. Our method enables us to fully characterize the null distribution of an association test that not only can be used to test the null hypothesis of no association, but also provides important information about the impact of density of the genetic markers on the significance of the tests. Our method does not require users to perform computationally intensive permutations, and hence provides a timely solution to an important and difficult problem in GWA studies.  相似文献   

9.
The transmission/disequilibrium test (TDT) [Spielman et al.: Am J Hum Genet 1993;52:506-516] has been postulated as the future of gene mapping for complex diseases, provided one is able to genotype a dense enough map of markers across the genome. Risch and Merikangas [Science 1996;273:1516-1517] suggested a million-marker screen in affected sibpair (ASP) families, demonstrating that the TDT is a more powerful test of linkage than traditional linkage tests based on allele-sharing when there is also association between marker and disease alleles. While the future of genotyping has arrived, successes in family-based association studies have been modest. This is often attributed to excessive false positives in candidate gene studies. This problem is only exacerbated by the increasing numbers of whole genome association (WGA) screens. When applied in ASPs, the TDT statistic, which assumes transmissions to siblings are independent, is not expected to have a constant variance in the presence of variable linkage. This results in generally more extreme statistics, hence will further aggravate the problem of having a large number of positive results to sort through. So an important question is how many positive TDT results will show up on a chromosome containing a disease gene due only to linkage, and will they obfuscate the true disease gene location. To answer this question we combined theory and computer simulations. These studies show that in ASPs the normal version of the TDT statistic has a mean of 0 and a variance of 1 in unlinked regions, but has a variance larger than 1 in linked regions. In contrast, the pedigree disequilibrium test (PDT) statistic adjusts for correlation between siblings due to linkage and maintains a constant variance of 1 at unassociated markers irrespective of linkage. The TDT statistic is generally larger than the PDT statistic across linked regions. This is true for unassociated as well as associated markers. To compare the two tests we ranked both statistics at the disease locus, or an associated marker, among statistics at all other markers. The TDT did better job than PDT placing the score of the associated marker near the top. Though, strictly speaking, the TDT in ASPs should be interpreted as a test of linkage and not a test of association, there is a good chance that if a marker stands out, the marker is associated as well as linked. In conclusion, our results suggest that TDT is an effective screening tool for WGA studies, especially in multiplex families.  相似文献   

10.

Background

Canine atopic dermatitis (AD) is a common, heritable, chronic allergic skin condition prevalent in the West Highland White Terrier (WHWT). In canine AD, environmental allergens trigger an inflammatory response causing visible skin lesions and chronic pruritus that can lead to secondary bacterial and yeast infections. The disorder shares many of the clinical and histopathological characteristics of human AD and represents an animal model of this disorder that could be used to further elucidate genetic causes of human AD. Microsatellite markers genotyped in families of WHWTs affected with AD were used to perform a genome-wide linkage study in order to isolate chromosomal regions associated with the disorder.

Results

Blood samples and health questionnaires were collected from 108 WHWTs spanning three families. A linkage simulation using these 108 dogs showed high power to detect a highly penetrant mutation. Ninety WHWTs were genotyped using markers from the Minimal Screening Set 2 (MSS-2). Two hundred and fifty six markers were informative and were used for linkage analysis. Using a LOD score of 2.7 as a significance threshold, no chromosomal regions were identified with significant linkage to AD. LOD scores greater than 1.0 were located in a 56 cM region of chromosome 7.

Conclusions

The study was unable to detect any chromosomal regions significantly linked to canine AD. This could be a result of factors such as environmental modification of phenotype, incorrect assignment of phenotype, a mutation of low penetrance, or incomplete genome coverage. A genome-wide SNP association study in a larger cohort of WHWTs may prove more successful by providing higher density coverage and higher statistical power.  相似文献   

11.
The genetic basis of the transmission disequilibrium test (TDT) for two-marker loci is explored from first principles. In this case, parents doubly heterozygous for a given haplotype at the pair of marker loci that are each in linkage disequilibrium with the disease gene with the further possibility of a second-order linkage disequilibrium are considered. The number of times such parents transmit the given haplotype to their affected offspring is counted and compared with the frequencies of haplotypes that are not transmitted. This is done separately for the coupling and repulsion phases of doubly heterozygous genotypes. Expectations of the counts for each of the sixteen cells possible with four-marker gametic types (transmitted vs not transmitted) are derived. Based on a test of symmetry in a square 4 × 4 contingency table, chi-square tests are proposed for the null hypothesis of no linkage between the markers and the disease gene. The power of the tests is discussed in terms of the corresponding non-centrality parameters for the alternative hypothesis that both the markers are linked with the disease locus. The results indicate that the power increases with the decrease in recombination probability and that it is higher for a lower frequency of the disease gene. Taking a pair of markers in an interval for exploring the linkage with the disease gene seems to be more informative than the single-marker case since the values of the non-centrality parameters tend to be consistently higher than their counterparts in the single-marker case. Limitations of the proposed test are also discussed.  相似文献   

12.
The genetic basis of the transmission disequilibrium test (TDT) for two-marker loci is explored from first principles. In this case, parents doubly heterozygous for a given haplotype at the pair of marker loci that are each in linkage disequilibrium with the disease gene with the further possibility of a second-order linkage disequilibrium are considered. The number of times such parents transmit the given haplotype to their affected offspring is counted and compared with the frequencies of haplotypes that are not transmitted. This is done separately for the coupling and repulsion phases of doubly heterozygous genotypes. Expectations of the counts for each of the sixteen cells possible with four-marker gametic types (transmitted vs not transmitted) are derived. Based on a test of symmetry in a square 4 x 4 contingency table, chi-square tests are proposed for the null hypothesis of no linkage between the markers and the disease gene. The power of the tests is discussed in terms of the corresponding non-centrality parameters for the alternative hypothesis that both the markers are linked with the disease locus. The results indicate that the power increases with the decrease in recombination probability and that it is higher for a lower frequency of the disease gene. Taking a pair of markers in an interval for exploring the linkage with the disease gene seems to be more informative than the single-marker case since the values of the non-centrality parameters tend to be consistently higher than their counterparts in the single-marker case. Limitations of the proposed test are also discussed.  相似文献   

13.
14.
There is increasing evidence that pleiotropy, the association of multiple traits with the same genetic variants/loci, is a very common phenomenon. Cross-phenotype association tests are often used to jointly analyze multiple traits from a genome-wide association study (GWAS). The underlying methods, however, are often designed to test the global null hypothesis that there is no association of a genetic variant with any of the traits, the rejection of which does not implicate pleiotropy. In this article, we propose a new statistical approach, PLACO, for specifically detecting pleiotropic loci between two traits by considering an underlying composite null hypothesis that a variant is associated with none or only one of the traits. We propose testing the null hypothesis based on the product of the Z-statistics of the genetic variants across two studies and derive a null distribution of the test statistic in the form of a mixture distribution that allows for fractions of variants to be associated with none or only one of the traits. We borrow approaches from the statistical literature on mediation analysis that allow asymptotic approximation of the null distribution avoiding estimation of nuisance parameters related to mixture proportions and variance components. Simulation studies demonstrate that the proposed method can maintain type I error and can achieve major power gain over alternative simpler methods that are typically used for testing pleiotropy. PLACO allows correlation in summary statistics between studies that may arise due to sharing of controls between disease traits. Application of PLACO to publicly available summary data from two large case-control GWAS of Type 2 Diabetes and of Prostate Cancer implicated a number of novel shared genetic regions: 3q23 (ZBTB38), 6q25.3 (RGS17), 9p22.1 (HAUS6), 9p13.3 (UBAP2), 11p11.2 (RAPSN), 14q12 (AKAP6), 15q15 (KNL1) and 18q23 (ZNF236).  相似文献   

15.
We applied three approaches for the identification of polymorphisms explaining the linkage evidence to the Genetic Analysis Workshop 14 simulated data: 1) the genotype-IBD sharing test (GIST); 2) an approach suggested by Horikawa and colleagues; and 3) the homozygote sharing test (HST). These tests were compared with a family-based association test. Two linked regions with highest nonparametric linkage scores were selected to apply these methods. In the first region, Horikawa's method identified the most SNPs within the region containing the disease susceptibility locus, while HST performed best in the second region. However, Horikawa's method also had the most type I errors. These methods show potential as additional tools to complement family-based association tests for the identification of disease susceptibility variants.  相似文献   

16.
Selective genotyping is common because it can increase the expected correlation between QTL genotype and phenotype and thus increase the statistical power of linkage tests (i.e., regression-based tests). Linkage can also be tested by assessing whether the marginal genotypic distribution conforms to its expectation, a marginal-based test. We developed a class of joint tests that, by constraining intercepts in regression-based analyses, capitalize on the information available in both regression-based and marginal-based tests. We simulated data corresponding to the null hypothesis of no QTL effect and the alternative of some QTL effect at the locus for a backcross and an F2 intercross between inbred strains. Regression-based and marginal-based tests were compared to corresponding joint tests. We studied the effects of random sampling, selective sampling from a single tail of the phenotypic distribution, and selective sampling from both tails of the phenotypic distribution. Joint tests were nearly as powerful as all competing alternatives for random sampling and two-tailed selection under both backcross and F2 intercross situations. Joint tests were generally more powerful for one-tailed selection under both backcross and F2 intercross situations. However, joint tests cannot be recommended for one-tailed selective genotyping if segregation distortion is suspected.  相似文献   

17.
Genetic factors influence the development of type II diabetes mellitus, but genetic loci for the most common forms of diabetes have not been identified. A genomic scan was conducted to identify loci linked to diabetes and body-mass index (BMI) in Pima Indians, a Native American population with a high prevalence of type II diabetes. Among 264 nuclear families containing 966 siblings, 516 autosomal markers with a median distance between adjacent markers of 6.4 cM were genotyped. Variance-components methods were used to test for linkage with an age-adjusted diabetes score and with BMI. In multipoint analyses, the strongest evidence for linkage with age-adjusted diabetes (LOD = 1.7) was on chromosome 11q, in the region that was also linked most strongly with BMI (LOD = 3.6). Bivariate linkage analyses strongly rejected both the null hypothesis of no linkage with either trait and the null hypothesis of no contribution of the locus to the covariation among the two traits. Sib-pair analyses suggest additional potential diabetes-susceptibility loci on chromosomes 1q and 7q.  相似文献   

18.
We obtained conclusive linkage of Alzheimer disease (AD) with a candidate region of 19.7 cM at 7q36 in an extended multiplex family, family 1270, ascertained in a population-based study of early-onset AD in the northern Netherlands. Single-nucleotide polymorphism and haplotype association analyses of a Dutch patient-control sample further supported the linkage at 7q36. In addition, we identified a shared haplotype at 7q36 between family 1270 and three of six multiplex AD-affected families from the same geographical region, which is indicative of a founder effect and defines a priority region of 9.3 cM. Mutation analysis of coding exons of 29 candidate genes identified one linked synonymous mutation, g.38030G-->C in exon 10, that affected codon 626 of the PAX transactivation domain interacting protein gene (PAXIP1). It remains to be determined whether PAXIP1 has a functional role in the expression of AD in family 1270 or whether another mutation at this locus explains the observed linkage and sharing. Together, our linkage data from the informative family 1270 and the association data in the population-based early-onset AD patient-control sample strongly support the identification of a novel AD locus at 7q36 and re-emphasize the genetic heterogeneity of AD.  相似文献   

19.
Zhu X  Elston RC  Cooper RS 《Human heredity》2001,51(4):183-191
Zhu and Elston developed a transmission disequilibrium test for quantitative traits by defining a linear transformation to condition out founder information. The method tests the null hypothesis of no linkage or association and can be applied to general pedigree structures. However, this method requires both genotype and phenotype parental information, which may be difficult to obtain. In this paper, we describe parametric and non-parametric methods to relax this requirement when only nuclear families are sampled. We show that neither method is affected by population stratification in the absence of linkage. The statistical power and validity of the tests are investigated by simulation. A simple simulation method to calculate the power of the nonparametric method is also discussed. In practice, the data may have some families with parental phenotype and genotype information available and some without. We briefly discuss how all the data may be analyzed jointly.  相似文献   

20.
A robust statistical method to detect linkage or association between a genetic marker and a set of distinct phenotypic traits is to combine univariate trait-specific test statistics for a more powerful overall test. This procedure does not need complex modeling assumptions, can easily handle the problem with partially missing trait values, and is applicable to the case with a mixture of qualitative and quantitative traits. In this note, we propose a simple test procedure along this line, and show its advantages over the standard combination tests for linkage or association in the literature through a data set from Genetic Analysis Workshop 12 (GAW12) and an extensive simulation study.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号