首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 343 毫秒
1.
OBJECTIVE: Linkage analysis using 22 Canadian pedigrees identified a promising schizophrenia candidate region on 1q23 with a maximum 2-point HLOD under a recessive model of 5.8 [Brzustowicz et al. 2000]. In the current study, we revisited this data set using a Bayesian linkage analysis technique, namely the posterior probability of linkage (PPL). METHODS: The PPL has been developed as an alternative to traditional linkage analysis. It differs from both LOD scores and 'non-parametric' methods in that it directly measures the probability of linkage given the data, and incorporates prior genomic information. RESULTS: As expected, PPL results for 1q23 supported the previously observed linkage, with an estimated multipoint PPL of 99.7%. However, the PPL supported two further results: a second peak on chromosome 1 at 1p13 with a multipoint with PPL of 70% and a chromosome 17 marker (D17S784 at 17q25) with a multipoint PPL of 44%. CONCLUSIONS: The PPL-based analysis presented has the advantage over other likelihood-based linkage methods in that it avoids maximization and produces a less complex view of the strength of evidence for linkage.  相似文献   

2.
This Genetic Analysis Workshop 13 contribution presents a linkage analysis of hypertension in the Framingham data based on the posterior probability of linkage, or PPL. We dichotomized the phenotype, coding individuals who had been treated for hypertension at any time, as well as those with repeated high blood pressure measurements, as affected. Here we use a new variation on the multipoint PPL that incorporates integration over the genetic model. PPLs were computed for chromosomes 1 through 5, 11, 14, and 17 and remained below the 2% assumed prior probability of linkage for 73% of the locations examined. The maximum PPL of 4.5% was obtained on chromosome 1 at 178 cM. Although this is more than twice the assumed prior probability of linkage, it is well below a level at which we would recommend committing substantial additional resources to molecular follow-up. While the PPL analysis of this data remains inconclusive, Bayesian methodology gives us a clear mechanism for using the information gained here in further studies.  相似文献   

3.
The posterior probability of linkage (PPL) statistic has been developed as a method for the rigorous accumulation of evidence for or against linkage allowing for both intra- and inter-sample heterogeneity. To date, the method has assumed linkage equilibrium between alleles at the trait locus and the marker locus. We now generalize the PPL to allow for linkage disequilibrium (LD), by incorporating variable phase probabilities into the underlying linkage likelihood. This enables us to recover the marginal posterior density of the recombination fraction, integrating out nuisance parameters of the trait model, including the locus heterogeneity (admixture) parameter, as well as a vector of LD parameters. The marginal posterior density can then be updated across data subsets or new data as they become available, while allowing parameters of the trait model to vary between data sets. The method applies immediately to general pedigree structures and to markers with multiple alleles. In the case of SNPs, the likelihood is parameterized in terms of the standard single LD parameter D'; and it therefore affords a mechanism for estimation of D' between the marker and the trait, again, without fixing the parameters of the trait model and allowing for updating across data sets. It is even possible to allow for a different associated allele in different populations, while accumulating information regarding the strength of LD. While a computationally efficient implementation for multi-allelic markers is still in progress, we have implemented a version of this new LD-PPL for SNPs and evaluated its performance in nuclear families. Our simulations show that LD-PPLs tend to be larger than PPLs (stronger evidence in favor of linkage/LD) with increased LD level, under a variety of generating models; while in the absence of linkage and LD, LD-PPLs tend to be smaller than PPLs (stronger evidence against linkage). The estimate of D' also behaves well even in relatively small, heterogeneous samples.  相似文献   

4.
OBJECTIVE: We continue statistical development of the posterior probability of linkage (PPL). We present a two-point PPL allowing for unequal male and female recombination fractions, thetaM and thetaF, and consider alternative priors on thetaM, thetaF. METHODS: We compare the sex-averaged PPL (PPLSA), assuming thetaM = thetaF, to the sex-specific PPL (PPLSS) in (thetaM, thetaF), in a series of simulations; we also compute the PPLSS using alternative priors on (thetaM, thetaF). RESULTS: The PPLSS based on a prior that ignores prior genomic information on sex specific recombination rates performs essentially identically to the PPLSA, even in the presence of large thetaM, thetaF differences. Moreover, adaptively skewing the prior, to incorporate (correct) genomic information on thetaM, thetaF differences, actually worsens performance of the PPLSS. We demonstrate that this has little to do with the PPLSS per se, but is rather due to extremely high levels of variability in the location of the maximum likelihood estimates of (thetaM, thetaF) in realistic data sets. CONCLUSIONS: Incorporating (correct) prior genomic information is not always helpful. We recommend that the PPLSA be used as the standard form of the PPL regardless of the sex-specific recombination rates in the region of the marker in question.  相似文献   

5.
Children who fail to develop language normally-in the absence of explanatory factors such as neurological disorders, hearing impairment, or lack of adequate opportunity-are clinically described as having specific language impairment (SLI). SLI has a prevalence of approximately 7% in children entering school and is associated with later difficulties in learning to read. Research indicates that genetic factors are important in the etiology of SLI. Studies have consistently demonstrated that SLI aggregates in families. Increased monozygotic versus dizygotic twin concordance rates indicate that heredity, not just shared environment, is the cause of the familial clustering. We have collected five pedigrees of Celtic ancestry that segregate SLI, and we have conducted genomewide categorical linkage analysis, using model-based LOD score techniques. Analysis was conducted under both dominant and recessive models by use of three phenotypic classifications: clinical diagnosis, language impairment (spoken language quotient <85) and reading discrepancy (nonverbal IQ minus non-word reading >15). Chromosome 13 yielded a maximum multipoint LOD score of 3.92 under the recessive reading discrepancy model. Simulation to correct for multiple models and multiple phenotypes indicated that the genomewide empirical P value is <.01. As an alternative measure, we also computed the posterior probability of linkage (PPL), obtaining a PPL of 53% in the same region. One other genomic region yielded suggestive results on chromosome 2 (multipoint LOD score 2.86, genomic P value <.06 under the recessive language impairment model). Our findings underscore the utility of traditional LOD-score-based methods in finding genes for complex diseases, specifically, SLI.  相似文献   

6.
In complex disease studies, it is crucial to perform multipoint linkage analysis with many markers and to use robust nonparametric methods that take account of all pedigree information. Currently available methods fall short in both regards. In this paper, we describe how to extract complete multipoint inheritance information from general pedigrees of moderate size. This information is captured in the multipoint inheritance distribution, which provides a framework for a unified approach to both parametric and nonparametric methods of linkage analysis. Specifically, the approach includes the following: (1) Rapid exact computation of multipoint LOD scores involving dozens of highly polymorphic markers, even in the presence of loops and missing data. (2) Non-parametric linkage (NPL) analysis, a powerful new approach to pedigree analysis. We show that NPL is robust to uncertainty about mode of inheritance, is much more powerful than commonly used nonparametric methods, and loses little power relative to parametric linkage analysis. NPL thus appears to be the method of choice for pedigree studies of complex traits. (3) Information-content mapping, which measures the fraction of the total inheritance information extracted by the available marker data and points out the regions in which typing additional markers is most useful. (4) Maximum-likelihood reconstruction of many-marker haplotypes, even in pedigrees with missing data. We have implemented NPL analysis, LOD-score computation, information-content mapping, and haplotype reconstruction in a new computer package, GENEHUNTER. The package allows efficient multipoint analysis of pedigree data to be performed rapidly in a single user-friendly environment.  相似文献   

7.
Sib-pair analysis is an increasingly important tool for genetic dissection of complex traits. Current methods for sib-pair analysis are primarily based on studying individual genetic markers one at a time and thus fail to use the full inheritance information provided by multipoint linkage analysis. In this paper, we describe how to extract the complete multipoint inheritance information for each sib pair. We then describe methods that use this information to map loci affecting traits, thereby providing a unified approach to both qualitative and quantitative traits. Specifically, complete multipoint approaches are presented for (1) exclusion mapping of qualitative traits; (2) maximum-likelihood mapping of qualitative traits; (3) information-content mapping, showing the extent to which all inheritance information has been extracted at each location in the genome; and (4) quantitative-trait mapping, by two parametric methods and one nonparametric method. In addition, we explore the effects of marker density, marker polymorphism, and availability of parents on the information content of a study. We have implemented the analysis methods in a new computer package, MAPMAKER/SIBS. With this computer package, complete multipoint analysis with dozens of markers in hundreds of sib pairs can be carried out in minutes.  相似文献   

8.
Computational constraints currently limit exact multipoint linkage analysis to pedigrees of moderate size. We introduce new algorithms that allow analysis of larger pedigrees by reducing the time and memory requirements of the computation. We use the observed pedigree genotypes to reduce the number of inheritance patterns that need to be considered. The algorithms are implemented in a new version (version 2.1) of the software package GENEHUNTER. Performance gains depend on marker heterozygosity and on the number of pedigree members available for genotyping, but typically are 10-1,000-fold, compared with the performance of the previous release (version 2.0). As a result, families with up to 30 bits of inheritance information have been analyzed, and further increases in family size are feasible. In addition to computation of linkage statistics and haplotype determination, GENEHUNTER can also perform single-locus and multilocus transmission/disequilibrium tests. We describe and implement a set of permutation tests that allow determination of empirical significance levels in the presence of linkage disequilibrium among marker loci.  相似文献   

9.
Multipoint quantitative-trait linkage analysis in general pedigrees.   总被引:49,自引:12,他引:37       下载免费PDF全文
Multipoint linkage analysis of quantitative-trait loci (QTLs) has previously been restricted to sibships and small pedigrees. In this article, we show how variance-component linkage methods can be used in pedigrees of arbitrary size and complexity, and we develop a general framework for multipoint identity-by-descent (IBD) probability calculations. We extend the sib-pair multipoint mapping approach of Fulker et al. to general relative pairs. This multipoint IBD method uses the proportion of alleles shared identical by descent at genotyped loci to estimate IBD sharing at arbitrary points along a chromosome for each relative pair. We have derived correlations in IBD sharing as a function of chromosomal distance for relative pairs in general pedigrees and provide a simple framework whereby these correlations can be easily obtained for any relative pair related by a single line of descent or by multiple independent lines of descent. Once calculated, the multipoint relative-pair IBDs can be utilized in variance-component linkage analysis, which considers the likelihood of the entire pedigree jointly. Examples are given that use simulated data, demonstrating both the accuracy of QTL localization and the increase in power provided by multipoint analysis with 5-, 10-, and 20-cM marker maps. The general pedigree variance component and IBD estimation methods have been implemented in the SOLAR (Sequential Oligogenic Linkage Analysis Routines) computer package.  相似文献   

10.
We performed multipoint linkage analysis using 83 markers from the SNP Consortium (TSC) SNP linkage map in 3 regions covering 190 cM previously scanned with microsatellite markers and found to be linked to type 2 diabetes. Since the average linkage disequilibrium present in the TSC SNP marker clusters is relatively low, we assumed the intracluster genetic distances were a reasonable small nonzero distance (0.03 cM) and performed linkage analysis using GENEHUNTER PLUS and ASM linkage analysis software. We found that for the pedigree structures and missing data patterns in our samples the average information content in all three regions and the LOD score curves in two regions obtained from the TSC SNP markers were similar to results obtained from microsatellite marker maps with 10 cM average spacing. We also give an algorithm which extends the Lander-Green algorithm to permit multipoint linkage analysis of clusters of tightly linked markers with arbitrarily high levels of intracluster linkage disequilibrium.  相似文献   

11.
Both theoretical and applied studies have proven that the utility of single nucleotide polymorphism (SNP) markers in linkage analysis is more powerful and cost-effective than current microsatellite marker assays. Here we performed a whole-genome scan on 115 White, non-Hispanic families segregating for alcohol dependence, using one 10.3-cM microsatellite marker set and two SNP data sets (0.33-cM, 0.78-cM spacing). Two definitions of alcohol dependence (ALDX1 and ALDX2) were used. Our multipoint nonparametric linkage analysis found alcoholism was nominal linked to 12 genomic regions. The linkage peaks obtained by using the microsatellite marker set and the two SNP sets had a high degree of correspondence in general, but the microsatellite marker set was insufficient to detect some nominal linkage peaks. The presence of linkage disequilibrium between markers did not significantly affect the results. Across the entire genome, SNP datasets had a much higher average linkage information content (0.33 cM: 0.93, 0.78 cM: 0.91) than did microsatellite marker set (0.57). The linkage peaks obtained through two SNP datasets were very similar with some minor differences. We conclude that genome-wide linkage analysis by using approximately 5,000 SNP markers evenly distributed across the human genome is sufficient and might be more powerful than current 10-cM microsatellite marker assays.  相似文献   

12.
Obesity is an increasingly serious health problem in the world. Body mass index (BMI), percentage fat mass, and body fat mass are important indices of obesity. For a sample of pedigrees that contains >10,000 relative pairs (including 1,249 sib pairs) that are useful for linkage analyses, we performed a whole-genome linkage scan, using 380 microsatellite markers to identify genomic regions that may contain quantitative-trait loci (QTLs) for obesity. Each pedigree was ascertained through a proband who has extremely low bone mass, which translates into a low BMI. A major QTL for BMI was identified on 2q14 near the marker D2S347 with a LOD score of 4.04 in two-point analysis and a maximum LOD score (MLS) of 4.44 in multipoint analysis. The genomic region near 2q14 also achieved an MLS >2.0 for percentage of fat mass and body fat mass. For the putative QTL on 2q14, as much as 28.2% of BMI variation (after adjustment for age and sex) may be attributable to this locus. In addition, several other genomic regions that may contain obesity-related QTLs are suggested. For example, 1p36 near the marker D1S468 may contain a QTL for BMI variation, with a LOD score of 2.75 in two-point analysis and an MLS of 2.09 in multipoint analysis. The genomic regions identified in this and earlier reports are compared for further exploration in extension studies that use larger samples and/or denser markers for confirmation and fine-mapping studies, to eventually identify major functional genes involved in obesity.  相似文献   

13.
In the present study we propose a multipoint approach, for the mapping of genes, that is based on the case-parent trio design. We first derive an expression for the expected preferential-allele-transmission statistics for transmission, from either parent to an affected child, for an arbitrary location within a chromosomal region demarcated by several genetic markers. No assumption about genetic mechanism is needed in this derivation, beyond the assumption that no more than one disease gene lies in the region framed by the markers. When one builds on this representation, the way in which one may maximize the genetic information from multiple markers becomes obvious. This proposed method differs from the popular transmission/disequilibrium test (TDT) approach for fine mapping, in the following ways: First, in contrast with the TDT approach, all markers contribute information, regardless of whether the parents are heterozygous at any one marker, and incomplete trio data can be utilized in our approach. Second, rather than performing the TDT at each marker separately, we propose a single test statistic that follows a chi(2) distribution with 1 df, under the null hypothesis of no linkage or linkage disequilibrium to the region. Third, in the presence of linkage evidence, we offer a means to estimate the location of the disease locus along with its sampling uncertainty. We illustrate the proposed method with data from a family study of asthma, conducted in Barbados.  相似文献   

14.
This paper describes the software package KELVIN, which supports the PPL (posterior probability of linkage) framework for the measurement of statistical evidence in human (or more generally, diploid) genetic studies. In terms of scope, KELVIN supports two-point (trait-marker or marker-marker) and multipoint linkage analysis, based on either sex-averaged or sex-specific genetic maps, with an option to allow for imprinting; trait-marker linkage disequilibrium (LD), or association analysis, in case-control data, trio data, and/or multiplex family data, with options for joint linkage and trait-marker LD or conditional LD given linkage; dichotomous trait, quantitative trait and quantitative trait threshold models; and certain types of gene-gene interactions and covariate effects. Features and data (pedigree) structures can be freely mixed and matched within analyses. The statistical framework is specifically tailored to accumulate evidence in a mathematically rigorous way across multiple data sets or data subsets while allowing for multiple sources of heterogeneity, and KELVIN itself utilizes sophisticated software engineering to provide a powerful and robust platform for studying the genetics of complex disorders.  相似文献   

15.
It is well known that genotyping errors lead to loss of power in gene-mapping studies and underestimation of the strength of correlations between trait- and marker-locus genotypes. In two-point linkage analysis, these errors can be absorbed in an inflated recombination-fraction estimate, leaving the test statistic quite robust. In multipoint analysis, however, genotyping errors can easily result in false exclusion of the true location of a disease-predisposing gene. In a companion article, we described a "complex-valued" extension of the recombination fraction to accommodate errors in the assignment of trait-locus genotypes, leading to a multipoint LOD score with the same robustness to errors in trait-locus genotypes that is seen with the conventional two-point LOD score. Here, a further extension of this model to "hypercomplex-valued" recombination fractions (hereafter referred to as "hypercomplex recombination fractions") is presented, to handle random and systematic sources of marker-locus genotyping errors. This leads to a multipoint method (either "model-based" or "model-free") with the same robustness to marker-locus genotyping errors that is seen with conventional two-point analysis but with the advantage that multiple marker loci can be used jointly to increase meiotic informativeness. The cost of this increased robustness is a decrease in fine-scale resolution of the estimated map location of the trait locus, in comparison with traditional multipoint analysis. This probability model further leads to algorithms for the estimation of the lower bounds for the error rates for genomewide and locus-specific genotyping, based on the null-hypothesis distribution of the LOD-score statistic in the presence of such errors. It is argued that those genome scans in which the LOD score is 0 for >50% of the genome are likely to be characterized by high rates of genotyping errors in general.  相似文献   

16.
Specific language impairment is a neurodevelopmental disorder characterized by impairments essentially restricted to the domain of language and language learning skills. This contrasts with autism, which is a pervasive developmental disorder defined by multiple impairments in language, social reciprocity, narrow interests and/or repetitive behaviors. Genetic linkage studies and family data suggest that the two disorders may have genetic components in common. Two samples, from Canada and the US, selected for specific language impairment were genotyped at loci where such common genes are likely to reside. Significant evidence for linkage was previously observed at chromosome 13q21 in our Canadian sample (HLOD 3.56) and was confirmed in our US sample (HLOD 2.61). Using the posterior probability of linkage (PPL) to combine evidence for linkage across the two samples yielded a PPL over 92%. Two additional loci on chromosome 2 and 7 showed weak evidence for linkage. However, a marker in the cystic fibrosis transmembrane conductance regulator (7q31) showed evidence for association to SLI, confirming results from another group (O'Brien et al. 2003). Our results indicate that using samples selected for components of the autism phenotype may be a useful adjunct to autism genetics.  相似文献   

17.
Results of autism linkage studies have been difficult to interpret across research groups, prompting the use of ever-increasing sample sizes to increase power. However, increasing sample size by pooling disparate collections for a single analysis may, in fact, not increase power in the face of genetic heterogeneity. Here, we applied the posterior probability of linkage (PPL), a method designed specifically to analyze multiple heterogeneous data sets, to the Autism Genetic Resource Exchange collection of families by analyzing six clinically defined subsets of the data and updating the PPL sequentially over the subsets. Our results indicate a substantial probability of linkage to chromosome 1, which had been previously overlooked; our findings also provide a further characterization of the possible parent-of-origin effects at the 17q11 locus that were previously described in this sample. This analysis illustrates that the way in which heterogeneity is addressed in linkage analysis can dramatically affect the overall conclusions of a linkage study.  相似文献   

18.
Linkage disequilibrium testing when linkage phase is unknown   总被引:2,自引:0,他引:2  
Schaid DJ 《Genetics》2004,166(1):505-512
Linkage disequilibrium, the nonrandom association of alleles from different loci, can provide valuable information on the structure of haplotypes in the human genome and is often the basis for evaluating the association of genomic variation with human traits among unrelated subjects. But, linkage phase of genetic markers measured on unrelated subjects is typically unknown, and so measurement of linkage disequilibrium, and testing whether it differs significantly from the null value of zero, requires statistical methods that can account for the ambiguity of unobserved haplotypes. A common method to test whether linkage disequilibrium differs significantly from zero is the likelihood-ratio statistic, which assumes Hardy-Weinberg equilibrium of the marker phenotype proportions. We show, by simulations, that this approach can be grossly biased, with either extremely conservative or liberal type I error rates. In contrast, we use simulations to show that a composite statistic, proposed by Weir and Cockerham, maintains the correct type I error rates, and, when comparisons are appropriate, has similar power as the likelihood-ratio statistic. We extend the composite statistic to allow for more than two alleles per locus, providing a global composite statistic, which is a strong competitor to the usual likelihood-ratio statistic.  相似文献   

19.
OBJECTIVES: Describe the inflation in nonparametric multipoint LOD scores due to inter-marker linkage disequilibrium (LD) across many markers with varied allele frequencies. METHOD: Using simulated two-generation families with and without parents, we conducted nonparametric multipoint linkage analysis with 2 to 10 markers with minor allele frequencies (MAF) of 0.5 and 0.1. RESULTS: Misspecification of population haplotype frequencies by assuming linkage equilibrium caused inflated multipoint LOD scores due to inter-marker LD when parental genotypes were not included. Inflation increased as more markers in LD were included and decreased as markers in equilibrium were added. When marker allele frequencies were unequal, the r2 measure of LD was a better predictor of inflation than D'. CONCLUSION: This observation strongly supports the evaluation of LD in multipoint linkage analyses, and further suggests that unaccounted for LD may be suspected when two-point and multipoint linkage analyses show a marked disparity in regions with elevated r2 measures of LD. Given the increasing popularity of high-density genome-wide SNP screens, inter-marker LD should be a concern in future linkage studies.  相似文献   

20.
The identification of genes contributing to complex diseases and quantitative traits requires genetic data of high fidelity, because undetected errors and mutations can profoundly affect linkage information. The recent emphasis on the use of the sibling-pair design eliminates or decreases the likelihood of detection of genotyping errors and marker mutations through apparent Mendelian incompatibilities or close double recombinants. In this article, we describe a hidden Markov method for detecting genotyping errors and mutations in multilocus linkage data. Specifically, we calculate the posterior probability of genotyping error or mutation for each sibling-pair-marker combination, conditional on all marker data and an assumed genotype-error rate. The method is designed for use with sibling-pair data when parental genotypes are unavailable. Through Monte Carlo simulation, we explore the effects of map density, marker-allele frequencies, marker position, and genotype-error rate on the accuracy of our error-detection method. In addition, we examine the impact of genotyping errors and error detection and correction on multipoint linkage information. We illustrate that even moderate error rates can result in substantial loss of linkage information, given efforts to fine-map a putative disease locus. Although simulations suggest that our method detects 相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号