首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Family data teamed with the transmission/disequilibrium test (TDT), which simultaneously evaluates linkage and association, is a powerful means of detecting disease-liability alleles. To increase the information provided by the test, various researchers have proposed TDT-based methods for haplotype transmission. Haplotypes indeed produce more-definitive transmissions than do the alleles comprising them, and this tends to increase power. However, the larger number of haplotypes, relative to alleles at individual loci, tends to decrease power, because of the additional degrees of freedom required for the test. An optimal strategy would focus the test on particular haplotypes or groups of haplotypes. In this report we develop such an approach by combining the theory of TDT with that of measured haplotype analysis (MHA). MHA uses the evolutionary relationships among haplotypes to produce a limited set of hypothesis tests and to increase the interpretability of these tests. The theory of our approach, called the "evolutionary tree" (ET)-TDT, is developed for two cases: when haplotype transmission is certain and when it is not. Simulations show the ET-TDT can be more powerful than other proposed methods under reasonable conditions. More importantly, our results show that, when multiple polymorphisms are found within the gene, the ET-TDT can be useful for determining which polymorphisms affect liability.  相似文献   

2.
Transmission/Disequilibrium Tests for Extended Marker Haplotypes   总被引:11,自引:0,他引:11       下载免费PDF全文
A generalization of the transmission/disequilibrium test to detect association between polymorphic markers and discrete or quantitative traits is discussed, with particular emphasis on marker haplotypes formed by several adjacent loci. Furthermore, strategies for testing haplotype association, using methods from spatial statistics, are developed. This approach compares the "similarity" of transmitted and untransmitted haplotypes, with the aim of determining the regions where there is greater similarity within the transmitted set. This arises from the fact that, although the original haplotypes carrying the mutation will be broken down by recombination, there may be a subset of markers near the mutation that are common to many of the recombinant haplotypes. Thus, by examination of each marker in turn and by measurement of the average size of the region shared identically by state in the transmitted and untransmitted haplotypes, it may be possible to detect regions of linkage disequilibrium that encompass the susceptibility gene.  相似文献   

3.
It has been demonstrated in the literature that the transmission/disequilibrium test (TDT) has higher power than the affected-sib-pair (ASP) mean test when linkage disequilibrium (LD) is strong but that the mean test has higher power when LD is weak. Thus, for ASP data, it seems clear that the TDT should be used when LD is strong but that the mean test or other linkage tests should be used when LD is weak or absent. However, in practice, it may be difficult to follow such a guideline, because the extent of LD is often unknown. Even with a highly dense genetic-marker map, in which some markers should be located near the disease-predisposing mutation, strong LD is not inevitable. Besides the genetic distance, LD is also affected by many factors, such as the allelic heterogeneity at the disease locus, the initial LD, the allelic frequencies at both disease locus and marker locus, and the age of the mutation. Therefore, it is of interest to develop methods that are adaptive to the extent of LD. In this report, we propose a disequilibrium maximum-binomial-likelihood (DMLB) test that incorporates LD in the maximum-binomial-likelihood (MLB) test. Examination of the corresponding score statistics shows that this method adaptively combines two sources of information: (a) the identity-by-descent (IBD) sharing score, which is informative for linkage regardless of the existence of LD, and (b) the contrast between allele-specific IBD sharing score, which is informative for linkage only in the presence of LD. For ASP data, the proposed test has higher power than either the TDT or the mean test when the extent of LD ranges from moderate to strong. Only when LD is very weak or absent is the DMLB slightly less powerful than the mean test; in such cases, the TDT has essentially no power to detect linkage. Therefore, the DMLB test is an interesting approach to linkage detection when the extent of LD is unknown.  相似文献   

4.
Haplotypes provide a more informative format of polymorphisms for genetic association analysis than do individual single-nucleotide polymorphisms. However, the practical efficacy of haplotype-based association analysis is challenged by a trade-off between the benefits of modeling abundant variation and the cost of the extra degrees of freedom. To reduce the degrees of freedom, several strategies have been considered in the literature. They include (1) clustering evolutionarily close haplotypes, (2) modeling the level of haplotype sharing, and (3) smoothing haplotype effects by introducing a correlation structure for haplotype effects and studying the variance components (VC) for association. Although the first two strategies enjoy a fair extent of power gain, empirical evidence showed that VC methods may exhibit only similar or less power than the standard haplotype regression method, even in cases of many haplotypes. In this study, we report possible reasons that cause the underpowered phenomenon and show how the power of the VC strategy can be improved. We construct a score test based on the restricted maximum likelihood or the marginal likelihood function of the VC and identify its nontypical limiting distribution. Through simulation, we demonstrate the validity of the test and investigate the power performance of the VC approach and that of the standard haplotype regression approach. With suitable choices for the correlation structure, the proposed method can be directly applied to unphased genotypic data. Our method is applicable to a wide-ranging class of models and is computationally efficient and easy to implement. The broad coverage and the fast and easy implementation of this method make the VC strategy an effective tool for haplotype analysis, even in modern genomewide association studies.  相似文献   

5.
Haplotype-based risk models can lead to powerful methods for detecting the association of a disease with a genomic region of interest. In population-based studies of unrelated individuals, however, the haplotype status of some subjects may not be discernible without ambiguity from available locus-specific genotype data. A score test for detecting haplotype-based association using genotype data has been developed in the context of generalized linear models for analysis of data from cross-sectional and retrospective studies. In this article, we develop a test for association using genotype data from cohort and nested case-control studies where subjects are prospectively followed until disease incidence or censoring (end of follow-up) occurs. Assuming a proportional hazard model for the haplotype effects, we derive an induced hazard function of the disease given the genotype data, and hence propose a test statistic based on the associated partial likelihood. The proposed test procedure can account for differential follow-up of subjects, can adjust for possibly time-dependent environmental co-factors and can make efficient use of valuable age-at-onset information that is available on cases. We provide an algorithm for computing the test statistic using readily available statistical software. Utilizing simulated data in the context of two genomic regions GPX1 and GPX3, we evaluate the validity of the proposed test for small sample sizes and study its power in the presence and absence of missing genotype data.  相似文献   

6.
Individuals who share a disease mutation from a common ancestor often share alleles at genetic markers adjacent to the mutation, even if the common ancestor is remote. The alleles at these adjacent markers, called the haplotype, can be visualized as a string of realizations of random variables, which may be dependent when individuals are related in some fashion. Ideally, for a sample of individuals all having the same (genetic) disease, this dependence-measured as haplotype-sharing-will be greater in the vicinity of disease genes than in other regions of the genome. In this paper we present a semiparametric test for haplotype-sharing. We begin by developing a model assuming that the ancestral haplotype is known and thus the extent of haplotype-sharing from a common ancestor can be determined unambiguously. The amount of overlap at markers far from the disease is treated as a random variable with an unknown distribution F, which we estimate non-parametrically. Overlap of markers surrounding disease genes are modeled as a mixture pF(x - theta) + (1 - p)F(x), in which p is the fraction of subjects with the disease mutation. Testing for a disease gene then amounts to testing whether p = 0. Next we drop the assumption that the ancestral haplotype is known. To detect excess clustering of haplotypes, we measure the pairwise overlap of a set of haplotypes. As in the simpler scenario, this distribution is modeled as a location-shift mixture. To test the hypothesis we construct a score test with a simple limiting distribution.  相似文献   

7.
In case-control studies, genetic associations for complex diseases may be probed either with single-locus tests or with haplotype-based tests. Although there are different views on the relative merits and preferences of the two test strategies, haplotype-based analyses are generally believed to be more powerful to detect genes with modest effects. However, a main drawback of haplotype-based association tests is the large number of distinct haplotypes, which increases the degrees of freedom for corresponding test statistics and thus reduces the statistical power. To decrease the degrees of freedom and enhance the efficiency and power of haplotype analysis, we propose an improved haplotype clustering method that is based on the haplotype cladistic analysis developed by Durrant et al. In our method, we attempt to combine the strengths of single-locus analysis and haplotype-based analysis into one single test framework. Novel in our method is that we develop a more informative haplotype similarity measurement by using p-values obtained from single-locus association tests to construct a measure of weight, which to some extent incorporates the information of disease outcomes. The weights are then used in computation of similarity measures to construct distance metrics between haplotype pairs in haplotype cladistic analysis. To assess our proposed new method, we performed simulation analyses to compare the relative performances of (1) conventional haplotype-based analysis using original haplotype, (2) single-locus allele-based analysis, (3) original haplotype cladistic analysis (CLADHC) by Durrant et al., and (4) our weighted haplotype cladistic analysis method, under different scenarios. Our weighted cladistic analysis method shows an increased statistical power and robustness, compared with the methods of haplotype cladistic analysis, single-locus test, and the traditional haplotype-based analyses. The real data analyses also show that our proposed method has practical significance in the human genetics field.  相似文献   

8.
Studies using haplotypes of multiple tightly linked markers are more informative than those using a single marker. However, studies based on multimarker haplotypes have some difficulties. First, if we consider each haplotype as an allele and use the conventional single-marker transmission/disequilibrium test (TDT), then the rapid increase in the degrees of freedom with an increasing number of markers means that the statistical power of the conventional tests will be low. Second, the parental haplotypes cannot always be unambiguously reconstructed. In the present article, we propose a haplotype-sharing TDT (HS-TDT) for linkage or association between a disease-susceptibility locus and a chromosome region in which several tightly linked markers have been typed. This method is applicable to both quantitative traits and qualitative traits. It is applicable to any size of nuclear family, with or without ambiguous phase information, and it is applicable to any number of alleles at each of the markers. The degrees of freedom (in a broad sense) of the test increase linearly as the number of markers considered increases but do not increase as the number of alleles at the markers increases. Our simulation results show that the HS-TDT has the correct type I error rate in structured populations and that, in most cases, the power of HS-TDT is higher than the power of the existing single-marker TDTs and haplotype-based TDTs.  相似文献   

9.
Type 1 diabetes is a T-cell-mediated chronic disease characterized by the autoimmune destruction of pancreatic insulin-producing beta cells and complete insulin deficiency. It is the result of a complex interrelation of genetic and environmental factors, most of which have yet to be identified. Simultaneous identification of these genetic factors, through use of unphased genotype data, has received increasing attention in the past few years. Several approaches have been described, such as the modified transmission/disequilibrium test procedure, the conditional extended transmission/disequilibrium test, and the stepwise logistic-regression procedure. These approaches are limited either by being restricted to family data or by ignoring so-called "haplotype interactions" between alleles. To overcome this limit, the present study provides a general method to identify, on the basis of unphased genotype data, the haplotype blocks that interact to define the risk for a complex disease. The principle underpinning the proposal is minimal entropy. The performance of our procedure is illustrated for both simulated and real data. In particular, for a set of Dutch type 1 diabetes data, our procedure suggests some novel evidence of the interactions between and within haplotype blocks that are across chromosomes 1, 2, 3, 4, 5, 6, 7, 8, 11, 12, 15, 16, 17, 19, and 21. The results demonstrate that, by considering interactions between potential disease haplotype blocks, we may succeed in identifying disease-predisposing genetic variants that might otherwise have remained undetected.  相似文献   

10.
When the mode of inheritance of a disease is unknown, the LOD-score method of linkage analysis must take into account uncertainties in model parameters. We have previously proposed a parametric linkage test called "MFLOD," which does not require specification of disease model parameters. In the present study, we introduce two new model-free parametric linkage tests, known as "MLOD" and "MALOD." These tests are defined, respectively, as the LOD score and the admixture LOD score, maximized (subject to the same constraints as MFLOD) over disease-model parameters. We compared the power of these three parametric linkage tests and that of two nonparametric linkage tests, NPLall and NPLpairs, which are implemented in GENEHUNTER. With the use of small pedigrees and a fully informative marker, we found the powers of MLOD, NPLall, and NPLpairs to be almost equivalent to each other and not far below that of a LOD-score analysis performed under the assumption the correct genetic parameters. Thus, linkage analysis is not much hindered by uncertain mode of inheritance. The results also suggest that both parametric and nonparametric methods are suitable for linkage analysis of complex disorders in small pedigrees. However, whether these results apply to large pedigrees remains to be answered.  相似文献   

11.
Genetic determinants of the degree of obesity and body fat distribution have been demonstrated by family studies. The heritability has been estimated to be in the range 0.2–0.7. Mutation leading to obesity in humans has been described for only two genes, one of them the leptin gene. The leptin gene codes for a cytokine secreted by fat cells that binds to the leptin receptor (Lep-R), which exerts some of its biological functions by expression in the brain. Hence, the Lep-R gene appears to be a promising candidate for the determination of obesity in humans. We isolated genomic DNA clones from the Lep-R gene region and identified a new polymorphic microsatellite marker (OBR-CA) within 80 kb of the translation start of Lep-R. We genotyped this and a second, intragenic microsatellite marker (D1S2852) in 130 nuclear families consisting of extremely obese children and adolescents and both parents. Using the most frequent parental allele of both markers, our analysis revealed a significant transmission disequilibrium for the 266-bp allele of D1S2852 (corrected P-value=0.042). No significant result was obtained with the most frequent allele of OBR-CA (corrected P-value=1.0). However, two rare alleles showed transmission disequilibrium and were subsequently used for constructing a haplotype with the 266-bp allele. This haplotype had a transmission rate of 80% (nominal P-value=0.02). In order to identify the underlying mutation, we sequenced all coding exons of Lep-R and the partially overlapping gene encoding the obese receptor gene-related protein (ob-rgrp) in individuals carrying this haplotype. We found one new mutation (Ser675Thr) in the Lep-R gene in one proband and several other mutations known to be not associated with obesity in other study groups. As this new mutation cannot explain our positive linkage result, the transmission disequilibrium of the 266-bp allele and the high transmission rate of the identified haplotype point towards a mutation in close proximity to marker D1S2852. Received: 9 March 1998 / Accepted: 17 July 1998  相似文献   

12.
This work introduces the use of an interval representation of fluxes. This representation can be useful in two common situations: (a) when fluxes are uncertain due to the lack of accurate measurements and (b) when the flux distribution is partially unknown. In addition, the interval representation can be used for other purposes such as dealing with inconsistency or representing a range of behaviour. Two main problems are addressed. On the one hand, the translation of a metabolic flux distribution into an elementary modes or extreme pathways activity pattern is analysed. In general, there is not a unique solution for this problem but a range of solutions. To represent the whole solution region in an easy way, it is possible to compute the alpha-spectrum (i.e., the range of possible values for each elementary mode or extreme pathway activity). Herein, a method is proposed which, based on the interval representation of fluxes, makes it possible to compute the alpha-spectrum from an uncertain or even partially unknown flux distribution. On the other hand, the concept of the flux-spectrum is introduced as a variant of the metabolic flux analysis methodology that presents some advantages: applicable when measurements are insufficient (underdetermined case), integration of uncertain measurements, inclusion of irreversibility constraints and an alternative procedure to deal with inconsistency. Frequently, when applying metabolic flux analysis the available measurements are insufficient and/or uncertain and the complete flux distribution cannot be uniquely calculated. The method proposed here allows the determination of the ranges of possible values for each non-calculable flux, resulting in a flux region called flux-spectrum. In order to illustrate the proposed methods, the example of the metabolic network of CHO cells cultivated in stirred flasks is used.  相似文献   

13.
Cells can sense and respond to mechanical signals over relatively long distances across fibrous extracellular matrices. Recently proposed models suggest that long-range force transmission can be attributed to the nonlinear elasticity or fibrous nature of collagen matrices, yet the mechanism whereby fibers align remains unknown. Moreover, cell shape and anisotropy of cellular contraction are not considered in existing models, although recent experiments have shown that they play crucial roles. Here, we explore all of the key factors that influence long-range force transmission in cell-populated collagen matrices: alignment of collagen fibers, responses to applied force, strain stiffening properties of the aligned fibers, aspect ratios of the cells, and the polarization of cellular contraction. A constitutive law accounting for mechanically driven collagen fiber reorientation is proposed. We systematically investigate the range of collagen-fiber alignment using both finite-element simulations and analytical calculations. Our results show that tension-driven collagen-fiber alignment plays a crucial role in force transmission. Small critical stretch for fiber alignment, large fiber stiffness and fiber strain-hardening behavior enable long-range interaction. Furthermore, the range of collagen-fiber alignment for elliptical cells with polarized contraction is much larger than that for spherical cells with diagonal contraction. A phase diagram showing the range of force transmission as a function of cell shape and polarization and matrix properties is presented. Our results are in good agreement with recent experiments, and highlight the factors that influence long-range force transmission, in particular tension-driven alignment of fibers. Our work has important relevance to biological processes including development, cancer metastasis, and wound healing, suggesting conditions whereby cells communicate over long distances.  相似文献   

14.
Recently, several authors have proposed that the availability of intermediate hosts (IHs) for definitive hosts (DHs) may contribute to determining the dynamics and evolutionary ecology of parasites with facultative complex life cycles. The protozoa Toxoplasma gondii may be transmitted to DHs either via predation of infected IHs through a complex life cycle (CLC) or directly from a contaminated environment through a simple life cycle (SLC). This parasite is also present in contrasting host density environments. We tested the hypothesis that the relative contributions of the CLC and SLC along an urban-rural gradient depend on the IH supply. We built and analysed a deterministic model of the T. gondii transmission cycle. The SLC relative contribution is important only in urban-type environments, i.e., with low predation rate on IHs. In contrast, the parasite is predominantly transmitted through a CLC in suburban and rural environments. The association of the two cycles enables the parasite to spread in situations of low IH availability and low DH population size for which each cycle alone is insufficient.  相似文献   

15.
OBJECTIVES: Linkage disequilibrium (LD) between closely spaced SNPs can be accommodated in linkage analysis by specifying the multi-SNP haplotype frequencies, if known. Phased haplotypes in candidate regions can provide gold standard haplotype frequency estimates, and may be of inherent interest as markers. We evaluated the effects of different methods of haplotype frequency estimation, and the use of marker phase information, on linkage analysis of a multi-SNP cluster in a candidate region for Alzheimer's disease (AD). METHODS: We performed parametric linkage analysis of a five-SNP cluster in extended pedigrees to compare the use of: (1) haplotype frequencies estimated by molecular phase determination, maximum likelihood estimation, or by assuming linkage equilibrium (LE); (2) AD families or controls as the frequency source; and (3) unphased or molecularly phased SNP data. RESULTS: There was moderate to strong pairwise LD among the five SNPs. Falsely assuming LE substantially inflated the LOD score, but the method of haplotype frequency estimation and particular sample used made little difference provided that LD was accommodated. Use of phased haplotypes produced a modest increase in the LOD score over unphased SNPs. CONCLUSIONS: Ignoring LD between markers can lead to substantially inflated evidence for linkage in LOD score analysis of extended pedigrees with missing data. Use of marker phase information in linkage analysis may be important in disease studies where the costs of family recruitment and phenotyping greatly exceed the costs of phase determination.  相似文献   

16.
Transmission ratio distortion is a characteristic of complete t-haplotypes, such that heterozygous males preferentially transmit the t-haplotype bearing chromosome 17 to the majority of their progeny. At least two genes contained within the t-haplotype have been identified as being required for such high transmission ratios. In this study we examine the effects of the genetic background and the chromosome homologous to the t-haplotype on transmission ratio distortion. We use two different congenic lines: BTBRTF/Nev.Ttf/t12, in which the t12 haplotype has a transmission ratio of 52%, and C3H/DiSn.Ttf/t12, in which the t12 haplotype has a transmission ratio of 99%. By intercrossing these two strains to produce reciprocal F1 and F2 generations, we have isolated the effects of the homologous chromosome 17 from the effects of the genetic background. We demonstrate that both the homologous chromosome and the genetic background have profound effects on t-haplotype transmission ratio distortion. Furthermore, it is evident that the t-haplotype transmission ratio behaves as a quantitative character rather than an intrinsic property of t-haplotypes.  相似文献   

17.
An allele of the mouse brachyury locus, T22H, had been shown previously to involve a deletion of several markers in the proximal part of chromosome 17, and almost certainly includes deletion of the t-complex distorter gene Tcd-1. The effects of T22H on transmission ratio distortion and male sterility caused by the t-complex were compared with those of a partial t-haplotype th51, which carries the t-form of the distorter Tcd-1t. In combination with the complete haplotype tw32, T22H caused severe impairment of male fertility, but males of genotype T22H/t6 or T22H/th51 were normally fertile. These results were very similar to those obtained when th51 was in combination with the same haplotypes. In effect on transmission ratio T22H was again similar to th51, in that it produced a marked increase in the transmission of the haplotype t6. To test whether the effects of T22H were due to deletion of elements other than Tcd-1, the effect of T22H on transmission of the partial haplotype th2 was compared with that of the deletion Thp. Again T22H markedly increased transmission of the t-haplotype and the effect was significantly greater than the small effect produced by Thp. It is concluded that deletion of the distorter Tcd-1 has an effect like that of the t-form of this distorter, Tcd-1t, and hence that Tcd-1t must be an amorph or hypomorph. It is speculated that other t-complex distorters, Tcd-2t and Tcd-3t, may also be amorphs or hypomorphs.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

18.
Many microparasites infect new hosts with specialized life stages, requiring a subset of the parasite population to forgo proliferation and develop into transmission forms. Transmission stage production influences infectivity, host exploitation, and the impact of medical interventions like drug treatment. Predicting how parasites will respond to public health efforts on both epidemiological and evolutionary timescales requires understanding transmission strategies. These strategies can rarely be observed directly and must typically be inferred from infection dynamics. Using malaria as a case study, we test previously described methods for inferring transmission stage investment against simulated data generated with a model of within-host infection dynamics, where the true transmission investment is known. We show that existing methods are inadequate and potentially very misleading. The key difficulty lies in separating transmission stages produced by different generations of parasites. We develop a new approach that performs much better on simulated data. Applying this approach to real data from mice infected with a single Plasmodium chabaudi strain, we estimate that transmission investment varies from zero to 20%, with evidence for variable investment over time in some hosts, but not others. These patterns suggest that, even in experimental infections where host genetics and other environmental factors are controlled, parasites may exhibit remarkably different patterns of transmission investment.  相似文献   

19.
When the transmission/disequilibrium test (TDT) is applied to multilocus haplotypes, a bias may be introduced in some families for which both parents have the same heterozygous genotype at some locus. The bias occurs because haplotypes can only be deduced from certain offspring, with the result that the transmissions of the two parental haplotypes are not independent. We obtain an unbiased TDT for individual haplotypes by calculating the correct variance for the transmission count within a family, using information from multiple siblings if they are available. An existing correction for dependence between siblings in the presence of linkage is retained. To obtain an unbiased multihaplotype TDT, we must either count transmissions from one randomly chosen parent or count all transmissions and estimate the significance level empirically. Alternatively, we may use missing-data techniques to estimate uncertain haplotypes, but these methods are not robust to population stratification. An illustration using data from the insulin-gene region in type 1 diabetes shows that the validity and power of the TDT may vary by an order of magnitude, depending on the method of analysis.  相似文献   

20.
Is Pneumocystis pneumonia (PcP) a transmissible fungal disease? Does nosocomial PcP occur? Is there Pneumocystis transmission in the community? These questions, which could not be tackled before the 2000s, may at present be approached using either noninvasive detection methods or experimental transmission models. Represented by a unique entity (P.?carinii) for almost one century, the Pneumocystis genus was shown to contain several species, being P.?jirovecii the sole species identified in humans hitherto. Molecular methods combined with cross infection experiments revealed strong host specificity that precludes Pneumocystis inter-species transmission. In contrast, respiratory transmission between mammals of a same species is usually highly active, even between immunocompetent hosts. Other transmission ways could also exist. New data show that human being is the unique P.?jirovecii reservoir; it would constitute the sole infection source in both hospital and community.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号