期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Powerful allele sharing statistics for nonparametric linkage analysis

Lange EM Lange K 《Human heredity》2004,57(1):49-58

Nonparametric linkage analysis is widely used to map susceptibility genes for complex diseases. This paper introduces six nonparametric statistics for measuring marker allele sharing among the affected members of a pedigree. We compare the power of these new statistics and three previous statistics to detect linkage with Mendelian diseases having recessive, additive, and dominant modes of inheritance. The nine statistics represent all possible combinations of three different IBD scoring functions and three different schemes for sampling genes among affecteds. Our results strongly suggest that the statistic T(rec)(blocks) is best for recessive traits, while the two statistics T(kin)(pairs) and T(all)(kin) vie for best for an additive trait. The best statistic for a dominant trait is less clear. The statistics T(kin)(pairs) and T(all)(kin) are equally promising for small sibships, but in extended pedigrees the statistics T(dom)(blocks) and T(dom)(pairs) appear best. For a complex trait, we advocate computing several of these statistics. 相似文献

2.

Bias toward the null hypothesis in model-free linkage analysis is highly dependent on the test statistic used

下载免费PDF全文

Cordell HJ 《American journal of human genetics》2004,74(6):1294-1302

Recently, it has been suggested that traditional nonparametric multipoint-linkage procedures can show a "bias" toward the null hypothesis of no effect when there is incomplete information about allele sharing at genotyped marker loci (or at positions in between marker loci). Here, I investigate the extent of this bias for a variety of test statistics commonly used in qualitative- ("affecteds only") and quantitative-trait linkage analysis. Through simulation and analytical derivation, I show that many of the test statistics available in standard linkage analysis packages (such as Genehunter, Merlin, and Allegro) are, in fact, not affected by this bias problem. A few test statistics--most notably the nonparametric linkage statistic and, to a lesser extent, the Aspex-MLS and Haseman-Elston statistics--are affected by the bias. Variance-components procedures, although unbiased, can show inflation or deflation of the test statistic attributable to the inclusion of pairs with incomplete identity-by-descent information. Results obtained--for instance, in genome scans--using these methods might therefore be worth revisiting to see if greater power can be obtained by use of an alternative statistic or by eliminating or downweighting uninformative relative pairs. 相似文献

3.

Composite statistics for QTL mapping with moderately discordant sibling pairs

下载免费PDF全文

Forrest WF Feingold E 《American journal of human genetics》2000,66(5):1642-1660

Extreme discordant sibling-pair (EDSP) designs have been shown in theory to be very powerful for mapping quantitative-trait loci (QTLs) in humans. However, their practical applicability has been somewhat limited by the need to phenotype very large populations to find enough pairs that are extremely discordant. In this paper, we demonstrate that there is also substantial power in pairs that are only moderately discordant, and that designs using moderately discordant pairs can yield a more practical balance between phenotyping and genotyping efforts. The power we demonstrate for moderately discordant pairs stems from a new statistical result. Statistical analysis in discordant-pair studies is generally done by testing for reduced identity by descent (IBD) sharing in the pairs. By contrast, the most commonly-used statistical methods for more standard QTL mapping are Haseman-Elston regression and variance-components analysis. Both of these use statistics that are functions of the trait values given IBD information for the pedigree. We show that IBD sharing statistics and "trait value given IBD" statistics contribute complementary rather than redundant information, and thus that statistics of the two types can be combined to form more powerful tests of linkage. We propose a simple composite statistic, and test it with simulation studies. The simulation results show that our composite statistic increases power only minimally for extremely discordant pairs. However, it boosts the power of moderately discordant pairs substantially and makes them a very practical alternative. Our composite statistic is straightforward to calculate with existing software; we give a practical example of its use by applying it to a Genetic Analysis Workshop (GAW) data set. 相似文献

4.

Efficient strategies for genomic searching using the affected-pedigree-member method of linkage analysis. 总被引：4，自引：3，他引：1

下载免费PDF全文

D. L. Brown M. B. Gorin D. E. Weeks 《American journal of human genetics》1994,54(3):544-552

The affected-pedigree-member (APM) method of linkage analysis is a nonparametric statistic that tests for nonrandom cosegregation of a disease and marker loci. The APM statistic is based on the observation that if a marker locus is near a disease-susceptibility locus, then affected individuals within a family should be more similar at the marker locus than is expected by chance. The APM statistic measures marker similarity in terms of identity by state (IBS) of marker alleles; that is, two alleles are IBS if they are the same, regardless of their ancestral origin. Since the APM statistic measures increased marker similarity, it makes no assumptions concerning how the disease is inherited; this can be an advantage when dealing with complex diseases for which the mode of inheritance is difficult to determine. We investigate here the power of the APM statistic to detect linkage in the context of a genomewide search. In such a search, the APM statistic is evaluated at a grid of markers. Then regions with high APM statistics are investigated more thoroughly by typing more markers in the region. Using simulated data, we investigate various search strategies and recommend an optimal search strategy that maximizes the power to detect linkage while minimizing the false-positive rate and number of markers. We determine an optimal series of three increasing cut-points and an independent criterion for significance. 相似文献

5.

Nonparametric trend statistic incorporating dispersion differences in sib pair linkage for quantitative traits

Kim MK Hong YJ Song HH 《Human heredity》2006,62(1):1-11

The Haseman-Elston (HE) regression method and its extensions are widely used in genetic studies for detecting linkage to quantitative trait loci (QTL) using sib pairs. The principle underlying the simple HE regression method is that the similarity in phenotypes between two siblings increases as they share an increasing number of alleles identical by descent (IBD) from their parents at a particular marker locus. In such a procedure, similarity was identified with the locations, that is, means of groups of sib pairs sharing 0, 1, and 2 alleles IBD. A more powerful, rank-based nonparametric test to detect increasing similarity in sib pairs is presented by combining univariate trend statistics not only of locations, but also of dispersions of the squared phenotypic differences of two siblings for three groups. This trend test does not rely on distributional assumptions, and is applicable to the skewed or leptokurtic phenotypic distributions, in addition to normal or near normal phenotypic distributions. The performances of nonparametric trend statistics, including nonparametric regression slope, are compared with the HE regression methods as genetic linkage strategies. 相似文献

6.

Heterogeneity in IBD allele sharing among covariate-defined subgroups: issues and findings for affected relatives

Bull SB Mirea L Briollais L Logan AG 《Human heredity》2003,56(1-3):94-106

OBJECTIVES: Modelling of variation in identical-by-descent (IBD) allele sharing using covariates can increase power to detect linkage, identify covariate-defined subgroups linked to particular marker regions, and improve the design of subsequent studies to localize genes and characterize their effects. In this report, we highlight issues that arise in studies of families with affected relatives. METHODS: Mirea et al. [Genet Epidemiol 2003, in press] extended linear and exponential linkage likelihood models [Kong and Cox, Am J Hum Genet 1997;61: 1179-1188] to model variation in NPL scores among covariate-defined groups of families, and proposed likelihood ratio (LR) and t statistics to detect differences in allele sharing between groups defined by a binary covariate. Here we evaluate factors affecting the power of these tests analytically and by example, as well as effects of constraints, nuisance parameters, and incomplete data on test validity by simulation of locus heterogeneity in families with affected siblings or affected cousins. RESULTS: Provided constraints on the parameters are avoided, these tests are particularly useful when one subgroup has less than expected IBD sharing. The distribution of the LR statistic depends on the extent of linkage, particularly in the presence of constraints. The t statistic may be biased by group differences in information content. CONCLUSIONS: We recommend that constraints be applied cautiously, and covariate effects in IBD allele sharing models interpreted with care. 相似文献

7.

Which strategy is better for linkage analysis: single-nucleotide polymorphisms or microsatellites? Evaluation by identity-by-state – identity-by-descent transformation affected sib-pair method on GAW14 data

Yue Q Apprey V Bonney GE 《BMC genetics》2005,6(Z1):S16

The central issue for Genetic Analysis Workshop 14 (GAW14) is the question, which is the better strategy for linkage analysis, the use of single-nucleotide polymorphisms (SNPs) or microsatellite markers? To answer this question we analyzed the simulated data using Duffy's SIB-PAIR program, which can incorporate parental genotypes, and our identity-by-state – identity-by-descent (IBS-IBD) transformation method of affected sib-pair linkage analysis which uses the matrix transformation between IBS and IBD. The advantages of our method are as follows: the assumption of Hardy-Weinberg equilibrium is not necessary; the parental genotype information maybe all unknown; both IBS and its related IBD transformation can be used in the linkage analysis; the determinant of the IBS-IBD transformation matrix provides a quantitative measure of the quality of the marker in linkage analysis. With the originally distributed simulated data, we found that 1) for microsatellite markers there are virtually no differences in types I and II error rates when parental genotypes were or were not used; 2) on average, a microsatellite marker has more power than a SNP marker does in linkage detection; 3) if parental genotype information is used, SNP markers show lower type I error rates than microsatellite markers; and 4) if parental genotypes are not available, SNP markers show considerable variation in type I error rates for different methods. 相似文献

8.

A survey of affected-sibship statistics for nonparametric linkage analysis

Sengul H Weeks DE Feingold E 《American journal of human genetics》2001,69(1):179-190

We have compared the power of a large number of allele-sharing statistics for "nonparametric" linkage analysis with affected sibships. Our rationale was that there is an extensive literature comparing statistics for sibling pairs but that there has not been much guidance on how to choose statistics for studies that include sibships of various sizes. We concentrated on statistics that can be described as assigning scores to each identity-by-descent-sharing configuration that a pedigree might take on (Whittemore and Halpern 1994). We considered sibships of sizes two through five, 27 different genetic models, and varying recombination fractions between the marker and the trait locus. We tried to identify statistics whose power was robust over a wide variety of models. We found that the statistic that is probably used most often in such studies-S(all)-performs quite well, although it is not necessarily the best. We also found several other statistics (such as the R criterion, S(robdom), and the Sobel-and-Lange statistic C) that perform well in most situations, a few (such as S(-#geno) and the Feingold-and-Siegmund version of S(pairs)) that have high power only in very special situations, and a few (such as S(-#geno), the N criterion, and the Sobel-and-Lange statistic B) that seem to have low power for the majority of the trait models. For the most part, the same statistics performed well for all sibship sizes. We also used our results to give some suggestions regarding how to weight sibships of different sizes, in forming an overall statistic. 相似文献

9.

Statistical tests for detection of misspecified relationships by use of genome-screen data

下载免费PDF全文

McPeek MS Sun L 《American journal of human genetics》2000,66(3):1076-1094

Misspecified relationships can have serious consequences for linkage studies, resulting in either reduced power or false-positive evidence for linkage. If some individuals in the pedigree are untyped, then Mendelian errors may not be observed. Previous approaches to detection of misspecified relationships by use of genotype data were developed for sib and half-sib pairs. We extend the likelihood calculations of G?ring and Ott and Boehnke and Cox to more-general relative pairs, for which identity-by-descent (IBD) status is no longer a Markov chain, and we propose a likelihood-ratio test. We also extend the identity-by-state (IBS)-based test of Ehm and Wagner to nonsib relative pairs. The likelihood-ratio test has high power, but its drawbacks include the need to construct and apply a separate Markov chain for each possible alternative relationship and the need for simulation to assess significance. The IBS-based test is simpler but has lower power. We propose two new test statistics-conditional expected IBD (EIBD) and adjusted IBS (AIBS)-designed to retain the simplicity of IBS while increasing power by taking into account chance sharing. In simulations, the power of EIBD is generally close to that of the likelihood-ratio test. The power of AIBS is higher than that of IBS, in all cases considered. We suggest a strategy of initial screening by use of EIBD and AIBS, followed by application of the likelihood-ratio test to only a subset of relative pairs, identified by use of EIBD and AIBS. We apply the methods to a Genetic Analysis Workshop 11 data set from the Collaborative Study on the Genetics of Alcoholism. 相似文献

10.

Identity-by-descent estimation and mapping of qualitative traits in large, complex pedigrees

下载免费PDF全文

Abney M 《Genetics》2008,179(3):1577-1590

Computing identity-by-descent sharing between individuals connected through a large, complex pedigree is a computationally demanding task that often cannot be done using exact methods. What I present here is a rapid computational method for estimating, in large complex pedigrees, the probability that pairs of alleles are IBD given the single-point genotype data at that marker for all individuals. The method can be used on pedigrees of essentially arbitrary size and complexity without the need to divide the individuals into separate subpedigrees. I apply the method to do qualitative trait linkage mapping using the nonparametric sharing statistic S(pairs). The validity of the method is demonstrated via simulation studies on a 13-generation 3028-person pedigree with 700 genotyped individuals. An analysis of an asthma data set of individuals in this pedigree finds four loci with P-values <10(-3) that were not detected in prior analyses. The mapping method is fast and can complete analyses of approximately 150 affected individuals within this pedigree for thousands of markers in a matter of hours. 相似文献

11.

Comparison of Nonparametric Statistics for Detection of Linkage in Nuclear Families: Single-Marker Evaluation 总被引：6，自引：2，他引：4

下载免费PDF全文

Sean Davis Daniel E. Weeks 《American journal of human genetics》1997,61(6):1431-1444

We have evaluated 23 different statistics, from a total of 10 popular software packages for model-free linkage analysis of nuclear-family data, by applying them to single-marker data simulated under several two-locus disease models. The statistics that we examined fall into two broad categories: (1) those that test directly for increased identity-by-state or identity-by-descent sharing (by use of the programs APM, Genetic Analysis System [GAS] SIBSTATE and SIBDES, SAGE SIBPAL, ERPA, SimIBD, and Genehunter NPL) and (2) those that are based on likelihood-ratio tests and that report LOD scores (by use of the programs Splink, SIBPAIR, Mapmaker/Sibs, ASPEX, and GAS SIBMLS). For each of eight two-locus disease models, we analyzed six data sets; the first three data sets consisted of two-child families with both sibs affected and zero, one, or both parents typed, whereas the other three data sets consisted of four-child families with at least two affected sibs and zero, one, or both parents typed. We report false-positive rates, overall rank by power, and the power for each statistic. We give rough recommendations regarding which programs provide the most powerful tests for linkage, as well as the programs to be avoided under certain conditions. For the likelihood-ratio-based statistics, we examined the effects of various treatments of sibships with multiple affected individuals. Finally, we explored the use of some simple two-of-three composite statistics and found that such tests are of only marginal benefit over the most powerful single statistic. 相似文献

12.

Power studies for the transmission/disequilibrium tests with multiple alleles. 总被引：6，自引：1，他引：5

下载免费PDF全文

N L Kaplan E R Martin B S Weir 《American journal of human genetics》1997,60(3):691-702

Case-control studies compare marker-allele distributions in affected and unaffected individuals, and significant results suggest linkage but may simply reflect population structure. For markers with m alleles (m > or = 2), a McNemar-like statistic, I, estimates the level of population association between marker and disease loci. To test for linkage after significant case-control tests, within-family tests are performed. These operate on the contingency table, with i, jth element equal to the number of parents that transmit marker allele Mi and do not transmit marker allele Mi to an affected offspring. The dimension of the table is the number of alleles at the marker locus. Three test statistics have recently been proposed in the literature: Tc compares symmetric pairs of cells (i, j) and (j, i), Tm compares row and column totals for the same marker allele, and a likelihood ratio statistic Tl uses all the cells in the table. In addition, we consider a new statistic, Tmhet, that uses only the heterozygous parents and is approximately chi2 with (m - 1) df. We use a Monte Carlo test to guarantee valid tests and to demonstrate the inferiority of Tc and the equality of Tm and Tl in terms of power. The power of the Tmhet test is close but not always equal to the power of the Tm test. We also show that under the alternative hypothesis of linkage, Tm is approximately noncentral chi2 with (m - 1) df and noncentrality parameter 2NT(1 - 2theta)2I*, when data on single affecteds in NT families are used. If the disease has a low population frequency, then I* is estimated using the case-control statistic I. This offers a basis for choosing sample size, or choosing a marker system. 相似文献

13.

Nonparametric longitudinal allele-sharing model

Kulle B Köhler K Rosenberger A Loesgen S Bickeböller H 《BMC genetics》2003,4(Z1):S85

Basically no methods are available for the analysis of quantitative traits in longitudinal genetic epidemiological studies. We introduce a nonparametric factorial design for longitudinal data on independent sib pairs, modelling the phenotypic quadratic differences as the dependent variable. Factors are the number of alleles shared identically by descent (IBD) and the age categories at which the dependent variable is measured, allowing for dependence due to age. To identify a linked marker a rank statistic tests the influence of IBD group on phenotypic quadratic differences. No assumptions are made on normality or variances of the dependent variable. We apply our method to 71 sib pairs from the Framingham Heart Study data provided at the Genetic Analysis Workshop 13. For all 15 available markers on chromosome 17 we analyzed the influence on systolic blood pressure. In addition, different selection strategies to sample from the whole data are discussed. 相似文献

14.

Construction of a confidence set of markers for the location of a disease gene using affected sib pair data

Lin S 《Human heredity》2002,53(2):103-112

We have previously proposed a confidence set approach for finding tightly linked genomic regions under the setting of parametric linkage analysis. In this article, we extend the confidence set approach to nonparametric linkage analysis of affected sib pair (ASP) data based on their identity-by-descent (IBD) information. Two well-known statistics in nonparametric linkage analysis, the Two-IBD test (proportion of ASPs sharing two alleles IBD), and the Mean test (average number of alleles shared IBD in the ASPs), are used for constructing confidence sets. Some numerical analyses as well as a simulation study were carried out to demonstrate the utility of the methods. Our results show that the fundamental advantages of the confidence set approach in parametric linkage analysis are retained when the method is generalized to nonparametric analysis. Our study on the accuracy of confidence sets, in terms of choice of tests, underlying disease incidence data, and amount of data available, leads us to conclude, among other things, that the Mean test outperforms the Two-IBD test in most situations, with the reverse being true only for traits with small additive variance. Although we describe how to construct confidence sets based on only two familiar tests, one can construct confidence sets similarly using other allele sharing statistics. 相似文献

15.

The linkage information content value of polymorphism genetic markers in model-free linkage analysis

Guo X Olson JM Elston RC Niu T 《Human heredity》2002,53(1):45-48

Guo and Elston [Hum Hered 1999;49:112-118] developed a linkage information content (LIC) value to measure the informativeness of a marker for identity-by-descent (IBD) sharing status of relative pairs. LIC values were derived for five types of relative pairs: full sib, half sib, grandparent-grandchild, first cousin and avuncular. In this paper, we give corrected LIC values for full sib, grandparent-grandchild, first cousin and avuncular pairs, and indicate the availability of a computer program to calculate them. 相似文献

16.

Consanguinity and the sib-pair method: an approach using identity by descent between and within individuals.

E. Génin F. Clerget-Darpoux 《American journal of human genetics》1996,59(5):1149-1162

To test for linkage between a trait and a marker, one can consider identical marker alleles in related individuals, for instance, sibs. For recessive diseases, it has been shown that some information may be gained from the identity by descent (IBD) of the two alleles of an affected inbred individual at the marker locus. The aim of this paper is to extend the sib-pair method of linkage analysis to the situation of sib pairs sampled from consanguineous populations. This extension takes maximum advantage of the information provided by both the IBD pattern between sibs and allelic identity within each sib of the pair. This is possible through the use of the condensed identity coefficients. Here, we propose a new test of linkage based on a chi2. We compare the performance of this test with that of the classical chi2 test based on the distribution of sib pairs sharing 0, 1, or 2 alleles IBD. For sib pairs from first-cousin matings, the proposed test can better detect the role of a disease-susceptibility (DS) locus. Its power is shown to be greater than that of the classical test, especially for models where the DS allele may be common and incompletely penetrant; that is to say for situations that may be encountered in multifactorial diseases. A study of the impact of inbreeding on the expected proportions of sib pairs sharing 0, 1, or 2 alleles IBD is also performed here. Ignoring inbreeding, when in fact inbreeding exists, increases the rate of type I errors in tests of linkage. 相似文献

17.

Modeling the effect of inbreeding among founders in linkage analysis

Hössjer O 《Theoretical population biology》2006,70(2):146-163

In this paper, we present a unified mathematical model for linkage analysis that allows for inbreeding among founders in all families. The identical by descent (IBD) configuration of each pedigree is modeled as a Markov process containing two parameters; the inverse inbreeding and kinship coefficient and a rate parameter proportional to the inverse expected length of chromosome segments shared IBD by two different founder haplotypes. We use hidden Markov models and define a forward-backward algorithm for computing the conditional IBD-distribution given marker data, thereby extending the multipoint method of Lander and Green [1987. Construction of multilocus genetic maps in humans, Proc. Natl. Acad. Sci. USA 84, 2363-2367] to situations where founders are inbred. Our methodology is valid for arbitrary pedigree structures. Simulation and theoretical approximations for nonparametric linkage (NPL) analysis based on affected sib pairs reveal that NPL scores are inflated and type 1 errors increased when the inbreeding coefficient or rate parameter is underestimated. When the parents are genotyped, we present a general way of modifying the score function to drastically reduce this effect. 相似文献

18.

A multilocus extension of the affected-pedigree-member method of linkage analysis.

下载免费PDF全文

D E Weeks K Lange 《American journal of human genetics》1992,50(4):859-868

The affected-pedigree-member (APM) method of linkage analysis is designed to detect departures from independent segregation of disease and marker phenotypes. The underlying statistic of the APM method operates on the identity-by-state relations implied by the marker phenotypes of the affected within a pedigree. Here we generalize the APM statistic to multiple linked markers. This generalization relies on recursive computation of two-locus kinship coefficients by an algorithm of Thompson. The distributional properties of the extended APM statistic are investigated theoretically and by simulation in the context of one real and one artificial data set. In both examples, the multilocus statistic tends to reject, more strongly than the single-locus statistics do, the null hypothesis of independent segregation between the disease locus and the marker loci. 相似文献

19.

Multipoint quantitative-trait linkage analysis in general pedigrees. 总被引：49，自引：12，他引：37

下载免费PDF全文

L Almasy J Blangero 《American journal of human genetics》1998,62(5):1198-1211

Multipoint linkage analysis of quantitative-trait loci (QTLs) has previously been restricted to sibships and small pedigrees. In this article, we show how variance-component linkage methods can be used in pedigrees of arbitrary size and complexity, and we develop a general framework for multipoint identity-by-descent (IBD) probability calculations. We extend the sib-pair multipoint mapping approach of Fulker et al. to general relative pairs. This multipoint IBD method uses the proportion of alleles shared identical by descent at genotyped loci to estimate IBD sharing at arbitrary points along a chromosome for each relative pair. We have derived correlations in IBD sharing as a function of chromosomal distance for relative pairs in general pedigrees and provide a simple framework whereby these correlations can be easily obtained for any relative pair related by a single line of descent or by multiple independent lines of descent. Once calculated, the multipoint relative-pair IBDs can be utilized in variance-component linkage analysis, which considers the likelihood of the entire pedigree jointly. Examples are given that use simulated data, demonstrating both the accuracy of QTL localization and the increase in power provided by multipoint analysis with 5-, 10-, and 20-cM marker maps. The general pedigree variance component and IBD estimation methods have been implemented in the SOLAR (Sequential Oligogenic Linkage Analysis Routines) computer package. 相似文献

20.

The affected-pedigree-member method of linkage analysis. 总被引：67，自引：45，他引：22

下载免费PDF全文

D E Weeks K Lange 《American journal of human genetics》1988,42(2):315-326

This paper describes a generalization of the affected-sib-pair method of linkage analysis to pedigrees. By substituting identity-by-state relations for identity-by-descent relations, we develop a test statistic for detecting departures from independent segregation of disease and marker phenotypes. The statistic is based on the marker phenotypes of affected pedigree members only. Since it is more striking for distantly affected relatives to share a rare marker allele than a common marker allele, the statistic also includes a weighting factor based on allele frequency. The distributional properties of the statistic are investigated theoretically and by simulation. Part of the theoretical treatment entails generalizing Karigl's multiple-person kinship coefficients. When the test statistic is applied to pedigree data on Huntington disease, the null hypothesis of independent segregation between the marker locus and the disease locus is firmly rejected. In this case, as expected, there is a loss of power when compared with standard lod-score analysis. However, our statistic possesses the advantage of requiring no explicit assumptions about the mode of inheritance of the disease. This point is illustrated by application of the test statistic to data on rheumatoid arthritis. 相似文献