首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
To avoid problems related to unknown population substructure, association studies may be conducted in founder populations. In such populations, however, the relatedness among individuals may be considerable. Neglecting such correlations among individuals can lead to seriously spurious associations. Here, we propose a method for case-control association studies of binary traits that is suitable for any set of related individuals, provided that their genealogy is known. Although we focus here on large inbred pedigrees, this method may also be used in outbred populations for case-control studies in which some individuals are relatives. We base inference on a quasi-likelihood score (QLS) function and construct a QLS test for allelic association. This approach can be used even when the pedigree structure is far too complex to use an exact-likelihood calculation. We also present an alternative approach to this test, in which we use the known genealogy to derive a correction factor for the case-control association chi2 test. We perform analytical power calculations for each of the two tests by deriving their respective noncentrality parameters. The QLS test is more powerful than the corrected chi2 test in every situation considered. Indeed, under certain regularity conditions, the QLS test is asymptotically the locally most powerful test in a general class of linear tests that includes the corrected chi2 test. The two methods are used to test for associations between three asthma-associated phenotypes and 48 SNPs in 35 candidate genes in the Hutterites. We report a highly significant novel association (P=2.10-6) between atopy and an amino acid polymorphism in the P-selectin gene, detected with the QLS test and also, but less significantly (P=.0014), with the transmission/disequilibrium test.  相似文献   

2.
Mao Y  Xu S 《Heredity》2005,94(3):305-315
Identity-By-Descent (IBD) is a general measurement of the relationship between two groups of genes. If the two groups consist of two homologous genes, one from each individual, the IBD is called the coancestry between the two individuals. Coancestry is an important concept in both population and quantitative genetics. It is the probability that both genes are copies of the same gene in the genealogy. The average coancestry value at a random locus in a population reflects the level of population diversity, effective population size, the level of inbreeding and other attributes. Coancestry is also the building block for the covariance structure used to estimate the additive genetic variance component for a quantitative trait. There are many other types of IBD matrices, depending on the natures of the genes included in each group, and these IBD matrices vary from locus to locus. Molecular markers distributed along the genome provide information that can be used to infer these locus-specific IBD matrices. As a result, we can estimate and test the variance components of a quantitative trait contributed by these loci using the inferred IBD matrices. In this study, we develop the concept of locus-specific epistatic IBD matrices and a Monte Carlo method to infer these IBD matrices. The method is suitable for large pedigrees with arbitrary complexity and various levels of missing marker information. With these locus-specific IBD matrices, we are ready to search for quantitative trait loci along the genome in complicated pedigrees.  相似文献   

3.
DeGiorgio M  Jankovic I  Rosenberg NA 《Genetics》2010,186(4):1367-1387
Gene diversity, a commonly used measure of genetic variation, evaluates the proportion of heterozygous individuals expected at a locus in a population, under the assumption of Hardy-Weinberg equilibrium. When using the standard estimator of gene diversity, the inclusion of related or inbred individuals in a sample produces a downward bias. Here, we extend a recently developed estimator shown to be unbiased in a diploid autosomal sample that includes known related or inbred individuals to the general case of arbitrary ploidy. We derive an exact formula for the variance of the new estimator, H, and present an approximation to facilitate evaluation of the variance when each individual is related to at most one other individual in a sample. When examining samples from the human X chromosome, which represent a mixture of haploid and diploid individuals, we find that H performs favorably compared to the standard estimator, both in theoretical computations of mean squared error and in data analysis. We thus propose that H is a useful tool in characterizing gene diversity in samples of arbitrary ploidy that contain related or inbred individuals.  相似文献   

4.
An issue often encountered in statistical genetics is whether, or to what extent, it is possible to estimate the degree to which individuals sampled from a background population are related to each other, on the basis of the available genotype data and some information on the demography of the population. In this article, we consider this question using explicit modelling of the pedigrees and gene flows at unlinked marker loci, but then restricting ourselves to a relatively recent history of the population, that is, considering the genealogy at most some tens of generations backwards in time. As a computational tool we use a Markov chain Monte Carlo numerical integration on the state space of genealogies of the sampled individuals. As illustrations of the method, we consider the question of relatedness at the level of genes/genomes (IBD estimation), using both simulated and real data.  相似文献   

5.
Family-based tests of linkage disequilibrium typically are based on nuclear-family data including affected individuals and their parents or their unaffected siblings. A limitation of such tests is that they generally are not valid tests of association when data from related nuclear families from larger pedigrees are used. Standard methods require selection of a single nuclear family from any extended pedigrees when testing for linkage disequilibrium. Often data are available for larger pedigrees, and it would be desirable to have a valid test of linkage disequilibrium that can use all potentially informative data. In this study, we present the pedigree disequilibrium test (PDT) for analysis of linkage disequilibrium in general pedigrees. The PDT can use data from related nuclear families from extended pedigrees and is valid even when there is population substructure. Using computer simulations, we demonstrated validity of the test when the asymptotic distribution is used to assess the significance, and examined statistical power. Power simulations demonstrate that, when extended pedigree data are available, substantial gains in power can be attained by use of the PDT rather than existing methods that use only a subset of the data. Furthermore, the PDT remains more powerful even when there is misclassification of unaffected individuals. Our simulations suggest that there may be advantages to using the PDT even if the data consist of independent families without extended family information. Thus, the PDT provides a general test of linkage disequilibrium that can be widely applied to different data structures.  相似文献   

6.
Fertility inheritance, a phenomenon in which an individual's number of offspring is positively correlated with his or her number of siblings, is a cultural process that can have a strong impact on genetic diversity. Until now, fertility inheritance has been detected primarily using genealogical databases. In this study, we develop a new method to infer fertility inheritance from genetic data in human populations. The method is based on the reconstruction of the gene genealogy of a sample of sequences from a given population and on the computation of the degree of imbalance in this genealogy. We show indeed that this level of imbalance increases with the level of fertility inheritance, and that other phenomena such as hidden population structure are unlikely to generate a signal of imbalance in the genealogy that would be confounded with fertility inheritance. By applying our method to mtDNA samples from 37 human populations, we show that matrilineal fertility inheritance is more frequent in hunter-gatherer populations than in food-producer populations. One possible explanation for this result is that in hunter-gatherer populations, individuals belonging to large kin networks may benefit from stronger social support and may be more likely to have a large number of offspring.  相似文献   

7.
In contrast to large GWA studies based on thousands of individuals and large meta-analyses combining GWAS results, we analyzed a small case/control sample for uric acid nephrolithiasis. Our cohort of closely related individuals is derived from a small, genetically isolated village in Sardinia, with well-characterized genealogical data linking the extant population up to the 16(th) century. It is expected that the number of risk alleles involved in complex disorders is smaller in isolated founder populations than in more diverse populations, and the power to detect association with complex traits may be increased when related, homogeneous affected individuals are selected, as they are more likely to be enriched with and share specific risk variants than are unrelated, affected individuals from the general population. When related individuals are included in an association study, correlations among relatives must be accurately taken into account to ensure validity of the results. A recently proposed association method uses an empirical genotypic covariance matrix estimated from genome-screen data to allow for additional population structure and cryptic relatedness that may not be captured by the genealogical data. We apply the method to our data, and we also investigate the properties of the method, as well as other association methods, in our highly inbred population, as previous applications were to outbred samples. The more promising regions identified in our initial study in the genetic isolate were then further investigated in an independent sample collected from the Italian population. Among the loci that showed association in this study, we observed evidence of a possible involvement of the region encompassing the gene LRRC16A, already associated to serum uric acid levels in a large meta-analysis of 14 GWAS, suggesting that this locus might lead a pathway for uric acid metabolism that may be involved in gout as well as in nephrolithiasis.  相似文献   

8.
Genome-wide association studies are routinely conducted to identify genetic variants that influence complex disorders. It is well known that failure to properly account for population or pedigree structure can lead to spurious association as well as reduced power. We propose a method, ROADTRIPS, for case-control association testing in samples with partially or completely unknown population and pedigree structure. ROADTRIPS uses a covariance matrix estimated from genome-screen data to correct for unknown population and pedigree structure while maintaining high power by taking advantage of known pedigree information when it is available. ROADTRIPS can incorporate data on arbitrary combinations of related and unrelated individuals and is computationally feasible for the analysis of genetic studies with millions of markers. In simulations with related individuals and population structure, including admixture, we demonstrate that ROADTRIPS provides a substantial improvement over existing methods in terms of power and type 1 error. The ROADTRIPS method can be used across a variety of study designs, ranging from studies that have a combination of unrelated individuals and small pedigrees to studies of isolated founder populations with partially known or completely unknown pedigrees. We apply the method to analyze two data sets: a study of rheumatoid arthritis in small UK pedigrees, from Genetic Analysis Workshop 15, and data from the Collaborative Study of the Genetics of Alcoholism on alcohol dependence in a sample of moderate-size pedigrees of European descent, from Genetic Analysis Workshop 14. We detect genome-wide significant association, after Bonferroni correction, in both studies.  相似文献   

9.
Fertility inheritance, a phenomenon in which an individual's number of offspring is positively correlated with his or her number of siblings, is a cultural process that can have a strong impact on genetic diversity. Until now, fertility inheritance has been detected primarily using genealogical databases. In this study, we develop a new method to infer fertility inheritance from genetic data in human populations. The method is based on the reconstruction of the gene genealogy of a sample of sequences from a given population and on the computation of the degree of imbalance in this genealogy. We show indeed that this level of imbalance increases with the level of fertility inheritance, and that other phenomena such as hidden population structure are unlikely to generate a signal of imbalance in the genealogy that would be confounded with fertility inheritance. By applying our method to mtDNA samples from 37 human populations, we show that matrilineal fertility inheritance is more frequent in hunter–gatherer populations than in food-producer populations. One possible explanation for this result is that in hunter–gatherer populations, individuals belonging to large kin networks may benefit from stronger social support and may be more likely to have a large number of offspring.  相似文献   

10.
The mutation rate in the human mtDNA control region   总被引:18,自引:0,他引:18       下载免费PDF全文
The mutation rate of the mitochondrial control region has been widely used to calibrate human population history. However, estimates of the mutation rate in this region have spanned two orders of magnitude. To readdress this rate, we sequenced the mtDNA control region in 272 individuals, who were related by a total of 705 mtDNA transmission events, from 26 large Icelandic pedigrees. Three base substitutions were observed, and the mutation rate across the two hypervariable regions was estimated to be 3/705 =.0043 per generation (95% confidence interval [CI].00088-.013), or.32/site/1 million years (95% CI.065-.97). This study is substantially larger than others published, which have directly assessed mtDNA mutation rates on the basis of pedigrees, and the estimated mutation rate is intermediate among those derived from pedigree-based studies. Our estimated rate remains higher than those based on phylogenetic comparisons. We discuss possible reasons for-and consequences of-this discrepancy. The present study also provides information on rates of insertion/deletion mutations, rates of heteroplasmy, and the reliability of maternal links in the Icelandic genealogy database.  相似文献   

11.
It is an assumption of large, population-based datasets that samples are annotated accurately whether they correspond to known relationships or unrelated individuals. These annotations are key for a broad range of genetics applications. While many methods are available to assess relatedness that involve estimates of identity-by-descent (IBD) and/or identity-by-state (IBS) allele-sharing proportions, we developed a novel approach that estimates IBD0, 1, and 2 based on observed IBS within windows. When combined with genome-wide IBS information, it provides an intuitive and practical graphical approach with the capacity to analyze datasets with thousands of samples without prior information about relatedness between individuals or haplotypes. We applied the method to a commonly used Human Variation Panel consisting of 400 nominally unrelated individuals. Surprisingly, we identified identical, parent-child, and full-sibling relationships and reconstructed pedigrees. In two instances non-sibling pairs of individuals in these pedigrees had unexpected IBD2 levels, as well as multiple regions of homozygosity, implying inbreeding. This combined method allowed us to distinguish related individuals from those having atypical heterozygosity rates and determine which individuals were outliers with respect to their designated population. Additionally, it becomes increasingly difficult to identify distant relatedness using genome-wide IBS methods alone. However, our IBD method further identified distant relatedness between individuals within populations, supported by the presence of megabase-scale regions lacking IBS0 across individual chromosomes. We benchmarked our approach against the hidden Markov model of a leading software package (PLINK), showing improved calling of distantly related individuals, and we validated it using a known pedigree from a clinical study. The application of this approach could improve genome-wide association, linkage, heterozygosity, and other population genomics studies that rely on SNP genotype data.  相似文献   

12.
Can we find the family trees, or pedigrees, that relate the haplotypes of a group of individuals? Collecting the genealogical information for how individuals are related is a very time-consuming and expensive process. Methods for automating the construction of pedigrees could stream-line this process. While constructing single-generation families is relatively easy given whole genome data, reconstructing multi-generational, possibly inbred, pedigrees is much more challenging. This article addresses the important question of reconstructing monogamous, regular pedigrees, where pedigrees are regular when individuals mate only with other individuals at the same generation. This article introduces two multi-generational pedigree reconstruction methods: one for inbreeding relationships and one for outbreeding relationships. In contrast to previous methods that focused on the independent estimation of relationship distances between every pair of typed individuals, here we present methods that aim at the reconstruction of the entire pedigree. We show that both our methods out-perform the state-of-the-art and that the outbreeding method is capable of reconstructing pedigrees at least six generations back in time with high accuracy. The two programs are available at http://cop.icsi.berkeley.edu/cop/.  相似文献   

13.

We study an extension of the standard framework for pedigree analysis, in which we allow pedigree founders to be inbred. This solves a number of practical challenges in calculating coefficients of relatedness, including condensed identity coefficients. As a consequence we expand considerably the class of pedigrees for which such coefficients may be efficiently computed. An application of this is the modelling of background inbreeding as a continuous effect. We also use inbred founders to shed new light on constructibility of relatedness coefficients, i.e., the problem of finding a genealogy yielding a given set of coefficients. In particular, we show that any theoretically admissible coefficients for a pair of noninbred individuals can be produced by a finite pedigree with inbred founders. Coupled with our computational methods, implemented in the R package ribd, this allows for the first time computer analysis of general constructibility solutions, thus making them accessible for practical use.

  相似文献   

14.
Because of the need for fine mapping of disease loci and the availability of dense single-nucleotide-polymorphism markers, many forms of association tests have been developed. Most of them are applicable only to triads, whereas some are amenable to nuclear families (sibships). Although there are a number of methods that can deal with extended families (e.g., the pedigree disequilibrium test [PDT]), most of them cannot accommodate incomplete data. Furthermore, despite a large body of literature on association mapping, only a very limited number of publications are applicable to X-chromosomal markers. In this report, we first extend the PDT to markers on the X chromosome for testing linkage disequilibrium in the presence of linkage. This method is applicable to any pedigree structure and is termed "X-chromosomal pedigree disequilibrium test" (XPDT). We then further extend the XPDT to accommodate pedigrees with missing genotypes in some of the individuals, especially founders. Monte Carlo (MC) samples of the missing genotypes are generated and used to calculate the XMCPDT (X-chromosomal MC PDT) statistic, which is defined as the conditional expectation of the XPDT statistic given the incomplete (observed) data. This MC version of the XPDT remains a valid test for association under linkage with the assumption that the pedigrees and their associated affection patterns are drawn randomly from a population of pedigrees with at least one affected offspring. This set of methods was compared with existing approaches through simulation, and substantial power gains were observed in all settings considered, with type I error rates closely tracking their nominal values.  相似文献   

15.
16.
Nuzhdin SV  Harshman LG  Zhou M  Harmon K 《Heredity》2007,99(3):313-321
Identification of genes underlying complex traits is an important problem. Quantitative trait loci (QTL) are mapped using marker-trait co-segregation in large panels of recombinant genotypes. Most frequently, recombinant inbred lines derived from two isogenic parents are used. Segregation patterns are also studied in pedigrees from multiple families. Great advances have been made through creative use of these techniques, but narrow sampling and inadequate power represent strong limitations. Here, we propose an approach combining the strengths of both techniques. We established a mapping population from a sample of natural genotypes, and applied artificial selection for a complex character. Selection changed the frequencies of alleles in QTLs contributing to the selection response. We infer QTLs with dense genotyping microarrays by identifying blocks of linked markers undergoing selective changes in allele frequency. We demonstrated this approach with an experimental population composed from 20 isogenic strains. Selection for starvation survival was executed in three replicated populations with three control non-selected populations. Three individuals per population were genotyped using Affymetrix GeneChips. Two regions of the genome, one each on the left arms of the second and third chromosomes, showed significant divergence between control and selected populations. For the former region, we inferred allele frequencies in selected and control populations by pyrosequencing. We conclude that the allele frequency difference, averaging approximately 40% between selected and control lines, contributed to selection response. Our approach can contribute to the fine scale decomposition of the genetics of direct and indirect selection responses, and genotype by environment interactions.  相似文献   

17.
Yan Li  Barry I. Graubard 《Biometrics》2009,65(4):1096-1104
Summary For studies on population genetics, the use of representative random samples of the target population can avoid ascertainment bias. Genetic variation data from over a hundred genes were collected in a U.S. nationally representative sample in the Third National Health and Nutrition Examination Survey (NHANES III). Surveys such as the NHANES have complex stratified multistage cluster sample designs with sample weighting that can inflate variances and alter the expectations of test statistics. Thus, classical statistical tests of Hardy–Weinberg equilibrium (HWE) and homogeneity of HW disequilibrium (HHWD) for simple random samples are not suitable for data from complex samples. We propose using Wald tests for HWE and generalized score tests for HHWD that have been modified for complex samples. Monte Carlo simulation studies are used to investigate the finite sample properties of the proposed tests. Rao–Scott corrections applied to the tests were found to improve their type I error properties. Our methods are applied to the NHANES III genetic data for three loci involved in metabolizing lead in the body.  相似文献   

18.
We propose a method, the maximum identity length contrast (MILC) statistic, to locate genetic risk factors for complex diseases in founder populations. The MILC approach compares the identity length of parental haplotypes that are transmitted to affected offspring with the identity length of those that are not transmitted to affected offspring. Initially, the statistical properties of the method were assessed using randomly selected affected individuals with unknown relationship. Because both nuclear families with multiple affected sibs and large pedigrees are often available in founder populations, we performed simulations to investigate the properties of the MILC statistic in the presence of closely related affected individuals. The simulation showed that the use of closely related affected individuals greatly enhances the power of the statistic. For a given sample size and type I error, the use of affected sib pairs, instead of affected individuals randomly selected from the population, could increase the power by a factor of two. This increase was related to an increase of kinship-coefficient contrast between haplotype groups when closely related individuals were considered. The MILC approach allows the simultaneous use of affected individuals from a founder population and affected individuals with any kind of relationship, close or remote. We used the MILC approach to analyze the role of HLA in celiac disease and showed that the effect of HLA may be detected with the MILC approach by typing only 11 affected individuals, who were part of a single large Finnish pedigree.  相似文献   

19.
An elevated level of erythrocyte sodium-lithium (Na-Li) countertransport has been suggested as a predictor of predisposition to essential hypertension. In order to evaluate whether a single genetic or environmental factor with large effects explains the mixture of distributions in Na-Li countertransport in the general population, complex segregation analyses were conducted by using 1,273 individuals more than age 20 years from 276 pedigrees selected without respect to disease risk factors or health status. Either a single genetic locus or a single environmental factor with large gender-specific effects explained the mixture of distributions for Na-Li countertransport in this sample equally well. In the subsample of pedigrees supporting a single-locus etiology, the single genetic locus explained 29.0% of the variability in adjusted Na-Li countertransport in males and 16.6% of that in females. In a subsample of pedigrees supporting an environmental factor etiology, the environmental factor explained 35.2% of the adjusted Na-Li countertransport in males and 20.5% of that in females. These results suggest that there are at least two different explanations for the mixture of distributions in Na-Li countertransport in the general population. Attempts to relate genetic variation in Na-Li countertransport to risk of essential hypertension must consider that the factor with large phenotypic effects on this trait is gender specific and may not be a single major locus in all pedigrees.  相似文献   

20.
We propose an analytical approximation method for the estimation of multipoint identity by descent (IBD) probabilities in pedigrees containing a moderate number of distantly related individuals. We show that in large pedigrees where cases are related through untyped ancestors only, it is possible to formulate the hidden Markov model of the Lander-Green algorithm in terms of the IBD configurations of the cases. We use a first-order Markov approximation to model the changes in this IBD-configuration variable along the chromosome. In simulated and real data sets, we demonstrate that estimates of parametric and nonparametric linkage statistics based on the first-order Markov approximation are accurate. The computation time is exponential in the number of cases instead of in the number of meioses separating the cases. We have implemented our approach in the computer program ALADIN (accurate linkage analysis of distantly related individuals). ALADIN can be applied to general pedigrees and marker types and has the ability to model marker-marker linkage disequilibrium with a clustered-markers approach. Using ALADIN is straightforward: It requires no parameters to be specified and accepts standard input files.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号