首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
Haplotype-based association analysis has been recognized as a tool with high resolution and potentially great power for identifying modest etiological effects of genes. However, in practice, its efficacy has not been as successfully reproduced as expected in theory. One primary cause is that such analysis tends to require a large number of parameters to capture the abundant haplotype varieties, and many of those are expended on rare haplotypes for which studies would have insufficient power to detect association even if it existed. To concentrate statistical power on more-relevant inferences, in this study, we developed a regression-based approach using clustered haplotypes to assess haplotype-phenotype association. Specifically, we generalized the probabilistic clustering methods of Tzeng to the generalized linear model (GLM) framework established by Schaid et al. The proposed method uses unphased genotypes and incorporates both phase uncertainty and clustering uncertainty. Its GLM framework allows adjustment of covariates and can model qualitative and quantitative traits. It can also evaluate the overall haplotype association or the individual haplotype effects. We applied the proposed approach to study the association between hypertriglyceridemia and the apolipoprotein A5 gene. Through simulation studies, we assessed the performance of the proposed approach and demonstrate its validity and power in testing for haplotype-trait association.  相似文献   

3.
We recently described a method for linkage disequilibrium (LD) mapping, using cladistic analysis of phased single-nucleotide polymorphism (SNP) haplotypes in a logistic regression framework. However, haplotypes are often not available and cannot be deduced with certainty from the unphased genotypes. One possible two-stage approach is to infer the phase of multilocus genotype data and analyze the resulting haplotypes as if known. Here, haplotypes are inferred using the expectation-maximization (EM) algorithm and the best-guess phase assignment for each individual analyzed. However, inferring haplotypes from phase-unknown data is prone to error and this should be taken into account in the subsequent analysis. An alternative approach is to analyze the phase-unknown multilocus genotypes themselves. Here we present a generalization of the method for phase-known haplotype data to the case of unphased SNP genotypes. Our approach is designed for high-density SNP data, so we opted to analyze the simulated dataset. The marker spacing in the initial screen was too large for our method to be effective, so we used the answers provided to request further data in regions around the disease loci and in null regions. Power to detect the disease loci, accuracy in localizing the true site of the locus, and false-positive error rates are reported for the inferred-haplotype and unphased genotype methods. For this data, analyzing inferred haplotypes outperforms analysis of genotypes. As expected, our results suggest that when there is little or no LD between a disease locus and the flanking region, there will be no chance of detecting it unless the disease variant itself is genotyped.  相似文献   

4.
Knowing the origin of invasive taxa, whether multiple introductions have occurred, and levels of genetic variation relative to the native range, is vital to conducting rigorous tests of hypotheses to explain biological invasions. We explore phylogeographical relationships of two Eurasian knapweed taxa that are invasive in North America, Centaurea diffusa and C. stoebe micranthos (Asteraceae), using chloroplast DNA intron sequences. We also gathered data from C. stoebe stoebe , hybrids between C. diffusa and C. stoebe stoebe ( C. x psammogena ), and three other species in the genus. We sequenced 213 individuals from Eurasia and North America, and found 22 haplotypes. Centaurea diffusa has lower haplotype diversity and allelic richness in the introduced range relative to the native range. Even with reduced variation, the data suggest at least two introductions of C. diffusa . There is a trend towards reduced variation in C. stoebe micranthos , but it is not significant. One of the haplotypes found in North American C. stoebe micranthos matches a haplotype from a taxon other than C. stoebe micranthos in Europe. This suggests introgression of the chloroplast between taxa, or possibly the invasion of another Centaurea taxon into North America. Additionally, C. diffusa , C. stoebe micranthos , and C. stoebe stoebe share several haplotypes, including their most common haplotype. This suggests ongoing hybridization between the species or incomplete segregation. These data can guide further exploration for the origins of these species, and point out locations within the introduced range with unique and diverse genetic makeup.  相似文献   

5.
The genetic differentiation among populations of the leaf beetle Chrysolina virgata living in wetlands of Japan was studied based on the sequence data of the mitochondrial cytochrome oxidase subunit I gene region (750 bp). Two distinct lineages of mitochondrial haplotypes were found: one (clade A) consisted of 26 haplotypes distributed over the distribution range of C. virgata between north‐east Honshu and Kyushu, whereas the other (clade B) was monotypic and confined to a small region in north‐east Honshu where it coexisted with clade A. Nested clade analysis for these haplotypes suggested that range expansion and following differentiation due to isolation by distance might have resulted in the present distribution pattern of the haplotypes in clade A. We discuss the evolutionary process leading to the occurrence of two distinct haplotype clades in Japan in terms of repeated colonization from the continent and range expansion and contraction during climatic changes.  相似文献   

6.
7.
Gene pool structure of Sakha Republic (Yakutia) native population has been studied: we defined composition and frequencies of Y-chromosome haplogroups for Yakuts. Six haplogroups: C3 x M77, C3c, N*, N2, N3a and R1a1 have been revealed in Yakut gene pool. A greater part of Y-chromosome in Yakut population belongs to N3a haplogroup (89%). All investigated Yakut population samples have low values of gene diversity, calculated based on haplogroup frequencies. Gene differentiation of the investigated samples estimated using the analysis of molecular variance (AMOVA) by two marker systems (haplogroup frequencies and microsatellite haplotypes of Y-chromosome) revealed a portion of interpopulation differences amounting to 0.24 and 2.85%, respectively. Frequencies and molecular phylogeny of YSTR-haplotypes were revealed for N3a haplogroup of Y-chromosome. Altogether forty haplotypes were found in Yakuts. Evenks and Yakuts are characterized by overlapping and very specific spectrum of N3a haplotypes, which is not typical for other Siberian ethnic groups. Cluster analysis of populations by N3a YSTR-haplotypes shows Yakut isolation from Turkic-speaking populations in the South Siberia. Genetic diversity generation time for a specific spectrum of Yakut haplotypes was estimated as 4.45 +/- 1.96 thousand years. As opposed to the data on mtDNA, the obtained results give an evidence for significant contribution of a local palaeolithic component into Y-chromosomal Yakut gene pool. Ethnogenetic reconstruction of the present picture of genetic diversity in N3a haplogroup in the territory of Siberia is under consideration.  相似文献   

8.
There have been increasing efforts to relate drug efficacy and disease predisposition with genetic polymorphisms. We present statistical tests for association of haplotype frequencies with discrete and continuous traits in samples of unrelated individuals. Haplotype frequencies are estimated through the expectation-maximization algorithm, and each individual in the sample is expanded into all possible haplotype configurations with corresponding probabilities, conditional on their genotype. A regression-based approach is then used to relate inferred haplotype probabilities to the response. The relationship of this technique to commonly used approaches developed for case-control data is discussed. We confirm the proper size of the test under H(0) and find an increase in power under the alternative by comparing test results using inferred haplotypes with single-marker tests using simulated data. More importantly, analysis of real data comprised of a dense map of single nucleotide polymorphisms spaced along a 12-cM chromosomal region allows us to confirm the utility of the haplotype approach as well as the validity and usefulness of the proposed statistical technique. The method appears to be successful in relating data from multiple, correlated markers to response.  相似文献   

9.
Because some genes have been cloned that have a known biochemical or physiological function, genetic variation can be measured in a population at loci that may directly influence a phenotype of interest. With this measured genotype approach, specific alleles or haplotypes in the probed DNA region can be assigned phenotypic effects. In this paper we address several problems encountered in implementing the measured genotype approach with restriction site data. A number of analytical problems arise in part as a consequence of the linkage disequilibrium that is commonly encountered when dealing with small DNA regions: 1) different restriction site polymorphisms are not statistically independent, 2) the sites being measured are not likely to be the direct cause of the associated phenotypic effects, 3) haplotype classes may be phenotypically heterogeneous, and 4) the sites that are most strongly associated with phenotypic effects are not necessarily the most closely linked to the actual genetic cause of the effects. When recombination and gene conversion are rare, the primary cause of linkage disequilibrium is history (mutational origin, genetic drift, hitchhiking, etc.). We deal with historical association directly by producing a cladogram that partially reconstructs the evolutionary history of the present-day haplotype variability. The cladogram defines a nested analysis of variance that simultaneously detects phenotypic effects, localizes the effects within the cladogram, and identifies haplotypes that are potentially heterogeneous in their phenotypic associations. The power of this approach is illustrated by an analysis of the associations between alcohol dehydrogenase (ADH) activity and restriction site variability in a 13-kb fragment surrounding the ADH locus in Drosophila melanogaster.  相似文献   

10.
Mutations in BRCA1 and BRCA2 account for about 40% of families with an inherited susceptibility to breast and/or ovarian cancer. Mutational analysis of these two genes has become the standard of care for families with a strong suggestion of inherited susceptibility. Methodologies for screening vary, but one of the more popular techniques is dHPLC, due to its combination of high sensitivity and low cost. The presence of a large number of polymorphisms in the two BRCA genes complicates dHPLC analysis, often leading to complex elution profiles. There are concerns that a pattern produced by a sample heterozygous for a polymorphism may be very similar to that produced by a sample heterozygous for a unique mutation within the same amplicon. Further molecular analysis is often required to resolve whether any given shift is due to a polymorphism or a disease-causing mutation. The use of ancestral haplotypes was explored as a means to minimize the need for further analysis. Groups of 86 patients were genotyped for 12 BRCA1 polymorphisms or 20 BRCA2 polymorphisms. For BRCA1, eight distinct haplotypes were identified, which are largely derivatives of two main lineages. For BRCA2, 17 distinct haplotypes were identified, leading to a much more complex polymorphic pattern. With this knowledge, we have defined a system to determine which patients, if any, require further investigations. This method could be used to supplement any comprehensive screening methodology for other large genes that lie within strong regions of linkage disequilibrium such as NF1, CFTR, MLH1, or MSH2.  相似文献   

11.
Summary Out of a population of 138598 infants born in southern Poland between 1987 and 1989, and screened for phenylketonuria (PKU), 28 cases were ascertained giving an incidence of 1 in 5000. DNA from 22 of these probands and their parents was isolated and eight polymorphic restriction sites were analyzed within the phenylalanine hydroxylase gene region. Twenty-one different haplotypes (HT) were revealed, five of them representing new categories. The most common haplotypes among those carrying normal alleles were: HT1 (27.3%) and HT4 (11.4%). Within the group of haplotypes with mutant alleles the most frequent was HT2 (56.8%), whereas the frequency of this haplotype in other European populations, such as French, Danish and German, ranged from 12% to 24%. HT3, being the most common in Danish (38%), and relatively frequent in the other western European (13–14%) populations, appeared to be very rare in our sample (2.3%). The mutation of codon 408 (exon 12, CT, ArgTrp), which has been described to be tightly linked to HT2, was tested on amplified DNA by dot-blot hybridization. This mutation was found in 25 out of 44 proband chromosomes. In one case it was linked to HT5, in the remaining 24 to HT2. Our results confirm molecular heterogeneity of PKU haplotypes, as well as their significant interpopulation variation.  相似文献   

12.
Because of its remoteness and mid-latitude position, New Zealand lacks access to the tropical climates that might have ensured straightforward survival for frost-sensitive species during glacial times. Nevertheless, the New Zealand lowland flora retains a substantial complement of plants sourced in the tropics. While there have been extinction pulses for elements of the frost-sensitive flora under glacial/stadial regimes, the surviving remnants have been able to recolonize large areas of habitat during successive warm climate periods. Refugia for such species in stadial New Zealand are likely to have been localized and ecologically suboptimal. To examine these relationships we have applied chloroplast DNA sequence data to the investigation of phylogeographical pattern for five endemic species of Metrosideros subg. Metrosideros , a wide-ranging group of mostly frost-sensitive woody plants in New Zealand. The results of this research verify the location of two generally mooted stadial refugia for the country and provide support for the existence of a third. A simple pattern of chloroplast haplotype diversity was recorded in extra-refugial areas, compared with a greater complexity in the vicinity of the identified refugia. This pattern was independently repeated in both main islands. The proposed refugia correspond to contemporary localities of high average winter temperatures. The sharing of chloroplast haplotypes between the different species of Metrosideros examined suggests that there has been a history of repeated hybridization and introgression for these plants, possibly initiated by periods of refugial confinement.  © 2004 The Linnean Society of London, Biological Journal of the Linnean Society , 2004, 83 , 399–412.  相似文献   

13.
We report on the use of rare mutations to tackle biosocial questions such as kinship and differential burial practices from past human populations. To do this, we have inferred nucleotide position 73 of HVS-II in individuals classified as belonging to haplogroup H from 76 human dental samples from the necropolis of Aldaieta (Basque Country, Spain, 6th-7th century) by means of PCR and restriction enzyme tests. The same analysis has been performed for 146 extant individuals from the northern Iberian peninsula. A combination of haplotype H and 73G in HVS-II, rare in extant populations (0.5-3%), has been found at a frequency of 20% in the ancient population of Aldaieta. These data can be explained in terms of the existence of different burial practices associated with a variety of factors, mainly social status and kinship. This hypothesis is also supported by archeological data. These results indicate that caution should be taken when making phylogenetic inferences from extinct populations, because an uncharacterized kinship can significantly bias allele frequencies.  相似文献   

14.
15.
Family-based association studies offer robustness to population stratification and can provide insight into maternally mediated and parent-of-origin effects. Usually, such studies investigate multiple markers covering a gene or chromosomal region of interest. We propose a simple and general method to test the association of a disease trait with multiple, possibly linked SNP markers and, subsequently, to nominate a set of “risk-haplotype-tagging alleles.” Our test, the max_Z2 test, uses only the genotypes of affected individuals and their parents without requiring the user to either know or assign haplotypes and their phases. It also accommodates sporadically missing SNP data. In the spirit of the pedigree disequilibrium test, our procedure requires only a vector of differences with expected value 0 under the null hypothesis. To enhance power against a range of alternatives when genotype data are complete, we also consider a method for combining multiple tests; here, we combine max_Z2 and Hotelling’s T2. To facilitate discovery of risk-related haplotypes, we develop a simple procedure for nominating risk-haplotype-tagging alleles. Our procedures can also be used to study maternally mediated genetic effects and to explore imprinting. We compare the statistical power of several competing testing procedures through simulation studies of case-parents triads, whose diplotypes are simulated on the basis of draws from the HapMap-based known haplotypes of four genes. In our simulations, the max_Z2 test and the max_TDT (transmission/disequilibrium test) proposed by McIntyre et al. perform almost identically, but max_Z2, unlike max_TDT, extends directly to the investigation of maternal effects. As an illustration, we reanalyze data from a previously reported orofacial cleft study, to now investigate both fetal and maternal effects of the IRF6 gene.  相似文献   

16.
Summary The segregation of factor B(BF)F subtypes was analyzed in conjunction with other MHC markers in 15 families with 89 offspring. Informative data for BF F subtypes were obtained from 11 families, 6 of them with known recombinant individuals for the HLA-B/DR/GLO region. The subtypes did not contribute further to the localization of the cross-overs, but followed the known segregation of conventional BF allotypes. In 2 families of one kinship, the recognition of heterozygous BF*FAFB individuals could be established following the inclusion of three generations. The rarer of the two BF F subtype alleles, BF*FA, is positively associated with the HLA haplotypes BW62, CW3, C4A*3 and A29, CWX, B44, C4A*3, B*1, DR7. BF F subtypes are regarded as a very useful additional tool for studies of MHC organization and disease association.  相似文献   

17.
18.
《Experimental mycology》1990,14(3):197-206
We have investigated genetic relationships among isolated of the genusBasidiobolus by examining restriction site polymorphisms in the ribosomal RNA genes. Approximately 10 kb or rDNA was cloned and employed as a probe of genomic DNA from 12 isolates, of human and saprobic origin, digested with nine restriction enzymes. While we foundB. microsporus Benjamin to be distinct, 9 human isolates designatedB. haptosporus Drechsler,B. meristosporus Drechsler, andB. ranarum Eidam were indistinguishable from one another. In contrast, two unidentified saprobic isolates could be readily differentiated from the human ones. We conclude that all human isolates examined fall within a single monophyletic group, but that significant genetic differentiation has taken place between human and saprobic isolates that is not accounted for in existing taxonomies of the genus.  相似文献   

19.
Randomly amplified polymorphic DNA markers (RAPDs) were applied in a cephalopod population study. Samples of the squid Moroteuthis ingens taken from around the Falkland Islands and Macquarie Island were used to test a null hypothesis that M. ingens forms a single, panmictic population in the Southern Ocean. Six of the 8 arbitrary RAPD primers screened produced a total of 30 reproducible polymorphic bands. Analysis of RAPD allele frequencies demonstrated high levels of variation between individuals but little variation between two sample sites. Although the differentiation between the two sites was low, subtle population structure was detected and the null hypothesis was rejected. The implications of low genetic differentiation between the two sites are briefly discussed in terms of possible egg and paralarval drift facilitated via the circumpolar current.  相似文献   

20.
Large-scale association studies are being undertaken with the hope of uncovering the genetic determinants of complex disease. We describe a computationally efficient method for inferring genealogies from population genotype data and show how these genealogies can be used to fine map disease loci and interpret association signals. These genealogies take the form of the ancestral recombination graph (ARG). The ARG defines a genealogical tree for each locus, and, as one moves along the chromosome, the topologies of consecutive trees shift according to the impact of historical recombination events. There are two stages to our analysis. First, we infer plausible ARGs, using a heuristic algorithm, which can handle unphased and missing data and is fast enough to be applied to large-scale studies. Second, we test the genealogical tree at each locus for a clustering of the disease cases beneath a branch, suggesting that a causative mutation occurred on that branch. Since the true ARG is unknown, we average this analysis over an ensemble of inferred ARGs. We have characterized the performance of our method across a wide range of simulated disease models. Compared with simpler tests, our method gives increased accuracy in positioning untyped causative loci and can also be used to estimate the frequencies of untyped causative alleles. We have applied our method to Ueda et al.'s association study of CTLA4 and Graves disease, showing how it can be used to dissect the association signal, giving potentially interesting results of allelic heterogeneity and interaction. Similar approaches analyzing an ensemble of ARGs inferred using our method may be applicable to many other problems of inference from population genotype data.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号