首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A genomewide admixture map for Latino populations   总被引:5,自引:1,他引:4       下载免费PDF全文
Admixture mapping is an economical and powerful approach for localizing disease genes in populations of recently mixed ancestry and has proven successful in African Americans. The method holds equal promise for Latinos, who typically inherit a mix of European, Native American, and African ancestry. However, admixture mapping in Latinos has not been practical because of the lack of a map of ancestry-informative markers validated in Native American and other populations. To address this, we screened multiple databases, containing millions of markers, to identify 4,186 markers that were putatively informative for determining the ancestry of chromosomal segments in Latino populations. We experimentally validated each of these markers in at least 232 new Latino, European, Native American, and African samples, and we selected a subset of 1,649 markers to form an admixture map. An advantage of our strategy is that we focused our map on markers distinguishing Native American from other ancestries and restricted it to markers with very similar frequencies in Europeans and Africans, which decreased the number of markers needed and minimized the possibility of false disease associations. We evaluated the effectiveness of our map for localizing disease genes in four Latino populations from both North and South America.  相似文献   

2.
For admixture mapping studies in Mexican Americans (MAM), we define a genomewide single-nucleotide-polymorphism (SNP) panel that can distinguish between chromosomal segments of Amerindian (AMI) or European (EUR) ancestry. These studies used genotypes for >400,000 SNPs, defined in EUR and both Pima and Mayan AMI, to define a set of ancestry-informative markers (AIMs). The use of two AMI populations was necessary to remove a subset of SNPs that distinguished genotypes of only one AMI subgroup from EUR genotypes. The AIMs set contained 8,144 SNPs separated by a minimum of 50 kb with only three intermarker intervals >1 Mb and had EUR/AMI FST values >0.30 (mean FST = 0.48) and Mayan/Pima FST values <0.05 (mean FST < 0.01). Analysis of a subset of these SNP AIMs suggested that this panel may also distinguish ancestry between EUR and other disparate AMI groups, including Quechuan from South America. We show, using realistic simulation parameters that are based on our analyses of MAM genotyping results, that this panel of SNP AIMs provides good power for detecting disease-associated chromosomal segments for genes with modest ethnicity risk ratios. A reduced set of 5,287 SNP AIMs captured almost the same admixture mapping information, but smaller SNP sets showed substantial drop-off in admixture mapping information and power. The results will enable studies of type 2 diabetes, rheumatoid arthritis, and other diseases among which epidemiological studies suggest differences in the distribution of ancestry-associated susceptibility.  相似文献   

3.
Admixture mapping requires a genomewide panel of relatively evenly spaced markers that can distinguish the ancestral origins of chromosomal segments in admixed individuals. Through use of the results of the International HapMap Project and specific selection criteria, the current study has examined the ability of selected single-nucleotide polymorphisms (SNPs) to extract continental ancestry information in African American subjects and to explore parameters for admixture mapping. Genotyping of two linguistically diverse West African populations (Bini and Kanuri Nigerians, who are Niger-Congo [Bantu] and Nilo-Saharan speakers, respectively), European Americans, and African Americans validated a genomewide set of >4,000 SNP ancestry-informative markers with mean and median F(ST) values >0.59 and mean and median Fisher's information content >2.5. This set of SNPs extracted a larger amount of ancestry information in African Americans than previously reported SNP panels and provides nearly uniform coverage of the genome. Moreover, in the current study, simulations show that this more informative panel improves power for admixture mapping in African Americans when ethnicity risk ratios are modest. This is particularly important in the application of admixture mapping in complex genetic diseases for which only modest ethnicity risk ratios of relevant susceptibility genes are expected.  相似文献   

4.
Population linkage disequilibrium occurs as a consequence of mutation, selection, genetic drift, and population substructure produced by admixture of genetically distinct ethnic populations. African American and Hispanic ethnic groups have a history of significant gene flow among parent groups, which can be of value in affecting genome scans for disease-gene discovery in the case-control and transmission/disequilibrium test designs. Disease-gene discovery using mapping by admixture linkage disequilibrium (MALD) requires a map of polymorphic markers that differentiate between the founding populations, along with differences in disease-gene allele frequencies. We describe markers appropriate for MALD mapping by assessing allele frequencies of 744 short tandem repeats (STRs) in African Americans, Hispanics, European Americans, and Asians, by choosing STR markers that have large differences in composite delta, log-likelihood ratios, and/or I*(2) for MALD. Additional markers can be added to this MALD map by utilization of the rapidly growing single-nucleotide-polymorphism databases and the literature, to achieve a 3-10-cM scanning scale. The map will be useful for studies of diseases, including prostate and breast cancer, diabetes, hypertension, and end-stage renal disease, that have large differences in incidence between the founding populations of either Hispanics or African Americans.  相似文献   

5.
The selection of productive varieties of modern Criollo cocoa, showing fine aromatic qualities in their beans, is of major interest for some producing countries, such as Venezuela. Cultivated populations of Modern Criollo or Trinitario varieties may be suitable for admixture mapping analysis, as large blocks of alleles derived from two identified divergent ancestors, recently admixed, are still preserved, after a few generations of recombination, similar to experimental mapping progenies. Two hundred and fifty-seven individuals from a cultivated population of Modern Criollo were selected and analysed with 92 microsatellite markers distributed along the genome. This population exhibited a wide range of variability for yield factors and morphological features. Population structure analysis identified two main subgroups corresponding to the admixture from the two ancestors Criollo and Forastero. Several significant associations between markers and phenotypic data (yield factors and morphological traits) were identified by a least squares general linear model (GLM) taking into account the population structure and the percentage of admixture of each individual. Results were compared with classical QTL analyses previously reported for other cacao populations. Most markers associated to quantitative traits were very close to QTLs detected formerly for the same traits. Associations were also identified between markers and several qualitative traits including the red pigmentation observed in different organs, mainly associated to common markers in linkage group 4.  相似文献   

6.
Verdu P  Rosenberg NA 《Genetics》2011,189(4):1413-1426
Admixed populations have been used for inferring migrations, detecting natural selection, and finding disease genes. These applications often use a simple statistical model of admixture rather than a modeling perspective that incorporates a more realistic history of the admixture process. Here, we develop a general model of admixture that mechanistically accounts for complex historical admixture processes. We consider two source populations contributing to the ancestry of a hybrid population, potentially with variable contributions across generations. For a random individual in the hybrid population at a given point in time, we study the fraction of genetic admixture originating from a specific one of the source populations by computing its moments as functions of time and of introgression parameters. We show that very different admixture processes can produce identical mean admixture proportions, but that such processes produce different values for the variance of the admixture proportion. When introgression parameters from each source population are constant over time, the long-term limit of the expectation of the admixture proportion depends only on the ratio of the introgression parameters. The variance of admixture decreases quickly over time after the source populations stop contributing to the hybrid population, but remains substantial when the contributions are ongoing. Our approach will facilitate the understanding of admixture mechanisms, illustrating how the moments of the distribution of admixture proportions can be informative about the historical admixture processes contributing to the genetic diversity of hybrid populations.  相似文献   

7.
Prospects for admixture mapping of complex traits   总被引:1,自引:0,他引:1       下载免费PDF全文
Admixture mapping extends to human populations the principles that underlie linkage analysis of an experimental cross. For detecting genes that contribute to ethnic variation in disease risk, admixture mapping has greater statistical power than family-linkage studies. In comparison with association studies, admixture mapping requires far fewer markers to search the genome and is less affected by allelic heterogeneity. Statistical-analysis programs for admixture mapping are now available, and a genomewide panel of markers for admixture mapping in populations formed by West African-European admixture has been assembled. Some of the remaining technical challenges include the ability to ensure that the statistical methods are robust and to develop marker panels for other admixed populations. Where admixed populations and panels of markers informative for ancestry are available, admixture mapping can be applied to localize genes that contribute to ethnic variation in any measurable trait.  相似文献   

8.
Zhu X  Zhang S  Tang H  Cooper R 《Human genetics》2006,120(3):431-445
Several disease-mapping methods have been proposed recently, which use the information generated by recent admixture of populations from historically distinct geographic origins. These methods include both classic likelihood and Bayesian approaches. In this study we directly maximize the likelihood function from the hidden Markov Model for admixture mapping using the EM algorithm, allowing for uncertainty in model parameters, such as the allele frequencies in the parental populations. We determined the robustness of the proposed method by examining the ancestral allele frequency estimate and individual marker-location specific ancestry when the data were generated by different population admixture models and no learning sample was used. The proposed method outperforms a widely used Bayesian MCMC strategy for data generated from various population admixture models. The multipoint information content for ancestry was derived based on the map provided by Smith et al. (2004) and the associated statistical power was calculated. We examined the distribution of admixture LD across the genome for both real and simulated data and established a threshold for genome wide significance applicable to admixture mapping studies. The software ADMIXPROGRAM for performing admixture mapping is available from authors.  相似文献   

9.
The genomes of nonhuman primates have recently become highly visible candidates for full genome analysis, as they provide powerful models of human disease and a better understanding of the evolution of the human genome. We describe the creation of a 5000 rad radiation hybrid (RH) mapping panel for the rhesus macaque. Duplicate genotypes of 84 microsatellite and coding gene sequence tagged sites from six macaque chromosomes produced an estimated whole genome retention frequency of 0.33. To test the mapping ability of the panel, we constructed RH maps for macaque chromosomes 7 and 9 and compared them to orthologous locus orders in existing human and baboon maps derived from different methodologies. Concordant marker order between all three species maps suggests that the current panel represents a powerful mapping resource for generating high-density comparative maps of the rhesus macaque and other species genomes.  相似文献   

10.
11.
Methods for high-density admixture mapping of disease genes   总被引:26,自引:0,他引:26       下载免费PDF全文
Admixture mapping (also known as "mapping by admixture linkage disequilibrium," or MALD) has been proposed as an efficient approach to localizing disease-causing variants that differ in frequency (because of either drift or selection) between two historically separated populations. Near a disease gene, patient populations descended from the recent mixing of two or more ethnic groups should have an increased probability of inheriting the alleles derived from the ethnic group that carries more disease-susceptibility alleles. The central attraction of admixture mapping is that, since gene flow has occurred recently in modern populations (e.g., in African and Hispanic Americans in the past 20 generations), it is expected that admixture-generated linkage disequilibrium should extend for many centimorgans. High-resolution marker sets are now becoming available to test this approach, but progress will require (a). computational methods to infer ancestral origin at each point in the genome and (b). empirical characterization of the general properties of linkage disequilibrium due to admixture. Here we describe statistical methods to estimate the ancestral origin of a locus on the basis of the composite genotypes of linked markers, and we show that this approach accurately estimates states of ancestral origin along the genome. We apply this approach to show that strong admixture linkage disequilibrium extends, on average, for 17 cM in African Americans. Finally, we present power calculations under varying models of disease risk, sample size, and proportions of ancestry. Studying approximately 2500 markers in approximately 2500 patients should provide power to detect many regions contributing to common disease. A particularly important result is that the power of an admixture mapping study to detect a locus will be nearly the same for a wide range of mixture scenarios: the mixture proportion should be 10%-90% from both ancestral populations.  相似文献   

12.
Sha Q  Zhang X  Zhu X  Zhang S 《Human heredity》2006,62(2):55-63
Admixture mapping, using unrelated individuals from the admixture populations that result from recent mating between members of each parental population, is an efficient approach to localize disease-causing variants that differ in frequency between two or more historically separated populations. Recently, several methods have been proposed to test linkage between a susceptibility gene and a disease locus by using admixture-generated linkage disequilibrium (LD) for each of the genotyped markers. In a genome scan, admixture mapping usually tests 2,000 to 3,000 markers across the genome. Currently, either a very conservative Sidak (or Bonferroni) correction or a very time consuming simulation-based method is used to correct for the multiple tests and evaluate the overall p value. In this report, we propose a computationally efficient analytical approach for correction of the multiple tests and for calculating the overall p value for an admixture genome scan. Except for the Sidak (or Bonferroni) correction, our proposed method is the first analytical approach for correction of the multiple tests and for calculating the overall p value for a genome scan. Our simulation studies show that the proposed method gives correct overall type I error rates for genome scans in all cases, and is much more computationally efficient than simulation-based methods.  相似文献   

13.
One enduring question in evolutionary biology is the extent of archaic admixture in the genomes of present-day populations. In this paper, we present a test for ancient admixture that exploits the asymmetry in the frequencies of the two nonconcordant gene trees in a three-population tree. This test was first applied to detect interbreeding between Neandertals and modern humans. We derive the analytic expectation of a test statistic, called the D statistic, which is sensitive to asymmetry under alternative demographic scenarios. We show that the D statistic is insensitive to some demographic assumptions such as ancestral population sizes and requires only the assumption that the ancestral populations were randomly mating. An important aspect of D statistics is that they can be used to detect archaic admixture even when no archaic sample is available. We explore the effect of sequencing error on the false-positive rate of the test for admixture, and we show how to estimate the proportion of archaic ancestry in the genomes of present-day populations. We also investigate a model of subdivision in ancestral populations that can result in D statistics that indicate recent admixture.  相似文献   

14.
Yi N  Banerjee S  Pomp D  Yandell BS 《Genetics》2007,176(3):1855-1864
Development of statistical methods and software for mapping interacting QTL has been the focus of much recent research. We previously developed a Bayesian model selection framework, based on the composite model space approach, for mapping multiple epistatic QTL affecting continuous traits. In this study we extend the composite model space approach to complex ordinal traits in experimental crosses. We jointly model main and epistatic effects of QTL and environmental factors on the basis of the ordinal probit model (also called threshold model) that assumes a latent continuous trait underlies the generation of the ordinal phenotypes through a set of unknown thresholds. A data augmentation approach is developed to jointly generate the latent data and the thresholds. The proposed ordinal probit model, combined with the composite model space framework for continuous traits, offers a convenient way for genomewide interacting QTL analysis of ordinal traits. We illustrate the proposed method by detecting new QTL and epistatic effects for an ordinal trait, dead fetuses, in a F(2) intercross of mice. Utility and flexibility of the method are also demonstrated using a simulated data set. Our method has been implemented in the freely available package R/qtlbim, which greatly facilitates the general usage of the Bayesian methodology for genomewide interacting QTL analysis for continuous, binary, and ordinal traits in experimental crosses.  相似文献   

15.
A mouse-human hybrid cell panel for mapping human chromosome 16   总被引:21,自引:0,他引:21  
A mouse-human hybrid cell panel for human chromosome 16 was constructed from human cell lines with breakpoints on chromosome 16 at p13.11, q13, q22 and q24. Fusions with the human fibroblast line GM3884, t(X;16)(q26;q24) allowed the isolation of clones with either the derivative X or the derivative 16 as the only human chromosome. This was a consequence of both the genes APRT and HPRT being involved in the translocation. The breakpoints of the line GM3884 were confirmed by aphidicolin induction of the common fragile site at 16q23. The results of the fusions with this line suggest a localisation of the APRT gene at 16q24 and confirm the localisation of HPRT to Xq26 to Xq27.3. These hybrid cell lines enable the localisation of genes and DNA fragments to six clearly defined regions. Further localisation within three of these regions is possible by use of the three fragile sites on chromosome 16. In situ hybridisation with the probe pBLUR confirmed that of three lines tested all contained a single human chromosome.  相似文献   

16.
17.
US Hispanic/Latino individuals are diverse in genetic ancestry, culture, and environmental exposures. Here, we characterized and controlled for this diversity in genome-wide association studies (GWASs) for the Hispanic Community Health Study/Study of Latinos (HCHS/SOL). We simultaneously estimated population-structure principal components (PCs) robust to familial relatedness and pairwise kinship coefficients (KCs) robust to population structure, admixture, and Hardy-Weinberg departures. The PCs revealed substantial genetic differentiation within and among six self-identified background groups (Cuban, Dominican, Puerto Rican, Mexican, and Central and South American). To control for variation among groups, we developed a multi-dimensional clustering method to define a “genetic-analysis group” variable that retains many properties of self-identified background while achieving substantially greater genetic homogeneity within groups and including participants with non-specific self-identification. In GWASs of 22 biomedical traits, we used a linear mixed model (LMM) including pairwise empirical KCs to account for familial relatedness, PCs for ancestry, and genetic-analysis groups for additional group-associated effects. Including the genetic-analysis group as a covariate accounted for significant trait variation in 8 of 22 traits, even after we fit 20 PCs. Additionally, genetic-analysis groups had significant heterogeneity of residual variance for 20 of 22 traits, and modeling this heteroscedasticity within the LMM reduced genomic inflation for 19 traits. Furthermore, fitting an LMM that utilized a genetic-analysis group rather than a self-identified background group achieved higher power to detect previously reported associations. We expect that the methods applied here will be useful in other studies with multiple ethnic groups, admixture, and relatedness.  相似文献   

18.
Admixture mapping is a statistical methodology that detects genetic variants in recently admixed populations that are responsible for ethnic differences in disease risk. Three software packages are now available for admixture mapping and we provide a brief overview of the statistical methods and other principal features they implement.  相似文献   

19.
Attention deficit/hyperactivity disorder (ADHD) is a common heritable disorder with a childhood onset. Molecular genetic studies of ADHD have previously focused on examining the roles of specific candidate genes, primarily those involved in dopaminergic pathways. We have performed the first systematic genomewide linkage scan for loci influencing ADHD in 126 affected sib pairs, using a approximately 10-cM grid of microsatellite markers. Allele-sharing linkage methods enabled us to exclude any loci with a lambda(s) of > or =3 from 96% of the genome and those with a lambda(s) of > or =2.5 from 91%, indicating that there is unlikely to be a major gene involved in ADHD susceptibility in our sample. Under a strict diagnostic scheme we could exclude all screened regions of the X chromosome for a locus-specific lambda(s) of >/=2 in brother-brother pairs, demonstrating that the excess of affected males with ADHD is probably not attributable to a major X-linked effect. Qualitative trait maximum LOD score analyses pointed to a number of chromosomal sites that may contain genetic risk factors of moderate effect. None exceeded genomewide significance thresholds, but LOD scores were >1.5 for regions on 5p12, 10q26, 12q23, and 16p13. Quantitative-trait analysis of ADHD symptom counts implicated a region on 12p13 (maximum LOD 2.6) that also yielded a LOD >1 when qualitative methods were used. A survey of regions containing 36 genes that have been proposed as candidates for ADHD indicated that 29 of these genes, including DRD4 and DAT1, could be excluded for a lambda(s) of 2. Only three of the candidates-DRD5, 5HTT, and CALCYON-coincided with sites of positive linkage identified by our screen. Two of the regions highlighted in the present study, 2q24 and 16p13, coincided with the top linkage peaks reported by a recent genome-scan study of autistic sib pairs.  相似文献   

20.
We propose a novel method that uses natural admixture between divergent lineages (hybridization) to investigate the genetic architecture of reproductive isolation and adaptive introgression. Our method employs multinomial regression to estimate genomic clines and to quantify introgression for individual loci relative to the genomic background (clines in genotype frequency along a genomic admixture gradient). Loci with patterns of introgression that deviate significantly from null expectations based on the remainder of the genome are potentially subject to selection and thus of interest to understanding adaptation and the evolution of reproductive isolation. Using simulations, we show that different forms of selection modify these genomic clines in predictable ways and that our method has good power to detect moderate to strong selection for multiple forms of selection. Using individual-based simulations, we demonstrate that our method generally has a low false positive rate, except when genetic drift is particularly pronounced (e.g. low population size, low migration rates from parental populations, and substantial time since initial admixture). Additional individual-based simulations reveal that moderate selection against heterozygotes can be detected as much as 50 c m away from the focal locus directly experiencing selection, but is not detected at unlinked loci. Finally, we apply our analytical method to previously published data sets from a mouse ( Mus musculus and M. domesticus ) and two sunflower ( Helianthus petiolaris and H. annuus ) hybrid zones. This method should be applicable to numerous species that are currently the focus of research in evolution and ecology and should help bring about new insights regarding the processes underlying the origin and maintenance of biological diversity.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号