首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Markers with large differences in allele frequencies between ethnicities provide ancestry information that can be applied to genetic studies. We identified over 100 biallelic ancestry informative markers (AIMs) with large allele frequency differences between European Americans (EA) and Pima Amerindians from laboratory and database screens. For 35 of these markers, Mayan, Yavapai and Quechuan Amerindians were genotyped and compared with EA and Pima allele frequencies. Markers with large allele frequency differences between EA and one Amerindian tribe showed only small differences between the Amerindian tribes. Examination of structure in individuals demonstrated a clear separation of subjects of European from those of Amerindian ancestry, and similarity between individuals from disparate Amerindian populations. The AIMs demonstrated the variation in ancestral composition of individual Mexican Americans, providing evidence of applicability in admixture mapping and in controlling for structure in association tests. In addition, a high percentage of single-nucleotide polymorphisms (SNPs) selected on the basis of large frequency differences between EA and Asian populations had large allele frequency differences between EA and Amerindians, suggesting an efficient method for greatly expanding AIMs for use in admixture mapping/structure analysis in Mexican Americans. Together, these data provide additional support for the practical application of admixture mapping in the Mexican American population.Electronic Supplementary Material Supplementary material is available in the online version of this article at  相似文献   

2.
Admixture mapping (also known as "mapping by admixture linkage disequilibrium," or MALD) provides a way of localizing genes that cause disease, in admixed ethnic groups such as African Americans, with approximately 100 times fewer markers than are required for whole-genome haplotype scans. However, it has not been possible to perform powerful scans with admixture mapping because the method requires a dense map of validated markers known to have large frequency differences between Europeans and Africans. To create such a map, we screened through databases containing approximately 450000 single-nucleotide polymorphisms (SNPs) for which frequencies had been estimated in African and European population samples. We experimentally confirmed the frequencies of the most promising SNPs in a multiethnic panel of unrelated samples and identified 3011 as a MALD map (1.2 cM average spacing). We estimate that this map is approximately 70% informative in differentiating African versus European origins of chromosomal segments. This map provides a practical and powerful tool, which is freely available without restriction, for screening for disease genes in African American patient cohorts. The map is especially appropriate for those diseases that differ in incidence between the parental African and European populations.  相似文献   

3.
Admixture mapping (AM) is a promising method for the identification of genetic risk factors for complex traits and diseases showing prevalence differences among populations. Efficient application of this method requires the use of a genomewide panel of ancestry-informative markers (AIMs) to infer the population of origin of chromosomal regions in admixed individuals. Genomewide AM panels with markers showing high frequency differences between West African and European populations are already available for disease-gene discovery in African Americans. However, no such a map is yet available for Hispanic/Latino populations, which are the result of two-way admixture between Native American and European populations or of three-way admixture of Native American, European, and West African populations. Here, we report a genomewide AM panel with 2,120 AIMs showing high frequency differences between Native American and European populations. The average intermarker genetic distance is ~1.7 cM. The panel was identified by genotyping, with the Affymetrix GeneChip Human Mapping 500K array, a population sample with European ancestry, a Mesoamerican sample comprising Maya and Nahua from Mexico, and a South American sample comprising Aymara/Quechua from Bolivia and Quechua from Peru. The main criteria for marker selection were both high information content for Native American/European ancestry (measured as the standardized variance of the allele frequencies, also known as "f value") and small frequency differences between the Mesoamerican and South American samples. This genomewide AM panel will make it possible to apply AM approaches in many admixed populations throughout the Americas.  相似文献   

4.
Admixture occurs when individuals from parental populations that have been isolated for hundreds of generations form a new hybrid population. Currently, interest in measuring biogeographic ancestry has spread from anthropology to forensic sciences, direct-to-consumers personal genomics, and civil rights issues of minorities, and it is critical for genetic epidemiology studies of admixed populations. Markers with highly differentiated frequencies among human populations are informative of ancestry and are called ancestry informative markers (AIMs). For tri-hybrid Latin American populations, ancestry information is required for Africans, Europeans and Native Americans. We developed two multiplex panels of AIMs (for 14 SNPs) to be genotyped by two mini-sequencing reactions, suitable for investigators of medium-small laboratories to estimate admixture of Latin American populations. We tested the performance of these AIMs by comparing results obtained with our 14 AIMs with those obtained using 108 AIMs genotyped in the same individuals, for which DNA samples is available for other investigators. We emphasize that this type of comparison should be made when new admixture/population structure panels are developed. At the population level, our 14 AIMs were useful to estimate European admixture, though they overestimated African admixture and underestimated Native American admixture. Combined with more AIMs, our panel could be used to infer individual admixture. We used our panel to infer the pattern of admixture in two urban populations (Montes Claros and Manhua?u) of the State of Minas Gerais (southeastern Brazil), obtaining a snapshot of their genetic structure in the context of their demographic history.  相似文献   

5.
We studied 156 individuals of Native American descent from the city of Tlapa in the state of Guerrero in western Mexico. Most individuals' ethnicity was either Nahua, Mixtec, or Tlapanec, but self-identified Mestizos and individuals of mixed ethnicities were also included in the sample. We typed 24 autosomal, one Y-chromosome, and four mitochondrial ancestry-informative markers (AIMs) to estimate group and individual admixture proportions, and determine whether the admixture process involved directional gene flow between parental groups. When genetically defined (GD) Mestizos were excluded from the analysis, Native American ancestry represented approximately 98% of the population's gene pool, while European and West African ancestry represented approximately 1% each. Maternally inherited markers also showed an exceptionally high Native American contribution (98.5%), as did the paternally inherited marker, DYS199 (90.7%). We did not detect genetic structure in this population using these AIMs, which appears consistent with the homogeneity of the sample in terms of admixture proportions. The addition of GD Mestizos to the sample did not produce a considerable change in admixture estimates, but it had a major effect on population structure. These results show that the population of Tlapa in Guerrero, Mexico, has experienced little admixture with Europeans and/or West Africans. They also show that the impact of a small number of admixed individuals on an otherwise homogeneous population might have profound implications on subsequent ancestry/phenotype analysis and mapping strategies. We suggest that heterogeneity is a major characteristic of Mexican populations and, as a consequence, should not be disregarded when designing epidemiological studies of Mexican and Mexican American populations.  相似文献   

6.
Population linkage disequilibrium occurs as a consequence of mutation, selection, genetic drift, and population substructure produced by admixture of genetically distinct ethnic populations. African American and Hispanic ethnic groups have a history of significant gene flow among parent groups, which can be of value in affecting genome scans for disease-gene discovery in the case-control and transmission/disequilibrium test designs. Disease-gene discovery using mapping by admixture linkage disequilibrium (MALD) requires a map of polymorphic markers that differentiate between the founding populations, along with differences in disease-gene allele frequencies. We describe markers appropriate for MALD mapping by assessing allele frequencies of 744 short tandem repeats (STRs) in African Americans, Hispanics, European Americans, and Asians, by choosing STR markers that have large differences in composite delta, log-likelihood ratios, and/or I*(2) for MALD. Additional markers can be added to this MALD map by utilization of the rapidly growing single-nucleotide-polymorphism databases and the literature, to achieve a 3-10-cM scanning scale. The map will be useful for studies of diseases, including prostate and breast cancer, diabetes, hypertension, and end-stage renal disease, that have large differences in incidence between the founding populations of either Hispanics or African Americans.  相似文献   

7.
Ancestry-informative markers (AIMs) show high allele frequency divergence between different ancestral or geographically distant populations. These genetic markers are especially useful in inferring the likely ancestral origin of an individual or estimating the apportionment of ancestry components in admixed individuals or populations. The study of AIMs is of great interest in clinical genetics research, particularly to detect and correct for population substructure effects in case-control association studies, but also in population and forensic genetics studies. This work presents a set of 46 ancestry-informative insertion deletion polymorphisms selected to efficiently measure population admixture proportions of four different origins (African, European, East Asian and Native American). All markers are analyzed in short fragments (under 230 basepairs) through a single PCR followed by capillary electrophoresis (CE) allowing a very simple one tube PCR-to-CE approach. HGDP-CEPH diversity panel samples from the four groups, together with Oceanians, were genotyped to evaluate the efficiency of the assay in clustering populations from different continental origins and to establish reference databases. In addition, other populations from diverse geographic origins were tested using the HGDP-CEPH samples as reference data. The results revealed that the AIM-INDEL set developed is highly efficient at inferring the ancestry of individuals and provides good estimates of ancestry proportions at the population level. In conclusion, we have optimized the multiplexed genotyping of 46 AIM-INDELs in a simple and informative assay, enabling a more straightforward alternative to the commonly available AIM-SNP typing methods dependent on complex, multi-step protocols or implementation of large-scale genotyping technologies.  相似文献   

8.
In admixed populations, genetic contributions from males and females of specific parental populations can be of different proportions due to past directional mating during the process of genetic admixture. In this research paper, we provide evidence of such male- and female-specific differential admixture components of African, European, and American Indian origin in an admixed population from the city of Melo, in the northeastern region of Uruguay. From data on 11 autosomal markers from a sample of 41 individuals of mixed African descent, we estimated 47% African, 38% European, and 15% Amerindian contributions. In contrast, 6 mtDNA site-specific polymorphic markers showed that the mtDNA genome of these individuals was 52% African, 19% European, and 29% Amerindian, while from 3 Y-specific polymorphic sites, we estimated 30% African, 64% European, and 6% Amerindian contributions. We argue that this heterogeneity of admixture estimates results from disproportionate unions of European males with African and American Indian females from which this mixed African population was formed. Also, we argue that the asymmetry of the admixture estimates from the three sets of markers (autosomal, mtDNA, and Y-linked) is a result of the changes in the direction of mating during the history of the population. Implications of such evidence of directional mating are discussed, indicating the need of further demographic data for a quantitative assessment of the impact of directional mating on genetic structure of admixed populations.  相似文献   

9.
Most individuals throughout the Americas are admixed descendants of Native American, European, and African ancestors. Complex historical factors have resulted in varying proportions of ancestral contributions between individuals within and among ethnic groups. We developed a panel of 446 ancestry informative markers (AIMs) optimized to estimate ancestral proportions in individuals and populations throughout Latin America. We used genome-wide data from 953 individuals from diverse African, European, and Native American populations to select AIMs optimized for each of the three main continental populations that form the basis of modern Latin American populations. We selected markers on the basis of locus-specific branch length to be informative, well distributed throughout the genome, capable of being genotyped on widely available commercial platforms, and applicable throughout the Americas by minimizing within-continent heterogeneity. We then validated the panel in samples from four admixed populations by comparing ancestry estimates based on the AIMs panel to estimates based on genome-wide association study (GWAS) data. The panel provided balanced discriminatory power among the three ancestral populations and accurate estimates of individual ancestry proportions (R2 > 0.9 for ancestral components with significant between-subject variance). Finally, we genotyped samples from 18 populations from Latin America using the AIMs panel and estimated variability in ancestry within and between these populations. This panel and its reference genotype information will be useful resources to explore population history of admixture in Latin America and to correct for the potential effects of population stratification in admixed samples in the region.  相似文献   

10.
Substantial increases of linkage disequilibrium (LD) both in magnitude and in range have been observed in recently admixed populations such as African-American (AfA). On the other hand, it has also been shown that LD in AfAs was very similar to that of African. In this study, we attempted to resolve these contradicting observations by conducting a systematic examination of the LD structure in AfAs by genotyping a sample of AfA individuals at 24,341 single nucleotide polymorphisms (SNPs) spanning almost the entire chromosome 21, with an average density of 1.5 kb/SNP. The overall LD in AfAs is similar to that in African populations and much less than that in European populations. Even when the ancestry-informative markers (AIMs) were used, extended LD in AfA was found to be limited to certain magnitude range (0.2 < or = r(2) < or = 0.8) and certain distance range, that is, between-marker distance more than 200 kb. Furthermore, the inclusion of AfA individuals with predominant African ancestry was found to reduce the overall magnitude of LD. Elevation of LD in the AfA population, compared with its parental populations, can only be observed at the markers with large allele frequency differences between 2 parental populations at limited scenario. AfA individuals of wholly African ancestry contribute little to the extended LD in the AfA population, and further genotyping or association analysis conducted using only admixed individuals may lead to higher statistical power and possibly reduced cost.  相似文献   

11.
Mapping by admixture linkage disequilibrium (MALD) is a theoretically powerful, although unproven, approach to mapping genetic variants that are involved in human disease. MALD takes advantage of long-range haplotypes that are generated by gene flow among recently admixed ethnic groups, such as African-Americans and Latinos. Under ideal circumstances, MALD will have more power to detect some genetic variants than other types of genome-wide association study that are carried out among more ethnically homogeneous populations. It will also require 200-500 times fewer markers, providing a significant economic advantage. The MALD approach is now being applied, with results expected in the near future.  相似文献   

12.
We consider the properties of the F(st) measure of genetic divergence between an admixed population and its parental source populations. Among all possible populations admixed among an arbitrary set of parental populations, we show that the value of F(st) between an admixed population and a specific source population is maximized when the admixed population is simply the most distant of the other source populations. For the case with only two parental populations, as a function of the admixture fraction, we further demonstrate that this F(st) value is monotonic and convex, so that F(st) is informative about the admixture fraction. We illustrate our results using example human population-genetic data, showing how they provide a framework in which to interpret the features of F(st) in admixed populations.  相似文献   

13.
As we move forward from the current generation of genome-wide association (GWA) studies, additional cohorts of different ancestries will be studied to increase power, fine map association signals, and generalize association results to additional populations. Knowledge of genetic ancestry as well as population substructure will become increasingly important for GWA studies in populations of unknown ancestry. Here we propose genotyping pooled DNA samples using genome-wide SNP arrays as a viable option to efficiently and inexpensively estimate admixture proportion and identify ancestry informative markers (AIMs) in populations of unknown origin. We constructed DNA pools from African American, Native Hawaiian, Latina, and Jamaican samples and genotyped them using the Affymetrix 6.0 array. Aided by individual genotype data from the African American cohort, we established quality control filters to remove poorly performing SNPs and estimated allele frequencies for the remaining SNPs in each panel. We then applied a regression-based method to estimate the proportion of admixture in each cohort using the allele frequencies estimated from pooling and populations from the International HapMap Consortium as reference panels, and identified AIMs unique to each population. In this study, we demonstrated that genotyping pooled DNA samples yields estimates of admixture proportion that are both consistent with our knowledge of population history and similar to those obtained by genotyping known AIMs. Furthermore, through validation by individual genotyping, we demonstrated that pooling is quite effective for identifying SNPs with large allele frequency differences (i.e., AIMs) and that these AIMs are able to differentiate two closely related populations (HapMap JPT and CHB).  相似文献   

14.
Principal components analysis of population admixture   总被引:1,自引:0,他引:1  
J Ma  CI Amos 《PloS one》2012,7(7):e40115
With the availability of high-density genotype information, principal components analysis (PCA) is now routinely used to detect and quantify the genetic structure of populations in both population genetics and genetic epidemiology. An important issue is how to make appropriate and correct inferences about population relationships from the results of PCA, especially when admixed individuals are included in the analysis. We extend our recently developed theoretical formulation of PCA to allow for admixed populations. Because the sampled individuals are treated as features, our generalized formulation of PCA directly relates the pattern of the scatter plot of the top eigenvectors to the admixture proportions and parameters reflecting the population relationships, and thus can provide valuable guidance on how to properly interpret the results of PCA in practice. Using our formulation, we theoretically justify the diagnostic of two-way admixture. More importantly, our theoretical investigations based on the proposed formulation yield a diagnostic of multi-way admixture. For instance, we found that admixed individuals with three parental populations are distributed inside the triangle formed by their parental populations and divide the triangle into three smaller triangles whose areas have the same proportions in the big triangle as the corresponding admixture proportions. We tested and illustrated these findings using simulated data and data from HapMap III and the Human Genome Diversity Project.  相似文献   

15.
The ethnic and geographic distributions of several common chronic diseases show distinct patterns that are consistent with the distribution of genes and genetic admixture. For example, diabetes and gallbladder disease occur most frequently among Amerindians, while those genetically admixed with them (such as Mexican-Americans) have intermediate rates, and lowest rates are found among Whites and Blacks. Because there will be heterogeneity from individual to individual in ancestral affinity within an admixed population, a method is developed for estimating each person's admixture probability. Results confirm that there is substantial heterogeneity of individual admixture among Mexican-Americans in Starr County, Texas, with a mean value indicating that 65% of genes in this population are Caucasian derived and 35% Amerindian derived. The individual estimates are shown to be unrelated to the probability of being diabetic and only marginally related to gallbladder disease, with those having the most Amerindian affinity being at increased risk. These results are a consequence of the independent assortment of loci and indicate that unless the markers employed are related (including linkage) to the disease of interest, the method will have limited utility. Individual admixture estimates will be useful, however, for examining aspects of population structure and will find increased utility for predicting disease and examining disease associations as more and more of the genome is represented by markers, a very probable prospect with the abundance of DNA polymorphism being identified by restriction enzymes.  相似文献   

16.
Admixture mapping is a potentially powerful tool for mapping complex genetic diseases. For application of this method, admixed individuals must have genomes composed of large segments derived intact from each founding population. Such segments are thought to be present in African Americans (AA) and should be demonstrable by examination of linkage disequilibrium (LD). Previous studies using a variety of polymorphic markers have variably reported long-range LD or rapid decay of LD. To further define the extent and characteristics of LD caused by admixture in the AA population, the current study utilized a set of 52 diallelic markers that were selected for large standard variances between putative representatives of the founder populations. LD was examined in over 250 marker-pairs, including linked markers from four different chromosomal regions and an equal number of matched unlinked comparisons. In the representative founder populations, strong LD was not observed for markers separated by more than 10 kb. In contrast, results indicated significant LD ( P<0.001, D'>0.3) in AA over large genomic segments exceeding 10 centiMorgans (cM) and 15 megabases (Mb). Only marginally significant LD was present between unlinked markers in this population, suggesting that choosing appropriate levels of significance for admixture mapping can minimize false positive results. The ability to detect LD for extended chromosomal segments in AA decayed not only as a function of the distance between markers, but also as a function of the standard variance of the markers. This examination of several genomic segments provides strong evidence that appropriate selection of informative markers is a crucial prerequisite for the application of admixture mapping to the AA population.  相似文献   

17.
Admixture between wild and captive populations is an increasing concern in conservation biology. Understanding the extent of admixture and the processes involved requires identification of admixed and non-admixed individuals. This can be achieved by statistical methods employing Bayesian clustering, but resolution is low if genetic differentiation is weak. Here, we analyse stocked brown trout populations represented by historical (1943–1956) and contemporary (2000s) samples, where genetic differentiation between wild populations and stocked trout is weak (pairwise FST of 0.047 and 0.053). By analysing a high number of microsatellite DNA markers (50) and making use of linkage map information, we achieve clear identification of admixed and non-admixed trout. Moreover, despite strong population-level admixture by hatchery strain trout in one of the populations (70.8%), non-admixed individuals nevertheless persist (7 out of 53 individuals). These remnants of the indigenous population are characterized by later spawning time than the majority of the admixed individuals. We hypothesize that isolation by time mediated by spawning time differences between wild and hatchery strain trout is a major factor rescuing a part of the indigenous population from introgression.  相似文献   

18.
In this study we analyzed a sample of the urban population of La Plata, Argentina, using 17 mtDNA haplogroups, the DYS 199 Y-chromosome polymorphism, and 5 autosomal population-associated alleles (PAAs). The contribution of native American maternal lineages to the population of La Plata was estimated as 45.6%, whereas the paternal contribution was much lower (10.6%), clearly indicating directional mating. Regarding autosomal evidence of admixture, the relative European, native American, and West African genetic contributions to the gene pool of La Plata were estimated to be 67.55% (+/-2.7), 25.9% (+/-4.3), and 6.5% (+/-6.4), respectively. When admixture was calculated at the individual level, we found a low correlation between the ancestral contribution estimated with uniparental lineages and autosomal markers. Most of the individuals from La Plata with a native American mtDNA haplogroup or the DYS199*T native American allele show a genetic contribution at the autosomal level that can be traced primarily to Europe. The results of this study emphasize the need to use both uniparentally and biparentally inherited genetic markers to understand the history of admixed populations.  相似文献   

19.
Self-reported race/ethnicity is frequently used in epidemiological studies to assess an individual’s background origin. However, in admixed populations such as Hispanic, self-reported race/ethnicity may not accurately represent them genetically because they are admixed with European, African and Native American ancestry. We estimated the proportions of genetic admixture in an ethnically diverse population of 396 mothers and 188 of their children with 35 ancestry informative markers (AIMs) using the STRUCTURE version 2.2 program. The majority of the markers showed significant deviation from Hardy-Weinberg equilibrium in our study population. In mothers self-identified as Black and White, the imputed ancestry proportions were 77.6% African and 75.1% European respectively, while the racial composition among self-identified Hispanics was 29.2% European, 26.0% African, and 44.8% Native American. We also investigated the utility of AIMs by showing the improved fitness of models in paraoxanase-1 genotype-phenotype associations after incorporating AIMs; however, the improvement was moderate at best. In summary, a minimal set of 35 AIMs is sufficient to detect population stratification and estimate the proportion of individual genetic admixture; however, the utility of these markers remains questionable.  相似文献   

20.
Maximum-likelihood estimation of admixture proportions from genetic data   总被引:9,自引:0,他引:9  
Wang J 《Genetics》2003,164(2):747-765
For an admixed population, an important question is how much genetic contribution comes from each parental population. Several methods have been developed to estimate such admixture proportions, using data on genetic markers sampled from parental and admixed populations. In this study, I propose a likelihood method to estimate jointly the admixture proportions, the genetic drift that occurred to the admixed population and each parental population during the period between the hybridization and sampling events, and the genetic drift in each ancestral population within the interval between their split and hybridization. The results from extensive simulations using various combinations of relevant parameter values show that in general much more accurate and precise estimates of admixture proportions are obtained from the likelihood method than from previous methods. The likelihood method also yields reasonable estimates of genetic drift that occurred to each population, which translate into relative effective sizes (N(e)) or absolute average N(e)'s if the times when the relevant events (such as population split, admixture, and sampling) occurred are known. The proposed likelihood method also has features such as relatively low computational requirement compared with previous ones, flexibility for admixture models, and marker types. In particular, it allows for missing data from a contributing parental population. The method is applied to a human data set and a wolflike canids data set, and the results obtained are discussed in comparison with those from other estimators and from previous studies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号