首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 875 毫秒
1.
Admixture mapping (also known as "mapping by admixture linkage disequilibrium," or MALD) provides a way of localizing genes that cause disease, in admixed ethnic groups such as African Americans, with approximately 100 times fewer markers than are required for whole-genome haplotype scans. However, it has not been possible to perform powerful scans with admixture mapping because the method requires a dense map of validated markers known to have large frequency differences between Europeans and Africans. To create such a map, we screened through databases containing approximately 450000 single-nucleotide polymorphisms (SNPs) for which frequencies had been estimated in African and European population samples. We experimentally confirmed the frequencies of the most promising SNPs in a multiethnic panel of unrelated samples and identified 3011 as a MALD map (1.2 cM average spacing). We estimate that this map is approximately 70% informative in differentiating African versus European origins of chromosomal segments. This map provides a practical and powerful tool, which is freely available without restriction, for screening for disease genes in African American patient cohorts. The map is especially appropriate for those diseases that differ in incidence between the parental African and European populations.  相似文献   

2.
Methods for high-density admixture mapping of disease genes   总被引:26,自引:0,他引:26       下载免费PDF全文
Admixture mapping (also known as "mapping by admixture linkage disequilibrium," or MALD) has been proposed as an efficient approach to localizing disease-causing variants that differ in frequency (because of either drift or selection) between two historically separated populations. Near a disease gene, patient populations descended from the recent mixing of two or more ethnic groups should have an increased probability of inheriting the alleles derived from the ethnic group that carries more disease-susceptibility alleles. The central attraction of admixture mapping is that, since gene flow has occurred recently in modern populations (e.g., in African and Hispanic Americans in the past 20 generations), it is expected that admixture-generated linkage disequilibrium should extend for many centimorgans. High-resolution marker sets are now becoming available to test this approach, but progress will require (a). computational methods to infer ancestral origin at each point in the genome and (b). empirical characterization of the general properties of linkage disequilibrium due to admixture. Here we describe statistical methods to estimate the ancestral origin of a locus on the basis of the composite genotypes of linked markers, and we show that this approach accurately estimates states of ancestral origin along the genome. We apply this approach to show that strong admixture linkage disequilibrium extends, on average, for 17 cM in African Americans. Finally, we present power calculations under varying models of disease risk, sample size, and proportions of ancestry. Studying approximately 2500 markers in approximately 2500 patients should provide power to detect many regions contributing to common disease. A particularly important result is that the power of an admixture mapping study to detect a locus will be nearly the same for a wide range of mixture scenarios: the mixture proportion should be 10%-90% from both ancestral populations.  相似文献   

3.
Polymorphic Admixture Typing in Human Ethnic Populations   总被引:5,自引:4,他引:1       下载免费PDF全文
A panel of 257 RFLP loci was selected on the basis of high heterozygosity in Caucasian DNA surveys and equivalent spacing throughout the human genome. Probes from each locus were used in a Southern blot survey of allele frequency distribution for four human ethnic groups: Caucasian, African American, Asian (Chinese), and American Indian (Cheyenne). Nearly all RFLP loci were polymorphic in each group, albeit with a broad range of differing allele frequencies (δ). The distribution of frequency differences (δ values) was used for three purposes: (1) to provide estimates for genetic distance (differentiation) among these ethnic groups, (2) to revisit with a large data set the proportion of human genetic variation attributable to differentiation within ethnic groups, and (3) to identify loci with high δ values between recently admixed populations of use in mapping by admixture linkage disequilibrium (MALD). Although most markers display significant allele frequency differences between ethnic groups, the overall genetic distances between ethnic groups were small (.066–.098), and <10% of the measured overall molecular genetic diversity in these human samples can be attributed to “racial” differentiation. The median δ values for pairwise comparisons between groups fell between .15 and .20, permitting identification of highly informative RFLP loci for MALD disease association studies.  相似文献   

4.
Mapping by admixture linkage disequilibrium (MALD) is a theoretically powerful, although unproven, approach to mapping genetic variants that are involved in human disease. MALD takes advantage of long-range haplotypes that are generated by gene flow among recently admixed ethnic groups, such as African-Americans and Latinos. Under ideal circumstances, MALD will have more power to detect some genetic variants than other types of genome-wide association study that are carried out among more ethnically homogeneous populations. It will also require 200-500 times fewer markers, providing a significant economic advantage. The MALD approach is now being applied, with results expected in the near future.  相似文献   

5.
Gene flow between genetically distinct populations creates linkage disequilibrium (admixture linkage disequilibrium [ALD]) among all loci (linked and unlinked) that have different allele frequencies in the founding populations. We have explored the distribution of ALD by using computer simulation of two extreme models of admixture: the hybrid-isolation (HI) model, in which admixture occurs in a single generation, and the continuous-gene-flow (CGF) model, in which admixture occurs at a steady rate in every generation. Linkage disequilibrium patterns in African American population samples from Jackson, MS, and from coastal South Carolina resemble patterns observed in the simulated CGF populations, in two respects. First, significant association between two loci (FY and AT3) separated by 22 cM was detected in both samples. The retention of ALD over relatively large (>10 cM) chromosomal segments is characteristic of a CGF pattern of admixture but not of an HI pattern. Second, significant associations were also detected between many pairs of unlinked loci, as observed in the CGF simulation results but not in the simulated HI populations. Such a high rate of association between unlinked markers in these populations could result in false-positive linkage signals in an admixture-mapping study. However, we demonstrate that by conditioning on parental admixture, we can distinguish between true linkage and association resulting from shared ancestry. Therefore, populations with a CGF history of admixture not only are appropriate for admixture mapping but also have greater power for detection of linkage disequilibrium over large chromosomal regions than do populations that have experienced a pattern of admixture more similar to the HI model, if methods are employed that detect and adjust for disequilibrium caused by continuous admixture.  相似文献   

6.
Mapping by admixture linkage disequilibrium (MALD) is a potentially powerful technique for the mapping of complex genetic diseases. The practical requirements of this method include (a) a set of markers spanning the genome that have large allele-frequency differences between the parental ethnicities contributing to the admixed population and (b) an understanding of the extent of admixture in the study population. To this end, a DNA-pooling technique was used to screen microsatellite and diallelic insertion/deletion markers for allele-frequency differences between putative representatives of the parental populations of the admixed Mexican American (MA) and African American (AA) populations. Markers with promising pooled differences were then confirmed by individual genotyping in both the parental and admixed populations. For the MA population, screening of >600 markers identified 151 ethnic-difference markers (EDMs) with delta>0.30 (where delta is the absolute value of each allele-frequency difference between two populations, summed over all marker alleles and divided by two) that are likely to be useful for MALD analysis. For the AA population, analysis of >400 markers identified 97 EDMs. In addition, individual genotyping of these markers in Pima Amerindians, Yavapai Amerindians, European American (EA) individuals, Africans from Zimbabwe, MA individuals, and AA individuals, as well as comparison to the CEPH genotyping set, suggests that the differences between subpopulations of an ethnicity are small for many markers with large interethnic differences. Estimates of admixture that are based on individual genotyping of these markers are consistent with a 60% EA:40% Amerindian contribution to MA populations and with a 20% EA:80% African contribution to AA populations. Taken together, these data suggest that EDMs with large interpopulation and small intrapopulation differences can be readily identified for MALD studies in both AA and MA populations.  相似文献   

7.
Admixture mapping (AM) is a promising method for the identification of genetic risk factors for complex traits and diseases showing prevalence differences among populations. Efficient application of this method requires the use of a genomewide panel of ancestry-informative markers (AIMs) to infer the population of origin of chromosomal regions in admixed individuals. Genomewide AM panels with markers showing high frequency differences between West African and European populations are already available for disease-gene discovery in African Americans. However, no such a map is yet available for Hispanic/Latino populations, which are the result of two-way admixture between Native American and European populations or of three-way admixture of Native American, European, and West African populations. Here, we report a genomewide AM panel with 2,120 AIMs showing high frequency differences between Native American and European populations. The average intermarker genetic distance is ~1.7 cM. The panel was identified by genotyping, with the Affymetrix GeneChip Human Mapping 500K array, a population sample with European ancestry, a Mesoamerican sample comprising Maya and Nahua from Mexico, and a South American sample comprising Aymara/Quechua from Bolivia and Quechua from Peru. The main criteria for marker selection were both high information content for Native American/European ancestry (measured as the standardized variance of the allele frequencies, also known as "f value") and small frequency differences between the Mesoamerican and South American samples. This genomewide AM panel will make it possible to apply AM approaches in many admixed populations throughout the Americas.  相似文献   

8.
Certain human hereditary conditions, notably those with low penetrance and those which require an environmental event such as infectious disease exposure, are difficult to localize in pedigree analysis, because of uncertainty in the phenotype of an affected patient's relatives. An approach to locating these genes in human cohort studies would be to use association analysis, which depends on linkage disequilibrium of flanking polymorphic DNA markers. In theory, a high degree of linkage disequilibrium between genes separated by 10-20 cM will be generated and persist in populations that have a history of recent (3-20 generations ago) admixture between genetically differentiated racial groups, such as has occurred in African Americans and Hispanic populations. We have conducted analytic and computer simulations to quantify the effect of genetic, genomic, and population parameters that affect the amount and ascertainment of linkage disequilibrium in populations with a history of genetic admixture. Our goal is to thoroughly explore the ranges of all relevant parameters or factors (e.g., sample size and degree of genetic differentiation between populations) that may be involved in gene localization studies, in hopes of prescribing guidelines for an efficient mapping strategy. The results provide reasonable limits on sample size (200-300 patients), marker number (200-300 in 20-cM intervals), and allele differentiation (loci with allele frequency difference of > or = .3 between admixed parent populations) to produce an efficient approach (> 95% ascertainment) for locating genes not easily tracked in human pedigrees.  相似文献   

9.
Markers informative for ancestry are necessary for admixture mapping and improving case-control association analyses. In particular, African Americans are an admixed population for which genetic studies require accurately evaluating admixture. This will require markers that can be used in African Americans to determine if a given genomic region is of European or African ancestry. This report shows that, despite studies indicating high intra-African sequence variation, markers with large inter-ethnic differences have only small variations in allele distribution among divergent African populations and should be valuable for evaluating admixture in complex disease genetic studies.  相似文献   

10.
Human genetic linkage maps are based on rates of recombination across the genome. These rates in humans vary by the sex of the parent from whom alleles are inherited, by chromosomal position, and by genomic features, such as GC content and repeat density. We have examined--for the first time, to our knowledge--racial/ethnic differences in genetic maps of humans. We constructed genetic maps based on 353 microsatellite markers in four racial/ethnic groups: whites, African Americans, Mexican Americans, and East Asians (Chinese and Japanese). These maps were generated using 9,291 subjects from 2,900 nuclear families who participated in the National Heart, Lung, and Blood Institute-funded Family Blood Pressure Program, the largest sample used for map construction to date. Although the maps for the different groups are generally similar, we did find regional and genomewide differences across ethnic groups, including a longer genomewide map for African Americans than for other populations. Some of this variation was explained by genotyping artifacts--namely, null alleles (i.e., alleles with null phenotypes) at a number of loci--and by ethnic differences in null-allele frequencies. In particular, null alleles appear to be the likely explanation for the excess map length in African Americans. We also found that nonrandom missing data biases map results. However, we found regions on chromosome 8p and telomeric segments with significant ethnic differences and a suggestive interval on chromosome 12q that were not due to genotype artifacts. The difference on chromosome 8p is likely due to a polymorphic inversion in the region. The results of our investigation have implications for inferences of possible genetic influences on human recombination as well as for future linkage studies, especially those involving populations of nonwhite ethnicity.  相似文献   

11.
Markers with large differences in allele frequencies between ethnicities provide ancestry information that can be applied to genetic studies. We identified over 100 biallelic ancestry informative markers (AIMs) with large allele frequency differences between European Americans (EA) and Pima Amerindians from laboratory and database screens. For 35 of these markers, Mayan, Yavapai and Quechuan Amerindians were genotyped and compared with EA and Pima allele frequencies. Markers with large allele frequency differences between EA and one Amerindian tribe showed only small differences between the Amerindian tribes. Examination of structure in individuals demonstrated a clear separation of subjects of European from those of Amerindian ancestry, and similarity between individuals from disparate Amerindian populations. The AIMs demonstrated the variation in ancestral composition of individual Mexican Americans, providing evidence of applicability in admixture mapping and in controlling for structure in association tests. In addition, a high percentage of single-nucleotide polymorphisms (SNPs) selected on the basis of large frequency differences between EA and Asian populations had large allele frequency differences between EA and Amerindians, suggesting an efficient method for greatly expanding AIMs for use in admixture mapping/structure analysis in Mexican Americans. Together, these data provide additional support for the practical application of admixture mapping in the Mexican American population.Electronic Supplementary Material Supplementary material is available in the online version of this article at  相似文献   

12.
A genomewide admixture map for Latino populations   总被引:5,自引:1,他引:4       下载免费PDF全文
Admixture mapping is an economical and powerful approach for localizing disease genes in populations of recently mixed ancestry and has proven successful in African Americans. The method holds equal promise for Latinos, who typically inherit a mix of European, Native American, and African ancestry. However, admixture mapping in Latinos has not been practical because of the lack of a map of ancestry-informative markers validated in Native American and other populations. To address this, we screened multiple databases, containing millions of markers, to identify 4,186 markers that were putatively informative for determining the ancestry of chromosomal segments in Latino populations. We experimentally validated each of these markers in at least 232 new Latino, European, Native American, and African samples, and we selected a subset of 1,649 markers to form an admixture map. An advantage of our strategy is that we focused our map on markers distinguishing Native American from other ancestries and restricted it to markers with very similar frequencies in Europeans and Africans, which decreased the number of markers needed and minimized the possibility of false disease associations. We evaluated the effectiveness of our map for localizing disease genes in four Latino populations from both North and South America.  相似文献   

13.
Admixture mapping is a potentially powerful tool for mapping complex genetic diseases. For application of this method, admixed individuals must have genomes composed of large segments derived intact from each founding population. Such segments are thought to be present in African Americans (AA) and should be demonstrable by examination of linkage disequilibrium (LD). Previous studies using a variety of polymorphic markers have variably reported long-range LD or rapid decay of LD. To further define the extent and characteristics of LD caused by admixture in the AA population, the current study utilized a set of 52 diallelic markers that were selected for large standard variances between putative representatives of the founder populations. LD was examined in over 250 marker-pairs, including linked markers from four different chromosomal regions and an equal number of matched unlinked comparisons. In the representative founder populations, strong LD was not observed for markers separated by more than 10 kb. In contrast, results indicated significant LD ( P<0.001, D'>0.3) in AA over large genomic segments exceeding 10 centiMorgans (cM) and 15 megabases (Mb). Only marginally significant LD was present between unlinked markers in this population, suggesting that choosing appropriate levels of significance for admixture mapping can minimize false positive results. The ability to detect LD for extended chromosomal segments in AA decayed not only as a function of the distance between markers, but also as a function of the standard variance of the markers. This examination of several genomic segments provides strong evidence that appropriate selection of informative markers is a crucial prerequisite for the application of admixture mapping to the AA population.  相似文献   

14.
郭伟  冯荣锦 《遗传学报》2006,33(1):12-18
在渐近混合模型中,混合现象发生在每一世代,通过对其混合连锁不平衡的理论分析,发现混合连锁不平衡与两个子群体间的基因频率差成正比。基于这一点,构造了一个对重组率严格单调的函数(△ker=△/(p1-p2),其中△代表连锁不平衡),进而据此推断标记基因座与疾病基因座的遗传连锁。应用人类基因组上不连锁的标记基因提供的连锁不平衡信息,基于病人组数据构造了一个准似然比统计量。模拟结果显示,此检验可用于精确的基因定位。文章亦讨论了参数对检验的影响。  相似文献   

15.
Recent admixture between genetically differentiated populations can result in high levels of association between alleles at loci that are <=10 cM apart. The transmission/disequilibrium test (TDT) proposed by Spielman et al. (1993) can be a powerful test of linkage between disease and marker loci in the presence of association and therefore could be a useful test of linkage in admixed populations. The degree of association between alleles at two loci depends on the differences in allele frequencies, at the two loci, in the founding populations; therefore, the choice of marker is important. For a multiallelic marker, one strategy that may improve the power of the TDT is to group marker alleles within a locus, on the basis of information about the founding populations and the admixed population, thereby collapsing the marker into one with fewer alleles. We have examined the consequences of collapsing a microsatellite into a two-allele marker, when two founding populations are assumed for the admixed population, and have found that if there is random mating in the admixed population, then typically there is a collapsing for which the power of the TDT is greater than that for the original microsatellite marker. A method is presented for finding the optimal collapsing that has minimal dependence on the disease and that uses estimates either of marker allele frequencies in the two founding populations or of marker allele frequencies in the current, admixed population and in one of the founding populations. Furthermore, this optimal collapsing is not always the collapsing with the largest difference in allele frequencies in the founding populations. To demonstrate this strategy, we considered a recent data set, published previously, that provides frequency estimates for 30 microsatellites in 13 populations.  相似文献   

16.
While studies have implicated alleles at the CAG and GGC trinucleotide repeats of the androgen receptor gene with high-grade, aggressive prostate cancer disease, little is known about the normal range of variation for these two loci, which are separated by about 1.1 kb. More importantly, few data exist on the extent of linkage disequilibrium (LD) between the two loci in different human populations. Here we present data on CAG and GGC allelic variation and LD in six diverse populations. Alleles at the CAG and GGC repeat loci of the androgen receptor were typed in over 1000 chromosomes from Africa, Asia, and North America. Levels of linkage disequilibrium between the two loci were compared between populations. Haplotype variation and diversity were estimated for each population. Our results reveal that populations of African descent possess significantly shorter alleles for the two loci than non-African populations (P<0.0001). Allelic diversity for both markers was higher among African Americans than any other population, including indigenous Africans from Sierra Leone and Nigeria. Analysis of molecular variance revealed that approx. 20% of CAG and GGC repeat variance could be attributed to differences between the populations. All non-African populations possessed the same common haplotype while the three populations of African descent possessed three divergent common haplotypes. Significant LD was observed in our sample of healthy African Americans. The LD observed in the African American population may be due to several reasons; recent migration of African Americans from diverse rural communities following urbanization, recurrent gene flow from diverse West African populations, and admixture with European Americans. This study represents the largest genotyping effort to be performed on the two androgen receptor trinucleotide repeat loci in diverse human populations.  相似文献   

17.
Restriction fragment length polymorphisms are good anthropological markers for discriminating geographically distinct populations at both the allele and the haplotype level. Two communities of African ancestry and ladinos, mestizos, and mulattoes living in the Esmeraldas province in northwestern Ecuador were analyzed for three RFLPs (EcoRI, RsaI, and MspI) of the COL1A2 gene. Also, the same markers were studied in a population sample from Spain to compare the allele and haplotype frequencies of the Esmeraldas populations with those of their representative European parental population. Data for the native American and sub-Saharan African founder components were available from the literature. No significant levels of differentiation between the two African Ecuadoran communities emerged from either the frequency analysis of each single marker and all three RFLP markers together or from the AMOVA. The ladinos and mestizos also showed a rather similar distribution of allele and haplotype frequencies, confirming that the two ethnic terms do not correspond to genetically different populations. The comparison with the supposed founding European, sub-Saharan African, and native American populations indicated a large presence of African genes in the gene pool of both communities, with a higher proportion of the Amerindian component in Viche than in Rio Cayapas. The present findings confirm the previous genetic admixture estimates based on nuclear and mitochondrial DNA markers and the demographic data.  相似文献   

18.
We analyzed the European genetic contribution to 10 populations of African descent in the United States (Maywood, Illinois; Detroit; New York; Philadelphia; Pittsburgh; Baltimore; Charleston, South Carolina; New Orleans; and Houston) and in Jamaica, using nine autosomal DNA markers. These markers either are population-specific or show frequency differences >45% between the parental populations and are thus especially informative for admixture. European genetic ancestry ranged from 6.8% (Jamaica) to 22.5% (New Orleans). The unique utility of these markers is reflected in the low variance associated with these admixture estimates (SEM 1.3%-2.7%). We also estimated the male and female European contribution to African Americans, on the basis of informative mtDNA (haplogroups H and L) and Y Alu polymorphic markers. Results indicate a sex-biased gene flow from Europeans, the male contribution being substantially greater than the female contribution. mtDNA haplogroups analysis shows no evidence of a significant maternal Amerindian contribution to any of the 10 populations. We detected significant nonrandom association between two markers located 22 cM apart (FY-null and AT3), most likely due to admixture linkage disequilibrium created in the interbreeding of the two parental populations. The strength of this association and the substantial genetic distance between FY and AT3 emphasize the importance of admixed populations as a useful resource for mapping traits with different prevalence in two parental populations.  相似文献   

19.
Mexican Americans are a numerous and fast growing ethnic population in the United States. Yet little is known about their genetic structure. Since they are a hybrid, it is of interest to identify their parental populations and to estimate the relative contributions of these groups. This information is relevant to historical, biomedical, and evolutionary concerns. New genetic typings on 730 Arizona Mexican Americans for the HLA-A, HLA-B, ABO, Rh, MNSs, Duffy, Kidd, and Kell loci are presented here and they are used to estimate ancestral contributions. We considered both a dihybrid model with Amerindians and Spaniards as proposed ancestors, and a trihybrid model with Amerindians, Spaniards, and Africans as proposed ancestors. A modified weighted least squares method that allows for linkage disequilibrium was used to estimate ancestral contributions for each model. The following admixture estimates were obtained: Amerindian, 0.29 +/- 0.04; Spaniard, 0.68 +/- 0.05; and African, 0.03 +/- 0.02. The interpretation of these results with respect to Amerindian and Spanish ancestry is straightforward. African ancestry is strongly supported by the presence of a marker of African descent, Fy, despite the fact that the standard error of the estimate is as large as the estimated admixture proportion. An evaluation of the sensitivity of these results to a number of variables is presented: 1) our choices of ancestral allele frequencies, 2) the possibility of selection at HLA and the blood groups, and 3) genetic drift in Mexican Americans.  相似文献   

20.
Admixture between populations originating on different continents can be exploited to detect disease susceptibility loci at which risk alleles are distributed differentially between these populations. We first examine the statistical power and mapping resolution of this approach in the limiting situation in which gamete admixture and locus ancestry are measured without uncertainty. We show that, for a rare disease, the most efficient design is to study affected individuals only. In a typical African American population (two-way admixture proportions 0.8/0.2, ancestry crossover rate 2 per 100 cM), a study of 800 affected individuals has 90% power to detect at P values <10(-5) a locus that generates a risk ratio of 2 between populations, with an expected mapping resolution (size of 95% confidence region for the position of the locus) of 4 cM. In practice, to infer locus ancestry from marker data requires Bayesian computationally intensive methods, as implemented in the program ADMIXMAP. Affected-only study designs require strong prior information on the frequencies of each allele given locus ancestry. We show how data from unadmixed and admixed populations can be combined to estimate these ancestry-specific allele frequencies within the admixed population under study, allowing for variation between allele frequencies in unadmixed and admixed populations. Using simulated data based on the genetic structure of the African American population, we show that 60% of information can be extracted in a test for linkage using markers with an ancestry information content of 36% at 3-cM spacing. As in classic linkage studies, the most efficient strategy is to use markers at a moderate density for an initial genome search and then to saturate regions of putative linkage with additional markers, to extract nearly all information about locus ancestry.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号