首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
Genomewide linkage studies are tending toward the use of single-nucleotide polymorphisms (SNPs) as the markers of choice. However, linkage disequilibrium (LD) between tightly linked SNPs violates the fundamental assumption of linkage equilibrium (LE) between markers that underlies most multipoint calculation algorithms currently available, and this leads to inflated affected-relative-pair allele-sharing statistics when founders' multilocus genotypes are unknown. In this study, we investigate the impact that the degree of LD, marker allele frequency, and association type have on estimating the probabilities of sharing alleles identical by descent in multipoint calculations and hence on type I error rates of different sib-pair linkage approaches that assume LE. We show that marker-marker LD does not inflate type I error rates of affected sib pair (ASP) statistics in the whole parameter space, and that, in any case, discordant sib pairs (DSPs) can be used to control for marker-marker LD in ASPs. We advocate the ASP/DSP design with appropriate sib-pair statistics that test the difference in allele sharing between ASPs and DSPs.  相似文献   

2.
Single nucleotide polymorphisms (SNPs) are widely used when investigators try to map complex disease genes. Although biallelic SNP markers are less informative than microsatellite markers, one can increase their information content by using haplotypes. However, assigning haplotypes (i.e., assigning phase) correctly can be problematic in the presence of SNP heterozygosity. For example, a doubly heterozygous individual, with genotype 12, 12, could have haplotypes 1-1/2-2 or 1-2/2-1 with equal probability; in the absence of additional information, there is no way to determine which haplotype is correct. Thus an algorithm that assigns haplotypes to such an individual will assign the wrong one 50% of the time. We have studied the frequency of haplotype misassignments, i.e., haplotypes that are misassigned solely because of inherent marker ambiguity (not because of errors in genotyping or calculation). We examined both SNPs and microsatellite markers. We used the computer programs GENEHUNTER and SIMWALK to assign the haplotypes. We simulated (a) families with 1-5 children, (b) haplotypes involving different numbers of marker loci (3, 5, 7 and 10 loci, all in linkage equilibrium), and (c) different allele frequencies. Misassignment rates are highest (a) in small families, (b) with many SNP loci, and (c) for loci with the greatest heterozygosity (i.e., where both alleles have frequency 0.5). For example, for triads (i.e., one-child families with both parents genotyped), misassignment rates for SNPs can reach almost 50%. Family sizes of 4-5 children are required in order to ensure a misassignment frequency of < or = 5% for ten-SNP haplotypes with allele frequencies of 0.25-0.5. For microsatellites, a family size of at least 2-3 children is necessary to keep haplotyping misassignments < or = 5%. Finally, we point out that it is misleading for a computer program to yield haplotype assignments without indicating that they may have been misassigned, and we discuss the implications of these misassignments for association and linkage analysis.  相似文献   

3.
Both theoretical and applied studies have proven that the utility of single nucleotide polymorphism (SNP) markers in linkage analysis is more powerful and cost-effective than current microsatellite marker assays. Here we performed a whole-genome scan on 115 White, non-Hispanic families segregating for alcohol dependence, using one 10.3-cM microsatellite marker set and two SNP data sets (0.33-cM, 0.78-cM spacing). Two definitions of alcohol dependence (ALDX1 and ALDX2) were used. Our multipoint nonparametric linkage analysis found alcoholism was nominal linked to 12 genomic regions. The linkage peaks obtained by using the microsatellite marker set and the two SNP sets had a high degree of correspondence in general, but the microsatellite marker set was insufficient to detect some nominal linkage peaks. The presence of linkage disequilibrium between markers did not significantly affect the results. Across the entire genome, SNP datasets had a much higher average linkage information content (0.33 cM: 0.93, 0.78 cM: 0.91) than did microsatellite marker set (0.57). The linkage peaks obtained through two SNP datasets were very similar with some minor differences. We conclude that genome-wide linkage analysis by using approximately 5,000 SNP markers evenly distributed across the human genome is sufficient and might be more powerful than current 10-cM microsatellite marker assays.  相似文献   

4.
We describe two highly polymorphic microsatellite AC repeat sequences, VK23AC and VK14AC, which are closely linked to the fragile X at Xq27.3. Both VK23AC (DXS297) and VK14AC (DXS292) are proximal to the fragile site. Two-point linkage analysis in 31 fragile X families gave (a) a recombination frequency of 1% (range 0.00%-4%) with a maximum lod score of 32.04 for DXS297 and (b) a recombination frequency of 7% (range of 3%-15%) with a maximum lod score of 12.87 for DXS292. Both of these polymorphisms are applicable to diagnosis by linkage in families with fragile X syndrome. A multipoint linkage map of genetic markers at Xq27.3 was constructed from genotyping these polymorphisms in the CEPH pedigrees. The DXS292 marker is in the DXS98-DXS297 interval and in 3 cM proximal to DXS297.  相似文献   

5.
Genomewide linkage scans have traditionally employed panels of microsatellite markers spaced at intervals of approximately 10 cM across the genome. However, there is a growing realization that a map of closely spaced single-nucleotide polymorphisms (SNPs) may offer equal or superior power to detect linkage, compared with low-density microsatellite maps. We performed a series of simulations to calculate the information content associated with microsatellite and SNP maps across a range of different marker densities and heterozygosities for sib pairs (with and without parental genotypes), sib trios, and sib quads. In the case of microsatellite markers, we varied density across 11 levels (1 marker every 0.5, 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 cM) and marker heterozygosity across 6 levels (2, 3, 4, 5, 10, or 20 equally frequent alleles), whereas, in the case of SNPs, we varied marker density across 4 levels (1 marker every 0.1, 0.2, 0.5, or 1 cM) and minor-allele frequency across 7 levels (0.5, 0.4, 0.3, 0.2, 0.1, 0.05, and 0.01). When parental genotypes were available, a map consisting of microsatellites spaced every 2 cM or a relatively sparse map of SNPs (i.e., at least 1 SNP/cM) was sufficient to extract most of the inheritance information from the map (>95% in most cases). However, when parental genotypes were unavailable, it was important to use as dense a map of markers as possible to extract the greatest amount of inheritance information. It is important to note that the information content associated with a traditional map of microsatellite markers (i.e., 1 marker every ~10 cM) was significantly lower than the information content associated with a dense map of SNPs or microsatellites. These results strongly suggest that previous linkage studies that employed sparse microsatellite maps could benefit substantially from reanalysis by use of a denser map of markers.  相似文献   

6.
Errors while genotyping are inevitable and can reduce the power to detect linkage. However, does genotyping error have the same impact on linkage results for single-nucleotide polymorphism (SNP) and microsatellite (MS) marker maps? To evaluate this question we detected genotyping errors that are consistent with Mendelian inheritance using large changes in multipoint identity-by-descent sharing in neighboring markers. Only a small fraction of Mendelian consistent errors were detectable (e.g., 18% of MS and 2.4% of SNP genotyping errors). More SNP genotyping errors are Mendelian consistent compared to MS genotyping errors, so genotyping error may have a greater impact on linkage results using SNP marker maps. We also evaluated the effect of genotyping error on the power and type I error rate using simulated nuclear families with missing parents under 0, 0.14, and 2.8% genotyping error rates. In the presence of genotyping error, we found that the power to detect a true linkage signal was greater for SNP (75%) than MS (67%) marker maps, although there were also slightly more false-positive signals using SNP marker maps (5 compared with 3 for MS). Finally, we evaluated the usefulness of accounting for genotyping error in the SNP data using a likelihood-based approach, which restores some of the power that is lost when genotyping error is introduced.  相似文献   

7.

Background

We analyzed 143 pedigrees (364 nuclear families) in the Collaborative Study on the Genetics of Alcoholism (COGA) data provided to the participants in the Genetic Analysis Workshop 14 (GAW14) with the goal of comparing results obtained from genome linkage analysis using microsatellite and with results obtained using SNP markers for two measures of alcoholism (maximum number of drinks -MAXDRINK and an electrophysiological measure from EEG -TTTH1). First, we constructed haplotype blocks by using the entire set of single-nucleotide polymorphisms (SNP) in chromosomes 1, 4, and 7. These chromosomes have shown linkage signals for MAXDRINK or EEG-TTTH1 in previous reports. Second, we randomly selected one, two, three, four, and five SNPs from each block (referred to as Rep1 – Rep5, respectively) to conduct linkage analysis using variance component approach. Finally, results of all SNP analyses were compared with those obtained using microsatellite markers.

Results

The LOD scores obtained from SNPs were slightly higher but the curves were not radically different from those obtained from microsatellite analyses. The peaks of linkage regions from SNP sets were slightly shifted to the left when compared to those from microsatellite markers. The reduced sets of SNPs provide signals in the same linkage regions but with a smaller LOD score suggesting a significant impact of the decrease in information content on linkage results. The widths of 1 LOD support interval of linkage regions from SNP sets were smaller when compared to those of microsatellite markers. However, two linkage regions obtained from the microsatellite linkage analysis on chromosome 7 for LOG of TTTH1 were not detected in the SNP based analyses.

Conclusion

The linkage results from SNPs showed narrower linkage regions and slightly higher LOD scores when compared to those of microsatellite markers. The different builds of the genetic maps used in microsatellite and SNPs markers or/and errors in genotyping may account for the microsatellite linkage signals on chromosome 7 that were not identified using SNPs. Also, unresolved map issues between SNPs and microsatellite markers may be partly responsible for the shifted linkage peaks when comparing the two types of markers.
  相似文献   

8.
We performed multipoint linkage analysis using 83 markers from the SNP Consortium (TSC) SNP linkage map in 3 regions covering 190 cM previously scanned with microsatellite markers and found to be linked to type 2 diabetes. Since the average linkage disequilibrium present in the TSC SNP marker clusters is relatively low, we assumed the intracluster genetic distances were a reasonable small nonzero distance (0.03 cM) and performed linkage analysis using GENEHUNTER PLUS and ASM linkage analysis software. We found that for the pedigree structures and missing data patterns in our samples the average information content in all three regions and the LOD score curves in two regions obtained from the TSC SNP markers were similar to results obtained from microsatellite marker maps with 10 cM average spacing. We also give an algorithm which extends the Lander-Green algorithm to permit multipoint linkage analysis of clusters of tightly linked markers with arbitrarily high levels of intracluster linkage disequilibrium.  相似文献   

9.
We have previously reported linkage of systemic lupus erythematosus to chromosome 2q37 in multicase families from Iceland and Sweden. This locus (SLEB2) was identified by linkage to the markers D2S125 and D2S140. In the present study we have analyzed additional microsatellite markers and SNPs covering a region of 30 cM around D2S125 in an extended set of Nordic families (Icelandic, Swedish, and Norwegian). Two-point linkage analysis in these families gave a maximum lod score at the position of markers D2S2585 and D2S2985 (Z = 4.51, PIC = 0.65), by applying a "model-free" pseudo-marker linkage analysis. Based on multipoint linkage analysis in the Nordic families, the most likely location of the SLEB2 locus is estimated to be in the interval between D2S125 and the position of markers D2S2585 and D2S2985, with a peak multipoint lod score of Z = 6.03, assuming a dominant pseudo-marker model. Linkage disequilibrium (LD) analysis was performed using the data from the multicase families and 89 single-case families of Swedish origin, using the same set of markers. The LD analysis showed evidence for association in the single-case and multicase families with locus GAAT3C11 (P < 0.0003), and weak evidence for association was obtained for several markers located telomeric to D2S125 in the multicase families. Thirteen Mexican families were analyzed separately and found not to have linkage to this region. Our results support the presence of the SLEB2 locus at 2q37.  相似文献   

10.
Cleidocranial dysplasia (CCD) is an autosomal dominant generalized bone dysplasia characterized by mild-to-moderate short stature, clavicular aplasia or hypoplasia, supernumerary and ectopic teeth, delayed eruption of secondary teeth, a characteristic craniofacial appearance, and a variety of other skeletal anomalies. We have performed linkage studies in five families with CCD, with 24 affected and 20 unaffected individuals, using microsatellite markers spanning two candidate regions on chromosomes 8q and 6. The strongest support for linkage was with chromosome 6p microsatellite marker D6S282 with a two-point lod score of 4.84 (theta = .03). Furthermore, the multipoint lod score was 5.70 in the interval between D6S282 and D6S291. These data show that the gene for autosomal dominant CCD is located within a 19-cM interval on the short arm of chromosome 6, between D6S282 and D6S291.  相似文献   

11.
Daw EW  Heath SC  Lu Y 《BMC genetics》2005,6(Z1):S32
Increasingly, single-nucleotide polymorphism (SNP) markers are being used in preference to microsatellite markers. However, methods developed for microsatellites may be problematic when applied to SNP markers. We evaluated the results of using SNPs vs. microsatellites in Monte Carlo Markov chain (MCMC) oligogenic combined segregation and linkage analysis methods. These methods were developed with microsatellite markers in mind. We selected chromosome 7 from the Collaborative Study on the Genetics of Alcoholism dataset for analysis because linkage to an electrophysiological trait had been reported there. We found linkage in the same region of chromosome 7 with the Affymetrix SNP data, the Illumina SNP data, and the microsatellite marker data. The MCMC sampler appears to mix with both types of data. The sampler implemented in this MCMC oligogenic combined segregation and linkage analysis appears to handle SNP data as well as microsatellite data and it is possible that the localizations with the SNP data are better.  相似文献   

12.
Dense SNP maps can be highly informative for linkage studies. But when parental genotypes are missing, multipoint linkage scores can be inflated in regions with substantial marker-marker linkage disequilibrium (LD). Such regions were observed in the Affymetrix SNP genotypes for the Genetic Analysis Workshop 14 (GAW14) Collaborative Study on the Genetics of Alcoholism (COGA) dataset, providing an opportunity to test a novel simulation strategy for studying this problem. First, an inheritance vector (with or without linkage present) is simulated for each replicate, i.e., locations of recombinations and transmission of parental chromosomes are determined for each meiosis. Then, two sets of founder haplotypes are superimposed onto the inheritance vector: one set that is inferred from the actual data and which contains the pattern of LD; and one set created by randomly selecting parental alleles based on the known allele frequencies, with no correlation (LD) between markers. Applying this strategy to a map of 176 SNPs (66 Mb of chromosome 7) for 100 replicates of 116 sibling pairs, significant inflation of multipoint linkage scores was observed in regions of high LD when parental genotypes were set to missing, with no linkage present. Similar inflation was observed in analyses of the COGA data for these affected sib pairs with parental genotypes set to missing, but not after reducing the marker map until r2 between any pair of markers was 相似文献   

13.
We have compared the efficiency of the lod score test which assumes heterogeneity (lod2) to the standard lod score test which assumes homogeneity (lod1) when three-point linkage analysis is used in successive map intervals. If it is assumed that a gene located midway between two linked marker loci is responsible for a proportion of disease cases, then the lod1 test loses power relative to the lod2 test, as the proportion of linked families decreases, as the flanking markers are more closely linked, and as more map intervals are tested. Moreover, when multipoint analysis is used, linkage for a disease gene is more likely to be incorrectly excluded from a complete and dense linkage map if true genetic heterogeneity is ignored. We thus conclude that, in general, the lod2 linkage test is more efficient for detecting a true linkage when a complete genetic marker map is screened for a heterogeneous disorder.  相似文献   

14.
We performed linkage and linkage disequilibrium (LD) mapping analyses to compare the power between microsatellite and single nucleotide polymorphism (SNP) markers. Chromosome-wide analyses were performed for a quantitative electrophysiological phenotype, ttth1, on chromosome 7. Multipoint analysis of microsatellite markers using the variance component (VC) method showed the highest LOD score of 4.20 at 162 cM, near D7S509 (163.7 cM). Two-point analysis of SNPs using the VC method yielded the highest LOD score of 3.98 in the Illumina SNP data and 3.45 in the Affymetrix SNP data around 152-153 cM. In family-based single SNP and SNP haplotype LD analysis, we identified seven SNPs associated with ttth1. We searched for any potential candidate genes in the location of the seven SNPs. The SNPs rs1476640 and rs768055 are located in the FLJ40852 gene (a hypothetical protein), and SNP rs1859646 is located in the TAS2R5 gene (a taste receptor). The other four SNPs are not located in any known or annotated genes. We found the high density SNP scan to be superior to microsatellites because it is effective in downstream fine mapping due to a better defined linkage region. Our study proves the utility of high density SNP in genome-wide mapping studies.  相似文献   

15.
Three prostate cancer susceptibility genes have been reported to be linked to different regions on chromosome 1: HPC1 at 1q24-25, PCAP at 1q42-43, and CAPB at 1p36. Replication studies analyzing each of these regions have yielded inconsistent results. To evaluate linkage across this chromosome systematically, we performed multipoint linkage analyses with 50 microsatellite markers spanning chromosome 1 in 159 hereditary prostate cancer families (HPC), including 79 families analyzed in the original report describing HPC1 linkage. The highest lod scores for the complete dataset of 159 families were observed at 1q24-25 at which the parametric lod score assuming heterogeneity (hlod) was 2.54 (P=0.0006) with an allele sharing lod of 2.34 (P=0.001) at marker D1S413, although only weak evidence was observed in the 80 families not previously analyzed for this region (hlod=0.44, P=0.14, and allele sharing lod=0.67, P=0.08). In the complete data set, the evidence for linkage across this region was very broad, with allele sharing lod scores greater than 0.5 extending approximately 100 cM from 1p13 to 1q32, possibly indicating the presence of multiple susceptibility genes. Elsewhere on chromosome 1, some evidence of linkage was observed at 1q42-43, with a peak allele sharing lod of 0.56 (P=0.11) and hlod of 0.24 (P=0.25) at D1S235. For analysis of the CAPB locus at 1p36, we focused on six HPC families in our collection with a history of primary brain cancer; four of these families had positive linkage results at 1p36, with a peak allele sharing lod of 0.61 (P=0.09) and hlod of 0.39 (P=0.16) at D1S407 in all six families. These results are consistent with the heterogeneous nature of hereditary prostate cancer, and the existence of multiple loci on chromosome 1 for this disease.  相似文献   

16.
Treacher Collins syndrome (TCS) is an autosomal dominant disorder of craniofacial development, the features of which include conductive hearing loss and cleft palate. We have studied 12 unrelated TCS families with multiple affected individuals for linkage to five chromosome 5 markers. There is strong evidence demonstrating linkage to three of these markers. Multipoint linkage analysis places the mutation causing TCS in the interval between the gene for the glucocorticoid receptor and the anonymous marker D5S22, with a maximum multipoint lod score of 9.1.  相似文献   

17.
Celiac disease (CD), or gluten-sensitive enteropathy, is a common multifactorial disorder resulting from intolerance to cereal prolamins. The only established genetic susceptibility factor is HLA-DQ, which appears to explain only part of the overall genetic risk. We performed a genomewide scan of CD in 60 Finnish families. In addition to strong evidence for linkage to the HLA region at 6p21.3 (Z(max)>5), suggestive evidence for linkage was found for six other chromosomal regions--1p36, 4p15, 5q31, 7q21, 9p21-23, and 16q12. We further analyzed the three most convincing regions--4p15, 5q31, and 7q21--by evaluation of dense marker arrays across each region and by analysis of an additional 38 families. Although multipoint analysis with dense markers provided supportive evidence (multipoint LOD scores 3.25 at 4p15, 1.49 at 5q31, and 1.04 at 7q21) for the initial findings, the additional 38 families did not strengthen evidence for linkage. The role that HLA-DQ plays was studied in more detail by analysis of DQB1 alleles in all 98 families. All but one patient carried one or two HLA-DQ risk alleles, and 65% of HLA-DQ2 carriers were affected. Our study indicates that the HLA region harbors a predominant CD-susceptibility locus in these Finnish families.  相似文献   

18.
Despite the theoretical evidence of the utility of single-nucleotide polymorphisms (SNPs) for linkage analysis, no whole-genome scans of a complex disease have yet been published to directly compare SNPs with microsatellites. Here, we describe a whole-genome screen of 157 families with multiple cases of rheumatoid arthritis (RA), performed using 11,245 genomewide SNPs. The results were compared with those from a 10-cM microsatellite scan in the same cohort. The SNP analysis detected HLA*DRB1, the major RA susceptibility locus (P=.00004), with a linkage interval of 31 cM, compared with a 50-cM linkage interval detected by the microsatellite scan. In addition, four loci were detected at a nominal significance level (P<.05) in the SNP linkage analysis; these were not observed in the microsatellite scan. We demonstrate that variation in information content was the main factor contributing to observed differences in the two scans, with the SNPs providing significantly higher information content than the microsatellites. Reducing the number of SNPs in the marker set to 3,300 (1-cM spacing) caused several loci to drop below nominal significance levels, suggesting that decreases in information content can have significant effects on linkage results. In contrast, differences in maps employed in the analysis, the low detectable rate of genotyping error, and the presence of moderate linkage disequilibrium between markers did not significantly affect the results. We have demonstrated the utility of a dense SNP map for performing linkage analysis in a late-age-at-onset disease, where DNA from parents is not always available. The high SNP density allows loci to be defined more precisely and provides a partial scaffold for association studies, substantially reducing the resource requirement for gene-mapping studies.  相似文献   

19.
Genotype data from the Illumina Linkage III SNP panel (n = 4,720 SNPs) and the Affymetrix 10 k mapping array (n = 11,120 SNPs) were used to test the effects of linkage disequilibrium (LD) between SNPs in a linkage analysis in the Collaborative Study on the Genetics of Alcoholism pedigree collection (143 pedigrees; 1,614 individuals). The average r2 between adjacent markers across the genetic map was 0.099 +/- 0.003 in the Illumina III panel and 0.17 +/- 0.003 in the Affymetrix 10 k array. In order to determine the effect of LD between marker loci in a nonparametric multipoint linkage analysis, markers in strong LD with another marker (r2 > 0.40) were removed (n = 471 loci in the Illumina panel; n = 1,804 loci in the Affymetrix panel) and the linkage analysis results were compared to the results using the entire marker sets. In all analyses using the ALDX1 phenotype, 8 linkage regions on 5 chromosomes (2, 7, 10, 11, X) were detected (peak markers p < 0.01), and the Illumina panel detected an additional region on chromosome 6. Analysis of the same pedigree set and ALDX1 phenotype using short tandem repeat markers (STRs) resulted in 3 linkage regions on 3 chromosomes (peak markers p < 0.01). These results suggest that in this pedigree set, LD between loci with spacing similar to the SNP panels tested may not significantly affect the overall detection of linkage regions in a genome scan. Moreover, since the data quality and information content are greatly improved in the SNP panels over STR genotyping methods, new linkage regions may be identified due to higher information content and data quality in a dense SNP linkage panel.  相似文献   

20.
We performed multipoint linkage analysis of the electrophysiological trait ECB21 on chromosome 4 in the full pedigrees provided by the Collaborative Study on the Genetics of Alcoholism (COGA). Three Markov chain Monte Carlo (MCMC)-based approaches were applied to the provided and re-estimated genetic maps and to five different marker panels consisting of microsatellite (STRP) and/or SNP markers at various densities. We found evidence of linkage near the GABRB1 STRP using all methods, maps, and marker panels. Difficulties encountered with SNP panels included convergence problems and demanding computations.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号