首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.

Background  

With the rapid development of high-throughput genotyping technologies, efficient methods for identifying linked regions using high-density SNP genotype data have become more and more important. Recently, a deterministic method that works very well on SNP genotyping data has been developed (Lin et al. Bioinformatics 2008, 24(1): 86–93). However, that program can only work on a limited number of family structures. In particular, the results (if any) will be poor when the genotype data for the whole chromosome of one of the parents in a nuclear family is missing.  相似文献   

2.
SNP microarray analysis for genome-wide detection of crossover regions   总被引:4,自引:0,他引:4  
There is a great deal of interest in understanding the non-random distribution of recombination events over the human genome, because it has important implications for using linkage disequilibrium (LD) to identify human disease genes. So far, only a few recombination hotspots in the human genome have been characterised and the identification of new crossover hotspots will contribute to a better understanding of the mechanisms that govern their formation and distribution. This study shows that high-density single nucleotide polymorphism (SNP) arrays, together with the presented analysis method, are an appropriate tool for generating a whole-genome recombination pattern and for detecting new crossover regions with enhanced recombination frequency. Based on the genotype data of 16 members of a Caucasian three-generation family, we identified 825 crossover regions. The average recombination frequency of females and males was 0.77 and 0.56 cM/Mb, respectively. We detected 24 crossover regions showing elevated recombination activity, which comprised known hotspots, like the MHC II region, confirming the non-random distribution of recombination events along the genome. Interestingly, 29.2% of the identified crossover hotspot regions overlapped with regions flanked by segmental duplications published by Bailey et al. (Science 297:1003–1007, 2002) suggesting that segmental duplications and crossover hotspot regions are mechanistically linked. By extrapolating the results of the present study, we conclude that it might be feasible, at least in part, to estimate to what extent the block-like pattern of LD exactly relies on the genome-wide crossover pattern using the next generation high-density SNP microarrays.Electronic Supplementary Material Supplementary material is available for this article at  相似文献   

3.
SNPselector: a web tool for selecting SNPs for genetic association studies   总被引:7,自引:0,他引:7  
SUMMARY: Single nucleotide polymorphisms (SNPs) are commonly used for association studies to find genes responsible for complex genetic diseases. With the recent advance of SNP technology, researchers are able to assay thousands of SNPs in a single experiment. But the process of manually choosing thousands of genotyping SNPs for tens or hundreds of genes is time consuming. We have developed a web-based program, SNPselector, to automate the process. SNPselector takes a list of gene names or a list of genomic regions as input and searches the Ensembl genes or genomic regions for available SNPs. It prioritizes these SNPs on their tagging for linkage disequilibrium, SNP allele frequencies and source, function, regulatory potential and repeat status. SNPselector outputs result in compressed Excel spreadsheet files for review by the user. AVAILABILITY: SNPselector is freely available at http://primer.duhs.duke.edu/  相似文献   

4.
Recent advances in technologies for high-throughout single-nucleotide polymorphism (SNP)-based genotyping have improved efficiency and cost so that it is now becoming reasonable to consider the use of SNPs for genomewide linkage analysis. However, a suitable screening set of SNPs and a corresponding linkage map have yet to be described. The SNP maps described here fill this void and provide a resource for fast genome scanning for disease genes. We have evaluated 6,297 SNPs in a diversity panel composed of European Americans, African Americans, and Asians. The markers were assessed for assay robustness, suitable allele frequencies, and informativeness of multi-SNP clusters. Individuals from 56 Centre d'Etude du Polymorphisme Humain pedigrees, with >770 potentially informative meioses altogether, were genotyped with a subset of 2,988 SNPs, for map construction. Extensive genotyping-error analysis was performed, and the resulting SNP linkage map has an average map resolution of 3.9 cM, with map positions containing either a single SNP or several tightly linked SNPs. The order of markers on this map compares favorably with several other linkage and physical maps. We compared map distances between the SNP linkage map and the interpolated SNP linkage map constructed by the deCode Genetics group. We also evaluated cM/Mb distance ratios in females and males, along each chromosome, showing broadly defined regions of increased and decreased rates of recombination. Evaluations indicate that this SNP screening set is more informative than the Marshfield Clinic's commonly used microsatellite-based screening set.  相似文献   

5.
Errors while genotyping are inevitable and can reduce the power to detect linkage. However, does genotyping error have the same impact on linkage results for single-nucleotide polymorphism (SNP) and microsatellite (MS) marker maps? To evaluate this question we detected genotyping errors that are consistent with Mendelian inheritance using large changes in multipoint identity-by-descent sharing in neighboring markers. Only a small fraction of Mendelian consistent errors were detectable (e.g., 18% of MS and 2.4% of SNP genotyping errors). More SNP genotyping errors are Mendelian consistent compared to MS genotyping errors, so genotyping error may have a greater impact on linkage results using SNP marker maps. We also evaluated the effect of genotyping error on the power and type I error rate using simulated nuclear families with missing parents under 0, 0.14, and 2.8% genotyping error rates. In the presence of genotyping error, we found that the power to detect a true linkage signal was greater for SNP (75%) than MS (67%) marker maps, although there were also slightly more false-positive signals using SNP marker maps (5 compared with 3 for MS). Finally, we evaluated the usefulness of accounting for genotyping error in the SNP data using a likelihood-based approach, which restores some of the power that is lost when genotyping error is introduced.  相似文献   

6.
Outside the context of hereditary deficiencies of complement and IgA, Mendelian inherited predisposition to small vessel lymphocytic vasculitis (SVLV) has rarely been documented. Here we report a large, multigenerational family segregating symmetrical cutaneous SVLV affecting the cheeks, thighs and hands. In all affected family members the disease presented in early infancy and there was no evidence for an association with systemic disease. Skin biopsy of lesions showed a lymphocytic vasculitis with red blood cell extravasation. Complementary studies, with extensive investigation focused on dysfunction of the immunological system were negative. The pattern of inheritance of SVLV in the family was compatible with an autosomal dominantly acting disease gene with incomplete penetrance. To localize the disease causing gene in the family a genome-wide linkage search was conducted using a high-density SNP array. Haplotype construction and analysis of recombination events permitted the minimal interval defining the disease locus to be refined to a 4.7 Mb region on chromosome 6q26–q27. The genes CCR6 and GPR31, which map to the linked region represent plausible candidates for the disease on the basis of their biological function. Extensive screening of both genes by mutational analysis failed to identify a deleterious mutation in the family.  相似文献   

7.

Background

Despite the dramatic reduction in the cost of high-density genotyping that has occurred over the last decade, it remains one of the limiting factors for obtaining the large datasets required for genomic studies of disease in the horse. In this study, we investigated the potential for low-density genotyping and subsequent imputation to address this problem.

Results

Using the haplotype phasing and imputation program, BEAGLE, it is possible to impute genotypes from low- to high-density (50K) in the Thoroughbred horse with reasonable to high accuracy. Analysis of the sources of variation in imputation accuracy revealed dependence both on the minor allele frequency of the single nucleotide polymorphisms (SNPs) being imputed and on the underlying linkage disequilibrium structure. Whereas equidistant spacing of the SNPs on the low-density panel worked well, optimising SNP selection to increase their minor allele frequency was advantageous, even when the panel was subsequently used in a population of different geographical origin. Replacing base pair position with linkage disequilibrium map distance reduced the variation in imputation accuracy across SNPs. Whereas a 1K SNP panel was generally sufficient to ensure that more than 80% of genotypes were correctly imputed, other studies suggest that a 2K to 3K panel is more efficient to minimize the subsequent loss of accuracy in genomic prediction analyses. The relationship between accuracy and genotyping costs for the different low-density panels, suggests that a 2K SNP panel would represent good value for money.

Conclusions

Low-density genotyping with a 2K SNP panel followed by imputation provides a compromise between cost and accuracy that could promote more widespread genotyping, and hence the use of genomic information in horses. In addition to offering a low cost alternative to high-density genotyping, imputation provides a means to combine datasets from different genotyping platforms, which is becoming necessary since researchers are starting to use the recently developed equine 70K SNP chip. However, more work is needed to evaluate the impact of between-breed differences on imputation accuracy.  相似文献   

8.
A genome-wide linkage scan was conducted in a Northern-European multigenerational pedigree with nine of 40 related members affected with concomitant strabismus. Twenty-seven members of the pedigree including all affected individuals were genotyped using a SNP array interrogating > 300,000 common SNPs. We conducted parametric and non-parametric linkage analyses assuming segregation of an autosomal dominant mutation, yet allowing for incomplete penetrance and phenocopies. We detected two chromosome regions with near-suggestive evidence for linkage, respectively on chromosomes 8 and 18. The chromosome 8 linkage implied a penetrance of 0.80 and a rate of phenocopy of 0.11, while the chromosome 18 linkage implied a penetrance of 0.64 and a rate of phenocopy of 0. Our analysis excludes a simple genetic determinism of strabismus in this pedigree.  相似文献   

9.
10.
Liu W  Zhao W  Chase GA 《Human heredity》2006,61(1):31-44
OBJECTIVE: Single nucleotide polymorphisms (SNPs) serve as effective markers for localizing disease susceptibility genes, but current genotyping technologies are inadequate for genotyping all available SNP markers in a typical linkage/association study. Much attention has recently been paid to methods for selecting the minimal informative subset of SNPs in identifying haplotypes, but there has been little investigation of the effect of missing or erroneous genotypes on the performance of these SNP selection algorithms and subsequent association tests using the selected tagging SNPs. The purpose of this study is to explore the effect of missing genotype or genotyping error on tagging SNP selection and subsequent single marker and haplotype association tests using the selected tagging SNPs. METHODS: Through two sets of simulations, we evaluated the performance of three tagging SNP selection programs in the presence of missing or erroneous genotypes: Clayton's diversity based program htstep, Carlson's linkage disequilibrium (LD) based program ldSelect, and Stram's coefficient of determination based program tagsnp.exe. RESULTS: When randomly selected known loci were relabeled as 'missing', we found that the average number of tagging SNPs selected by all three algorithms changed very little and the power of subsequent single marker and haplotype association tests using the selected tagging SNPs remained close to the power of these tests in the absence of missing genotype. When random genotyping errors were introduced, we found that the average number of tagging SNPs selected by all three algorithms increased. In data sets simulated according to the haplotype frequecies in the CYP19 region, Stram's program had larger increase than Carlson's and Clayton's programs. In data sets simulated under the coalescent model, Carlson's program had the largest increase and Clayton's program had the smallest increase. In both sets of simulations, with the presence of genotyping errors, the power of the haplotype tests from all three programs decreased quickly, but there was not much reduction in power of the single marker tests. CONCLUSIONS: Missing genotypes do not seem to have much impact on tagging SNP selection and subsequent single marker and haplotype association tests. In contrast, genotyping errors could have severe impact on tagging SNP selection and haplotype tests, but not on single marker tests.  相似文献   

11.
Individual genotyping of single nucleotide polymorphisms (SNPs) remains expensive, especially for linkage disequilibrium mapping strategies involving high-throughput SNP genotyping. On one hand, current methods may suit scientific and laboratory needs in regard to accuracy, reproducibility/robustness, and large-scale application. On the other hand, a cheaper and less time-consuming alternative to individual genotyping is the use of SNP allelefrequencies determined in DNA pools. We have developed an accurate and reproducible protocol for allele frequency determination using Pyrosequencing technology in large genomic DNA pools (374 individuals). The measured correlation (R2) in large DNA pools was 0.980. In the context of disease-associated SNPs studies, we compared the allele frequencies between the disease (e.g., type 2 diabetes and obesity) and control groups detected by either individual genotyping or Pyrosequencing of DNA pools. In large pools, the variation between the two methods was 1.5 +/- 0.9%. It may be concluded that the allele frequency determination protocol could reliably detect over 4% differences between populations. The method is economical in regard to amounts of DNA, PCR, and primer extension reagents required. Furthermore, it allows the rapid determination of allelefrequency differences in case/control groups for association studies and susceptibility gene discovery in complex diseases.  相似文献   

12.
Homologous meiotic recombination occurs in most sexually reproducing organisms, yet its evolutionary advantages are elusive. Previous research explored recombination in the honeybee, a eusocial hymenopteran with an exceptionally high genome-wide recombination rate. A comparable study in a non-social member of the Hymenoptera that would disentangle the impact of sociality from Hymenoptera-specific features such as haplodiploidy on the evolution of the high genome-wide recombination rate in social Hymenoptera is missing. Utilizing single-nucleotide polymorphisms (SNPs) between two Nasonia parasitoid wasp genomes, we developed a SNP genotyping microarray to infer a high-density linkage map for Nasonia. The map comprises 1,255 markers with an average distance of 0.3 cM. The mapped markers enabled us to arrange 265 scaffolds of the Nasonia genome assembly 1.0 on the linkage map, representing 63.6% of the assembled N. vitripennis genome. We estimated a genome-wide recombination rate of 1.4–1.5 cM/Mb for Nasonia, which is less than one tenth of the rate reported for the honeybee. The local recombination rate in Nasonia is positively correlated with the distance to the center of the linkage groups, GC content, and the proportion of simple repeats. In contrast to the honeybee genome, gene density in the parasitoid wasp genome is positively associated with the recombination rate; regions of low recombination are characterized by fewer genes with larger introns and by a greater distance between genes. Finally, we found that genes in regions of the genome with a low recombination frequency tend to have a higher ratio of non-synonymous to synonymous substitutions, likely due to the accumulation of slightly deleterious non-synonymous substitutions. These findings are consistent with the hypothesis that recombination reduces interference between linked sites and thereby facilitates adaptive evolution and the purging of deleterious mutations. Our results imply that the genomes of haplodiploid and of diploid higher eukaryotes do not differ systematically in their recombination rates and associated parameters.  相似文献   

13.
High-throughput SNP genotyping is widely used for plant genetic studies. Recently, a RICE6K SNP array has been developed based on the Illumina Bead Array platform and Infinium SNP assay technology for genome-wide evaluation of allelic variations and breeding applications. In this study, the RICE6K SNP array was used to genotype a recombinant inbred line (RIL) population derived from the cross between the indica variety, Zhenshan 97, and the japonica variety, Xizang 2. A total of 3324 SNP markers of high quality were identified and were grouped into 1495 recombination bins in the RIL population. A high-density linkage map, consisting of the 1495 bins, was developed, covering 1591.2 cM and with average length ofl.1 cM per bin. Segregation distortions were observed in 24 regions of the 11 chromosomes in the RILs. One half of the distorted regions contained fertility genes that had been previously reported. A total of 23 QTLs were identified for yield. Seven QTLs were firstly detected in this study. The positive alleles from about half of the identified QTLs came from Zhenshan 97 and they had lower phenotypic values than Xizang 2. This indicated that favorable alleles for breeding were dispersed in both parents and pyramiding favorable alleles could develop elite lines. The size of the mapping population for QTL analysis using high throughput SNP genotyping platform is also discussed.  相似文献   

14.
Kiwifruit (Actinidia spp) is a woody, perennial and deciduous vine. In this genus, there are multiple ploidy levels but the main cultivated cultivars are polyploid. Despite the availability of many genomic resources in kiwifruit, SNP genotyping is still a challenge given these different levels of polyploidy. Recent advances in SNP array technologies have offered a high-throughput genotyping platform for genome-wide DNA polymorphisms. In this study, we developed a high-density SNP genotyping array to facilitate genetic studies and breeding applications in kiwifruit. SNP discovery was performed by genome-wide DNA sequencing of 40 kiwifruit genotypes. The identified SNPs were stringently filtered for sequence quality, predicted conversion performance and distribution over the available Actinidia chinensis genome. A total of 134 729 unique SNPs were put on the array. The array was evaluated by genotyping 400 kiwifruit individuals. We performed a multidimensional scaling analysis to assess the diversity of kiwifruit germplasm, showing that the array was effective to distinguish kiwifruit accessions. Using a tetraploid F1 population, we constructed an integrated linkage map covering 3060.9 cM across 29 linkage groups and performed QTL analysis for the sex locus that has been identified on Linkage Group 3 (LG3) in Actinidia arguta. Finally, our dataset presented evidence of tetrasomic inheritance with partial preferential pairing in A. arguta. In conclusion, we developed and evaluated a 135K SNP genotyping array for kiwifruit. It has the advantage of a comprehensive design that can be an effective tool in genetic studies and breeding applications in this high-value crop.  相似文献   

15.
The determination of relatedness between individuals in a family is crucial in analysis of common complex diseases. We present a method to infer close inter-familial relationships based on SNP genotyping data and provide the relationship coefficient of kinship in Korean families. We obtained blood samples from 43 Korean individuals in two families. SNP data was obtained using the Affymetrix Genome-wide Human SNP array 6.0 and the Illumina Human 1M-Duo chip. To measure the kinship coefficient with the SNP genotyping data, we considered all possible pairs of individuals in each family. The genetic distance between two individuals in a pair was determined using the allele sharing distance method. The results show that genetic distance is proportional to the kinship coefficient and that a close degree of kinship can be confirmed with SNP genotyping data. This study represents the first attempt to identify the genetic distance between very closely related individuals. [BMB Reports 2013; 46(6): 305-309]  相似文献   

16.
Genetic studies in Turkish, Native American, European American, and African American (AA) families have linked chromosome 18q21.1–23 to susceptibility for diabetes-associated nephropathy. In this study, we have carried out fine linkage mapping in the 18q region previously linked to diabetic nephropathy in AAs by genotyping both microsatellite and single nucleotide polymorphisms (SNPs) for linkage analysis in an expanded set of 223 AA families multiplexed for type 2 diabetes associated ESRD (T2DM-ESRD). Several approaches were used to evaluate evidence of linkage with the strongest evidence for linkage in ordered subset analysis with an earlier age of T2DM diagnosis compared to the remaining pedigrees (LOD 3.9 at 90.1 cM, ∆P = 0.0161, NPL P value = 0.00002). Overall, the maximum LODs and LOD-1 intervals vary in magnitude and location depending upon analysis. The linkage mapping was followed up by performing a dense SNP map, genotyping 2,814 SNPs in the refined LOD-1 region in 1,029 AA T2DM-ESRD cases and 1,027 AA controls. Of the top 25 most associated SNPs, 10 resided within genic regions. Two candidate genes stood out: NEDD4L and SERPINB7. SNP rs512099, located in intron 1 of NEDD4L, was associated under a dominant model of inheritance [P value = 0.0006; Odds ratio (95% Confidence Interval) OR (95% CI) = 0.70 (0.57–0.86)]. SNP rs1720843, located in intron 2 of SERPINB7, was associated under a recessive model of inheritance [P value = 0.0017; OR (95% CI) = 0.65 (0.50–0.85)]. Collectively, these results suggest that multiple genes in this region may influence diabetic nephropathy susceptibility in AAs.  相似文献   

17.
Linkage analysis identifies markers that appear to be co-inherited with a trait within pedigrees. The inheritance of a chromosomal segment may be probabilistically reconstructed, with missing data complicating inference. Inheritance patterns are further obscured in the analysis of complex traits, where variants in one or more genes may contribute to phenotypic variation within a pedigree. In this case, determining which relatives share a trait variant is not simple. We describe how to represent these patterns of inheritance for marker loci. We summarize how to sample patterns of inheritance consistent with genotypic and pedigree data using gl_auto, available in MORGAN v3.0. We describe identification of classes of equivalent inheritance patterns with the program IBDgraph. We finally provide an example of how these programs may be used to simplify interpretation of linkage analysis of complex traits in general pedigrees. We borrow information across loci in a parametric linkage analysis of a large pedigree. We explore the contribution of each equivalence class to a linkage signal, illustrate estimated patterns of identity-by-descent sharing, and identify a haplotype tagging the chromosomal segment driving the linkage signal. Haplotype carriers are more likely to share the linked trait variant, and can be prioritized for subsequent DNA sequencing.  相似文献   

18.
SUMMARY: The high cost of genotyping single nucleotide polymorphisms (SNPs) generally prohibits the systematic mapping of entire genetic linkage regions in order to find the polymorphisms associated with increased risk of disease. In practice, SNPs are selected at approximately equal spacing across the linkage region to try to locate a SNP lying in the haplotype block of the disease SNP. The size of the haplotype block may not be known, however, and SNPs taken from public domain sources may not in fact be polymorphic. Our program will choose a subset of the SNPs in a linkage region so as to maximize the expected proportion of the sequence that lies within a given distance of a real SNP. AVAILABILITY: The software is available, free of charge, for academic use on request from the authors. SUPPLEMENTARY INFORMATION: www.oxagen.co.uk  相似文献   

19.
BACKGROUND: Neural tube defects (NTDs) are considered complex, with both genetic and environmental factors implicated. To date, no major causative genes have been identified in humans despite several investigations. The first genomewide screen in NTDs demonstrated evidence of linkage to chromosomes 7 and 10. This screen included 44 multiplex families and consisted of 402 microsatellite markers spaced approximately 10 cM apart. Further investigation of the genomic screen data identified a single large multiplex family, pedigree 8776, as primarily driving the linkage results on chromosome 7. METHODS: To investigate this family more thoroughly, a high-density single nucleotide polymorphism (SNP) screen was performed. Two-point and multipoint linkage analyses were performed using both parametric and nonparametric methods. RESULTS: For both the microsatellite and SNP markers, linkage analysis suggested the involvement of a locus or loci proximal to the telomeric regions of chromosomes 2q and 7p, with both regions generating a LOD* score of 3.0 using a nonparametric identity by descent relative sharing method. CONCLUSIONS: The regions with the strongest evidence for linkage map proximal to the telomeres on these two chromosomes. In addition to mutations and/or variants in a major gene, these loci may harbor a microdeletion and/or translocation; potentially, polygenic factors may also be involved. This single family may be promising for narrowing the search for NTD susceptibility genes.  相似文献   

20.
OBJECTIVES: Describe the inflation in nonparametric multipoint LOD scores due to inter-marker linkage disequilibrium (LD) across many markers with varied allele frequencies. METHOD: Using simulated two-generation families with and without parents, we conducted nonparametric multipoint linkage analysis with 2 to 10 markers with minor allele frequencies (MAF) of 0.5 and 0.1. RESULTS: Misspecification of population haplotype frequencies by assuming linkage equilibrium caused inflated multipoint LOD scores due to inter-marker LD when parental genotypes were not included. Inflation increased as more markers in LD were included and decreased as markers in equilibrium were added. When marker allele frequencies were unequal, the r2 measure of LD was a better predictor of inflation than D'. CONCLUSION: This observation strongly supports the evaluation of LD in multipoint linkage analyses, and further suggests that unaccounted for LD may be suspected when two-point and multipoint linkage analyses show a marked disparity in regions with elevated r2 measures of LD. Given the increasing popularity of high-density genome-wide SNP screens, inter-marker LD should be a concern in future linkage studies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号