首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 599 毫秒
1.
The identification of genes for monogenic disorders has proven to be highly effective for understanding disease mechanisms, pathways and gene function in humans. Nevertheless, while thousands of Mendelian disorders have not yet been mapped there has been a trend away from studying single-gene disorders. In part, this is due to the fact that many of the remaining single-gene families are not large enough to map the disease locus to a single site in the genome. New tools and approaches are needed to allow researchers to effectively tap into this genetic gold-mine. Towards this goal, we have used haploid cell lines to experimentally validate the use of high-density single nucleotide polymorphism (SNP) arrays to define genome-wide haplotypes and candidate regions, using a small amyotrophic lateral sclerosis (ALS) family as a prototype. Specifically, we used haploid-cell lines to determine if high-density SNP arrays accurately predict haplotypes across entire chromosomes and show that haplotype information significantly enhances the genetic information in small families. Panels of haploid-cell lines were generated and a 5 centimorgan (cM) short tandem repeat polymorphism (STRP) genome scan was performed. Experimentally derived haplotypes for entire chromosomes were used to directly identify regions of the genome identical-by-descent in 5 affected individuals. Comparisons between experimentally determined and in silico haplotypes predicted from SNP arrays demonstrate that SNP analysis of diploid DNA accurately predicted chromosomal haplotypes. These methods precisely identified 12 candidate intervals, which are shared by all 5 affected individuals. Our study illustrates how genetic information can be maximized using readily available tools as a first step in mapping single-gene disorders in small families.  相似文献   

2.
Genome-wide linkage disequilibrium analysis in bread wheat and durum wheat.   总被引:3,自引:0,他引:3  
Bread wheat and durum wheat were examined for linkage disequilibrium (LD) using microsatellite markers distributed across the genome. The allele database consisted of 189 bread wheat accessions genotyped at 370 loci and 93 durum wheat accessions genotyped at 245 loci. A significance level of p < 0.001 was set for all comparisons. The bread and durum wheat collections showed that 47.9% and 14.0% of all locus pairs were in LD, respectively. LD was more prevalent between loci on the same chromosome compared with loci on independent chromosomes and was highest between adjacent loci. Only a small fraction (bread wheat, 0.9%; durum wheat, 3.2%) of the locus pairs in LD showed R2 values > 0.2. The LD between adjacent locus pairs extended (R2 > 0.2) approximately 2-3 cM, on average, but some regions of the bread and durum wheat genomes showed high levels of LD (R2 = 0.7 and 1.0, respectively) extending 41.2 and 25.5 cM, respectively. The wheat collections were clustered by similarity into subpopulations using unlinked microsatellite data and the software Structure. Analysis within subpopulations showed 14- to 16-fold fewer locus pairs in LD, higher R2 values for those pairs in LD, and LD extending further along the chromosome. The data suggest that LD mapping of wheat can be performed with simple sequence repeats to a resolution of <5 cM.  相似文献   

3.
Recent studies report a surprisingly high degree of marker-to-marker linkage disequilibrium (LD) in ruminant livestock populations. This has important implications for QTL mapping and marker-assisted selection. This study evaluated LD between microsatellite markers in a number of breeding populations of layer chickens using the standardized chi-square (chi(2')) measure. The results show appreciable LD among markers separated by up to 5 cM, decreasing rapidly with increased separation between markers. The LD within 5 cM was strongly conserved across generations and differed among chromosomal regions. Using marker-to-marker LD as an indication for marker-QTL LD, a genome scan of markers spaced 2 cM apart at moderate power would have good chances of uncovering most QTL segregating in these populations. However, of markers showing significant trait associations, only 57% are expected to be within 5 cM of the responsible QTL, and the remainder will be up to 20 cM away. Thus, high-resolution LD mapping of QTL will require dense marker genotyping across the region of interest to allow for interval mapping of the QTL.  相似文献   

4.
Haplotype diversity in a genomic region of approximately 70 kb in 1q21 between genes PKLR and GBA was characterized by typing one single nucleotide polymorphism (SNP) in PKLR, two SNPs in GBA and one short tandem repeat polymorphism (STRP) in PKLR in 1792 chromosomes from 17 worldwide populations. Two other SNPs in GBA were typed in three African populations. Most chromosomes carried one of either two phylogenetically distinct haplotypes with different alleles at each site. Allele diversity at the STRP was tightly linked to haplotype background. Linkage disequilibrium (LD) was highly significant for all SNP pairs in all populations, although it was, on average, slightly higher in non-African populations than in sub-Saharan Africans. Variation at PKLR-GBA was also tightly linked to that at the GBA pseudogene, 16 kb downstream from GBA. Thus, a 90 kb-long LD block was observed, which points to a low recombination rate in this region. Detailed haplotype phylogeny suggests that the chimpanzee GBA haplotype is not one of the two most frequent haplotypes. Based on variability at the PKLR STRP and on the geographical distribution of LD, the expansion of the two main haplotypes may have predated the "Out of Africa" expansion of anatomically modern humans. LD and STRP variability in non-Africans are approximately 87% of those in Africans, in contrast with other loci; this implies that the "out of Africa" bottleneck may have had a broad distribution of effects across loci.  相似文献   

5.

Background  

Short tandem repeat polymorphisms (STRPs) are powerful tools for gene mapping and other applications. A STRP genome scan of 10 cM is usually adequate for mapping single gene disorders. However mapping studies involving genetically complex disorders and especially association (linkage disequilibrium) often require higher STRP density.  相似文献   

6.
Drought often delays developmental events so that plant height and above-ground biomass are reduced, resulting in yield loss due to inadequate photosynthate. In this study, plant height and biomass measured by the Normalized Difference Vegetation Index (NDVI) were used as criteria for drought tolerance. A total of 305 lines representing temperate, tropical and subtropical maize germplasm were genotyped using two single nucleotide polymorphism (SNP) chips each containing 1536 markers, from which 2052 informative SNPs and 386 haplotypes each constructed with two or more SNPs were used for linkage disequilibrium (LD) or association mapping. Single SNP- and haplotype-based LD mapping identified two significant SNPs and three haplotype loci [a total of four quantitative trait loci (QTL)] for plant height under well-watered and water-stressed conditions. For biomass, 32 SNPs and 12 haplotype loci (30 QTL) were identified using NDVIs measured at seven stages under the two water regimes. Some significant SNP and haplotype loci for NDVI were shared by different stages. Comparing significant loci identified by single SNP- and haplotype-based LD mapping, we found that six out of the 14 chromosomal regions defined by haplotype loci each included at least one significant SNP for the same trait. Significant SNP haplotype loci explained much higher phenotypic variation than individual SNPs. Moreover, we found that two significant SNPs (two QTL) and one haplotype locus were shared by plant height and NDVI. The results indicate the power of comparative LD mapping using single SNPs and SNP haplotypes with QTL shared by plant height and biomass as secondary traits for drought tolerance in maize.  相似文献   

7.
Population-based mapping approaches are attractive for tracing the genetic background to phenotypic traits in wild species, given that it is often difficult to gather extensive and well-defined pedigrees needed for quantitative trait locus analysis. However, the feasibility of association or hitch-hiking mapping is dependent on the degree of linkage disequilibrium (LD) in the population, on which there is yet limited information for wild species. Here we use single nucleotide polymorphism (SNP) markers from 23 genes in a recently established linkage map of the Z chromosome of the collared flycatcher, to study the extent of LD in a natural bird population. In most but not all cases we find SNPs within the same intron (less than 500 bp) to be in perfect LD. However, LD then decays to background level at a distance 1cM or 400-500 kb. Although LD seems more extensive than in other species, if the observed pattern is representative for other regions of the genome and turns out to be a general feature of natural bird populations, dense marker maps might be needed for genome scans aimed at identifying association between marker and trait loci.  相似文献   

8.
Single-marker linkage-disequilibrium (LD) methods cannot fully describe disequilibrium in an entire chromosomal region surrounding a disease allele. With the advent of myriad tightly linked microsatellite markers, we have an opportunity to extend LD analysis from single markers to multiple-marker haplotypes. Haplotype analysis has increased statistical power to disclose the presence of a disease locus in situations where it correctly reflects the historical process involved. For maximum efficiency, evidence of LD ought to come not just from a single haplotype, which may well be rare, but in addition from many similar haplotypes that could have descended from the same ancestral founder but have been trimmed in succeeding generations. We present such an analysis, called the "trimmed-haplotype method." We focus on chromosomal regions that are small enough that disequilibrium in significant portions of them may have been preserved in some pedigrees and yet that contain enough markers to minimize coincidental occurrence of the haplotype in the absence of a disease allele: perhaps regions 1-2 cM in length. In general, we could have no idea what haplotype an ancestral founder carried generations ago, nor do we usually have a precise chromosomal location for the disease-susceptibility locus. Therefore, we must search through all possible haplotypes surrounding multiple locations. Since such repeated testing obliterates the sampling distribution of the test, we employ bootstrap methods to calculate significance levels. Trimmed-haplotype analysis is performed on family data in which genotypes have been assembled into haplotypes. It can be applied either to conventional parent-affected-offspring triads or to multiplex pedigrees. We present a method for summarizing the LD evidence, in any pedigree, that can be employed in trimmed-haplotype analysis as well as in other methods.  相似文献   

9.
Xu P  Wu X  Wang B  Liu Y  Ehlers JD  Close TJ  Roberts PA  Diop NN  Qin D  Hu T  Lu Z  Li G 《PloS one》2011,6(1):e15952
Asparagus bean (Vigna. unguiculata ssp. sesquipedialis) is a distinctive subspecies of cowpea [Vigna. unguiculata (L.) Walp.] that apparently originated in East Asia and is characterized by extremely long and thin pods and an aggressive climbing growth habit. The crop is widely cultivated throughout Asia for the production of immature pods known as 'long beans' or 'asparagus beans'. While the genome of cowpea ssp. unguiculata has been characterized recently by high-density genetic mapping and partial sequencing, little is known about the genome of asparagus bean. We report here the first genetic map of asparagus bean based on SNP and SSR markers. The current map consists of 375 loci mapped onto 11 linkage groups (LGs), with 191 loci detected by SNP markers and 184 loci by SSR markers. The overall map length is 745 cM, with an average marker distance of 1.98 cM. There are four high marker-density blocks distributed on three LGs and three regions of segregation distortion (SDRs) identified on two other LGs, two of which co-locate in chromosomal regions syntenic to SDRs in soybean. Synteny between asparagus bean and the model legume Lotus. japonica was also established. This work provides the basis for mapping and functional analysis of genes/QTLs of particular interest in asparagus bean, as well as for comparative genomics study of cowpea at the subspecies level.  相似文献   

10.
Genomic distribution of MITEs in barley determined by MITE-AFLP mapping.   总被引:1,自引:0,他引:1  
H Takahashi  H Akagi  K Mori  K Sato  K Takeda 《Génome》2006,49(12):1616-1620
Miniature inverted-repeat transposable elements (MITEs) represent a large superfamily of transposons that are moderately to highly repetitive and frequently found near or within plant genes. To elucidate the organization of MITEs in the barley genome, MITEs were integrated into the genetic map of barley. In this report, we describe the use of MITEs in amplified fragment length polymorphism (AFLP) mapping, and demonstrate their superiority over conventional AFLP mapping. Barley MITEs include members of the Stowaway, Barfly, and Pangrangja families. By amplifying the flanking sequences of these MITEs, a total of 214 loci were mapped from a population of 93 doubled-haploid segregating individuals between Hordeum vulgare ssp. vulgare and H. vulgare ssp. spontaneum. The 214 MITE-AFLP and 40 anchor simple sequence repeat (SSR) loci were distributed on 7 linkage groups, covering a total map distance of 1 165 cM. The average marker density on each chromosome ranged between 3.4 and 9.6 cM per locus. Only 1 MITE-based locus was frequently found to be associated with MITE loci from the same family, resulting in clusters in chromosomal subregions. In barley, it will be possible to cover the entire genome with a limited set of MITE-based primers and to build highly dense maps of specific regions.  相似文献   

11.
Mutations at the cystic fibrosis transmembrane conductance regulator gene (CFTR) cause cystic fibrosis, the most prevalent severe genetic disorder in individuals of European descent. We have analyzed normal allele and haplotype variation at four short tandem repeat polymorphisms (STRPs) and two single-nucleotide polymorphisms (SNPs) in CFTR in 18 worldwide population samples, comprising a total of 1,944 chromosomes. The rooted phylogeny of the SNP haplotypes was established by typing ape samples. STRP variation within SNP haplotype backgrounds was highest in most ancestral haplotypes-although, when STRP allele sizes were taken into account, differences among haplotypes became smaller. Haplotype background determines STRP diversity to a greater extent than populations do, which indicates that haplotype backgrounds are older than populations. Heterogeneity among STRPs can be understood as the outcome of differences in mutation rate and pattern. STRP sites had higher heterozygosities in Africans, although, when whole haplotypes were considered, no significant differences remained. Linkage disequilibrium (LD) shows a complex pattern not easily related to physical distance. The analysis of the fraction of possible different haplotypes not found may circumvent some of the methodological difficulties of LD measure. LD analysis showed a positive correlation with locus polymorphism, which could partly explain the unusual pattern of similar LD between Africans and non-Africans. The low values found in non-Africans may imply that the size of the modern human population that emerged "Out of Africa" may be larger than what previous LD studies suggested.  相似文献   

12.
13.
Knowledge of the extent and range of linkage disequilibrium (LD), defined as non-random association of alleles at two or more loci, in animal populations is extremely valuable in localizing genes affecting quantitative traits, identifying chromosomal regions under selection, studying population history, and characterizing/managing genetic resources and diversity. Two commonly used LD measures, r(2) and D', and their permutation based adjustments, were evaluated using genotypes of more than 6,000 pigs from six commercial lines (two terminal sire lines and four maternal lines) at ~4,500 autosomal SNPs (single nucleotide polymorphisms). The results indicated that permutation only partially removed the dependency of D' on allele frequency and that r(2) is a considerably more robust LD measure. The maximum r(2) was derived as a function of allele frequency. Using the same genotype dataset, the extent of LD in these pig populations was estimated for all possible syntenic SNP pairs using r(2) and the ratio of r(2) over its theoretical maximum. As expected, the extent of LD highest for SNP pairs was found in tightest linkage and decreased as their map distance increased. The level of LD found in these pig populations appears to be lower than previously implied in several other studies using microsatellite genotype data. For all pairs of SNPs approximately 3 centiMorgan (cM) apart, the average r(2) was equal to 0.1. Based on the average population-wise LD found in these six commercial pig lines, we recommend a spacing of 0.1 to 1 cM for a whole genome association study in pig populations.  相似文献   

14.
Meuwissen TH  Goddard ME 《Genetics》2007,176(4):2551-2560
A novel multipoint method, based on an approximate coalescence approach, to analyze multiple linked markers is presented. Unlike other approximate coalescence methods, it considers all markers simultaneously but only two haplotypes at a time. We demonstrate the use of this method for linkage disequilibrium (LD) mapping of QTL and estimation of effective population size. The method estimates identity-by-descent (IBD) probabilities between pairs of marker haplotypes. Both LD and combined linkage and LD mapping rely on such IBD probabilities. The method is approximate in that it considers only the information on a pair of haplotypes, whereas a full modeling of the coalescence process would simultaneously consider all haplotypes. However, full coalescence modeling is computationally feasible only for few linked markers. Using simulations of the coalescence process, the method is shown to give almost unbiased estimates of the effective population size. Compared to direct marker and haplotype association analyses, IBD-based QTL mapping showed clearly a higher power to detect a QTL and a more realistic confidence interval for its position. The modeling of LD could be extended to estimate other LD-related parameters such as recombination rates.  相似文献   

15.
Gordon D  Simonic I  Ott J 《Genomics》2000,66(1):87-92
We explore the extent of deviations from Hardy-Weinberg equilibrium (HWE) at a marker locus and linkage disequilibrium (LD) between pairs of marker loci in the Afrikaner population of South Africa. DNA samples were used for genotyping of 23 loci on six chromosomes. The samples were collected from 91 healthy unrelated Afrikaner adults. Exact tests were used to determine evidence for deviations from HWE at a single marker locus or LD between pairs of marker loci. At the 0.05 level of significance, evidence was found for deviation from HWE at only one of the 23 loci. At the same level of significance, LD was found among 8 of the 34 intrachromosomal pairs of loci. On chromosome 21, there was evidence for LD (P = 0.02) between a pair of loci with a genetic distance of 5.51 cM. On chromosome 2, there was evidence for LD between a pair of loci with a genetic distance of 5.28 cM (P = 0.002) and a pair of loci with a genetic distance of 3.68 cM (P = 0.0004). Detailed analysis of LD for one locus pair indicated that only a few of all alleles participated in the LD and that strong LD was most often positive. Our findings indicate that Afrikaans-speaking Afrikaners represent one of those special populations deemed particularly suitable for disequilibrium mapping.  相似文献   

16.
BLASTX alignment between 189.5 Mb of rice genomic sequence and translated Arabidopsis thaliana annotated coding sequences (CDS) identified 60 syntenic regions involving 4–22 rice orthologs covering ≤3.2 cM (centiMorgan). Most regions are <3 cM in length. A detailed and updated version of a table representing these regions is available on our web site. Thirty-five rice loci match two distinct A.thaliana loci, as expected from the duplicated nature of the A.thaliana genome. One A.thaliana locus matches two distinct rice regions, suggesting that rice chromosomal sequence duplications exist. A high level of rearrangement characterizing the 60 syntenic regions illustrates the ancient nature of the speciation between A.thaliana and rice. The apparent reduced level of microcollinearity implies the dispersion to new genomic locations, via transposon activity, of single or small clusters of genes in the rice genome, which represents a significant additional effector of plant genome evolution.  相似文献   

17.
Evaluating the patterns of linkage disequilibrium (LD) is important for association mapping study as well as for studying the genomic architecture of human genome (e.g., haplotype block structures). Commonly used bi-allelic pairwise measures for assessing LD between two loci, such as r 2 and D′, may not make full and efficient use of modern multilocus data. Though extended to multilocus scenarios, their performance is still questionable. Meanwhile, most existing measures for an entire multilocus region, such as normalized entropy difference, do not consider existence of LD heterogeneity across the region under investigation. Additionally, these existing multilocus measures cannot handle distant regions where long-range LD patterns may exist. In this study, we proposed a novel multilocus LD measure developed based on mutual information theory. Our proposed measure described LD pattern between two chromosome regions each of which may consist of multiple loci (including multi-allele loci). As such, the proposed measure can better characterize LD patterns between two arbitrary regions. As potential applications, we developed algorithms on the proposed measure for partitioning haplotype blocks and for selecting haplotype tagging SNPs (htSNPs), which were helpful for follow-up association tests. The results on both simulated and empirical data showed that our LD measure had distinct advantages over pairwise and other multilocus measures. First, our measure was more robust, and can capture comprehensively the LD information between neighboring as well as disjointed regions. Second, haplotype blocks were better described via our proposed measure. Furthermore, association tests with htSNPs from the proposed algorithm had improved power over tests on single markers and on haplotypes.  相似文献   

18.
An integrated molecular linkage map of olive (Olea europaea L.) was constructed based on randomly amplified polymorphic DNA (RAPD), sequence characterized amplified region (SCAR), and microsatellite markers using the pseudo-testcross strategy. A mapping population of 104 individuals was generated from an F1 full-sib family of a cross between 'Frantoio' and 'Kalamata'. The hybridity of the mapping population was confirmed by genetic similarity and nonmetric multidimensional scaling. Twenty-three linkage groups were mapped for 'Kalamata', covering 759 cM of the genome with 89 loci and an average distance between loci of 11.5 cM. Twenty-seven linkage groups were mapped for 'Frantoio', covering 798 cM of the genome with 92 loci and an average distance between loci of 12.3 cM. For the integrated map, 15 linkage groups covered 879 cM of the genome with 101 loci and an average distance between loci of 10.2 cM. The size of the genomic DNA was estimated to be around 3000 cM. A sequence characterized amplified region marker linked to olive peacock disease resistance was mapped to linkage group 2 of the integrated map. These maps will be the starting point for studies on the structure, evolution, and function of the olive genome. When the mapping progeny pass through their juvenile phase and assume their adult characters, mapping morphological markers and identification of quantitative trait loci for adaptive traits will be the primary targets.  相似文献   

19.
Patterns of linkage disequilibrium (LD) reveal the action of evolutionary processes and provide crucial information for association mapping of disease genes. Although recent studies have described the landscape of LD among single nucleotide polymorphisms (SNPs) from across the human genome, associations involving other classes of molecular variation remain poorly understood. In addition to recombination and population history, mutation rate and process are expected to shape LD. To test this idea, we measured associations between short-tandem-repeat polymorphisms (STRPs), which can mutate rapidly and recurrently, and SNPs in 721 regions across the human genome. We directly compared STRP-SNP LD with SNP-SNP LD from the same genomic regions in the human HapMap populations. The intensity of STRP-SNP LD, measured by the average of D', was reduced, consistent with the action of recurrent mutation. Nevertheless, a higher fraction of STRP-SNP pairs than SNP-SNP pairs showed significant LD, on both short (up to 50 kb) and long (cM) scales. These results reveal the substantial effects of mutational processes on LD at STRPs and provide important measures of the potential of STRPs for association mapping of disease genes.  相似文献   

20.
Genome wide association studies using high throughput technology are already being conducted despite the significant hurdles that need to be overcome (Nat Rev Genet 6:95–108, 2005; Nat Rev Genet 6:109–118, 2005). Methods for detecting haplotype association signals in genome wide haplotype datasets are as yet very limited. Much methodological research has already been devoted to linkage disequilibrium (LD) fine mapping where the focus is the identification of the disease locus rather than the detection of a disease signal. Applications of these approaches to genome wide scanning are limited by the strong model assumptions of the sharing process, which lead to computational complexity. We describe a new algorithm for the initial identification of disease susceptibility loci in genome wide haplotype association studies. Excess sharing of ancestral haplotypes, which indicates the presence of a disease locus, is detected with a simple, easy to interpret, χ 2 based statistic. The method allows genome wide scanning for qualitative traits within reasonable computational timeframes and can serve as a first pass analysis prior to the usage of likelihood based methods, providing candidate regions and inferred susceptibility haplotypes. Our method makes no assumptions regarding the population history or the pattern of background LD. Statistical significance is evaluated with permutation tests. The method is illustrated on simulated and real data where it is applied to simple (cystic fibrosis) and complex disease (multiple sclerosis) examples. The statistic has low type I error and greater power to map disease loci over conventional single marker tests for low to moderate levels of LD.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号