首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Abstract

Determining haplotype‐specific DNA sequence information is very important in a wide range of research fields. However, no simple and robust approaches are currently available for determining haplotype‐specific sequence information. We have addressed this problem by developing a very simple and robust haplotype‐specific sequencing approach. We utilise the fact that DNA sequencing polymerases are sensitive to 3′end mismatches in the sequencing primer. By using two sequencing primers with 3′end corresponding to the two alleles in a given SNP locus, we are able to obtain allele‐specific DNA sequences from both alleles.

We evaluated this direct haplotype‐specific approach by determining haplotypes within the intron 2 sequence of the fructan‐6‐fructosyltransferase (6ft) gene in Lolium perenne L. We obtained reliable haplotype‐specific sequences for all primers and genotypes evaluated. We conclude that the haplotype‐specific sequencing is robust, and that the approach has a potentially very wide application range for any diploid organism.  相似文献   

2.
Determining haplotype-specific DNA sequence information is very important in a wide range of research fields. However, no simple and robust approaches are currently available for determining haplotype-specific sequence information. We have addressed this problem by developing a very simple and robust haplotype-specific sequencing approach. We utilise the fact that DNA sequencing polymerases are sensitive to 3'end mismatches in the sequencing primer. By using two sequencing primers with 3'end corresponding to the two alleles in a given SNP locus, we are able to obtain allele-specific DNA sequences from both alleles. We evaluated this direct haplotype-specific approach by determining haplotypes within the intron 2 sequence of the fructan-6-fructosyltransferase (6-ft) gene in Lolium perenne L. We obtained reliable haplotype-specific sequences for all primers and genotypes evaluated. We conclude that the haplotype-specific sequencing is robust, and that the approach has a potentially very wide application range for any diploid organism.  相似文献   

3.
RFLPs of 68 normal and 74 mutant alleles at the phenylalanine hydroxylase (PAH) locus were determined in 37 French kindreds. A total of 23 haplotypes, including 18 normal and 16 mutant alleles, were observed. Two-thirds of all mutant alleles were confined within only four haplotypes, while the last third was accounted for by 12 haplotypes, including eight haplotypes absent from Caucasian pedigrees reported thus far. Several mutant haplotypes were present in typical phenylketonuria only, others were present in variants only, and some were present in both. In addition, a particular mutant haplotype (haplotype 2) was found to harbor different mutations in our series, resulting in either typical phenylketonuria or in mild hyperphenylalaninemias. The diploid combination of so many mutant haplotypes in PAH-deficient patients and of compound heterozygosity at the PAH locus in southern Europe might account for the broad spectrum of individual phenotypes observed in France.  相似文献   

4.
Determining the underlying haplotypes of individual human genomes is an essential, but currently difficult, step toward a complete understanding of genome function. Fosmid pool-based next-generation sequencing allows genome-wide generation of 40-kb haploid DNA segments, which can be phased into contiguous molecular haplotypes computationally by Single Individual Haplotyping (SIH). Many SIH algorithms have been proposed, but the accuracy of such methods has been difficult to assess due to the lack of real benchmark data. To address this problem, we generated whole genome fosmid sequence data from a HapMap trio child, NA12878, for which reliable haplotypes have already been produced. We assembled haplotypes using eight algorithms for SIH and carried out direct comparisons of their accuracy, completeness and efficiency. Our comparisons indicate that fosmid-based haplotyping can deliver highly accurate results even at low coverage and that our SIH algorithm, ReFHap, is able to efficiently produce high-quality haplotypes. We expanded the haplotypes for NA12878 by combining the current haplotypes with our fosmid-based haplotypes, producing near-to-complete new gold-standard haplotypes containing almost 98% of heterozygous SNPs. This improvement includes notable fractions of disease-related and GWA SNPs. Integrated with other molecular biological data sets, this phase information will advance the emerging field of diploid genomics.  相似文献   

5.
In Leghorn (laying) chickens, susceptibility to a number of infectious diseases is strongly associated with the major histocompatibility ( B ) complex. Nucleotide sequence data have been published for six class I ( B-F ) alleles and for class II ( B-Lβ ) alleles or isotypes from 17 Leghorn haplotypes. It is not known if classical B-L or B-F alleles in broilers are identical, at the sequence level, to any Leghorn alleles. This report describes molecular and immunogenetic characterization of two haplotypes from commercial broiler breeder chickens that were originally identified by serology as a single haplotype, but were differentiated serologically in the present work. The two haplotypes, designated B A4 and B A4variant, shared identical B-G restriction fragment length polymorphism patterns, but differed in one B-Lβ fragment that cosegregated with the serological B haplotype. Furthermore, the nucleotide sequences of the highly variable exons of an expressed B-LβII family gene and B-F gene from the two haplotypes were markedly different from each other. Both the B-LβII family and B-F gene sequences from the B A4 haplotype were identical to the sequences obtained from the reference B 21 haplotype in Leghorns; however, in the B A4 haplotype the B-Lβ 21 and B-F 21 alleles were in linkage with B-G alleles that were not G 21. The nucleotide sequences from B A4variant were unique among the reported chicken B-LβII family and B-F alleles.  相似文献   

6.
Many methods exist for genotyping—revealing which alleles an individual carries at different genetic loci. A harder problem is haplotyping—determining which alleles lie on each of the two homologous chromosomes in a diploid individual. Conventional approaches to haplotyping require the use of several generations to reconstruct haplotypes within a pedigree, or use statistical methods to estimate the prevalence of different haplotypes in a population. Several molecular haplotyping methods have been proposed, but have been limited to small numbers of loci, usually over short distances. Here we demonstrate a method which allows rapid molecular haplotyping of many loci over long distances. The method requires no more genotypings than pedigree methods, but requires no family material. It relies on a procedure to identify and genotype single DNA molecules, and reconstruction of long haplotypes by a ‘tiling’ approach. We demonstrate this by resolving haplotypes in two regions of the human genome, harbouring 20 and 105 single-nucleotide polymorphisms, respectively. The method can be extended to reconstruct haplotypes of arbitrary complexity and length, and can make use of a variety of genotyping platforms. We also argue that this method is applicable in situations which are intractable to conventional approaches.  相似文献   

7.

Background

Almost all genome sequencing projects neglect the fact that diploid organisms contain two genome copies and consequently what is published is a composite of the two. This means that the relationship between alternate alleles at two or more linked loci is lost. We have developed a simplified method of directly obtaining the haploid sequences of each genome copy from an individual organism.

Results

The diploid sequences of three groups of cattle samples were obtained using a simple sample preparation procedure requiring only a microscope and a haemocytometer. Samples were: 1) lymphocytes from a single Angus steer; 2) sperm cells from an Angus bull; 3) lymphocytes from East African Zebu (EAZ) cattle collected and processed in a field laboratory in Eastern Kenya. Haploid sequence from a fosmid library prepared from lymphocytes of an EAZ cow was used for comparison. Cells were serially diluted to a concentration of one cell per microlitre by counting with a haemocytometer at each dilution. One microlitre samples, each potentially containing a single cell, were lysed and divided into six aliquots (except for the sperm samples which were not divided into aliquots). Each aliquot was amplified with phi29 polymerase and sequenced. Contigs were obtained by mapping to the bovine UMD3.1 reference genome assembly and scaffolds were assembled by joining adjacent contigs that were within a threshold distance of each other. Scaffolds that appeared to contain artefacts of CNV or repeats were filtered out leaving scaffolds with an N50 length of 27–133 kb and a 88–98 % genome coverage. SNP haplotypes were assembled with the Single Individual Haplotyper program to generate an N50 size of 97–201 kb but only ~27–68 % genome coverage. This method can be used in any laboratory with no special equipment at only slightly higher costs than conventional diploid genome sequencing. A substantial body of software for analysis and workflow management was written and is available as supplementary data.

Conclusions

We have developed a set of laboratory protocols and software tools that will enable any laboratory to obtain haplotype sequences at only modestly greater cost than traditional mixed diploid sequences.  相似文献   

8.
We have explored the use of multilocus microsatellite haplotypes to study introgression from cultivated (Malus domestica) into wild apple (Malus sylvestris), and to study gene flow among remnant populations of M. sylvestris. A haplotype consisted of alleles at microsatellite loci along one chromosome. As destruction of haplotypes through recombination occurs much faster than loss of alleles due to genetic drift, the lifespan of a multilocus haplotype is much shorter than that of the underlying alleles. When different populations share the same haplotype, this may indicate recent gene flow between populations. Similarly, haplotypes shared between two species would be a strong signal for introgression. As the expected lifespan of a haplotype depends on the strength of the linkage, the length [in centiMorgans (cM)] of the haplotype shared contains information on the number of generations passed. This application of shared haplotypes is distinct from using haplotype-sharing to detect association between markers and a certain trait. We inferred haplotypes for four to eight microsatellite loci on Linkage Group 10 of apple from genotype data using the program phase, and then identified those haplotypes shared between populations and species. Compared with a Bayesian analysis of unlinked microsatellite loci using the program structure, haplotype-sharing detected a partially different set of putative hybrids. Cultivated haplotypes present in M. sylvestris were short (< 1.5 cM), indicating that introgression had taken place many generations ago, except for two Belgian plants that contained a haplotype of 47.1 cM, indicating recent introgression. In the estimation of gene flow, F(ST) based on unlinked loci indicated small (0.032-0.058) but statistically significant differentiation between some populations only. However, various M. sylvestris haplotypes were shared in nearly all pairwise comparisons of populations, and their length indicated recent gene flow. Hence, all Dutch populations should be considered as one conservation unit. The added value of using sharing of multilocus microsatellite haplotypes as a source of population genetic information is discussed.  相似文献   

9.
10.
Joan D. Ferraris 《Hydrobiologia》1993,266(1-3):255-265
Molecular biological tools currently available to us are revolutionizing the way in which we can address questions in evolutionary biology. The purpose of this article is to provide an overview of molecular techniques and applications available to biologists who are interested in evolutionary studies but who have little acquaintance with molecular biology. In evolutionary biology, techniques designed to determine degree of nucleic acid similarity are in common use and will be dealt with first. Another approach, namely gene expression studies, has strong implications for evolutionary biology but generally requires substantial familiarity with molecular biological tools. Expression studies provide powerful tools for discerning processes of speciation, as in the selection of genetic variants, as well as discerning lineages, e.g., expression of specific homeobox genes during segment formation. For investigations where either nucleic acid identity or gene expression are the ultimate goal, detailed information, protocols and appropriate controls are beyond the scope of this work but, where possible, recent review articles are cited.  相似文献   

11.
Rohde K  Fürst R 《Human heredity》2003,56(1-3):41-47
In order to find association of genetic traits to some haplotypes from closely spaced multilocus phase-unknown genotypes we use a three-stage approach. Haplotype frequencies and the most likely haplotype pair for each individual are estimated from random samples of individual or small (nuclear) family genotypes via an EM algorithm. If the most likely haplotype pair configuration of the whole sample outweighs the less likely ones, we may consider the estimated haplotypes as alleles of a multi-allelic marker and carry out the conventional statistics, TDT or ANOVA for quantitative traits. If the most likely haplotype pair configuration and the less likely ones do not differ much in their weight, we sample the TDT or ANOVA statistic over all haplotype pair configurations using the Metropolis-Hastings algorithm. Applications of our method to simulated data as well as real data are given.  相似文献   

12.
Inference of haplotypes from PCR-amplified samples of diploid populations   总被引:51,自引:0,他引:51  
Direct sequencing of genomic DNA from diploid individuals leads to ambiguities on sequencing gels whenever there is more than one mismatching site in the sequences of the two orthologous copies of a gene. While these ambiguities cannot be resolved from a single sample without resorting to other experimental methods (such as cloning in the traditional way), population samples may be useful for inferring haplotypes. For each individual in the sample that is homozygous for the amplified sequence, there are no ambiguities in the identification of the allele's sequence. The sequences of other alleles can be inferred by taking the remaining sequence after "subtracting off" the sequencing ladder of each known site. Details of the algorithm for extracting allelic sequences from such data are presented here, along with some population-genetic considerations that influence the likelihood for success of the method. The algorithm also applies to the problem of inferring haplotype frequencies of closely linked restriction-site polymorphisms.  相似文献   

13.
14.
15.
Haplotype data are especially important in the study of complex diseases since it contains more information than genotype data. However, obtaining haplotype data is technically difficult and costly. Computational methods have proved to be an effective way of inferring haplotype data from genotype data. One of these methods, the haplotype inference by pure parsimony approach (HIPP), casts the problem as an optimization problem and as such has been proved to be NP-hard. We have designed and developed a new preprocessing procedure for this problem. Our proposed algorithm works with groups of haplotypes rather than individual haplotypes. It iterates searching and deleting haplotypes that are not helpful in order to find the optimal solution. This preprocess can be coupled with any of the current solvers for the HIPP that need to preprocess the genotype data. In order to test it, we have used two state-of-the-art solvers, RTIP and GAHAP, and simulated and real HapMap data. Due to the computational time and memory reduction caused by our preprocess, problem instances that were previously unaffordable can be now efficiently solved.  相似文献   

16.
Haplotypic sequences contain significantly more information than genotypes of genetic markers and are critical for studying disease association and genome evolution. Current methods for obtaining haplotypic sequences require the physical separation of alleles before sequencing, are time consuming and are not scaleable for large surveys of genetic variation. We have developed a novel method for acquiring haplotypic sequences from long PCR products using simple, high-throughput techniques. This method applies modified shotgun sequencing protocols to sequence both alleles concurrently, with read-pair information allowing the two alleles to be separated during sequence assembly. Although the haplotypic sequences can be assembled manually from the resultant data using pre-existing sequence assembly software, we have devised a novel heuristic algorithm to automate assembly and remove human error. We validated the approach on two long PCR products amplified from the human genome and confirmed the accuracy of our sequences against full-length clones of the same alleles. This method presents a simple high-throughput means to obtain full haplotypic sequences potentially up to 20 kb in length and is suitable for surveying genetic variation even in poorly-characterized genomes as it requires no prior information on sequence variation.  相似文献   

17.
A current strategy for obtaining haplotype information from several individuals involves short-read sequencing of pooled amplicons, where fragments from each individual is identified by a unique DNA barcode. In this paper, we report a new method to recover the phylogeny of haplotypes from short-read sequences obtained using pooled amplicons from a mixture of individuals, without barcoding. The method, AFPhyloMix, accepts an alignment of the mixture of reads against a reference sequence, obtains the single-nucleotide-polymorphisms (SNP) patterns along the alignment, and constructs the phylogenetic tree according to the SNP patterns. AFPhyloMix adopts a Bayesian inference model to estimate the phylogeny of the haplotypes and their relative abundances, given that the number of haplotypes is known. In our simulations, AFPhyloMix achieved at least 80% accuracy at recovering the phylogenies and relative abundances of the constituent haplotypes, for mixtures with up to 15 haplotypes. AFPhyloMix also worked well on a real data set of kangaroo mitochondrial DNA sequences.  相似文献   

18.
While standard DNA‐sequencing approaches readily yield genotypic sequence data, haplotype information is often of greater utility for population genetic analyses. However, obtaining individual haplotype sequences can be costly and time‐consuming and sometimes requires statistical reconstruction approaches that are subject to bias and error. Advancements have recently been made in determining individual chromosomal sequences in large‐scale genomic studies, yet few options exist for obtaining this information from large numbers of highly polymorphic individuals in a cost‐effective manner. As a solution, we developed a simple PCR‐based method for obtaining sequence information from individual DNA strands using standard laboratory equipment. The method employs a water‐in‐oil emulsion to separate the PCR mixture into thousands of individual microreactors. PCR within these small vesicles results in amplification from only a single starting DNA template molecule and thus a single haplotype. We improved upon previous approaches by including SYBR Green I and a melted agarose solution in the PCR, allowing easy identification and separation of individually amplified DNA molecules. We demonstrate the use of this method on a highly polymorphic estuarine population of the copepod Eurytemora affinis for which current molecular and computational methods for haplotype determination have been inadequate.  相似文献   

19.
We investigated the frequency, origin, and molecular basis of phenylketonuria (PKU) in U.S. blacks. On the basis of 10 years of Maryland newborn-screening data, we found the frequency to be 1/50,000, or one-third that in whites. We performed haplotype analysis of the phenylalanine hydroxylase (PAH) gene of 36 U.S. blacks, 16 from individuals with classical PKU and 20 from controls. In blacks, 20% of wild-type PAH alleles have a common Caucasian haplotype (i.e., haplotype 1), whereas 80% had a variety of haplotypes, all rare in Caucasians and Asians. One of these, haplotype 15, accounted for a large fraction (30%). Among black mutant PAH alleles, 20% have a haplotype (i.e., either haplotype 1 or haplotype 4) common in Caucasians; 40% have a haplotype rare in Caucasians and Asians, and 40% have one of two previously undescribed haplotypes. Both can be derived from known haplotypes by a single event. One of these haplotypes is characterized by a new MspI restriction site, located in intron 8, which was present in five of 16 black mutant alleles but was not present in 60 U.S. black control, 20 U.S. Caucasian control, or 20 Caucasian mutant PAH alleles. Sequence analysis of DNA from a single individual, homozygous for the new MspI associated haplotype, shows homozygosity for a C----T transition at nucleotide 896 in exon 7 of the PAH cDNA, resulting in the conversion of leucine 255 to serine (L255S).  相似文献   

20.
By direct sequence analysis of 94 mutant phenylalanine hydroxylase alleles using polymerase chain reaction-based techniques, we identified a C to T transition in exon 7 of the human phenylalanine hydroxylase gene that is associated with RFLP haplotypes 1 and 4. A leucine for proline substitution at position 281 can be predicted from the nucleotide sequence of the mutant codon. Expression analysis in cultured mammalian cells after site-directed mutagenesis proved that the base substitution is a disease causing gene lesion. Dot-blot hybridization analysis using allele-specific oligonucleotides revealed that 25% of all mutant haplotype 1 alleles in the German population bear this mutation. In addition, this mutation could be detected on one mutant haplotype 4 allele. The fact that this mutation is associated with only 25% of all mutant haplotype 1 alleles suggests that multiple mutations may be associated with this haplotype. The occurrence of several different mutations would be in agreement with the clinical heterogeneity observed in the group of patients whose PKU alleles belong to haplotype 1.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号