首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 218 毫秒
1.
Physical and linkage mapping underpin efforts to sequence and characterize the genomes of eukaryotic organisms by providing a skeleton framework for whole genome assembly. Hitherto, linkage and physical “contig” maps were generated independently prior to merging. Here, we develop a new and easy method, BAC HAPPY MAPPING (BAP mapping), that utilizes BAC library pools as a HAPPY mapping panel together with an Mbp-sized DNA panel to integrate the linkage and physical mapping efforts into one pipeline. Using Arabidopsis thaliana as an exemplar, a set of 40 Sequence Tagged Site (STS) markers spanning ∼10% of chromosome 4 were simultaneously assembled onto a BAP map compiled using both a series of BAC pools each comprising 0.7x genome coverage and dilute (0.7x genome) samples of sheared genomic DNA. The resultant BAP map overcomes the need for polymorphic loci to separate genetic loci by recombination and allows physical mapping in segments of suppressed recombination that are difficult to analyze using traditional mapping techniques. Even virtual “BAC-HAPPY-mapping” to convert BAC landing data into BAC linkage contigs is possible.  相似文献   

2.
HAPPY mapping was designed to pursue the analysis of approximately random HAPloid DNA breakage samples using the PolYmerase chain reaction for mapping genomes. In the present study, we improved the method and integrated two other molecular techniques into the process: whole genome amplification and the Sequenom SNP (single nucleotide polymorphism) genotyping assay in order to facilitate whole genome mapping of X. tropicalis. The former technique amplified enough DNA materials to genotype a large number of markers, while the latter allowed for relatively high throughput marker genotyping with multiplex assays on the HAPPY lines. A total of 58 X. tropicalis genes were genotyped on an initial panel of 383 HAPPY lines, which contributed to formation of a working panel of 146 lines. Further genotyping of 29 markers on the working panel led to construction of a HAPPY map for the X. tropicalis genome. We believe that our improved HAPPY method described in the present study has paved the way for the community to map different genomes with a simple, but powerful approach.  相似文献   

3.
ABSTRACT: BACKGROUND: Eimeria is a genus of parasites in the same phylum (Apicomplexa) as human parasites such as Toxoplasma, Cryptosporidium and the malaria parasite Plasmodium. As an apicomplexan whose life-cycle involves a single host, Eimeria is a convenient model for understanding this group of organisms. Although the genomes of the Apicomplexa are diverse, that of Eimeria is unique in being composed of large alternating blocks of sequence with very different characteristics - an arrangement seen in no other organism. This arrangement has impeded efforts to fully sequence the genome of Eimeria, which remains the last of the major apicomplexans to be fully analyzed. In order to increase the value of the genome sequence data and aid in the effort to gain a better understanding of the Eimeria tenella genome, we constructed a whole genome map for the parasite. RESULTS: A total of 1245 contigs representing 70.0% of the whole genome assembly sequences (Wellcome Trust Sanger Institute) were selected and subjected to marker selection. Subsequently, 2482 HAPPY markers were developed and typed. Of these, 795 were considered as usable markers, and utilized in the construction of a HAPPY map. Markers developed from chromosomally-assigned genes were then integrated into the HAPPY map and this aided the assignment of a number of linkage groups to their respective chromosomes. BAC-end sequences and contigs from whole genome sequencing were also integrated to improve and validate the HAPPY map. This resulted in an integrated HAPPY map consisting of 60 linkage groups that covers approximately half of the estimated 60 Mb genome. Further analysis suggests that the segmental organization first seen in Chromosome 1 is present throughout the genome, with repeat-poor (P) regions alternating with repeat-rich (R) regions. Evidence of copy-number variation between strains was also uncovered. CONCLUSIONS: This paper describes the application of a whole genome mapping method to improve the assembly of the genome of E. tenella from shotgun data, and to help reveal its overall structure. A preliminary assessment of copy-number variation (extra or missing copies of genomic segments) between strains of E. tenella was also carried out. The emerging picture is of a very unusual genome architecture displaying inter-strain copy-number variation. We suggest that these features may be related to the known ability of this parasite to rapidly develop drug resistance.  相似文献   

4.
Advanced resources for genome‐assisted research in barley (Hordeum vulgare) including a whole‐genome shotgun assembly and an integrated physical map have recently become available. These have made possible studies that aim to assess genetic diversity or to isolate single genes by whole‐genome resequencing and in silico variant detection. However such an approach remains expensive given the 5 Gb size of the barley genome. Targeted sequencing of the mRNA‐coding exome reduces barley genomic complexity more than 50‐fold, thus dramatically reducing this heavy sequencing and analysis load. We have developed and employed an in‐solution hybridization‐based sequence capture platform to selectively enrich for a 61.6 megabase coding sequence target that includes predicted genes from the genome assembly of the cultivar Morex as well as publicly available full‐length cDNAs and de novo assembled RNA‐Seq consensus sequence contigs. The platform provides a highly specific capture with substantial and reproducible enrichment of targeted exons, both for cultivated barley and related species. We show that this exome capture platform provides a clear path towards a broader and deeper understanding of the natural variation residing in the mRNA‐coding part of the barley genome and will thus constitute a valuable resource for applications such as mapping‐by‐sequencing and genetic diversity analyzes.  相似文献   

5.
A high-density genetic map, an essential tool for comparative genomic studies and quantitative trait locus fine mapping, can also facilitate genome sequence assembly. The sequence-based marker technology known as restriction site-associated DNA (RAD) enables synchronous, single nucleotide polymorphism marker discovery, and genotyping using massively parallel sequencing. We constructed a high-density linkage map for carnation (Dianthus caryophyllus L.) based on simple sequence repeat (SSR) markers in combination with RAD markers developed by double-digest RAD sequencing (ddRAD-seq). A total of 2404 (285 SSR and 2119 RAD) markers could be assigned to 15 linkage groups spanning 971.5 cM, with an average marker interval of 0.4 cM. The total length of scaffolds with identified map positions was 95.6 Mb, which is equivalent to 15.4 % of the estimated genome size. The generated map is the first SSR and RAD marker-based high-density linkage map reported for carnation. The ddRAD-seq pipeline developed in this study should also help accelerate genetic and genomics analyses and molecular breeding of carnation and other non-model crops.  相似文献   

6.
Direct sequencing of total plant DNA using next generation sequencing technologies generates a whole chloroplast genome sequence that has the potential to provide a barcode for use in plant and food identification. Advances in DNA sequencing platforms may make this an attractive approach for routine plant identification. The HiSeq (Illumina) and Ion Torrent (Life Technology) sequencing platforms were used to sequence total DNA from rice to identify polymorphisms in the whole chloroplast genome sequence of a wild rice plant relative to cultivated rice (cv. Nipponbare). Consensus chloroplast sequences were produced by mapping sequence reads to the reference rice chloroplast genome or by de novo assembly and mapping of the resulting contigs to the reference sequence. A total of 122 polymorphisms (SNPs and indels) between the wild and cultivated rice chloroplasts were predicted by these different sequencing and analysis methods. Of these, a total of 102 polymorphisms including 90 SNPs were predicted by both platforms. Indels were more variable with different sequencing methods, with almost all discrepancies found in homopolymers. The Ion Torrent platform gave no apparent false SNP but was less reliable for indels. The methods should be suitable for routine barcoding using appropriate combinations of sequencing platform and data analysis.  相似文献   

7.

Background  

Determining the position and order of contigs and scaffolds from a genome assembly within an organism's genome remains a technical challenge in a majority of sequencing projects. In order to exploit contemporary technologies for DNA sequencing, we developed a strategy for whole genome single nucleotide polymorphism sequencing allowing the positioning of sequence contigs onto a linkage map using the bin mapping method.  相似文献   

8.
Rapid advances in sequencing technologies of second- and even third-generation made the whole genome sequencing a routine procedure. However, the methods for assembling of the obtained sequences and its results require special consideration. Modern assemblers are based on heuristic algorithms, which lead to fragmented genome assembly composed of scaffolds and contigs of different lengths, the order of which along the chromosome and belonging to a particular chromosome often remain unknown. In this regard, the resulting genome sequence can only be considered as a draft assembly. The principal improvement in the quality and reliability of a draft assembly can be achieved by targeted sequencing of the genome elements of different size, e.g., chromosomes, chromosomal regions, and DNA fragments cloned in different vectors, as well as using reference genome, optical mapping, and Hi-C technology. This approach, in addition to simplifying the assembly of the genome draft, will more accurately identify numerical and structural chromosomal variations and abnormalities of the genomes of the studied species. In this review, we discuss the key technologies for the genome sequencing and the de novo assembly, as well as different approaches to improve the quality of existing drafts of genome sequences.  相似文献   

9.
Transposable elements (TEs) – selfish DNA sequences that can move within the genome – comprise a large proportion of the genomes of many organisms. Although low‐coverage whole‐genome sequencing can be used to survey TE composition, it is noneconomical for species with large quantities of DNA. Here, we utilize restriction‐site associated DNA sequencing (RADSeq) as an alternative method to survey TE composition. First, we demonstrate in silico that double digest restriction‐site associated DNA sequencing (ddRADseq) markers contain the same TE compositions as whole genome assemblies across arthropods. Next, we show empirically using eight Synalpheus snapping shrimp species with large genomes that TE compositions from ddRADseq and low‐coverage whole‐genome sequencing are comparable within and across species. Finally, we develop a new bioinformatic pipeline, TERAD, to extract TE compositions from RADseq data. Our study expands the utility of RADseq to study the repeatome, making comparative studies of genome structure for species with large genomes more tractable and affordable.  相似文献   

10.
RAD-seq技术在基因组研究中的现状及展望   总被引:4,自引:0,他引:4  
王洋坤  胡艳  张天真 《遗传》2014,36(1):41-49
Restriction-site associated DNA sequencing(RAD-seq)技术是在二代测序基础上发展起来的一项基于全基因组酶切位点的简化基因组测序技术。该方法技术流程简单, 不受有无参考基因组的限制, 可大大简化基因组的复杂性, 减少实验费用, 通过一次测序就可以获得数以万计的多态性标记。目前, RAD-seq技术已成功应用于超高密度遗传图谱的构建、重要性状的精细定位、辅助基因组序列组装、群体基因组学以及系统发生学等基因组研究热点领域。文章主要介绍了RAD-seq的技术原理、技术发展及其在基因组研究中的广泛应用。鉴于RAD-seq方法的独特性, 该技术必将在复杂基因组研究领域具有广泛的应用前景。  相似文献   

11.
The genomes of nonhuman primates have recently become highly visible candidates for full genome analysis, as they provide powerful models of human disease and a better understanding of the evolution of the human genome. We describe the creation of a 5000 rad radiation hybrid (RH) mapping panel for the rhesus macaque. Duplicate genotypes of 84 microsatellite and coding gene sequence tagged sites from six macaque chromosomes produced an estimated whole genome retention frequency of 0.33. To test the mapping ability of the panel, we constructed RH maps for macaque chromosomes 7 and 9 and compared them to orthologous locus orders in existing human and baboon maps derived from different methodologies. Concordant marker order between all three species maps suggests that the current panel represents a powerful mapping resource for generating high-density comparative maps of the rhesus macaque and other species genomes.  相似文献   

12.
HAPPY mapping is an in vitro approach for defining the order and spacing of DNA markers directly on native genomic DNA. This cloning-free technique is based on analysing the segregation of markers amplified from high molecular weight genomic DNA which has been broken randomly and 'segregated' by limiting dilution into subhaploid samples. It is a uniquely versatile tool, allowing for the construction of genome maps with flexible ranges and resolutions. Moreover, it is applicable to plant genomes, for which many of the techniques pioneered in animal genomes are inapplicable or inappropriate. We report here its demonstration in a plant genome by reconstructing the physical map of a 1.9 Mbp region around the FCA locus of Arabidopsis thaliana. The resulting map, spanning around 10% of chromosome 4, is in excellent agreement with the DNA sequence and has a mean marker spacing of 16 kbp. We argue that HAPPY maps of any required resolution can be made immediately and with relatively little effort for most plant species and, furthermore, that such maps can greatly aid the construction of regional or genome-wide physical maps.  相似文献   

13.
14.
The assembly of a reference genome sequence of bread wheat is challenging due to its specific features such as the genome size of 17 Gbp, polyploid nature and prevalence of repetitive sequences. BAC‐by‐BAC sequencing based on chromosomal physical maps, adopted by the International Wheat Genome Sequencing Consortium as the key strategy, reduces problems caused by the genome complexity and polyploidy, but the repeat content still hampers the sequence assembly. Availability of a high‐resolution genomic map to guide sequence scaffolding and validate physical map and sequence assemblies would be highly beneficial to obtaining an accurate and complete genome sequence. Here, we chose the short arm of chromosome 7D (7DS) as a model to demonstrate for the first time that it is possible to couple chromosome flow sorting with genome mapping in nanochannel arrays and create a de novo genome map of a wheat chromosome. We constructed a high‐resolution chromosome map composed of 371 contigs with an N50 of 1.3 Mb. Long DNA molecules achieved by our approach facilitated chromosome‐scale analysis of repetitive sequences and revealed a ~800‐kb array of tandem repeats intractable to current DNA sequencing technologies. Anchoring 7DS sequence assemblies obtained by clone‐by‐clone sequencing to the 7DS genome map provided a valuable tool to improve the BAC‐contig physical map and validate sequence assembly on a chromosome‐arm scale. Our results indicate that creating genome maps for the whole wheat genome in a chromosome‐by‐chromosome manner is feasible and that they will be an affordable tool to support the production of improved pseudomolecules.  相似文献   

15.
Rearrangements of the genome can be detected by microarray methods and massively parallel sequencing, which identify copy-number alterations and breakpoint junctions, but these techniques are poorly suited to reconstructing the long-range organization of rearranged chromosomes, for example, to distinguish between translocations and insertions. The single-DNA-molecule technique HAPPY mapping is a method for mapping normal genomes that should be able to analyse genome rearrangements, i.e. deviations from a known genome map, to assemble rearrangements into a long-range map. We applied HAPPY mapping to cancer cell lines to show that it could identify rearrangement of genomic segments, even in the presence of normal copies of the genome. We could distinguish a simple interstitial deletion from a copy-number loss at an inversion junction, and detect a known translocation. We could determine whether junctions detected by sequencing were on the same chromosome, by measuring their linkage to each other, and hence map the rearrangement. Finally, we mapped an uncharacterized reciprocal translocation in the T-47D breast cancer cell line to about 2 kb and hence cloned the translocation junctions. We conclude that HAPPY mapping is a versatile tool for determining the structure of rearrangements in the human genome.  相似文献   

16.
Fundamental improvement was made for genome sequencing since the next-generation sequencing (NGS) came out in the 2000s. The newer technologies make use of the power of massively-parallel short-read DNA sequencing, genome alignment and assembly methods to digitally and rapidly search the genomes on a revolutionary scale, which enable large-scale whole genome sequencing (WGS) accessible and practical for researchers. Nowadays, whole genome sequencing is more and more prevalent in detecting the genetics of diseases, studying causative relations with cancers, making genome-level comparative analysis, reconstruction of human population history, and giving clinical implications and instructions. In this review, we first give a typical pipeline of whole genome sequencing, including the lab template preparation, sequencing, genome assembling and quality control, variants calling and annotations. We compare the difference between whole genome and whole exome sequencing (WES), and explore a wide range of applications of whole genome sequencing for both mendelian diseases and complex diseases in medical genetics. We highlight the impact of whole genome sequencing in cancer studies, regulatory variant analysis, predictive medicine and precision medicine, as well as discuss the challenges of the whole genome sequencing.   相似文献   

17.
Next-generation sequencing (NGS) technologies have enabled high-throughput and low-cost generation of sequence data; however, de novo genome assembly remains a great challenge, particularly for large genomes. NGS short reads are often insufficient to create large contigs that span repeat sequences and to facilitate unambiguous assembly. Plant genomes are notorious for containing high quantities of repetitive elements, which combined with huge genome sizes, makes accurate assembly of these large and complex genomes intractable thus far. Using two-color genome mapping of tiling bacterial artificial chromosomes (BAC) clones on nanochannel arrays, we completed high-confidence assembly of a 2.1-Mb, highly repetitive region in the large and complex genome of Aegilops tauschii, the D-genome donor of hexaploid wheat (Triticum aestivum). Genome mapping is based on direct visualization of sequence motifs on single DNA molecules hundreds of kilobases in length. With the genome map as a scaffold, we anchored unplaced sequence contigs, validated the initial draft assembly, and resolved instances of misassembly, some involving contigs <2 kb long, to dramatically improve the assembly from 75% to 95% complete.  相似文献   

18.
Radiation hybrid (RH) and HAPPY mapping are two technologies used in animal systems that have attracted the attention of the plant genetics community because they bridge the resolution gap between meiotic and BAC-based physical mapping that would facilitate the analysis of plant species lacking substantial genomics resources. Research has shown that the essence of these approaches can be applied and that a variety of strategies can be used to produce mapping panels. Mapping panels composed of live plants, protoplast fusion cultures, and sub-genomic DNA samples have been described. The resolution achievable by RH mapping panels involving live-plant derivatives of a monosomic maize (Zea mays) chromosome 9 addition in allohexaploid oat (Avena sativa), a monosomic chromosome 1D addition in allotetraploid durum wheat (Triticum turgidum), and interspecific hybrids between two tetraploid cotton species (G. hirsutum and G. barbadense), has been estimated to range from 0.6 to 6 Mb. On the other hand, a more comprehensive evaluation of one panel from durum wheat suggests that a higher mapping resolution (approximately 200 kb) is possible. In cases involving RH mapping panels based on barley (Hordeum vulgare)-tobacco (Nicotiana tabacum) protoplast fusions or a HAPPY mapping panel based on genomic DNA from Arabidopsis thaliana, the potential mapping resolution appears to be higher (50 to 200 kb). Despite these encouraging results, the application of either RH or HAPPY mapping in plants is still in the experimental phase and additional work is clearly needed before these methods are more routinely utilized.  相似文献   

19.
Hierarchical shotgun sequencing remains the method of choice for assembling high‐quality reference sequences of complex plant genomes. The efficient exploitation of current high‐throughput technologies and powerful computational facilities for large‐insert clone sequencing necessitates the sequencing and assembly of a large number of clones in parallel. We developed a multiplexed pipeline for shotgun sequencing and assembling individual bacterial artificial chromosomes (BACs) using the Illumina sequencing platform. We illustrate our approach by sequencing 668 barley BACs (Hordeum vulgare L.) in a single Illumina HiSeq 2000 lane. Using a newly designed parallelized computational pipeline, we obtained sequence assemblies of individual BACs that consist, on average, of eight sequence scaffolds and represent >98% of the genomic inserts. Our BAC assemblies are clearly superior to a whole‐genome shotgun assembly regarding contiguity, completeness and the representation of the gene space. Our methods may be employed to rapidly obtain high‐quality assemblies of a large number of clones to assemble map‐based reference sequences of plant and animal species with complex genomes by sequencing along a minimum tiling path.  相似文献   

20.
陆才瑞  邹长松  宋国立 《遗传》2015,37(8):765-776
传统的利用正向遗传学方法的基因定位一般是通过构建遗传连锁图谱进行的,该过程步骤繁琐、耗时耗力,很多情形下定位精确度低、区间大。随着高通量测序技术的快速发展以及测序成本的不断降低,多种简单快捷的利用测序手段定位基因的方法被开发出来,包括对突变体基因组直接测序定位、突变体材料构建混池测序定位和遗传分离群体测序构建图谱定位等,还可以对转录组和部分基因组进行测序定位。这些方法可以在核苷酸水平鉴定突变位点,并已推广到复杂的遗传背景中。近期报道的一些测序定位甚至是在不依赖于参考基因组序列、遗传杂交和连锁信息的情况下完成的,这使得很多非模式物种也能开展正向遗传学研究。本文就这些新技术及其在基因定位中的应用进行了综述。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号