首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Reference sequences are sequences that are used for public consultation, and therefore must be of high quality. Using the whole‐genome shotgun/next‐generation sequencing approach, many genome sequences of complex higher plants have been generated in recent years, and are generally considered reference sequences. However, none of these sequences has been experimentally evaluated at the whole‐genome sequence assembly level. Rice has a relatively simple plant genome, and the genome sequences for its two sub‐species obtained using different sequencing approaches were published approximately 10 years ago. This provides a unique system for a case study to evaluate the qualities and utilities of published plant genome sequences. We constructed a robust BAC physical map embedding a large number of BAC end sequences forrice variety 93–11. Through BAC end sequence alignments and tri‐assembly comparisons of the 93–11 physical map and the two reference sequences, we found that the Nipponbare reference sequence generated using the clone‐by‐clone approach has a high quality but still contains small artifact inversions and missing sequences. In contrast, the 93–11 reference sequence generated using the whole‐genome shotgun approach contains many large and varied assembly errors, such as inversions, duplications and translocations, as well as missing sequences. The 93–11 physical map provides an invaluable resource for evaluation and improvements toward completion of both Nipponbare and 93–11 reference sequences.  相似文献   

2.
Chinese hamster ovary (CHO) cells have frequently been used in biotechnology as a mammalian host cell platform for expressing genes of interest. Previously, we constructed a detailed physical chromosomal map of the CHO DG44 cell line by fluorescence in situ hybridization (FISH) imaging using 303 bacterial artificial chromosome (BAC) clones as hybridization probes (BAC-FISH). BAC-FISH results revealed that the two longest chromosomes were completely paired. However, other chromosomes featured partial deletions or rearrangements. In this study, we determined the end sequences of 303 BAC clones (BAC end sequences), which were used for BAC-FISH probes. Among 606 BAC-end sequences (BESs) (forward and reverse ends), 558 could be determined. We performed a comparison between all determined BESs and mouse genome sequences using NCBI BLAST. Among these 558 BESs, 465 showed high homology to mouse chromosomal sequences. We analyzed the locations of these BACs in chromosomes of the CHO DG44 cell line using a physical chromosomal map. From the obtained results, we investigated the regional similarities among CHO chromosomes (A–T) and mouse chromosomes (1–19 and sex) about 217 BESs (46.7% of 465 high homologous BESs). Twenty-three specific narrow regions in 13 chromosomes of the CHO DG44 cell line showed high homology to mouse chromosomes, but most of other regions did not show significant correlations with the mouse genome. These results contribute to accurate alignments of chromosomes of Chinese hamster and its genome sequence, analysis of chromosomal instability in CHO cells, and the development of target locations for gene and/or genome editing techniques.  相似文献   

3.
The assembly of a reference genome sequence of bread wheat is challenging due to its specific features such as the genome size of 17 Gbp, polyploid nature and prevalence of repetitive sequences. BAC‐by‐BAC sequencing based on chromosomal physical maps, adopted by the International Wheat Genome Sequencing Consortium as the key strategy, reduces problems caused by the genome complexity and polyploidy, but the repeat content still hampers the sequence assembly. Availability of a high‐resolution genomic map to guide sequence scaffolding and validate physical map and sequence assemblies would be highly beneficial to obtaining an accurate and complete genome sequence. Here, we chose the short arm of chromosome 7D (7DS) as a model to demonstrate for the first time that it is possible to couple chromosome flow sorting with genome mapping in nanochannel arrays and create a de novo genome map of a wheat chromosome. We constructed a high‐resolution chromosome map composed of 371 contigs with an N50 of 1.3 Mb. Long DNA molecules achieved by our approach facilitated chromosome‐scale analysis of repetitive sequences and revealed a ~800‐kb array of tandem repeats intractable to current DNA sequencing technologies. Anchoring 7DS sequence assemblies obtained by clone‐by‐clone sequencing to the 7DS genome map provided a valuable tool to improve the BAC‐contig physical map and validate sequence assembly on a chromosome‐arm scale. Our results indicate that creating genome maps for the whole wheat genome in a chromosome‐by‐chromosome manner is feasible and that they will be an affordable tool to support the production of improved pseudomolecules.  相似文献   

4.
In an effort to increase the density of sequence-based markers for the horse genome we generated 9473 BAC end sequences (BESs) from the CHORI-241 BAC library with an average read length of 677 bp. BLASTN searches with the BESs revealed 4036 meaningful hits (E 相似文献   

5.
Recent advances in both clone fingerprinting and draft sequencing technology have made it increasingly common for species to have a bacterial artificial clone (BAC) fingerprint map, BAC end sequences (BESs) and draft genomic sequence. The FPC (fingerprinted contigs) software package contains three modules that maximize the value of these resources. The BSS (blast some sequence) module provides a way to easily view the results of aligning draft sequence to the BESs, and integrates the results with the following two modules. The MTP (minimal tiling path) module uses sequence and fingerprints to determine a minimal tiling path of clones. The DSI (draft sequence integration) module aligns draft sequences to FPC contigs, displays them alongside the contigs and identifies potential discrepancies; the alignment can be based on either individual BES alignments to the draft, or on the locations of BESs that have been assembled into the draft. FPC also supports high-throughput fingerprint map generation as its time-intensive functions have been parallelized for Unix-based desktops or servers with multiple CPUs. Simulation results are provided for the MTP, DSI and parallelization. These features are in the FPC V9.3 software package, which is freely available.  相似文献   

6.
Numerous QTL for a variety of phenotypic traits in dairy and beef cattle have been mapped on bovine chromosome 6 (BTA6). The complete and validated information on the molecular genome organization is an essential prerequisite for the conclusive identification of the causative sequence variation underlying the QTL. In our study we describe efforts to improve the genomic sequence map assembly of BTA6 by filling-in gaps and by suggesting sequence contig rearrangements. This is achieved by the generation and in silico mapping of BAC-end sequences (BESs) from clones containing sequences placed on our high-resolution radiation hybrid (RH) map of BTA6 onto the genome sequence map. Linking high-resolution RH mapping with in silico mapping of BESs on BTA6 enabled the detection of discrepancies in chromosomal assignments of genome sequence contigs and improved the resolution of non-conclusive assignments on the genome sequence assembly. Furthermore, 37% of BESs enabled chromosomal assignment of contigs previously unassigned. Anchoring of 66% of BESs onto HSA4 confirmed the synteny of the respective region of BTA6 including the known evolutionary breakpoints. The BESs will play an important role in the ongoing efforts to complete the sequence of the bovine genome and will also provide a source for the identification of new polymorphic sites in the genome sequence to resolve QTL-containing intervals.  相似文献   

7.
A comparative physical map of the AA genome (Oryza sativa) and the BB genome (O. punctata) was constructed by aligning a physical map of O. punctata, deduced from 63,942 BAC end sequences (BESs) and 34,224 fingerprints, onto the O. sativa genome sequence. The level of conservation of each chromosome between the two species was determined by calculating a ratio of BES alignments. The alignment result suggests more divergence of intergenic and repeat regions in comparison to gene-rich regions. Further, this characteristic enabled localization of heterochromatic and euchromatic regions for each chromosome of both species. The alignment identified 16 locations containing expansions, contractions, inversions, and transpositions. By aligning 40% of the punctata BES on the map, 87% of the punctata FPC map covered 98% of the O. sativa genome sequence. The genome size of O. punctata was estimated to be 8% larger than that of O. sativa with individual chromosome differences of 1.5-16.5%. The sum of expansions and contractions observed in regions >500 kb were similar, suggesting that most of the contractions/expansions contributing to the genome size difference between the two species are small, thus preserving the macro-collinearity between these species, which diverged approximately 2 million years ago.  相似文献   

8.
An overview of the apple genome through BAC end sequence analysis   总被引:1,自引:0,他引:1  
The apple, Malus x domestica Borkh., is one of the most important fruit trees grown worldwide. A bacterial artificial chromosome (BAC)-based physical map of the apple genome has been recently constructed. Based on this physical map, a total of approximately 2,100 clones from different contigs (overlapping BAC clones) have been selected and sequenced at both ends, generating 3,744 high-quality BAC end sequences (BESs) including 1,717 BAC end pairs. Approximately 8.5% of BESs contain simple sequence repeats (SSRs), most of which are AT/TA dimer repeats. Potential transposable elements are identified in approximately 21% of BESs, and most of these elements are retrotransposons. About 11% of BESs have homology to the Arabidopsis protein database. The matched proteins cover a broad range of categories. The average GC content of the predicted coding regions of BESs is 42.4%; while, that of the whole BESs is 39%. A small number of BES pairs were mapped to neighboring chromosome regions of A. thaliana and Populus trichocarpa; whereas, no pairs are mapped to the Oryza sativa genome. The apple has a higher degree of synteny with the closely related Populus than with the distantly related Arabidopsis. BAC end sequencing can be used to anchor a small proportion of the apple genome to the Populus and possibly to the Arabidopsis genomes.  相似文献   

9.
A set of BAC clones spanning the human genome   总被引:13,自引:0,他引:13  
Using the human bacterial artificial chromosome (BAC) fingerprint-based physical map, genome sequence assembly and BAC end sequences, we have generated a fingerprint-validated set of 32855 BAC clones spanning the human genome. The clone set provides coverage for at least 98% of the human fingerprint map, 99% of the current assembled sequence and has an effective resolving power of 79 kb. We have made the clone set publicly available, anticipating that it will generally facilitate FISH or array-CGH-based identification and characterization of chromosomal alterations relevant to disease.  相似文献   

10.
The Human BAC Ends database includes all non-redundant human BAC end sequences (BESs) generated by The Institute for Genomic Research (TIGR), the University of Washington (UW) and California Institute of Technology (CalTech). It incorporates the available BAC mapping data from different genome centers and the annotation results of each end sequence for the contents of repeats, ESTs and STS markers. For each BAC end the database integrates the sequence, the phred quality scores, the map and the annotation, and provides links to sites of the library information, the reports of GenBank, dbGSS and GDB, and other relevant data. The database is freely accessible via the web and supports sequence or clone searches and anonymous FTP. The relevant sites and resources are described at http://www.tigr.org/ tdb/humgen/bac_end_search/bac_end_intro.html  相似文献   

11.
Papaya (Carica papaya L.) is a major tree fruit crop of tropical and subtropical regions with an estimated genome size of 372 Mbp. We present the analysis of 4.7% of the papaya genome based on BAC end sequences (BESs) representing 17 million high-quality bases. Microsatellites discovered in 5,452 BESs and flanking primer sequences are available to papaya breeding programs at . Sixteen percent of BESs contain plant repeat elements, the vast majority (83.3%) of which are class I retrotransposons. Several novel papaya-specific repeats were identified. Approximately 19.1% of the BESs have homology to Arabidopsis cDNA. Increasing numbers of completely sequenced plant genomes and BES projects enable novel approaches to comparative plant genomics. Paired BESs of Carica, Arabidopsis, Populus, Brassica and Lycopersicon were mapped onto the completed genomes of Arabidopsis and Populus. In general the level of microsynteny was highest between closely related organisms. However, papaya revealed a higher degree of apparent synteny with the more distantly related poplar than with the more closely related Arabidopsis. This, as well as significant colinearity observed between peach and poplar genome sequences, support recent observations of frequent genome rearrangements in the Arabidopsis lineage and suggest that the poplar genome sequence may be more useful for elucidating the papaya and other rosid genomes. These insights will play a critical role in selecting species and sequencing strategies that will optimally represent crop genomes in sequence databases.Electronic Supplementary Material Supplementary material is available for this article at and is accessible for authorized users.Chun Wan J. Lai and Qingyi Yu have contributed equally to this work.  相似文献   

12.
BAC-end sequences (BESs) of hybrid sugarcane cultivar R570 are presented. A total of 66,990 informative BESs were obtained from 43,874 BAC clones. Similarity search using a variety of public databases revealed that 13.5 and 42.8 % of BESs match known gene-coding and repeat regions, respectively. That 11.7 % of BESs are still unmatched to any nucleotide sequences in the current public databases despite the fact that a close relative, sorghum, is fully sequenced, indicates that there may be many sugarcane-specific or lineage-specific sequences. We found 1,742 simple sequence repeat motifs in 1,585 BESs, spanning 27,383 bp in length. As simple sequence repeat markers derived from BESs have some advantages over randomly generated markers, these may be particularly useful for comparing BAC-based physical maps with genetic maps. BES and overgo hybridization information was used for anchoring sugarcane BAC clones to the sorghum genome sequence. While sorghum and sugarcane have extensive similarity in terms of genomic structure, only 2,789 BACs (6.4 %) could be confidently anchored to the sorghum genome at the stringent threshold of having both-end information (BESs or overgos) within 300 Kb. This relatively low rate of anchoring may have been caused in part by small- or large-scale genomic rearrangements in the Saccharum genus after two rounds of whole genome duplication since its divergence from the sorghum lineage about 7.8 million years ago. Limiting consideration to only low-copy matches, 1,245 BACs were placed to 1,503 locations, covering ~198 Mb of the sorghum genome or about 78 % of the estimated 252 Mb of euchromatin. BESs and their analyses presented here may provide an early profile of the sugarcane genome as well as a basis for BAC-by-BAC sequencing of much of the basic gene set of sugarcane.  相似文献   

13.
Hierarchical shotgun sequencing remains the method of choice for assembling high‐quality reference sequences of complex plant genomes. The efficient exploitation of current high‐throughput technologies and powerful computational facilities for large‐insert clone sequencing necessitates the sequencing and assembly of a large number of clones in parallel. We developed a multiplexed pipeline for shotgun sequencing and assembling individual bacterial artificial chromosomes (BACs) using the Illumina sequencing platform. We illustrate our approach by sequencing 668 barley BACs (Hordeum vulgare L.) in a single Illumina HiSeq 2000 lane. Using a newly designed parallelized computational pipeline, we obtained sequence assemblies of individual BACs that consist, on average, of eight sequence scaffolds and represent >98% of the genomic inserts. Our BAC assemblies are clearly superior to a whole‐genome shotgun assembly regarding contiguity, completeness and the representation of the gene space. Our methods may be employed to rapidly obtain high‐quality assemblies of a large number of clones to assemble map‐based reference sequences of plant and animal species with complex genomes by sequencing along a minimum tiling path.  相似文献   

14.
Characterizing the walnut genome through analyses of BAC end sequences   总被引:1,自引:0,他引:1  
Persian walnut (Juglans regia L.) is an economically important tree for its nut crop and timber. To gain insight into the structure and evolution of the walnut genome, we constructed two bacterial artificial chromosome (BAC) libraries, containing a total of 129,024 clones, from in vitro-grown shoots of J. regia cv. Chandler using the HindIII and MboI cloning sites. A total of 48,218 high-quality BAC end sequences (BESs) were generated, with an accumulated sequence length of 31.2?Mb, representing approximately 5.1% of the walnut genome. Analysis of repeat DNA content in BESs revealed that approximately 15.42% of the genome consists of known repetitive DNA, while walnut-unique repetitive DNA identified in this study constitutes 13.5% of the genome. Among the walnut-unique repetitive DNA, Julia SINE and JrTRIM elements represent the first identified walnut short interspersed element (SINE) and terminal-repeat retrotransposon in miniature (TRIM) element, respectively; both types of elements are abundant in the genome. As in other species, these SINEs and TRIM elements could be exploited for developing repeat DNA-based molecular markers in walnut. Simple sequence repeats (SSR) from BESs were analyzed and found to be more abundant in BESs than in expressed sequence tags. The density of SSR in the walnut genome analyzed was also slightly higher than that in poplar and papaya. Sequence analysis of BESs indicated that approximately 11.5% of the walnut genome represents a coding sequence. This study is an initial characterization of the walnut genome and provides the largest genomic resource currently available; as such, it will be a valuable tool in studies aimed at genetically improving walnut.  相似文献   

15.
Human BAC ends quality assessment and sequence analyses   总被引:8,自引:0,他引:8  
Zhao S  Malek J  Mahairas G  Fu L  Nierman W  Venter JC  Adams MD 《Genomics》2000,63(3):321-332
End sequences from bacterial artificial chromosomes (BACs) provide highly specific sequence markers in large-scale sequencing projects. To date, we have generated >300,000 end sequences from >186,000 human BAC clones with an average read length of >460 bp for a total of 141 Mb covering approximately 4.7% of the genome. Over 60% of the clones have BAC end sequences (BESs) from both ends representing more than fivefold coverage of the human genome by the paired-end clones. Our quality assessments and sequence analyses indicate that BESs from human BAC libraries developed at The California Institute of Technology (CalTech) and Roswell Park Cancer Institute have similar properties. The analyses have highlighted differences in insert size for different segments of the CalTech library. Problems with the fidelity of tracking of sequence data back to physical clones have been observed in some subsets of the overall BES dataset. The annotation results of BESs for the contents of available genomic sequences, sequence tagged sites, expressed sequence tags, protein encoding regions, and repeats indicate that this resource will be valuable in many areas of genome research.  相似文献   

16.
The pooid subfamily of grasses includes some of the most important crop, forage and turf species, such as wheat, barley and Lolium. Developing genomic resources, such as whole-genome physical maps, for analysing the large and complex genomes of these crops and for facilitating biological research in grasses is an important goal in plant biology. We describe a bacterial artificial chromosome (BAC)-based physical map of the wild pooid grass Brachypodium distachyon and integrate this with whole genome shotgun sequence (WGS) assemblies using BAC end sequences (BES). The resulting physical map contains 26 contigs spanning the 272 Mb genome. BES from the physical map were also used to integrate a genetic map. This provides an independent validation and confirmation of the published WGS assembly. Mapped BACs were used in Fluorescence In Situ Hybridisation (FISH) experiments to align the integrated physical map and sequence assemblies to chromosomes with high resolution. The physical, genetic and cytogenetic maps, integrated with whole genome shotgun sequence assemblies, enhance the accuracy and durability of this important genome sequence and will directly facilitate gene isolation.  相似文献   

17.
A high utility integrated map of the pig genome   总被引:2,自引:1,他引:1  

Background

The domestic pig is being increasingly exploited as a system for modeling human disease. It also has substantial economic importance for meat-based protein production. Physical clone maps have underpinned large-scale genomic sequencing and enabled focused cloning efforts for many genomes. Comparative genetic maps indicate that there is more structural similarity between pig and human than, for example, mouse and human, and we have used this close relationship between human and pig as a way of facilitating map construction.

Results

Here we report the construction of the most highly continuous bacterial artificial chromosome (BAC) map of any mammalian genome, for the pig (Sus scrofa domestica) genome. The map provides a template for the generation and assembly of high-quality anchored sequence across the genome. The physical map integrates previous landmark maps with restriction fingerprints and BAC end sequences from over 260,000 BACs derived from 4 BAC libraries and takes advantage of alignments to the human genome to improve the continuity and local ordering of the clone contigs. We estimate that over 98% of the euchromatin of the 18 pig autosomes and the X chromosome along with localized coverage on Y is represented in 172 contigs, with chromosome 13 (218 Mb) represented by a single contig. The map is accessible through pre-Ensembl, where links to marker and sequence data can be found.

Conclusion

The map will enable immediate electronic positional cloning of genes, benefiting the pig research community and further facilitating use of the pig as an alternative animal model for human disease. The clone map and BAC end sequence data can also help to support the assembly of maps and genome sequences of other artiodactyls.  相似文献   

18.
Availability of the human genome sequence and high similarity between humans and pigs at the molecular level provides an opportunity to use a comparative mapping approach to piggy-BAC the human genome. In order to advance the pig genome sequencing initiative, sequence similarity between large-scale porcine BAC-end sequences (BESs) and human genome sequence was used to construct a comparatively-anchored porcine physical map that is a first step towards sequencing the pig genome. A total of 50,300 porcine BAC clones were end-sequenced, yielding 76,906 BESs after trimming with an average read length of 538 bp. To anchor the porcine BACs on the human genome, these BESs were subjected to BLAST analysis using the human draft sequence, revealing 31.5% significant hits (E < e(-5)). Both genic and non-genic regions of homology contributed to the alignments between the human and porcine genomes. Porcine BESs with unique homology matches within the human genome provided a source of markers spaced approximately 70 to 300 kb along each human chromosome. In order to evaluate the utility of piggy-BACing human genome sequences, and confirm predictions of orthology, 193 evenly spaced BESs with similarity to HSA3 and HSA21 were selected and then utilized for developing a high-resolution (1.22 Mb) comparative radiation hybrid map of SSC13 that represents a fusion of HSA3 and HSA21. Resulting RH mapping of SSC13 covers 99% and 97% of HSA3 and HSA21, respectively. Seven evolutionary conserved blocks were identified including six on HSA3 and a single syntenic block corresponding to HSA21. The strategy of piggy-BACing the human genome described in this study demonstrates that through a directed, targeted comparative genomics approach construction of a high-resolution anchored physical map of the pig genome can be achieved. This map supports the selection of BACs to construct a minimal tiling path for genome sequencing and targeted gap filling. Moreover, this approach is highly relevant to other genome sequencing projects.  相似文献   

19.
Zhang X  Zhao C  Huang C  Duan H  Huan P  Liu C  Zhang X  Zhang Y  Li F  Zhang HB  Xiang J 《PloS one》2011,6(11):e27612
Zhikong scallop (Chlamys farreri) is one of the most economically important aquaculture species in China. Physical maps are crucial tools for genome sequencing, gene mapping and cloning, genetic improvement and selective breeding. In this study, we have developed a genome-wide, BAC-based physical map for the species. A total of 81,408 clones from two BAC libraries of the scallop were fingerprinted using an ABI 3130xl Genetic Analyzer and a fingerprinting kit developed in our laboratory. After data processing, 63,641 (~5.8× genome coverage) fingerprints were validated and used in the physical map assembly. A total of 3,696 contigs were assembled for the physical map. Each contig contained an average of 10.0 clones, with an average physical size of 490 kb. The combined total physical size of all contigs was 1.81 Gb, equivalent to approximately 1.5 fold of the scallop haploid genome. A total of 10,587 BAC end sequences (BESs) and 167 markers were integrated into the physical map. We evaluated the physical map by overgo hybridization, BAC-FISH (fluorescence in situ hybridization), contig BAC pool screening and source BAC library screening. The results have provided evidence of the high reliability of the contig physical map. This is the first physical map in mollusc; therefore, it provides an important platform for advanced research of genomics and genetics, and mapping of genes and QTL of economical importance, thus facilitating the genetic improvement and selective breeding of the scallop and other marine molluscs.  相似文献   

20.
Availability of the human genome sequence and high similarity between humans and pigs at the molecular level provides an opportunity to use a comparative mapping approach to piggy-BAC the human genome. In order to advance the pig genome sequencing initiative, sequence similarity between large-scale porcine BAC-end sequences (BESs) and human genome sequence was used to construct a comparatively-anchored porcine physical map that is a first step towards sequencing the pig genome. A total of 50,300 porcine BAC clones were end-sequenced, yielding 76,906 BESs after trimming with an average read length of 538 bp. To anchor the porcine BACs on the human genome, these BESs were subjected to BLAST analysis using the human draft sequence, revealing 31.5% significant hits (E < e?5). Both genic and non-genic regions of homology contributed to the alignments between the human and porcine genomes. Porcine BESs with unique homology matches within the human genome provided a source of markers spaced approximately 70 to 300 kb along each human chromosome. In order to evaluate the utility of piggy-BACing human genome sequences, and confirm predictions of orthology, 193 evenly spaced BESs with similarity to HSA3 and HSA21 were selected and then utilized for developing a high-resolution (1.22 Mb) comparative radiation hybrid map of SSC13 that represents a fusion of HSA3 and HSA21. Resulting RH mapping of SSC13 covers 99% and 97% of HSA3 and HSA21, respectively. Seven evolutionary conserved blocks were identified including six on HSA3 and a single syntenic block corresponding to HSA21. The strategy of piggy-BACing the human genome described in this study demonstrates that through a directed, targeted comparative genomics approach construction of a high-resolution anchored physical map of the pig genome can be achieved. This map supports the selection of BACs to construct a minimal tiling path for genome sequencing and targeted gap filling. Moreover, this approach is highly relevant to other genome sequencing projects.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号