首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The assembly of a reference genome sequence of bread wheat is challenging due to its specific features such as the genome size of 17 Gbp, polyploid nature and prevalence of repetitive sequences. BAC‐by‐BAC sequencing based on chromosomal physical maps, adopted by the International Wheat Genome Sequencing Consortium as the key strategy, reduces problems caused by the genome complexity and polyploidy, but the repeat content still hampers the sequence assembly. Availability of a high‐resolution genomic map to guide sequence scaffolding and validate physical map and sequence assemblies would be highly beneficial to obtaining an accurate and complete genome sequence. Here, we chose the short arm of chromosome 7D (7DS) as a model to demonstrate for the first time that it is possible to couple chromosome flow sorting with genome mapping in nanochannel arrays and create a de novo genome map of a wheat chromosome. We constructed a high‐resolution chromosome map composed of 371 contigs with an N50 of 1.3 Mb. Long DNA molecules achieved by our approach facilitated chromosome‐scale analysis of repetitive sequences and revealed a ~800‐kb array of tandem repeats intractable to current DNA sequencing technologies. Anchoring 7DS sequence assemblies obtained by clone‐by‐clone sequencing to the 7DS genome map provided a valuable tool to improve the BAC‐contig physical map and validate sequence assembly on a chromosome‐arm scale. Our results indicate that creating genome maps for the whole wheat genome in a chromosome‐by‐chromosome manner is feasible and that they will be an affordable tool to support the production of improved pseudomolecules.  相似文献   

2.
Complex Triticeae genomes pose a challenge to genome sequencing efforts due to their size and repetitive nature. Genome sequencing can reveal details of conservation and rearrangements between related genomes. We have applied Illumina second generation sequencing technology to sequence and assemble the low copy and unique regions of Triticum aestivum chromosome arm 7BS, followed by the construction of a syntenic build based on gene order in Brachypodium. We have delimited the position of a previously reported translocation between 7BS and 4AL with a resolution of one or a few genes and report approximately 13% genes from 7BS having been translocated to 4AL. An additional 13 genes are found on 7BS which appear to have originated from 4AL. The gene content of the 7DS and 7BS syntenic builds indicate a total of ~77,000 genes in wheat. Within wheat syntenic regions, 7BS and 7DS share 740 genes and a common gene conservation rate of ~39% of the genes from the corresponding regions in Brachypodium, as well as a common rate of colinearity with Brachypodium of ~60%. Comparison of wheat homoeologues revealed ~84% of genes previously identified in 7DS have a homoeologue on 7BS or 4AL. The conservation rates we have identified among wheat homoeologues and with Brachypodium provide a benchmark of homoeologous gene conservation levels for future comparative genomic analysis. The syntenic build of 7BS is publicly available at http://www.wheatgenome.info.  相似文献   

3.
Bread wheat (Triticum aestivum L.) is the most important staple food crop for 35% of the world's population. International efforts are underway to facilitate an increase in wheat production, of which the International Wheat Genome Sequencing Consortium (IWGSC) plays an important role. As part of this effort, we have developed a sequence‐based physical map of wheat chromosome 6A using whole‐genome profiling (WGP?). The bacterial artificial chromosome (BAC) contig assembly tools fingerprinted contig (fpc ) and linear topological contig (ltc ) were used and their contig assemblies were compared. A detailed investigation of the contigs structure revealed that ltc created a highly robust assembly compared with those formed by fpc . The ltc assemblies contained 1217 contigs for the short arm and 1113 contigs for the long arm, with an L50 of 1 Mb. To facilitate in silico anchoring, WGP? tags underlying BAC contigs were extended by wheat and wheat progenitor genome sequence information. Sequence data were used for in silico anchoring against genetic markers with known sequences, of which almost 79% of the physical map could be anchored. Moreover, the assigned sequence information led to the ‘decoration’ of the respective physical map with 3359 anchored genes. Thus, this robust and genetically anchored physical map will serve as a framework for the sequencing of wheat chromosome 6A, and is of immediate use for map‐based isolation of agronomically important genes/quantitative trait loci located on this chromosome.  相似文献   

4.
Bread wheat (Triticum aestivum, AABBDD) is an allohexaploid species derived from two rounds of interspecific hybridizations. A high-quality genome sequence assembly of diploid Aegilops tauschii, the donor of the wheat D genome, will provide a useful platform to study polyploid wheat evolution. A combined approach of BAC pooling and next-generation sequencing technology was employed to sequence the minimum tiling path (MTP) of 3176 BAC clones from the short arm of Ae. tauschii chromosome 3 (At3DS). The final assembly of 135 super-scaffolds with an N50 of 4.2 Mb was used to build a 247-Mb pseudomolecule with a total of 2222 predicted protein-coding genes. Compared with the orthologous regions of rice, Brachypodium, and sorghum, At3DS contains 38.67% more genes. In comparison to At3DS, the short arm sequence of wheat chromosome 3B (Ta3BS) is 95-Mb large in size, which is primarily due to the expansion of the non-centromeric region, suggesting that transposable element (TE) bursts in Ta3B likely occurred there. Also, the size increase is accompanied by a proportional increase in gene number in Ta3BS. We found that in the sequence of short arm of wheat chromosome 3D (Ta3DS), there was only less than 0.27% gene loss compared to At3DS. Our study reveals divergent evolution of grass genomes and provides new insights into sequence changes in the polyploid wheat genome.  相似文献   

5.

Background

The ~17 Gb hexaploid bread wheat genome is a high priority and a major technical challenge for genomic studies. In particular, the D sub-genome is relatively lacking in genetic diversity, making it both difficult to map genetically, and a target for introgression of agriculturally useful traits. Elucidating its sequence and structure will therefore facilitate wheat breeding and crop improvement.

Results

We generated shotgun sequences from each arm of flow-sorted Triticum aestivum chromosome 5D using 454 FLX Titanium technology, giving 1.34× and 1.61× coverage of the short (5DS) and long (5DL) arms of the chromosome respectively. By a combination of sequence similarity and assembly-based methods, ~74% of the sequence reads were classified as repetitive elements, and coding sequence models of 1314 (5DS) and 2975 (5DL) genes were generated. The order of conserved genes in syntenic regions of previously sequenced grass genomes were integrated with physical and genetic map positions of 518 wheat markers to establish a virtual gene order for chromosome 5D.

Conclusions

The virtual gene order revealed a large-scale chromosomal rearrangement in the peri-centromeric region of 5DL, and a concentration of non-syntenic genes in the telomeric region of 5DS. Although our data support the large-scale conservation of Triticeae chromosome structure, they also suggest that some regions are evolving rapidly through frequent gene duplications and translocations.

Sequence accessions

EBI European Nucleotide Archive, Study no. ERP002330

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-1080) contains supplementary material, which is available to authorized users.  相似文献   

6.
Mapping‐by‐sequencing analyses have largely required a complete reference sequence and employed whole genome re‐sequencing. In species such as wheat, no finished genome reference sequence is available. Additionally, because of its large genome size (17 Gb), re‐sequencing at sufficient depth of coverage is not practical. Here, we extend the utility of mapping by sequencing, developing a bespoke pipeline and algorithm to map an early‐flowering locus in einkorn wheat (Triticum monococcum L.) that is closely related to the bread wheat genome A progenitor. We have developed a genomic enrichment approach using the gene‐rich regions of hexaploid bread wheat to design a 110‐Mbp NimbleGen SeqCap EZ in solution capture probe set, representing the majority of genes in wheat. Here, we use the capture probe set to enrich and sequence an F2 mapping population of the mutant. The mutant locus was identified in T. monococcum, which lacks a complete genome reference sequence, by mapping the enriched data set onto pseudo‐chromosomes derived from the capture probe target sequence, with a long‐range order of genes based on synteny of wheat with Brachypodium distachyon. Using this approach we are able to map the region and identify a set of deleted genes within the interval.  相似文献   

7.
8.
Bread wheat (Triticum aestivum L.) is one of the most important crops globally and a high priority for genetic improvement, but its large and complex genome has been seen as intractable to whole genome sequencing. Isolation of individual wheat chromosome arms has facilitated large-scale sequence analyses. However, so far there is no such survey of sequences from the A genome of wheat. Greater understanding of an A chromosome could facilitate wheat improvement and future sequencing of the entire genome. We have constructed BAC library from the long arm of T. aestivum chromosome 1A (1AL) and obtained BAC end sequences from 7,470 clones encompassing the arm. We obtained 13,445 (89.99%) useful sequences with a cumulative length of 7.57 Mb, representing 1.43% of 1AL and about 0.14% of the entire A genome. The GC content of the sequences was 44.7%, and 90% of the chromosome was estimated to comprise repeat sequences, while just over 1% encoded expressed genes. From the sequence data, we identified a large number of sites suitable for development of molecular markers (362 SSR and 6,948 ISBP) which will have utility for mapping this chromosome and for marker assisted breeding. From 44 putative ISBP markers tested 23 (52.3%) were found to be useful. The BAC end sequence data also enabled the identification of genes and syntenic blocks specific to chromosome 1AL, suggesting regions of particular functional interest and targets for future research.  相似文献   

9.
The evolution of five chromosomes of Brachypodium distachyon from a 12-chromosome ancestor of all grasses by dysploidy raises an interesting question about the fate of redundant centromeres. Three independent but complementary approaches were pursued to study centromeric region homologies among the chromosomes of Brachypodium, wheat, and rice. The genes present in pericentromeres of the basic set of seven chromosomes of wheat and the Triticeae, and the 80 rice centromeric genes spanning the CENH3 binding domain of centromeres 3, 4, 5, 7, and 8 were used as “anchor” markers to identify centromere locations in the B. distachyon chromosomes. A total of 53 B. distachyon bacterial artificial chromosome (BAC) clones anchored by wheat pericentromeric expressed sequence tags (ESTs) were used as probes for BAC-fluorescence in situ hybridization (FISH) analysis of B. distachyon mitotic chromosomes. Integrated sequence alignment and BAC-FISH data were used to determine the approximate positions of active and inactive centromeres in the five B. distachyon chromosomes. The following syntenic relationships of the centromeres for Brachypodium (Bd), rice (R), and wheat (W) were evident: Bd1-R6, Bd2-R5-W1, Bd3-R10, Bd4-R11-W4, and Bd5-R4. Six rice centromeres syntenic to five wheat centromeres were inactive in Brachypodium chromosomes. The conservation of centromere gene synteny among several sets of homologous centromeres of three species indicates that active genes can persist in ancient centromeres with more than 40 million years of shared evolutionary history. Annotation of a BAC contig spanning an inactive centromere in chromosome Bd3 which is syntenic to rice Cen8 and W7 pericentromeres, along with BAC FISH data from inactive centromeres revealed that the centromere inactivation was accompanied by the loss of centromeric retrotransposons and turnover of centromere-specific satellites during Bd chromosome evolution.  相似文献   

10.
Next‐generation sequencing (NGS) provides a powerful tool for the discovery of important genes and alleles in crop plants and their wild relatives. Despite great advances in NGS technologies, whole‐genome shotgun sequencing is cost‐prohibitive for species with complex genomes. An attractive option is to reduce genome complexity to a single chromosome prior to sequencing. This work describes a strategy for studying the genomes of distant wild relatives of wheat by isolating single chromosomes from addition or substitution lines, followed by chromosome sorting using flow cytometry and sequencing of chromosomal DNA by NGS technology. We flow‐sorted chromosome 5Mg from a wheat/Aegilops geniculata disomic substitution line [DS5Mg (5D)] and sequenced it using an Illumina HiSeq 2000 system at approximately 50 × coverage. Paired‐end sequences were assembled and used for structural and functional annotation. A total of 4236 genes were annotated on 5Mg, in close agreement with the predicted number of genes on wheat chromosome 5D (4286). Single‐gene FISH indicated no major chromosomal rearrangements between chromosomes 5Mg and 5D. Comparing chromosome 5Mg with model grass genomes identified synteny blocks in Brachypodium distachyon, rice (Oryza sativa), sorghum (Sorghum bicolor) and barley (Hordeum vulgare). Chromosome 5Mg‐specific SNPs and cytogenetic probe‐based resources were developed and validated. Deletion bin‐mapped and ordered 5Mg SNP markers will be useful to track 5M‐specific introgressions and translocations. This study provides a detailed sequence‐based analysis of the composition of a chromosome from a distant wild relative of bread wheat, and opens up opportunities to develop genomic resources for wild germplasm to facilitate crop improvement.  相似文献   

11.
Flow cytometric sorting of individual chromosomes and chromosome‐based sequencing reduces the complexity of large, repetitive Triticeae genomes. We flow‐sorted chromosome 5D of Aegilops tauschii, the D genome donor of bread wheat and sequenced it by Roche 454 GS FLX platform to approximately 2.2x coverage. Repetitive sequences represent 81.09% of the survey sequences of this chromosome, and Class I retroelements are the prominent type, with a particular abundance of LTR/Gypsy superfamily. Nonrepetitive sequences were assembled to cover 17.76% of the total chromosome regions. Up to 6188 nonrepetitive gene loci were predicted to be encoded by the 5D chromosome. The numbers and chromosomal distribution patterns of tRNA genes suggest abundance in tRNALys and tRNAMet species, while the nonrepetitive assembly reveals tRNAAla species as the most abundant type. A comparative analysis of the genomic sequences of bread wheat and Aegilops chromosome 5D indicates conservation of gene content. Orthologous unique genes, matching Aegilops 5D sequences, numbered 3730 in barley, 5063 in Brachypodium, 4872 in sorghum and 4209 in rice. In this study, we provide a chromosome‐specific view into the structure and organization of the 5D chromosome of Ae. tauschii, the D genome ancestor of bread wheat. This study contributes to our understanding of the chromosome‐level evolution of the wheat genome and presents a valuable resource in wheat genomics due to the recent hybridization of Ae. tauschii genome with its tetraploid ancestor.  相似文献   

12.
With the expansion of next‐generation sequencing technology and advanced bioinformatics, there has been a rapid growth of genome sequencing projects. However, while this technology enables the rapid and cost‐effective assembly of draft genomes, the quality of these assemblies usually falls short of gold standard genome assemblies produced using the more traditional BAC by BAC and Sanger sequencing approaches. Assembly validation is often performed by the physical anchoring of genetically mapped markers, but this is prone to errors and the resolution is usually low, especially towards centromeric regions where recombination is limited. New approaches are required to validate reference genome assemblies. The ability to isolate individual chromosomes combined with next‐generation sequencing permits the validation of genome assemblies at the chromosome level. We demonstrate this approach by the assessment of the recently published chickpea kabuli and desi genomes. While previous genetic analysis suggests that these genomes should be very similar, a comparison of their chromosome sizes and published assemblies highlights significant differences. Our chromosomal genomics analysis highlights short defined regions that appear to have been misassembled in the kabuli genome and identifies large‐scale misassembly in the draft desi genome. The integration of chromosomal genomics tools within genome sequencing projects has the potential to significantly improve the construction and validation of genome assemblies. The approach could be applied both for new genome assemblies as well as published assemblies, and complements currently applied genome assembly strategies.  相似文献   

13.
Cucumber, Cucumis sativus L. is the only taxon with 2n = 2x = 14 chromosomes in the genus Cucumis. It consists of two cross‐compatible botanical varieties: the cultivated C. sativus var. sativus and the wild C. sativus var. hardwickii. There is no consensus on the evolutionary relationship between the two taxa. Whole‐genome sequencing of the cucumber genome provides a new opportunity to advance our understanding of chromosome evolution and the domestication history of cucumber. In this study, a high‐density genetic map for cultivated cucumber was developed that contained 735 marker loci in seven linkage groups spanning 707.8 cM. Integration of genetic and physical maps resulted in a chromosome‐level draft genome assembly comprising 193 Mbp, or 53% of the 367 Mbp cucumber genome. Strategically selected markers from the genetic map and draft genome assembly were employed to screen for fosmid clones for use as probes in comparative fluorescence in situ hybridization analysis of pachytene chromosomes to investigate genetic differentiation between wild and cultivated cucumbers. Significant differences in the amount and distribution of heterochromatins, as well as chromosomal rearrangements, were uncovered between the two taxa. In particular, six inversions, five paracentric and one pericentric, were revealed in chromosomes 4, 5 and 7. Comparison of the order of fosmid loci on chromosome 7 of cultivated and wild cucumbers, and the syntenic melon chromosome I suggested that the paracentric inversion in this chromosome occurred during domestication of cucumber. The results support the sub‐species status of these two cucumber taxa, and suggest that C. sativus var. hardwickii is the progenitor of cultivated cucumber.  相似文献   

14.

Background

The substantially large bread wheat genome, organized into highly similar three sub-genomes, renders genomic research challenging. The construction of BAC-based physical maps of individual chromosomes reduces the complexity of this allohexaploid genome, enables elucidation of gene space and evolutionary relationships, provides tools for map-based cloning, and serves as a framework for reference sequencing efforts. In this study, we constructed the first comprehensive physical map of wheat chromosome arm 5DS, thereby exploring its gene space organization and evolution.

Results

The physical map of 5DS was comprised of 164 contigs, of which 45 were organized into 21 supercontigs, covering 176 Mb with an N50 value of 2,173 kb. Fifty-eight of the contigs were larger than 1 Mb, with the largest contig spanning 6,649 kb. A total of 1,864 molecular markers were assigned to the map at a density of 10.5 markers/Mb, anchoring 100 of the 120 contigs (>5 clones) that constitute ~95 % of the cumulative length of the map. Ordering of 80 contigs along the deletion bins of chromosome arm 5DS revealed small-scale breaks in syntenic blocks. Analysis of the gene space of 5DS suggested an increasing gradient of genes organized in islands towards the telomere, with the highest gene density of 5.17 genes/Mb in the 0.67-0.78 deletion bin, 1.4 to 1.6 times that of all other bins.

Conclusions

Here, we provide a chromosome-specific view into the organization and evolution of the D genome of bread wheat, in comparison to one of its ancestors, revealing recent genome rearrangements. The high-quality physical map constructed in this study paves the way for the assembly of a reference sequence, from which breeding efforts will greatly benefit.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1641-y) contains supplementary material, which is available to authorized users.  相似文献   

15.
Despite being a major international crop, our understanding of the wheat genome is relatively poor due to its large size and complexity. To gain a greater understanding of wheat genome diversity, we have identified single nucleotide polymorphisms between 16 Australian bread wheat varieties. Whole‐genome shotgun Illumina paired read sequence data were mapped to the draft assemblies of chromosomes 7A, 7B and 7D to identify more than 4 million intervarietal SNPs. SNP density varied between the three genomes, with much greater density observed on the A and B genomes than the D genome. This variation may be a result of substantial gene flow from the tetraploid Triticum turgidum, which possesses A and B genomes, during early co‐cultivation of tetraploid and hexaploid wheat. In addition, we examined SNP density variation along the chromosome syntenic builds and identified genes in low‐density regions which may have been selected during domestication and breeding. This study highlights the impact of evolution and breeding on the bread wheat genome and provides a substantial resource for trait association and crop improvement. All SNP data are publically available on a generic genome browser GBrowse at www.wheatgenome.info .  相似文献   

16.

Background

The purpose of the study is to elucidate the sequence composition of the short arm of rye chromosome 1 (Secale cereale) with special focus on its gene content, because this portion of the rye genome is an integrated part of several hundreds of bread wheat varieties worldwide.

Methodology/Principal Findings

Multiple Displacement Amplification of 1RS DNA, obtained from flow sorted 1RS chromosomes, using 1RS ditelosomic wheat-rye addition line, and subsequent Roche 454FLX sequencing of this DNA yielded 195,313,589 bp sequence information. This quantity of sequence information resulted in 0.43× sequence coverage of the 1RS chromosome arm, permitting the identification of genes with estimated probability of 95%. A detailed analysis revealed that more than 5% of the 1RS sequence consisted of gene space, identifying at least 3,121 gene loci representing 1,882 different gene functions. Repetitive elements comprised about 72% of the 1RS sequence, Gypsy/Sabrina (13.3%) being the most abundant. More than four thousand simple sequence repeat (SSR) sites mostly located in gene related sequence reads were identified for possible marker development. The existence of chloroplast insertions in 1RS has been verified by identifying chimeric chloroplast-genomic sequence reads. Synteny analysis of 1RS to the full genomes of Oryza sativa and Brachypodium distachyon revealed that about half of the genes of 1RS correspond to the distal end of the short arm of rice chromosome 5 and the proximal region of the long arm of Brachypodium distachyon chromosome 2. Comparison of the gene content of 1RS to 1HS barley chromosome arm revealed high conservation of genes related to chromosome 5 of rice.

Conclusions

The present study revealed the gene content and potential gene functions on this chromosome arm and demonstrated numerous sequence elements like SSRs and gene-related sequences, which can be utilised for future research as well as in breeding of wheat and rye.  相似文献   

17.
Wheat is one of the most important staple crops worldwide and also an excellent model species for crop evolution and polyploidization studies. The breakthrough of sequencing the bread wheat genome and progenitor genomes lays the foundation to decipher the complexity of wheat origin and evolutionary process as well as the genetic consequences of polyploidization. In this study, we sequenced 3286 BACs from chromosome 7DL of bread wheat cv. Chinese Spring and integrated the unmapped contigs from IWGSC v1 and available PacBio sequences to close gaps present in the 7DL assembly. In total, 8043 out of 12 825 gaps, representing 3 491 264 bp, were closed. We then used the improved assembly of 7DL to perform comparative genomic analysis of bread wheat (Ta7DL) and its D donor, Aegilops tauschii (At7DL), to identify domestication signatures. Results showed a strong syntenic relationship between Ta7DL and At7DL, although some small rearrangements were detected at the distal regions. A total of 53 genes appear to be lost genes during wheat polyploidization, with 23% (12 genes) as RGA (disease resistance gene analogue). Furthermore, 86 positively selected genes (PSGs) were identified, considered to be domestication‐related candidates. Finally, overlapping of QTLs obtained from GWAS analysis and PSGs indicated that TraesCS7D02G321000 may be one of the domestication genes involved in grain morphology. This study provides comparative information on the sequence, structure and organization between bread wheat and Ae. tauschii from the perspective of the 7DL chromosome, which contribute to better understanding of the evolution of wheat, and supports wheat crop improvement.  相似文献   

18.
Generating a contiguous, ordered reference sequence of a complex genome such as hexaploid wheat (2n = 6x = 42; approximately 17 GB) is a challenging task due to its large, highly repetitive, and allopolyploid genome. In wheat, ordering of whole‐genome or hierarchical shotgun sequencing contigs is primarily based on recombination and comparative genomics‐based approaches. However, comparative genomics approaches are limited to syntenic inference and recombination is suppressed within the pericentromeric regions of wheat chromosomes, thus, precise ordering of physical maps and sequenced contigs across the whole‐genome using these approaches is nearly impossible. We developed a whole‐genome radiation hybrid (WGRH) resource and tested it by genotyping a set of 115 randomly selected lines on a high‐density single nucleotide polymorphism (SNP) array. At the whole‐genome level, 26 299 SNP markers were mapped on the RH panel and provided an average mapping resolution of approximately 248 Kb/cR1500 with a total map length of 6866 cR1500. The 7296 unique mapping bins provided a five‐ to eight‐fold higher resolution than genetic maps used in similar studies. Most strikingly, the RH map had uniform bin resolution across the entire chromosome(s), including pericentromeric regions. Our research provides a valuable and low‐cost resource for anchoring and ordering sequenced BAC and next generation sequencing (NGS) contigs. The WGRH developed for reference wheat line Chinese Spring (CS‐WGRH), will be useful for anchoring and ordering sequenced BAC and NGS based contigs for assembling a high‐quality, reference sequence of hexaploid wheat. Additionally, this study provides an excellent model for developing similar resources for other polyploid species.  相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号