首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
Erigeron breviscapus is an important medicinal plant in Compositae and the first species to realize the whole process from the decoding of the draft genome sequence to scutellarin biosynthesis in yeast. However, the previous low‐quality genome assembly has hindered the optimization of candidate genes involved in scutellarin synthesis and the development of molecular‐assisted breeding based on the genome. Here, the E. breviscapus genome was updated using PacBio RSII sequencing data and Hi‐C data, and increased in size from 1.2 Gb to 1.43 Gb, with a scaffold N50 of 156.82 Mb and contig N50 of 140.95 kb, and a total of 43,514 protein‐coding genes were obtained and oriented onto nine pseudo‐chromosomes, thus becoming the third plant species assembled to chromosome level after sunflower and lettuce in Compositae. Fourteen genes with evidence for positive selection were identified and found to be related to leaf morphology, flowering and secondary metabolism. The number of genes in some gene families involved in flavonoid biosynthesis in E. breviscapus have been significantly expanded. In particular, additional candidate genes involved in scutellarin biosynthesis, such as flavonoid‐7‐O‐glucuronosyltransferase genes (F7GATs) were identified using updated genome. In addition, three candidate genes encoding indole‐3‐pyruvate monooxygenase YUCCA2 (YUC2), serine carboxypeptidase‐like 18 (SCPL18), and F‐box protein (FBP), respectively, were identified to be probably related to leaf development and flowering by resequencing 99 individuals. These results provided a substantial genetic basis for improving agronomic and quality traits of E. breviscapus, and provided a platform for improving other draft genome assemblies to chromosome‐level.  相似文献   

3.
Advanced resources for genome‐assisted research in barley (Hordeum vulgare) including a whole‐genome shotgun assembly and an integrated physical map have recently become available. These have made possible studies that aim to assess genetic diversity or to isolate single genes by whole‐genome resequencing and in silico variant detection. However such an approach remains expensive given the 5 Gb size of the barley genome. Targeted sequencing of the mRNA‐coding exome reduces barley genomic complexity more than 50‐fold, thus dramatically reducing this heavy sequencing and analysis load. We have developed and employed an in‐solution hybridization‐based sequence capture platform to selectively enrich for a 61.6 megabase coding sequence target that includes predicted genes from the genome assembly of the cultivar Morex as well as publicly available full‐length cDNAs and de novo assembled RNA‐Seq consensus sequence contigs. The platform provides a highly specific capture with substantial and reproducible enrichment of targeted exons, both for cultivated barley and related species. We show that this exome capture platform provides a clear path towards a broader and deeper understanding of the natural variation residing in the mRNA‐coding part of the barley genome and will thus constitute a valuable resource for applications such as mapping‐by‐sequencing and genetic diversity analyzes.  相似文献   

4.
5.
Onychostoma macrolepis is an emerging commercial cyprinid fish species. It is a model system for studies of sexual dimorphism and genome evolution. Here, we report the chromosome‐level assembly of the O.macrolepis genome obtained from the integration of nanopore long‐read sequencing with physical maps produced using Bionano and Hi‐C technology. A total of 87.9 Gb of nanopore sequence provided approximately 100‐fold coverage of the genome. The preliminary genome assembly was 883.2 Mb in size with a contig N50 size of 11.2 Mb. The 969 corrected contigs obtained from Bionano optical mapping were assembled into 853 scaffolds and produced an assembly of 886.5 Mb with a scaffold N50 of 16.5 Mb. Finally, using the Hi‐C data, 881.3 Mb (99.4% of genome) in 526 scaffolds were anchored and oriented in 25 chromosomes ranging in size from 25.27 to 56.49 Mb. In total, 24,770 protein‐coding genes were predicted in the genome, and ~96.85% of the genes were functionally annotated. The annotated assembly contains 93.3% complete genes from the BUSCO reference set. In addition, we identified 409 Mb (46.23% of the genome) of repetitive sequence, and 11,213 non‐coding RNAs, in the genome. Evolutionary analysis revealed that O. macrolepis diverged from common carp approximately 24.25 million years ago. The chromosomes of O. macrolepis showed an unambiguous correspondence to the chromosomes of zebrafish. The high‐quality genome assembled in this work provides a valuable genomic resource for further biological and evolutionary studies of O. macrolepis.  相似文献   

6.
Wild barley (Hordeum spontaneum) is the progenitor of cultivated barley (Hordeum vulgare) and provides a rich source of genetic variations for barley improvement. Currently, the genome sequences of wild barley and its differences with cultivated barley remain unclear. In this study, we report a high‐quality draft assembly of wild barley accession (AWCS276; henceforth named as WB1), which consists of 4.28 Gb genome and 36 395 high‐confidence protein‐coding genes. BUSCO analysis revealed that the assembly included full lengths of 95.3% of the 956 single‐copy plant genes, illustrating that the gene‐containing regions have been well assembled. By comparing with the genome of the cultivated genotype Morex, it is inferred that the WB1 genome contains more genes involved in resistance and tolerance to biotic and abiotic stresses. The presence of the numerous WB1‐specific genes indicates that, in addition to enhance allele diversity for genes already existing in the cultigen, exploiting the wild barley taxon in breeding should also allow the incorporation of novel genes. Furthermore, high levels of genetic variation in the pericentromeric regions were detected in chromosomes 3H and 5H between the wild and cultivated genotypes, which may be the results of domestication. This H. spontaneum draft genome assembly will help to accelerate wild barley research and be an invaluable resource for barley improvement and comparative genomics research.  相似文献   

7.
Allotetraploid oilseed rape (Brassica napus L.) is an agriculturally important crop. Cultivation and breeding of B. napus by humans has resulted in numerous genetically diverse morphotypes with optimized agronomic traits and ecophysiological adaptation. To further understand the genetic basis of diversification and adaptation, we report a draft genome of an Asian semi‐winter oilseed rape cultivar ‘ZS11’ and its comprehensive genomic comparison with the genomes of the winter‐type cultivar ‘Darmor‐bzh’ as well as two progenitors. The integrated BAC‐to‐BAC and whole‐genome shotgun sequencing strategies were effective in the assembly of repetitive regions (especially young long terminal repeats) and resulted in a high‐quality genome assembly of B. napus ‘ZS11’. Within a short evolutionary period (~6700 years ago), semi‐winter‐type ‘ZS11’ and the winter‐type ‘Darmor‐bzh’ maintained highly genomic collinearity. Even so, certain genetic differences were also detected in two morphotypes. Relative to ‘Darmor‐bzh’, both two subgenomes of ‘ZS11’ are closely related to its progenitors, and the ‘ZS11’ genome harbored several specific segmental homoeologous exchanges (HEs). Furthermore, the semi‐winter‐type ‘ZS11’ underwent potential genomic introgressions with B. rapa (Ar). Some of these genetic differences were associated with key agronomic traits. A key gene of A03.FLC3 regulating vernalization‐responsive flowering time in ‘ZS11’ was first experienced HE, and then underwent genomic introgression event with Ar, which potentially has led to genetic differences in controlling vernalization in the semi‐winter types. Our observations improved our understanding of the genetic diversity of different B. napus morphotypes and the cultivation history of semi‐winter oilseed rape in Asia.  相似文献   

8.
The Tetraodontidae family are known to have relatively small and compact genomes compared to other vertebrates. The obscure puffer fish Takifugu obscurus is an anadromous species that migrates to freshwater from the sea for spawning. Thus the euryhaline characteristics of T. obscurus have been investigated to gain understanding of their survival ability, osmoregulation, and other homeostatic mechanisms in both freshwater and seawater. In this study, a high quality chromosome‐level reference genome for T. obscurus was constructed using long‐read Pacific Biosciences (PacBio) Sequel sequencing and a Hi‐C‐based chromatin contact map platform. The final genome assembly of T. obscurus is 381 Mb, with a contig N50 length of 3,296 kb and longest length of 10.7 Mb, from a total of 62 Gb of raw reads generated using single‐molecule real‐time sequencing technology from a PacBio Sequel platform. The PacBio data were further clustered into chromosome‐scale scaffolds using a Hi‐C approach, resulting in a 373 Mb genome assembly with a contig N50 length of 15.2 Mb and and longest length of 28 Mb. When we directly compared the 22 longest scaffolds of T. obscurus to the 22 chromosomes of the tiger puffer Takifugu rubripes, a clear one‐to‐one orthologous relationship was observed between the two species, supporting the chromosome‐level assembly of T. obscurus. This genome assembly can serve as a valuable genetic resource for exploring fugu‐specific compact genome characteristics, and will provide essential genomic information for understanding molecular adaptations to salinity fluctuations and the evolution of osmoregulatory mechanisms.  相似文献   

9.
Chinese liquorice/licorice (Glycyrrhiza uralensis) is a leguminous plant species whose roots and rhizomes have been widely used as a herbal medicine and natural sweetener. Whole‐genome sequencing is essential for gene discovery studies and molecular breeding in liquorice. Here, we report a draft assembly of the approximately 379‐Mb whole‐genome sequence of strain 308‐19 of G. uralensis; this assembly contains 34 445 predicted protein‐coding genes. Comparative analyses suggested well‐conserved genomic components and collinearity of gene loci (synteny) between the genome of liquorice and those of other legumes such as Medicago and chickpea. We observed that three genes involved in isoflavonoid biosynthesis, namely, 2‐hydroxyisoflavanone synthase (CYP93C), 2,7,4′‐trihydroxyisoflavanone 4′‐O‐methyltransferase/isoflavone 4′‐O‐methyltransferase (HI4OMT) and isoflavone‐7‐O‐methyltransferase (7‐IOMT) formed a cluster on the scaffold of the liquorice genome and showed conserved microsynteny with Medicago and chickpea. Based on the liquorice genome annotation, we predicted genes in the P450 and UDP‐dependent glycosyltransferase (UGT) superfamilies, some of which are involved in triterpenoid saponin biosynthesis, and characterised their gene expression with the reference genome sequence. The genome sequencing and its annotations provide an essential resource for liquorice improvement through molecular breeding and the discovery of useful genes for engineering bioactive components through synthetic biology approaches.  相似文献   

10.
Combining high‐throughput sequencing with targeted sequence capture has become an attractive tool to study specific genomic regions of interest. Most studies have so far focused on the exome using short‐read technology. These approaches are not designed to capture intergenic regions needed to reconstruct genomic organization, including regulatory regions and gene synteny. Here, we demonstrate the power of combining targeted sequence capture with long‐read sequencing technology for comparative genomic analyses of the haemoglobin (Hb) gene clusters across eight species separated by up to 70 million years. Guided by the reference genome assembly of the Atlantic cod (Gadus morhua) together with genome information from draft assemblies of selected codfishes, we designed probes covering the two Hb gene clusters. Use of custom‐made barcodes combined with PacBio RSII sequencing led to highly continuous assemblies of the LA (~100 kb) and MN (~200 kb) clusters, which include syntenic regions of coding and intergenic sequences. Our results revealed an overall conserved genomic organization of the Hb genes within this lineage, yet with several, lineage‐specific gene duplications. Moreover, for some of the species examined, we identified amino acid substitutions at two sites in the Hbb1 gene as well as length polymorphisms in its regulatory region, which has previously been linked to temperature adaptation in Atlantic cod populations. This study highlights the use of targeted long‐read capture as a versatile approach for comparative genomic studies by generation of a cross‐species genomic resource elucidating the evolutionary history of the Hb gene family across the highly divergent group of codfishes.  相似文献   

11.
Marine medaka (Oryzias melastigma) is considered to be a useful fish model for marine and estuarine ecotoxicology studies and has good potential for field‐based population genomics because of its geographical distribution in Asian estuarine and coastal areas. In this study, we present the first whole‐genome draft of O. melastigma. The genome assembly consists of 8,602 scaffolds (N50 = 23.737 Mb) and a total genome length of 779.4 Mb. A total of 23,528 genes were predicted, and 12,670 gene families shared with three teleost species (Japanese medaka, mangrove killifish and zebrafish) were identified. Genome analyses revealed that the O. melastigma genome is highly heterozygous and contains a large number of repeat sequences. This assembly represents a useful genomic resource for fish scientists.  相似文献   

12.
13.
Symbiotic relationships between vestimentiferan tubeworms and chemosynthetic Gammaproteobacteria build the foundations of many hydrothermal vent and hydrocarbon seep ecosystems in the deep sea. The association between the vent tubeworm Riftia pachyptila and its endosymbiont Candidatus Endoriftia persephone has become a model system for symbiosis research in deep‐sea vestimentiferans, while markedly fewer studies have investigated symbiotic relationships in other tubeworm species, especially at cold seeps. Here we sequenced the endosymbiont genome of the tubeworm Lamellibrachia barhami from a cold seep in the Gulf of California, using short‐ and long‐read sequencing technologies in combination with Hi‐C and Dovetail Chicago libraries. Our final assembly had a size of ~4.17 MB, a GC content of 54.54%, 137X coverage, 4153 coding sequences, and a CheckM completeness score of 97.19%. A single scaffold contained 99.51% of the genome. Comparative genomic analyses indicated that the L. barhami symbiont shares a set of core genes and many metabolic pathways with other vestimentiferan symbionts, while containing 433 unique gene clusters that comprised a variety of transposases, defence‐related genes and a lineage‐specific CRISPR/Cas3 system. This assembly represents the most contiguous tubeworm symbiont genome resource to date and will be particularly valuable for future comparative genomic studies investigating structural genome evolution, physiological adaptations and host‐symbiont communication in chemosynthetic animal‐microbe symbioses.  相似文献   

14.
Casuarina equisetifolia (C. equisetifolia), a conifer‐like angiosperm with resistance to typhoon and stress tolerance, is mainly cultivated in the coastal areas of Australasia. C. equisetifolia, making it a valuable model to study secondary growth associated genes and stress‐tolerance traits. However, the genome sequence is unavailable and therefore wood‐associated growth rate and stress resistance at the molecular level is largely unexplored. We therefore constructed a high‐quality draft genome sequence of C. equisetifolia by a combination of Illumina second‐generation sequencing reads and Pacific Biosciences single‐molecule real‐time (SMRT) long reads to advance the investigation of this species. Here, we report the genome assembly, which contains approximately 300 megabases (Mb) and scaffold size of N50 is 1.06 Mb. Additionally, gene annotation, assisted by a combination of prediction and RNA‐seq data, generated 29 827 annotated protein‐coding genes and 1983 non‐coding genes, respectively. Furthermore, we found that the total number of repetitive sequences account for one‐third of the genome assembly. Here we also construct the genome‐wide map of DNA modification, such as two novel forms N6‐adenine (6mA) and N4‐methylcytosine (4mC) at the level of single‐nucleotide resolution using single‐molecule real‐time (SMRT) sequencing. Interestingly, we found that 17% of 6mA modification genes and 15% of 4mC modification genes also included alternative splicing events. Finally, we investigated cellulose, hemicellulose, and lignin‐related genes, which were associated with secondary growth and contained different DNA modifications. The high‐quality genome sequence and annotation of C. equisetifolia in this study provide a valuable resource to strengthen our understanding of the diverse traits of trees.  相似文献   

15.
Salmonids are of particular interest to evolutionary biologists due to their incredible diversity of life‐history strategies and the speed at which many salmonid species have diversified. In Switzerland alone, over 30 species of Alpine whitefish from the subfamily Coregoninae have evolved since the last glacial maximum, with species exhibiting a diverse range of morphological and behavioural phenotypes. This, combined with the whole genome duplication which occurred in the ancestor of all salmonids, makes the Alpine whitefish radiation a particularly interesting system in which to study the genetic basis of adaptation and speciation and the impacts of ploidy changes and subsequent rediploidization on genome evolution. Although well‐curated genome assemblies exist for many species within Salmonidae, genomic resources for the subfamily Coregoninae are lacking. To assemble a whitefish reference genome, we carried out PacBio sequencing from one wild‐caught Coregonus sp. “Balchen” from Lake Thun to ~90× coverage. PacBio reads were assembled independently using three different assemblers, falcon , canu and wtdbg2 and subsequently scaffolded with additional Hi‐C data. All three assemblies were highly contiguous, had strong synteny to a previously published Coregonus linkage map, and when mapping additional short‐read data to each of the assemblies, coverage was fairly even across most chromosome‐scale scaffolds. Here, we present the first de novo genome assembly for the Salmonid subfamily Coregoninae. The final 2.2‐Gb wtdbg2 assembly included 40 scaffolds, an N50 of 51.9 Mb and was 93.3% complete for BUSCOs. The assembly consisted of ~52% transposable elements and contained 44,525 genes.  相似文献   

16.
The leopard coral grouper, Plectropomus leopardus, belonging to the family Epinephelinae, is a carnivorous coral reef fish widely distributed in tropical and subtropical waters of the Indo‐Pacific. Due to its appealing body appearance and delicious taste, P. leopardus has become a popular commercial fish for aquaculture in many countries. However, the lack of genomic and molecular resources for P. leopardus has hindered study of its biology and genomic breeding programmes. Here we report the de novo sequencing and assembly of the P. leopardus genome using a combination of 10 × Genomics, high‐throughput chromosome conformation capture (Hi‐C) and PacBio long‐read sequencing technologies. The genome assembly has a total length of 881.55 Mb with a scaffold N50 of 34.15 Mb, consisting of 24 pseudochromosome scaffolds. busco analysis showed that 97.2% of the conserved single‐copy genes were retrieved, indicating the assembly was almost entire. We predicted 25,248 protein‐coding genes, among which 96.5% were functionally annotated. Comparative genomic analyses revealed that gene family expansions in P. leopardus were associated with immune‐related pathways. In addition, we identified 5,178,453 single nucleotide polymorphisms based on genome resequencing of 54 individuals. The P. leopardus genome and genomic variation data provide valuable genomic resources for studies of its genetics, evolution and biology. In particular, it is expected to benefit the development of genomic breeding programmes in the farming industry.  相似文献   

17.
Next‐generation whole‐genome shotgun assemblies of complex genomes are highly useful, but fail to link nearby sequence contigs with each other or provide a linear order of contigs along individual chromosomes. Here, we introduce a strategy based on sequencing progeny of a segregating population that allows de novo production of a genetically anchored linear assembly of the gene space of an organism. We demonstrate the power of the approach by reconstructing the chromosomal organization of the gene space of barley, a large, complex and highly repetitive 5.1 Gb genome. We evaluate the robustness of the new assembly by comparison to a recently released physical and genetic framework of the barley genome, and to various genetically ordered sequence‐based genotypic datasets. The method is independent of the need for any prior sequence resources, and will enable rapid and cost‐efficient establishment of powerful genomic information for many species.  相似文献   

18.
The superb fairy‐wren, Malurus cyaneus, is one of the most iconic Australian passerine species. This species belongs to an endemic Australasian clade, Meliphagides, which diversified early in the evolution of the oscine passerines. Today, the oscine passerines comprise almost half of all avian species diversity. Despite the rapid increase of available bird genome assemblies, this part of the avian tree has not yet been represented by a high‐quality reference. To rectify that, we present the first high‐quality genome assembly of a Meliphagides representative: the superb fairy‐wren. We combined Illumina shotgun and mate‐pair sequences, PacBio long‐reads, and a genetic linkage map from an intensively sampled pedigree of a wild population to generate this genome assembly. Of the final assembled 1.07‐Gb genome, 975 Mb (90.4%) was anchored onto 25 pseudochromosomes resulting in a final superscaffold N50 of 68.11 Mb. This high‐quality bird genome assembly is one of only a handful which is also accompanied by a genetic map and recombination landscape. In comparison to other pedigree‐based bird genetic maps, we find that the fairy‐wren genetic map more closely resembles those of Taeniopygia guttata and Parus major maps, unlike the Ficedula albicollis map which more closely resembles that of Gallus gallus. Lastly, we also provide a predictive gene and repeat annotation of the genome assembly. This new high‐quality, annotated genome assembly will be an invaluable resource not only regarding the superb fairy‐wren species and relatives but also broadly across the avian tree by providing a novel reference point for comparative genomic analyses.  相似文献   

19.
20.
Cicer arietinum L. (chickpea) is the third most important food legume crop. We have generated the draft sequence of a desi‐type chickpea genome using next‐generation sequencing platforms, bacterial artificial chromosome end sequences and a genetic map. The 520‐Mb assembly covers 70% of the predicted 740‐Mb genome length, and more than 80% of the gene space. Genome analysis predicts the presence of 27 571 genes and 210 Mb as repeat elements. The gene expression analysis performed using 274 million RNA‐Seq reads identified several tissue‐specific and stress‐responsive genes. Although segmental duplicated blocks are observed, the chickpea genome does not exhibit any indication of recent whole‐genome duplication. Nucleotide diversity analysis provides an assessment of a narrow genetic base within the chickpea cultivars. We have developed a resource for genetic markers by comparing the genome sequences of one wild and three cultivated chickpea genotypes. The draft genome sequence is expected to facilitate genetic enhancement and breeding to develop improved chickpea varieties.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号