首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
Onychostoma macrolepis is an emerging commercial cyprinid fish species. It is a model system for studies of sexual dimorphism and genome evolution. Here, we report the chromosome‐level assembly of the O.macrolepis genome obtained from the integration of nanopore long‐read sequencing with physical maps produced using Bionano and Hi‐C technology. A total of 87.9 Gb of nanopore sequence provided approximately 100‐fold coverage of the genome. The preliminary genome assembly was 883.2 Mb in size with a contig N50 size of 11.2 Mb. The 969 corrected contigs obtained from Bionano optical mapping were assembled into 853 scaffolds and produced an assembly of 886.5 Mb with a scaffold N50 of 16.5 Mb. Finally, using the Hi‐C data, 881.3 Mb (99.4% of genome) in 526 scaffolds were anchored and oriented in 25 chromosomes ranging in size from 25.27 to 56.49 Mb. In total, 24,770 protein‐coding genes were predicted in the genome, and ~96.85% of the genes were functionally annotated. The annotated assembly contains 93.3% complete genes from the BUSCO reference set. In addition, we identified 409 Mb (46.23% of the genome) of repetitive sequence, and 11,213 non‐coding RNAs, in the genome. Evolutionary analysis revealed that O. macrolepis diverged from common carp approximately 24.25 million years ago. The chromosomes of O. macrolepis showed an unambiguous correspondence to the chromosomes of zebrafish. The high‐quality genome assembled in this work provides a valuable genomic resource for further biological and evolutionary studies of O. macrolepis.  相似文献   

3.
The ladybird beetle Propylea japonica is an important natural enemy in agro‐ecological systems. Studies on the strong tolerance of P. japonica to high temperatures and insecticides, and its population and phenotype diversity have recently increased. However, abundant genome resources for obtaining insights into stress‐resistance mechanisms and genetic intra‐species diversity for P. japonica are lacking. Here, we constructed the P. japonica genome maps using Pacific Bioscience (PacBio) and Illumina sequencing technologies. The genome size was 850.90 Mb with a contig N50 of 813.13 kb. The Hi‐C sequence data were used to upgrade draft genome assemblies; 4,777 contigs were assembled to 10 chromosomes; and the final draft genome assembly was 803.93 Mb with a contig N50 of 813.98 kb and a scaffold N50 of 100.34 Mb. Approximately 495.38 Mb of repeated sequences was annotated. The 18,018 protein‐coding genes were predicted, of which 95.78% were functionally annotated, and 1,407 genes were species‐specific. The phylogenetic analysis showed that P. japonica diverged from the ancestor of Anoplophora glabripennis and Tribolium castaneum ~ 236.21 million years ago. We detected that some important gene families involved in detoxification of pesticides and tolerance to heat stress were expanded in P. japonica, especially cytochrome P450 and Hsp70 genes. Overall, the high‐quality draft genome sequence of P. japonica will provide invaluable resource for understanding the molecular mechanisms of stress resistance and will facilitate the research on population genetics, evolution and phylogeny of Coccinellidae. This genome will also provide new avenues for conserving the diversity of predator insects.  相似文献   

4.
5.
The leopard coral grouper, Plectropomus leopardus, belonging to the family Epinephelinae, is a carnivorous coral reef fish widely distributed in tropical and subtropical waters of the Indo‐Pacific. Due to its appealing body appearance and delicious taste, P. leopardus has become a popular commercial fish for aquaculture in many countries. However, the lack of genomic and molecular resources for P. leopardus has hindered study of its biology and genomic breeding programmes. Here we report the de novo sequencing and assembly of the P. leopardus genome using a combination of 10 × Genomics, high‐throughput chromosome conformation capture (Hi‐C) and PacBio long‐read sequencing technologies. The genome assembly has a total length of 881.55 Mb with a scaffold N50 of 34.15 Mb, consisting of 24 pseudochromosome scaffolds. busco analysis showed that 97.2% of the conserved single‐copy genes were retrieved, indicating the assembly was almost entire. We predicted 25,248 protein‐coding genes, among which 96.5% were functionally annotated. Comparative genomic analyses revealed that gene family expansions in P. leopardus were associated with immune‐related pathways. In addition, we identified 5,178,453 single nucleotide polymorphisms based on genome resequencing of 54 individuals. The P. leopardus genome and genomic variation data provide valuable genomic resources for studies of its genetics, evolution and biology. In particular, it is expected to benefit the development of genomic breeding programmes in the farming industry.  相似文献   

6.
Bivalves, a highly diverse and the most evolutionarily successful class of invertebrates native to aquatic habitats, provide valuable molecular resources for understanding the evolutionary adaptation and aquatic ecology. Here, we reported a high‐quality chromosome‐level genome assembly of the razor clam Sinonovacula constricta using Pacific Bioscience single‐molecule real‐time sequencing, Illumina paired‐end sequencing, 10X Genomics linked‐reads and Hi‐C reads. The genome size was 1,220.85 Mb, containing scaffold N50 of 65.93 Mb and contig N50 of 976.94 Kb. A total of 899 complete (91.92%) and seven partial (0.72%) matches of the 978 metazoa Benchmarking Universal Single‐Copy Orthologs were determined in this genome assembly. And Hi‐C scaffolding of the genome resulted in 19 pseudochromosomes. A total of 28,594 protein‐coding genes were predicted in the S. constricta genome, of which 25,413 genes (88.88%) were functionally annotated. In addition, 39.79% of the assembled genome was composed of repetitive sequences, and 4,372 noncoding RNAs were identified. The enrichment analyses of the significantly expanded and contracted genes suggested an evolutionary adaptation of S. constricta to highly stressful living environments. In summary, the genomic resources generated in this work not only provide a valuable reference genome for investigating the molecular mechanisms of S. constricta biological functions and evolutionary adaptation, but also facilitate its genetic improvement and disease treatment. Meanwhile, the obtained genome greatly improves our understanding of the genetics of molluscs and their comparative evolution.  相似文献   

7.
Chimonanthus salicifolius, a member of the Calycanthaceae of magnoliids, is one of the most famous medicinal plants in Eastern China. Here, we report a chromosome‐level genome assembly of Csalicifolius, comprising 820.1 Mb of genomic sequence with a contig N50 of 2.3 Mb and containing 36 651 annotated protein‐coding genes. Phylogenetic analyses revealed that magnoliids were sister to the eudicots. Two rounds of ancient whole‐genome duplication were inferred in the Csalicifolious genome. One is shared by Calycanthaceae after its divergence with Lauraceae, and the other is in the ancestry of Magnoliales and Laurales. Notably, long genes with > 20 kb in length were much more prevalent in the magnoliid genomes compared with other angiosperms, which could be caused by the length expansion of introns inserted by transposon elements. Homologous genes within the flavonoid pathway for Csalicifolius were identified, and correlation of the gene expression and the contents of flavonoid metabolites revealed potential critical genes involved in flavonoids biosynthesis. This study not only provides an additional whole‐genome sequence from the magnoliids, but also opens the door to functional genomic research and molecular breeding of Csalicifolius.  相似文献   

8.
Erigeron breviscapus is an important medicinal plant in Compositae and the first species to realize the whole process from the decoding of the draft genome sequence to scutellarin biosynthesis in yeast. However, the previous low‐quality genome assembly has hindered the optimization of candidate genes involved in scutellarin synthesis and the development of molecular‐assisted breeding based on the genome. Here, the E. breviscapus genome was updated using PacBio RSII sequencing data and Hi‐C data, and increased in size from 1.2 Gb to 1.43 Gb, with a scaffold N50 of 156.82 Mb and contig N50 of 140.95 kb, and a total of 43,514 protein‐coding genes were obtained and oriented onto nine pseudo‐chromosomes, thus becoming the third plant species assembled to chromosome level after sunflower and lettuce in Compositae. Fourteen genes with evidence for positive selection were identified and found to be related to leaf morphology, flowering and secondary metabolism. The number of genes in some gene families involved in flavonoid biosynthesis in E. breviscapus have been significantly expanded. In particular, additional candidate genes involved in scutellarin biosynthesis, such as flavonoid‐7‐O‐glucuronosyltransferase genes (F7GATs) were identified using updated genome. In addition, three candidate genes encoding indole‐3‐pyruvate monooxygenase YUCCA2 (YUC2), serine carboxypeptidase‐like 18 (SCPL18), and F‐box protein (FBP), respectively, were identified to be probably related to leaf development and flowering by resequencing 99 individuals. These results provided a substantial genetic basis for improving agronomic and quality traits of E. breviscapus, and provided a platform for improving other draft genome assemblies to chromosome‐level.  相似文献   

9.
The Tetraodontidae family are known to have relatively small and compact genomes compared to other vertebrates. The obscure puffer fish Takifugu obscurus is an anadromous species that migrates to freshwater from the sea for spawning. Thus the euryhaline characteristics of T. obscurus have been investigated to gain understanding of their survival ability, osmoregulation, and other homeostatic mechanisms in both freshwater and seawater. In this study, a high quality chromosome‐level reference genome for T. obscurus was constructed using long‐read Pacific Biosciences (PacBio) Sequel sequencing and a Hi‐C‐based chromatin contact map platform. The final genome assembly of T. obscurus is 381 Mb, with a contig N50 length of 3,296 kb and longest length of 10.7 Mb, from a total of 62 Gb of raw reads generated using single‐molecule real‐time sequencing technology from a PacBio Sequel platform. The PacBio data were further clustered into chromosome‐scale scaffolds using a Hi‐C approach, resulting in a 373 Mb genome assembly with a contig N50 length of 15.2 Mb and and longest length of 28 Mb. When we directly compared the 22 longest scaffolds of T. obscurus to the 22 chromosomes of the tiger puffer Takifugu rubripes, a clear one‐to‐one orthologous relationship was observed between the two species, supporting the chromosome‐level assembly of T. obscurus. This genome assembly can serve as a valuable genetic resource for exploring fugu‐specific compact genome characteristics, and will provide essential genomic information for understanding molecular adaptations to salinity fluctuations and the evolution of osmoregulatory mechanisms.  相似文献   

10.
Parasitoid wasps represent a large proportion of hymenopteran species. They have complex evolutionary histories and are important biocontrol agents. To advance parasitoid research, a combination of Illumina short‐read, PacBio long‐read and Hi‐C scaffolding technologies was used to develop a high‐quality chromosome‐level genome assembly for Pteromalus puparum, which is an important pupal endoparasitoid of caterpillar pests. The chromosome‐level assembly has aided in studies of venom and detoxification genes. The assembled genome size is 338 Mb with a contig N50 of 38.7 kb and a scaffold N50 of 1.16 Mb. Hi‐C analysis assembled scaffolds onto five chromosomes and raised the scaffold N50 to 65.8 Mb, with more than 96% of assembled bases located on chromosomes. Gene annotation was assisted by RNA sequencing for the two sexes and four different life stages. Analysis detected 98% of the BUSCO (Benchmarking Universal Single‐Copy Orthologs) gene set, supporting a high‐quality assembly and annotation. In total, 40.1% (135.6 Mb) of the assembly is composed of repetitive sequences, and 14,946 protein‐coding genes were identified. Although venom genes play important roles in parasitoid biology, their spatial distribution on chromosomes was poorly understood. Mapping has revealed venom gene tandem arrays for serine proteases, pancreatic lipase‐related proteins and kynurenine–oxoglutarate transaminases, which have amplified in the P. puparum lineage after divergence from its common ancestor with Nasonia vitripennis. In addition, there is a large expansion of P450 genes in P. puparum. These examples illustrate how chromosome‐level genome assembly can provide a valuable resource for molecular, evolutionary and biocontrol studies of parasitoid wasps.  相似文献   

11.
Ficus erecta, a wild relative of the common fig (F. carica), is a donor of Ceratocystis canker resistance in fig breeding programmes. Interspecific hybridization followed by recurrent backcrossing is an effective method to transfer the resistance trait from wild to cultivated fig. However, this process is time consuming and labour intensive for trees, especially for gynodioecious plants such as fig. In this study, genome resources were developed for F. erecta to facilitate fig breeding programmes. The genome sequence of F. erecta was determined using single‐molecule real‐time sequencing technology. The resultant assembly spanned 331.6 Mb with 538 contigs and an N50 length of 1.9 Mb, from which 51 806 high‐confidence genes were predicted. Pseudomolecule sequences corresponding to the chromosomes of F. erecta were established with a genetic map based on single nucleotide polymorphisms from double‐digest restriction‐site‐associated DNA sequencing. Subsequent linkage analysis and whole‐genome resequencing identified a candidate gene for the Ceratocystis canker resistance trait. Genome‐wide genotyping analysis enabled the selection of female lines that possessed resistance and effective elimination of the donor genome from the progeny. The genome resources provided in this study will accelerate and enhance disease‐resistance breeding programmes in fig.  相似文献   

12.
The rice stem borer, Chilo suppressalis, is one of the most damaging insect pests to rice production worldwide. Although C. suppressalis has been the focus of numerous studies examining cold tolerance and diapause, plant–insect interactions, pesticide targets and resistance, and the development of RNAi‐mediated pest management, the absence of a high‐quality genome has limited deeper insights. To address this limitation, we generated a draft C. suppressalis genome constructed from both Illumina and PacBio sequences. The assembled genome size was 824.35 Mb with a contig N50 of 307 kb and a scaffold N50 of 1.75 Mb. Hi‐C scaffolding assigned 99.2% of the bases to one of 29 chromosomes. Based on universal single‐copy orthologues (BUSCO), the draft genome assembly was estimated to be 97% complete and is predicted to encompass 15,653 protein‐coding genes. Cold tolerance is an extreme survival strategy found in animals. However, little is known regarding the genetic basis of the winter ecology of C. suppressalis. Here, we focused our orthologous analysis on those gene families associated with animal cold tolerance. Our finding provided the first genomic evidence revealing specific cold‐tolerant strategies in C. suppressalis, including those involved in glucose‐originated glycerol biosynthesis, triacylglycerol‐originated glycerol biosynthesis, fatty acid synthesis and trehalose transport‐intermediate cold tolerance. The high‐quality C. suppressalis genome provides a valuable resource for research into a broad range of areas in molecular ecology, and subsequently benefits developing modern pest control strategies.  相似文献   

13.
Casuarina equisetifolia (C. equisetifolia), a conifer‐like angiosperm with resistance to typhoon and stress tolerance, is mainly cultivated in the coastal areas of Australasia. C. equisetifolia, making it a valuable model to study secondary growth associated genes and stress‐tolerance traits. However, the genome sequence is unavailable and therefore wood‐associated growth rate and stress resistance at the molecular level is largely unexplored. We therefore constructed a high‐quality draft genome sequence of C. equisetifolia by a combination of Illumina second‐generation sequencing reads and Pacific Biosciences single‐molecule real‐time (SMRT) long reads to advance the investigation of this species. Here, we report the genome assembly, which contains approximately 300 megabases (Mb) and scaffold size of N50 is 1.06 Mb. Additionally, gene annotation, assisted by a combination of prediction and RNA‐seq data, generated 29 827 annotated protein‐coding genes and 1983 non‐coding genes, respectively. Furthermore, we found that the total number of repetitive sequences account for one‐third of the genome assembly. Here we also construct the genome‐wide map of DNA modification, such as two novel forms N6‐adenine (6mA) and N4‐methylcytosine (4mC) at the level of single‐nucleotide resolution using single‐molecule real‐time (SMRT) sequencing. Interestingly, we found that 17% of 6mA modification genes and 15% of 4mC modification genes also included alternative splicing events. Finally, we investigated cellulose, hemicellulose, and lignin‐related genes, which were associated with secondary growth and contained different DNA modifications. The high‐quality genome sequence and annotation of C. equisetifolia in this study provide a valuable resource to strengthen our understanding of the diverse traits of trees.  相似文献   

14.
Chinese liquorice/licorice (Glycyrrhiza uralensis) is a leguminous plant species whose roots and rhizomes have been widely used as a herbal medicine and natural sweetener. Whole‐genome sequencing is essential for gene discovery studies and molecular breeding in liquorice. Here, we report a draft assembly of the approximately 379‐Mb whole‐genome sequence of strain 308‐19 of G. uralensis; this assembly contains 34 445 predicted protein‐coding genes. Comparative analyses suggested well‐conserved genomic components and collinearity of gene loci (synteny) between the genome of liquorice and those of other legumes such as Medicago and chickpea. We observed that three genes involved in isoflavonoid biosynthesis, namely, 2‐hydroxyisoflavanone synthase (CYP93C), 2,7,4′‐trihydroxyisoflavanone 4′‐O‐methyltransferase/isoflavone 4′‐O‐methyltransferase (HI4OMT) and isoflavone‐7‐O‐methyltransferase (7‐IOMT) formed a cluster on the scaffold of the liquorice genome and showed conserved microsynteny with Medicago and chickpea. Based on the liquorice genome annotation, we predicted genes in the P450 and UDP‐dependent glycosyltransferase (UGT) superfamilies, some of which are involved in triterpenoid saponin biosynthesis, and characterised their gene expression with the reference genome sequence. The genome sequencing and its annotations provide an essential resource for liquorice improvement through molecular breeding and the discovery of useful genes for engineering bioactive components through synthetic biology approaches.  相似文献   

15.
Marine medaka (Oryzias melastigma) is considered to be a useful fish model for marine and estuarine ecotoxicology studies and has good potential for field‐based population genomics because of its geographical distribution in Asian estuarine and coastal areas. In this study, we present the first whole‐genome draft of O. melastigma. The genome assembly consists of 8,602 scaffolds (N50 = 23.737 Mb) and a total genome length of 779.4 Mb. A total of 23,528 genes were predicted, and 12,670 gene families shared with three teleost species (Japanese medaka, mangrove killifish and zebrafish) were identified. Genome analyses revealed that the O. melastigma genome is highly heterozygous and contains a large number of repeat sequences. This assembly represents a useful genomic resource for fish scientists.  相似文献   

16.
The brown planthopper Nilaparvata lugens, white‐backed planthopper Sogatella furcifera, and small brown planthopper Laodelphax striatellus are three major insect pests of rice. They are genetically close; however, they differ in several ecological traits such as host range, migration capacity, and in their sex chromosomes. Though the draft genome of these three planthoppers have been previously released, the quality of genome assemblies need to be improved. The absence of chromosome‐level genome resources has hindered in‐depth research of these three species. Here, we performed a de novo genome assembly for N. lugens to increase its genome assembly quality with PacBio and Illumina platforms, increasing the contig N50 to 589.46 Kb. Then, with the new N. lugens genome and previously reported S. furcifera and L. striatellus genome assemblies, we generated chromosome‐level scaffold assemblies of these three planthopper species using HiC scaffolding technique. The scaffold N50s significantly increased to 77.63 Mb, 43.36 Mb and 29.24 Mb for N. lugens, S. furcifera and L. striatellus, respectively. To identify sex chromosomes of these three planthopper species, we carried out genome re‐sequencing of males and females and successfully determined the X and Y chromosomes for N. lugens, and X chromosome for S. furcifera and L. striatellus. The gene content of the sex chromosomes showed high diversity among these three planthoppers suggesting the rapid evolution of sex‐linked genes, and all chromosomes showed high synteny. The chromosome‐level genome assemblies of three planthoppers would provide a valuable resource for a broad range of future research in molecular ecology, and subsequently benefits development of modern pest control strategies.  相似文献   

17.
A high-quality reference genome is necessary to determine the molecular mechanisms underlying important biological phenomena; therefore, in the present study, a chromosome-level genome assembly of the Chinese shrimp Fenneropenaeus chinensis was performed. Muscle of a male shrimp was sequenced using PacBio platform, and assembled by Hi-C technology. The assembled F. chinensis genome was 1.47 Gb with contig N50 of 472.84 Kb, including 57.73% repetitive sequences, and was anchored to 43 pseudochromosomes, with scaffold N50 of 36.87 Mb. In total, 25,026 protein-coding genes were predicted. The genome size of F. chinensis showed significant contraction in comparison with that of other penaeid species, which is likely related to migration observed in this species. However, the F. chinensis genome included several expanded gene families related to cellular processes and metabolic processes, and the contracted gene families were associated with virus infection process. The findings signify the adaptation of F. chinensis to the selection pressure of migration and cold environment. Furthermore, the selection signature analysis identified genes associated with metabolism, phototransduction, and nervous system in cultured shrimps when compared with wild population, indicating targeted, artificial selection of growth, vision, and behavior during domestication. The construction of the genome of F. chinensis provided valuable information for the further genetic mechanism analysis of important biological processes, and will facilitate the research of genetic changes during evolution.  相似文献   

18.
The iconic orange clownfish, Amphiprion percula, is a model organism for studying the ecology and evolution of reef fishes, including patterns of population connectivity, sex change, social organization, habitat selection and adaptation to climate change. Notably, the orange clownfish is the only reef fish for which a complete larval dispersal kernel has been established and was the first fish species for which it was demonstrated that antipredator responses of reef fishes could be impaired by ocean acidification. Despite its importance, molecular resources for this species remain scarce and until now it lacked a reference genome assembly. Here, we present a de novo chromosome‐scale assembly of the genome of the orange clownfish Amphiprion percula. We utilized single‐molecule real‐time sequencing technology from Pacific Biosciences to produce an initial polished assembly comprised of 1,414 contigs, with a contig N50 length of 1.86 Mb. Using Hi‐C‐based chromatin contact maps, 98% of the genome assembly were placed into 24 chromosomes, resulting in a final assembly of 908.8 Mb in length with contig and scaffold N50s of 3.12 and 38.4 Mb, respectively. This makes it one of the most contiguous and complete fish genome assemblies currently available. The genome was annotated with 26,597 protein‐coding genes and contains 96% of the core set of conserved actinopterygian orthologs. The availability of this reference genome assembly as a community resource will further strengthen the role of the orange clownfish as a model species for research on the ecology and evolution of reef fishes.  相似文献   

19.
Peach (Prunus persica L. Batsch) is an economically important fruit crop worldwide. Although a high-quality peach genome has previously been published, Sanger sequencing was used for its assembly, which generated short contigs. Here, we report a chromosome-level genome assembly and sequence analysis of Chinese Cling, an important founder cultivar for peach breeding programs worldwide. The assembled genome contained 247.33 Mb with a contig N50 of 4.13 Mb and a scaffold N50 of 29.68 Mb, representing 99.8% of the estimated genome. Comparisons between this genome and the recently published one (Lovell peach) uncovered 685 407 single nucleotide polymorphisms, 162 655 insertions and deletions, and 16 248 structural variants. Gene family analysis highlighted the contraction of the gene families involved in flavone, flavonol, flavonoid, and monoterpenoid biosynthesis. Subsequently, the volatile compounds of 256 peach varieties were quantitated in mature fruits in 2015 and 2016 to perform a genome-wide association analysis. A comparison with the identified domestication genomic regions allowed us to identify 25 quantitative trait loci, associated with seven volatile compounds, in the domestication region, which is consistent with the differences in volatile compounds between wild and cultivated peaches. Finally, a gene encoding terpene synthase, located within a previously reported quantitative trait loci region, was identified to be associated with linalool synthesis. Such findings highlight the importance of this new assembly for the analysis of evolutionary mechanisms and gene identification in peach species. Furthermore, this high-quality peach genome provides valuable information for future fruit improvement.  相似文献   

20.
《Genomics》2022,114(4):110400
Endive (Cichorium endivia L.) is a leafy vegetable in the Asteraceae family. Sesquiterpene lactones (STLs) in endive leaves bring a bitter taste that varies between varieties. Despite their importance in breeding varieties with unique flavours, sesquiterpenoid biosynthesis pathways in endive are poorly understood. We assembled a chromosome-scale endive genome of 641 Mb with a contig N50 of 5.16 Mb and annotated 46,711 protein-coding genes. Several gene families, especially terpene synthases (TPS) genes, expanded significantly in the C. endivia genome. STLs biosynthesis-related genes and TPS genes in more bitter varieties have shown a higher level of expression, which could be attributed to genomic variations. Our results penetrate the origin and diversity of bitter taste and facilitate the molecular breeding of endive varieties with unique bitter tastes. The high-quality endive assembly would provide a reference genome for studying the evolution and diversity of Asteraceae.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号