首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Onychostoma macrolepis is an emerging commercial cyprinid fish species. It is a model system for studies of sexual dimorphism and genome evolution. Here, we report the chromosome‐level assembly of the O.macrolepis genome obtained from the integration of nanopore long‐read sequencing with physical maps produced using Bionano and Hi‐C technology. A total of 87.9 Gb of nanopore sequence provided approximately 100‐fold coverage of the genome. The preliminary genome assembly was 883.2 Mb in size with a contig N50 size of 11.2 Mb. The 969 corrected contigs obtained from Bionano optical mapping were assembled into 853 scaffolds and produced an assembly of 886.5 Mb with a scaffold N50 of 16.5 Mb. Finally, using the Hi‐C data, 881.3 Mb (99.4% of genome) in 526 scaffolds were anchored and oriented in 25 chromosomes ranging in size from 25.27 to 56.49 Mb. In total, 24,770 protein‐coding genes were predicted in the genome, and ~96.85% of the genes were functionally annotated. The annotated assembly contains 93.3% complete genes from the BUSCO reference set. In addition, we identified 409 Mb (46.23% of the genome) of repetitive sequence, and 11,213 non‐coding RNAs, in the genome. Evolutionary analysis revealed that O. macrolepis diverged from common carp approximately 24.25 million years ago. The chromosomes of O. macrolepis showed an unambiguous correspondence to the chromosomes of zebrafish. The high‐quality genome assembled in this work provides a valuable genomic resource for further biological and evolutionary studies of O. macrolepis.  相似文献   

2.
Parasitoid wasps represent a large proportion of hymenopteran species. They have complex evolutionary histories and are important biocontrol agents. To advance parasitoid research, a combination of Illumina short‐read, PacBio long‐read and Hi‐C scaffolding technologies was used to develop a high‐quality chromosome‐level genome assembly for Pteromalus puparum, which is an important pupal endoparasitoid of caterpillar pests. The chromosome‐level assembly has aided in studies of venom and detoxification genes. The assembled genome size is 338 Mb with a contig N50 of 38.7 kb and a scaffold N50 of 1.16 Mb. Hi‐C analysis assembled scaffolds onto five chromosomes and raised the scaffold N50 to 65.8 Mb, with more than 96% of assembled bases located on chromosomes. Gene annotation was assisted by RNA sequencing for the two sexes and four different life stages. Analysis detected 98% of the BUSCO (Benchmarking Universal Single‐Copy Orthologs) gene set, supporting a high‐quality assembly and annotation. In total, 40.1% (135.6 Mb) of the assembly is composed of repetitive sequences, and 14,946 protein‐coding genes were identified. Although venom genes play important roles in parasitoid biology, their spatial distribution on chromosomes was poorly understood. Mapping has revealed venom gene tandem arrays for serine proteases, pancreatic lipase‐related proteins and kynurenine–oxoglutarate transaminases, which have amplified in the P. puparum lineage after divergence from its common ancestor with Nasonia vitripennis. In addition, there is a large expansion of P450 genes in P. puparum. These examples illustrate how chromosome‐level genome assembly can provide a valuable resource for molecular, evolutionary and biocontrol studies of parasitoid wasps.  相似文献   

3.
4.
5.
Cultivated potato (Solanum tuberosum L.) is a highly heterozygous autotetraploid that presents challenges in genome analyses and breeding. Wild potato species serve as a resource for the introgression of important agronomic traits into cultivated potato. One key species is Solanum chacoense and the diploid, inbred clone M6, which is self‐compatible and has desirable tuber market quality and disease resistance traits. Sequencing and assembly of the genome of the M6 clone of S. chacoense generated an assembly of 825 767 562 bp in 8260 scaffolds with an N50 scaffold size of 713 602 bp. Pseudomolecule construction anchored 508 Mb of the genome assembly into 12 chromosomes. Genome annotation yielded 49 124 high‐confidence gene models representing 37 740 genes. Comparative analyses of the M6 genome with six other Solanaceae species revealed a core set of 158 367 Solanaceae genes and 1897 genes unique to three potato species. Analysis of single nucleotide polymorphisms across the M6 genome revealed enhanced residual heterozygosity on chromosomes 4, 8 and 9 relative to the other chromosomes. Access to the M6 genome provides a resource for identification of key genes for important agronomic traits and aids in genome‐enabled development of inbred diploid potatoes with the potential to accelerate potato breeding.  相似文献   

6.
Triplophysa is an endemic fish genus of the Tibetan Plateau in China. Triplophysa tibetana, which lives at a recorded altitude of ~4,000 m and plays an important role in the highland aquatic ecosystem, serves as an excellent model for investigating high‐altitude environmental adaptation. However, evolutionary and conservation studies of T. tibetana have been limited by scarce genomic resources for the genus Triplophysa. In the present study, we applied PacBio sequencing and the Hi‐C technique to assemble the T. tibetana genome. A 652‐Mb genome with 1,325 contigs with an N50 length of 3.1 Mb was obtained. The 1,137 contigs were further assembled into 25 chromosomes, representing 98.7% and 80.47% of all contigs at the base and sequence number level, respectively. Approximately 260 Mb of sequence, accounting for ~39.8% of the genome, was identified as repetitive elements. DNA transposons (16.3%), long interspersed nuclear elements (12.4%) and long terminal repeats (11.0%) were the most repetitive types. In total, 24,372 protein‐coding genes were predicted in the genome, and ~95% of the genes were functionally annotated via a search in public databases. Using whole genome sequence information, we found that T. tibetana diverged from its common ancestor with Danio rerio ~121.4 million years ago. The high‐quality genome assembled in this work not only provides a valuable genomic resource for future population and conservation studies of T. tibetana, but it also lays a solid foundation for further investigation into the mechanisms of environmental adaptation of endemic fishes in the Tibetan Plateau.  相似文献   

7.
Antheraea pernyi is a semi‐domesticated lepidopteran insect species valuable to the silk industry, human health, and ecological tourism. Owing to its economic influence and developmental properties, it serves as an ideal model for investigating divergence of the Bombycoidea super family. However, studies on the karyotype evolution and functional genomics of A. pernyi are limited by scarce genomic resource. Here, we applied PacBio sequencing and chromosome structure capture technique to assemble the first high‐quality A. pernyi genome from a single male individual. The genome is 720.67 Mb long with 49 chromosomes and a 13.77‐Mb scaffold N50. Approximately 441.75 Mb, accounting for 60.74% of the genome, was identified as repeats. The genome comprises 21,431 protein‐coding genes, 85.22% of which were functionally annotated. Comparative genomics analysis suggested that A. pernyi diverged from its common ancestor with A. yamamai ~30.3 million years ago, and that chromosome fission contributed to the increased chromosome number. The genome assembled in this work will not only facilitate future research on A. pernyi and related species but also help to progress comparative genomics analyses in Lepidoptera.  相似文献   

8.
9.
The superb fairy‐wren, Malurus cyaneus, is one of the most iconic Australian passerine species. This species belongs to an endemic Australasian clade, Meliphagides, which diversified early in the evolution of the oscine passerines. Today, the oscine passerines comprise almost half of all avian species diversity. Despite the rapid increase of available bird genome assemblies, this part of the avian tree has not yet been represented by a high‐quality reference. To rectify that, we present the first high‐quality genome assembly of a Meliphagides representative: the superb fairy‐wren. We combined Illumina shotgun and mate‐pair sequences, PacBio long‐reads, and a genetic linkage map from an intensively sampled pedigree of a wild population to generate this genome assembly. Of the final assembled 1.07‐Gb genome, 975 Mb (90.4%) was anchored onto 25 pseudochromosomes resulting in a final superscaffold N50 of 68.11 Mb. This high‐quality bird genome assembly is one of only a handful which is also accompanied by a genetic map and recombination landscape. In comparison to other pedigree‐based bird genetic maps, we find that the fairy‐wren genetic map more closely resembles those of Taeniopygia guttata and Parus major maps, unlike the Ficedula albicollis map which more closely resembles that of Gallus gallus. Lastly, we also provide a predictive gene and repeat annotation of the genome assembly. This new high‐quality, annotated genome assembly will be an invaluable resource not only regarding the superb fairy‐wren species and relatives but also broadly across the avian tree by providing a novel reference point for comparative genomic analyses.  相似文献   

10.
The ladybird beetle Propylea japonica is an important natural enemy in agro‐ecological systems. Studies on the strong tolerance of P. japonica to high temperatures and insecticides, and its population and phenotype diversity have recently increased. However, abundant genome resources for obtaining insights into stress‐resistance mechanisms and genetic intra‐species diversity for P. japonica are lacking. Here, we constructed the P. japonica genome maps using Pacific Bioscience (PacBio) and Illumina sequencing technologies. The genome size was 850.90 Mb with a contig N50 of 813.13 kb. The Hi‐C sequence data were used to upgrade draft genome assemblies; 4,777 contigs were assembled to 10 chromosomes; and the final draft genome assembly was 803.93 Mb with a contig N50 of 813.98 kb and a scaffold N50 of 100.34 Mb. Approximately 495.38 Mb of repeated sequences was annotated. The 18,018 protein‐coding genes were predicted, of which 95.78% were functionally annotated, and 1,407 genes were species‐specific. The phylogenetic analysis showed that P. japonica diverged from the ancestor of Anoplophora glabripennis and Tribolium castaneum ~ 236.21 million years ago. We detected that some important gene families involved in detoxification of pesticides and tolerance to heat stress were expanded in P. japonica, especially cytochrome P450 and Hsp70 genes. Overall, the high‐quality draft genome sequence of P. japonica will provide invaluable resource for understanding the molecular mechanisms of stress resistance and will facilitate the research on population genetics, evolution and phylogeny of Coccinellidae. This genome will also provide new avenues for conserving the diversity of predator insects.  相似文献   

11.
The Tetraodontidae family are known to have relatively small and compact genomes compared to other vertebrates. The obscure puffer fish Takifugu obscurus is an anadromous species that migrates to freshwater from the sea for spawning. Thus the euryhaline characteristics of T. obscurus have been investigated to gain understanding of their survival ability, osmoregulation, and other homeostatic mechanisms in both freshwater and seawater. In this study, a high quality chromosome‐level reference genome for T. obscurus was constructed using long‐read Pacific Biosciences (PacBio) Sequel sequencing and a Hi‐C‐based chromatin contact map platform. The final genome assembly of T. obscurus is 381 Mb, with a contig N50 length of 3,296 kb and longest length of 10.7 Mb, from a total of 62 Gb of raw reads generated using single‐molecule real‐time sequencing technology from a PacBio Sequel platform. The PacBio data were further clustered into chromosome‐scale scaffolds using a Hi‐C approach, resulting in a 373 Mb genome assembly with a contig N50 length of 15.2 Mb and and longest length of 28 Mb. When we directly compared the 22 longest scaffolds of T. obscurus to the 22 chromosomes of the tiger puffer Takifugu rubripes, a clear one‐to‐one orthologous relationship was observed between the two species, supporting the chromosome‐level assembly of T. obscurus. This genome assembly can serve as a valuable genetic resource for exploring fugu‐specific compact genome characteristics, and will provide essential genomic information for understanding molecular adaptations to salinity fluctuations and the evolution of osmoregulatory mechanisms.  相似文献   

12.
Sarcophaga peregrina is considered to be of great ecological, medical and forensic significance, and has unusual biological characteristics such as an ovoviviparous reproductive pattern and adaptation to feed on carrion. The availability of a high‐quality genome will help to further reveal the mechanisms underlying these charcateristics. Here we present a de novo‐assembled genome at chromosome scale for S. peregrina. The final assembled genome was 560.31 Mb with contig N50 of 3.84 Mb. Hi‐C scaffolding reliably anchored six pseudochromosomes, accounting for 97.76% of the assembled genome. Moreover, 45.70% of repeat elements were identified in the genome. A total of 14,476 protein‐coding genes were functionally annotated, accounting for 92.14% of all predicted genes. Phylogenetic analysis indicated that S. peregrina and S. bullata diverged ~ 7.14 million years ago. Comparative genomic analysis revealed expanded and positively selected genes related to biological features that aid in clarifying its ovoviviparous reproduction and carrion‐feeding adaptations, such as lipid metabolism, olfactory receptor activity, antioxidant enzymes, proteolysis and serine‐type endopeptidase activity. Protein‐coding genes associated with ovoviparity, such as yolk proteins, transferrin and acid sphingomyelinase, were identified. This study provides a valuable genomic resource for S. peregrina, and sheds insight into further revealing the underlying molecular mechanisms of adaptive evolution.  相似文献   

13.
The leaf resemblance of Kallima (Nymphalidae) butterflies is an important ecological adaptive mechanism that increases their survival. However, the genetic mechanism underlying ecological adaptation remains unclear owing to a dearth of genomic information. Here, we determined the karyotype (n = 31) of the dead‐leaf butterfly Kallima inachus, and generated a high‐quality, chromosome‐level assembly (568.92 Mb; contig N50: 19.20 Mb). We also identified candidate Z and W chromosomes. To our knowledge, this is the first study to report on these aspects of this species. In the assembled genome, 15,309 protein‐coding genes and 49.86% repeat elements were annotated. Phylogenetic analysis showed that K. inachus diverged from Melitaea cinxia (no leaf resemblance), both of which are in Nymphalinae, around 40 million years ago. Demographic analysis indicated that the effective population size of K. inachus decreased during the last interglacial period in the Pleistocene. The wings of adults with the pigmentary gene ebony knocked out using CRISPR/Cas9 showed phenotypes in which the orange dorsal region and entire ventral surface darkened, suggesting its vital role in the ecological adaption of dead‐leaf butterflies. Our results provide important genome resources for investigating the genetic mechanism underlying protective resemblance in dead‐leaf butterflies and insights into the molecular basis of protective coloration.  相似文献   

14.
15.
The greenhouse whitefly, Trialeurodes vaporariorum Westwood, is an agricultural pest of global importance. Here we report a 787‐Mb high‐quality draft genome sequence of T. vaporariorum assembled from PacBio long reads and Hi‐C chromatin interaction maps, which has scaffold and contig N50 lengths of 70 Mb and 500 kb, respectively, and contains 18,275 protein‐coding genes. About 98.8% of the assembled contigs were placed onto the 11 T. vaporariorum chromosomes. Comparative genomic analysis reveals significantly expanded gene families such as aspartyl proteases in T. vaporariorum compared to Bemisia tabaci Mediterranean (MED) and Middle East‐Asia Minor 1 (MEAM1). Furthermore, the cytochrome CYP6 subfamily shows significant expansion in T. vaporariorum and several genes in this subfamily display developmental stage‐specific expression patterns. The high‐quality T. vaporariorum genome provides a valuable resource for research in a broad range of areas such as fundamental molecular ecology, insect–plant/insect–microorganism or virus interactions and pest resistance management.  相似文献   

16.
Apolygus lucorum (Miridae) is an omnivorous pest that occurs worldwide and is notorious for the serious damage it causes to various crops and substantial economic losses. Although some studies have examined the biological characteristics of the mirid bug, no reference genome is available in Miridae, limiting in‐depth studies of this pest. Here, we present a chromosome‐scale reference genome of A. lucorum, the first sequenced Miridae species. The assembled genome size was 1.02 Gb with a contig N50 of 785 kb. With Hi‐C scaffolding, 1,016 Mb contig sequences were clustered, ordered and assembled into 17 large scaffolds with scaffold N50 length 68 Mb, each corresponding to a natural chromosome. Numerous transposable elements occur in this genome and contribute to the large genome size. Expansions of genes associated with omnivorousness and mesophyll feeding such as those related to digestion, chemosensory perception, and detoxification were observed in A. lucorum, suggesting that gene expansion contributed to its strong environmental adaptability and severe harm to crops. We clarified that a salivary enzyme polygalacturonase is unique in mirid bugs and has significantly expanded in A. lucorum, which may contribute to leaf damage from this pest. The reference genome of A. lucorum not only facilitates biological studies of Hemiptera as well as an understanding of the damage mechanism of mesophyll feeding, but also provides a basis on which to develop efficient control technologies for mirid bugs.  相似文献   

17.
Bivalves, a highly diverse and the most evolutionarily successful class of invertebrates native to aquatic habitats, provide valuable molecular resources for understanding the evolutionary adaptation and aquatic ecology. Here, we reported a high‐quality chromosome‐level genome assembly of the razor clam Sinonovacula constricta using Pacific Bioscience single‐molecule real‐time sequencing, Illumina paired‐end sequencing, 10X Genomics linked‐reads and Hi‐C reads. The genome size was 1,220.85 Mb, containing scaffold N50 of 65.93 Mb and contig N50 of 976.94 Kb. A total of 899 complete (91.92%) and seven partial (0.72%) matches of the 978 metazoa Benchmarking Universal Single‐Copy Orthologs were determined in this genome assembly. And Hi‐C scaffolding of the genome resulted in 19 pseudochromosomes. A total of 28,594 protein‐coding genes were predicted in the S. constricta genome, of which 25,413 genes (88.88%) were functionally annotated. In addition, 39.79% of the assembled genome was composed of repetitive sequences, and 4,372 noncoding RNAs were identified. The enrichment analyses of the significantly expanded and contracted genes suggested an evolutionary adaptation of S. constricta to highly stressful living environments. In summary, the genomic resources generated in this work not only provide a valuable reference genome for investigating the molecular mechanisms of S. constricta biological functions and evolutionary adaptation, but also facilitate its genetic improvement and disease treatment. Meanwhile, the obtained genome greatly improves our understanding of the genetics of molluscs and their comparative evolution.  相似文献   

18.
Genomes of varying sizes have been sequenced with next‐generation sequencing platforms. However, most reference sequences include draft unordered scaffolds containing chimeras caused by mis‐scaffolding. A BioNano genome (BNG) optical map was constructed to improve the previously sequenced flax genome (Linum usitatissimum L., 2n = 30, about 373 Mb), which consisted of 3852 scaffolds larger than 1 kb and totalling 300.6 Mb. The high‐resolution BNG map of cv. CDC Bethune totalled 317 Mb and consisted of 251 BNG contigs with an N50 of 2.15 Mb. A total of 622 scaffolds (286.6 Mb, 94.9%) aligned to 211 BNG contigs (298.6 Mb, 94.2%). Of those, 99 scaffolds, diagnosed to contain assembly errors, were refined into 225 new scaffolds. Using the newly refined scaffold sequences and the validated bacterial artificial chromosome‐based physical map of CDC Bethune, the 211 BNG contigs were scaffolded into 94 super‐BNG contigs (N50 of 6.64 Mb) that were further assigned to the 15 flax chromosomes using the genetic map. The pseudomolecules total about 316 Mb, with individual chromosomes of 15.6 to 29.4 Mb, and cover 97% of the annotated genes. Evidence from the chromosome‐scale pseudomolecules suggests that flax has undergone palaeopolyploidization and mesopolyploidization events, followed by rearrangements and deletions or fusion of chromosome arms from an ancient progenitor with a haploid chromosome number of eight.  相似文献   

19.
Marine medaka (Oryzias melastigma) is considered to be a useful fish model for marine and estuarine ecotoxicology studies and has good potential for field‐based population genomics because of its geographical distribution in Asian estuarine and coastal areas. In this study, we present the first whole‐genome draft of O. melastigma. The genome assembly consists of 8,602 scaffolds (N50 = 23.737 Mb) and a total genome length of 779.4 Mb. A total of 23,528 genes were predicted, and 12,670 gene families shared with three teleost species (Japanese medaka, mangrove killifish and zebrafish) were identified. Genome analyses revealed that the O. melastigma genome is highly heterozygous and contains a large number of repeat sequences. This assembly represents a useful genomic resource for fish scientists.  相似文献   

20.
Erigeron breviscapus is an important medicinal plant in Compositae and the first species to realize the whole process from the decoding of the draft genome sequence to scutellarin biosynthesis in yeast. However, the previous low‐quality genome assembly has hindered the optimization of candidate genes involved in scutellarin synthesis and the development of molecular‐assisted breeding based on the genome. Here, the E. breviscapus genome was updated using PacBio RSII sequencing data and Hi‐C data, and increased in size from 1.2 Gb to 1.43 Gb, with a scaffold N50 of 156.82 Mb and contig N50 of 140.95 kb, and a total of 43,514 protein‐coding genes were obtained and oriented onto nine pseudo‐chromosomes, thus becoming the third plant species assembled to chromosome level after sunflower and lettuce in Compositae. Fourteen genes with evidence for positive selection were identified and found to be related to leaf morphology, flowering and secondary metabolism. The number of genes in some gene families involved in flavonoid biosynthesis in E. breviscapus have been significantly expanded. In particular, additional candidate genes involved in scutellarin biosynthesis, such as flavonoid‐7‐O‐glucuronosyltransferase genes (F7GATs) were identified using updated genome. In addition, three candidate genes encoding indole‐3‐pyruvate monooxygenase YUCCA2 (YUC2), serine carboxypeptidase‐like 18 (SCPL18), and F‐box protein (FBP), respectively, were identified to be probably related to leaf development and flowering by resequencing 99 individuals. These results provided a substantial genetic basis for improving agronomic and quality traits of E. breviscapus, and provided a platform for improving other draft genome assemblies to chromosome‐level.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号