首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Onychostoma macrolepis is an emerging commercial cyprinid fish species. It is a model system for studies of sexual dimorphism and genome evolution. Here, we report the chromosome‐level assembly of the O.macrolepis genome obtained from the integration of nanopore long‐read sequencing with physical maps produced using Bionano and Hi‐C technology. A total of 87.9 Gb of nanopore sequence provided approximately 100‐fold coverage of the genome. The preliminary genome assembly was 883.2 Mb in size with a contig N50 size of 11.2 Mb. The 969 corrected contigs obtained from Bionano optical mapping were assembled into 853 scaffolds and produced an assembly of 886.5 Mb with a scaffold N50 of 16.5 Mb. Finally, using the Hi‐C data, 881.3 Mb (99.4% of genome) in 526 scaffolds were anchored and oriented in 25 chromosomes ranging in size from 25.27 to 56.49 Mb. In total, 24,770 protein‐coding genes were predicted in the genome, and ~96.85% of the genes were functionally annotated. The annotated assembly contains 93.3% complete genes from the BUSCO reference set. In addition, we identified 409 Mb (46.23% of the genome) of repetitive sequence, and 11,213 non‐coding RNAs, in the genome. Evolutionary analysis revealed that O. macrolepis diverged from common carp approximately 24.25 million years ago. The chromosomes of O. macrolepis showed an unambiguous correspondence to the chromosomes of zebrafish. The high‐quality genome assembled in this work provides a valuable genomic resource for further biological and evolutionary studies of O. macrolepis.  相似文献   

2.
Parasitoid wasps represent a large proportion of hymenopteran species. They have complex evolutionary histories and are important biocontrol agents. To advance parasitoid research, a combination of Illumina short‐read, PacBio long‐read and Hi‐C scaffolding technologies was used to develop a high‐quality chromosome‐level genome assembly for Pteromalus puparum, which is an important pupal endoparasitoid of caterpillar pests. The chromosome‐level assembly has aided in studies of venom and detoxification genes. The assembled genome size is 338 Mb with a contig N50 of 38.7 kb and a scaffold N50 of 1.16 Mb. Hi‐C analysis assembled scaffolds onto five chromosomes and raised the scaffold N50 to 65.8 Mb, with more than 96% of assembled bases located on chromosomes. Gene annotation was assisted by RNA sequencing for the two sexes and four different life stages. Analysis detected 98% of the BUSCO (Benchmarking Universal Single‐Copy Orthologs) gene set, supporting a high‐quality assembly and annotation. In total, 40.1% (135.6 Mb) of the assembly is composed of repetitive sequences, and 14,946 protein‐coding genes were identified. Although venom genes play important roles in parasitoid biology, their spatial distribution on chromosomes was poorly understood. Mapping has revealed venom gene tandem arrays for serine proteases, pancreatic lipase‐related proteins and kynurenine–oxoglutarate transaminases, which have amplified in the P. puparum lineage after divergence from its common ancestor with Nasonia vitripennis. In addition, there is a large expansion of P450 genes in P. puparum. These examples illustrate how chromosome‐level genome assembly can provide a valuable resource for molecular, evolutionary and biocontrol studies of parasitoid wasps.  相似文献   

3.
The red‐spotted grouper Epinephelus akaara (E. akaara) is one of the most economically important marine fish in China, Japan and South‐East Asia and is a threatened species. The species is also considered a good model for studies of sex inversion, development, genetic diversity and immunity. Despite its importance, molecular resources for E. akaara remain limited and no reference genome has been published to date. In this study, we constructed a chromosome‐level reference genome of E. akaara by taking advantage of long‐read single‐molecule sequencing and de novo assembly by Oxford Nanopore Technology (ONT) and Hi‐C. A red‐spotted grouper genome of 1.135 Gb was assembled from a total of 106.29 Gb polished Nanopore sequence (GridION, ONT), equivalent to 96‐fold genome coverage. The assembled genome represents 96.8% completeness (BUSCO) with a contig N50 length of 5.25 Mb and a longest contig of 25.75 Mb. The contigs were clustered and ordered onto 24 pseudochromosomes covering approximately 95.55% of the genome assembly with Hi‐C data, with a scaffold N50 length of 46.03 Mb. The genome contained 43.02% repeat sequences and 5,480 noncoding RNAs. Furthermore, combined with several RNA‐seq data sets, 23,808 (99.5%) genes were functionally annotated from a total of 23,923 predicted protein‐coding sequences. The high‐quality chromosome‐level reference genome of E. akaara was assembled for the first time and will be a valuable resource for molecular breeding and functional genomics studies of red‐spotted grouper in the future.  相似文献   

4.
The rice leaffolder Cnaphalocrocis exigua (Crambidae, Lepidoptera) is an important agricultural pest that damages rice crops and other members of related grass families. C. exigua exhibits a very similar morphological phenotype and feeding behaviour to C. medinalis, another species of rice leaffolder whose genome was recently reported. However, genomic information for C. exigua remains extremely limited. Here, we used a hybrid strategy combining different sequencing technologies, including Illumina, PacBio, 10× Genomics, and Hi – C scaffolding, to generate a high-quality chromosome-level genome assembly of C. exigua. We initially obtained a 798.8 Mb assembly with a contig N50 size of 2.9 Mb, and the N50 size was subsequently increased to 25.7 Mb using Hi – C technology to anchor 1413 scaffolds to 32 chromosomes. We detected a total of 97.7% Benchmarking Universal Single-Copy Orthologues (BUSCO) in the genome assembly, which was comprised of ~52% repetitive sequence and annotated 14,922 protein-coding genes. Of note, the Z and W sex chromosomes were assembled and identified. A comparative genomic analysis demonstrated that despite the high synteny observed between the two rice leaffolders, the species have distinct genomic features associated with expansion and contraction of gene families and selection pressure. In summary, our chromosome-level genome assembly and comparative genomic analysis of C. exigua provide novel insights into the evolution and ecology of this rice insect pests and offer useful information for pest control.  相似文献   

5.
Peach (Prunus persica L. Batsch) is an economically important fruit crop worldwide. Although a high-quality peach genome has previously been published, Sanger sequencing was used for its assembly, which generated short contigs. Here, we report a chromosome-level genome assembly and sequence analysis of Chinese Cling, an important founder cultivar for peach breeding programs worldwide. The assembled genome contained 247.33 Mb with a contig N50 of 4.13 Mb and a scaffold N50 of 29.68 Mb, representing 99.8% of the estimated genome. Comparisons between this genome and the recently published one (Lovell peach) uncovered 685 407 single nucleotide polymorphisms, 162 655 insertions and deletions, and 16 248 structural variants. Gene family analysis highlighted the contraction of the gene families involved in flavone, flavonol, flavonoid, and monoterpenoid biosynthesis. Subsequently, the volatile compounds of 256 peach varieties were quantitated in mature fruits in 2015 and 2016 to perform a genome-wide association analysis. A comparison with the identified domestication genomic regions allowed us to identify 25 quantitative trait loci, associated with seven volatile compounds, in the domestication region, which is consistent with the differences in volatile compounds between wild and cultivated peaches. Finally, a gene encoding terpene synthase, located within a previously reported quantitative trait loci region, was identified to be associated with linalool synthesis. Such findings highlight the importance of this new assembly for the analysis of evolutionary mechanisms and gene identification in peach species. Furthermore, this high-quality peach genome provides valuable information for future fruit improvement.  相似文献   

6.
Complete and highly accurate reference genomes and gene annotations are indispensable for basic biological research and trait improvement of woody tree species. In this study, we integrated single‐molecule sequencing and high‐throughput chromosome conformation capture techniques to produce a high‐quality and long‐range contiguity chromosome‐scale genome assembly of the soft‐seeded pomegranate cultivar ‘Tunisia’. The genome covers 320.31 Mb (scaffold N50 = 39.96 Mb; contig N50 = 4.49 Mb) and includes 33 594 protein‐coding genes. We also resequenced 26 pomegranate varieties that varied regarding seed hardness. Comparative genomic analyses revealed many genetic differences between soft‐ and hard‐seeded pomegranate varieties. A set of selective loci containing SUC8‐like, SUC6, FoxO and MAPK were identified by the selective sweep analysis between hard‐ and soft‐seeded populations. An exceptionally large selective region (26.2 Mb) was identified on chromosome 1. Our assembled pomegranate genome is more complete than other currently available genome assemblies. Our results indicate that genomic variations and selective genes may have contributed to the genetic divergence between soft‐ and hard‐seeded pomegranate varieties.  相似文献   

7.
8.
9.
10.
The ladybird beetle Propylea japonica is an important natural enemy in agro‐ecological systems. Studies on the strong tolerance of P. japonica to high temperatures and insecticides, and its population and phenotype diversity have recently increased. However, abundant genome resources for obtaining insights into stress‐resistance mechanisms and genetic intra‐species diversity for P. japonica are lacking. Here, we constructed the P. japonica genome maps using Pacific Bioscience (PacBio) and Illumina sequencing technologies. The genome size was 850.90 Mb with a contig N50 of 813.13 kb. The Hi‐C sequence data were used to upgrade draft genome assemblies; 4,777 contigs were assembled to 10 chromosomes; and the final draft genome assembly was 803.93 Mb with a contig N50 of 813.98 kb and a scaffold N50 of 100.34 Mb. Approximately 495.38 Mb of repeated sequences was annotated. The 18,018 protein‐coding genes were predicted, of which 95.78% were functionally annotated, and 1,407 genes were species‐specific. The phylogenetic analysis showed that P. japonica diverged from the ancestor of Anoplophora glabripennis and Tribolium castaneum ~ 236.21 million years ago. We detected that some important gene families involved in detoxification of pesticides and tolerance to heat stress were expanded in P. japonica, especially cytochrome P450 and Hsp70 genes. Overall, the high‐quality draft genome sequence of P. japonica will provide invaluable resource for understanding the molecular mechanisms of stress resistance and will facilitate the research on population genetics, evolution and phylogeny of Coccinellidae. This genome will also provide new avenues for conserving the diversity of predator insects.  相似文献   

11.
Cowpea (Vigna unguiculata [L.] Walp.) is a major crop for worldwide food and nutritional security, especially in sub‐Saharan Africa, that is resilient to hot and drought‐prone environments. An assembly of the single‐haplotype inbred genome of cowpea IT97K‐499‐35 was developed by exploiting the synergies between single‐molecule real‐time sequencing, optical and genetic mapping, and an assembly reconciliation algorithm. A total of 519 Mb is included in the assembled sequences. Nearly half of the assembled sequence is composed of repetitive elements, which are enriched within recombination‐poor pericentromeric regions. A comparative analysis of these elements suggests that genome size differences between Vigna species are mainly attributable to changes in the amount of Gypsy retrotransposons. Conversely, genes are more abundant in more distal, high‐recombination regions of the chromosomes; there appears to be more duplication of genes within the NBS‐LRR and the SAUR‐like auxin superfamilies compared with other warm‐season legumes that have been sequenced. A surprising outcome is the identification of an inversion of 4.2 Mb among landraces and cultivars, which includes a gene that has been associated in other plants with interactions with the parasitic weed Striga gesnerioides. The genome sequence facilitated the identification of a putative syntelog for multiple organ gigantism in legumes. A revised numbering system has been adopted for cowpea chromosomes based on synteny with common bean (Phaseolus vulgaris). An estimate of nuclear genome size of 640.6 Mbp based on cytometry is presented.  相似文献   

12.
Marine medaka (Oryzias melastigma) is considered to be a useful fish model for marine and estuarine ecotoxicology studies and has good potential for field‐based population genomics because of its geographical distribution in Asian estuarine and coastal areas. In this study, we present the first whole‐genome draft of O. melastigma. The genome assembly consists of 8,602 scaffolds (N50 = 23.737 Mb) and a total genome length of 779.4 Mb. A total of 23,528 genes were predicted, and 12,670 gene families shared with three teleost species (Japanese medaka, mangrove killifish and zebrafish) were identified. Genome analyses revealed that the O. melastigma genome is highly heterozygous and contains a large number of repeat sequences. This assembly represents a useful genomic resource for fish scientists.  相似文献   

13.
Powdery mildew of wheat (Triticum aestivum L.) is caused by the ascomycete fungus Blumeria graminis f.sp. tritici. Genomic approaches open new ways to study the biology of this obligate biotrophic pathogen. We started the analysis of the Bg tritici genome with the low-pass sequencing of its genome using the 454 technology and the construction of the first genomic bacterial artificial chromosome (BAC) library for this fungus. High-coverage contigs were assembled with the 454 reads. They allowed the characterization of 56 transposable elements and the establishment of the Blumeria repeat database. The BAC library contains 12,288 clones with an average insert size of 115 kb, which represents a maximum of 7.5-fold genome coverage. Sequencing of the BAC ends generated 12.6 Mb of random sequence representative of the genome. Analysis of BAC-end sequences revealed a massive invasion of transposable elements accounting for at least 85% of the genome. This explains the unusually large size of this genome which we estimate to be at least 174 Mb, based on a large-scale physical map constructed through the fingerprinting of the BAC library. Our study represents a crucial step in the perspective of the determination and study of the whole Bg tritici genome sequence.  相似文献   

14.
15.
The greenfin horse‐faced filefish, Thamnaconus septentrionalis, is a valuable commercial fish species that is widely distributed in the Indo‐West Pacific Ocean. This fish has characteristic blue–green fins, rough skin and a spine‐like first dorsal fin. Thamnaconus septentrionalis is of conservation concern because its population has declined sharply, and it is an important marine aquaculture fish species in China. Genomic resources for the filefish are lacking, and no reference genome has been released. In this study, the first chromosome‐level genome of T. septentrionalis was constructed using nanopore sequencing and Hi‐C technology. A total of 50.95 Gb polished nanopore sequences were generated and were assembled into a 474.31‐Mb genome, accounting for 96.45% of the estimated genome size of this filefish. The assembled genome contained only 242 contigs, and the achieved contig N50 was 22.46 Mb, a surprisingly high value among all sequenced fish species. Hi‐C scaffolding of the genome resulted in 20 pseudochromosomes containing 99.44% of the total assembled sequences. The genome contained 67.35 Mb of repeat sequences, accounting for 14.2% of the assembly. A total of 22,067 protein‐coding genes were predicted, 94.82% of which were successfully annotated with putative functions. Furthermore, a phylogenetic tree was constructed using 1,872 single‐copy orthologous genes, and 67 unique gene families were identified in the filefish genome. This high‐quality assembled genome will be a valuable resource for a range of future genomic, conservation and breeding studies of T. septentrionalis.  相似文献   

16.
17.
The cabbage looper, Trichoplusia ni, is a globally distributed highly polyphagous herbivore and an important agricultural pest. T. ni has evolved resistance to various chemical insecticides, and is one of the only two insect species that have evolved resistance to the biopesticide Bacillus thuringiensis (Bt) in agricultural systems and has been selected for resistance to baculovirus infections. We report a 333‐Mb high‐quality T. ni genome assembly, which has N50 lengths of scaffolds and contigs of 4.6 Mb and 140 Kb, respectively, and contains 14,384 protein‐coding genes. High‐density genetic maps were constructed to anchor 305 Mb (91.7%) of the assembly to 31 chromosomes. Comparative genomic analysis of T. ni with Bombyx mori showed enrichment of tandemly duplicated genes in T. ni in families involved in detoxification and digestion, consistent with the broad host range of T. ni. High levels of genome synteny were found between T. ni and other sequenced lepidopterans. However, genome synteny analysis of T. ni and the T. ni derived cell line High Five (Hi5) indicated extensive genome rearrangements in the cell line. These results provided the first genomic evidence revealing the high instability of chromosomes in lepidopteran cell lines known from karyotypic observations. The high‐quality T. ni genome sequence provides a valuable resource for research in a broad range of areas including fundamental insect biology, insect‐plant interactions and co‐evolution, mechanisms and evolution of insect resistance to chemical and biological pesticides, and technology development for insect pest management.  相似文献   

18.
19.
Monogononta is the most speciose class of rotifers, with more than 2,000 species. The monogonont genus Brachionus is widely distributed at a global scale, and a few of its species are commonly used as ecological and evolutionary models to address questions related to aquatic ecology, cryptic speciation, evolutionary ecology, the evolution of sex and ecotoxicology. With the importance of Brachionus species in many areas of research, it is remarkable that the genome has not been characterized. This study aims to address this lacuna by presenting, for the first time, the whole‐genome assembly of the freshwater species Brachionus calyciflorus. The total length of the assembled genome was 129.6 Mb, with 1,041 scaffolds. The N50 value was 786.6 kb, and the GC content was 24%. A total of 16,114 genes were annotated with repeat sequences, accounting for 21% of the assembled genome. This assembled genome may form a basis for future studies addressing key questions on the evolution of monogonont rotifers. It will also provide the necessary molecular resources to mechanistically investigate ecophysiological and ecotoxicological responses.  相似文献   

20.
Soybean cyst nematode (SCN, Heterodera glycines) is a major pest of soybean that is spreading across major soybean production regions worldwide. Increased SCN virulence has recently been observed in both the United States and China. However, no study has reported a genome assembly for H. glycines at the chromosome scale. Herein, the first chromosome‐level reference genome of X12, an unusual SCN race with high infection ability, is presented. Using whole‐genome shotgun (WGS) sequencing, Pacific Biosciences (PacBio) sequencing, Illumina paired‐end sequencing, 10X Genomics linked reads and high‐throughput chromatin conformation capture (Hi‐C) genome scaffolding techniques, a 141.01‐megabase (Mb) assembled genome was obtained with scaffold and contig N50 sizes of 16.27 Mb and 330.54 kilobases (kb), respectively. The assembly showed high integrity and quality, with over 90% of Illumina reads mapped to the genome. The assembly quality was evaluated using Core Eukaryotic Genes Mapping Approach and Benchmarking Universal Single‐Copy Orthologs. A total of 11,882 genes were predicted using de novo, homolog and RNAseq data generated from eggs, second‐stage juveniles (J2), third‐stage juveniles (J3) and fourth‐stage juveniles (J4) of X12, and 79.0% of homologous sequences were annotated in the genome. These high‐quality X12 genome data will provide valuable resources for research in a broad range of areas, including fundamental nematode biology, SCN–plant interactions and co‐evolution, and also contribute to the development of technology for overall SCN management.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号