首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 531 毫秒
1.
2.
Antheraea pernyi is a semi‐domesticated lepidopteran insect species valuable to the silk industry, human health, and ecological tourism. Owing to its economic influence and developmental properties, it serves as an ideal model for investigating divergence of the Bombycoidea super family. However, studies on the karyotype evolution and functional genomics of A. pernyi are limited by scarce genomic resource. Here, we applied PacBio sequencing and chromosome structure capture technique to assemble the first high‐quality A. pernyi genome from a single male individual. The genome is 720.67 Mb long with 49 chromosomes and a 13.77‐Mb scaffold N50. Approximately 441.75 Mb, accounting for 60.74% of the genome, was identified as repeats. The genome comprises 21,431 protein‐coding genes, 85.22% of which were functionally annotated. Comparative genomics analysis suggested that A. pernyi diverged from its common ancestor with A. yamamai ~30.3 million years ago, and that chromosome fission contributed to the increased chromosome number. The genome assembled in this work will not only facilitate future research on A. pernyi and related species but also help to progress comparative genomics analyses in Lepidoptera.  相似文献   

3.
Marine medaka (Oryzias melastigma) is considered to be a useful fish model for marine and estuarine ecotoxicology studies and has good potential for field‐based population genomics because of its geographical distribution in Asian estuarine and coastal areas. In this study, we present the first whole‐genome draft of O. melastigma. The genome assembly consists of 8,602 scaffolds (N50 = 23.737 Mb) and a total genome length of 779.4 Mb. A total of 23,528 genes were predicted, and 12,670 gene families shared with three teleost species (Japanese medaka, mangrove killifish and zebrafish) were identified. Genome analyses revealed that the O. melastigma genome is highly heterozygous and contains a large number of repeat sequences. This assembly represents a useful genomic resource for fish scientists.  相似文献   

4.
Onychostoma macrolepis is an emerging commercial cyprinid fish species. It is a model system for studies of sexual dimorphism and genome evolution. Here, we report the chromosome‐level assembly of the O.macrolepis genome obtained from the integration of nanopore long‐read sequencing with physical maps produced using Bionano and Hi‐C technology. A total of 87.9 Gb of nanopore sequence provided approximately 100‐fold coverage of the genome. The preliminary genome assembly was 883.2 Mb in size with a contig N50 size of 11.2 Mb. The 969 corrected contigs obtained from Bionano optical mapping were assembled into 853 scaffolds and produced an assembly of 886.5 Mb with a scaffold N50 of 16.5 Mb. Finally, using the Hi‐C data, 881.3 Mb (99.4% of genome) in 526 scaffolds were anchored and oriented in 25 chromosomes ranging in size from 25.27 to 56.49 Mb. In total, 24,770 protein‐coding genes were predicted in the genome, and ~96.85% of the genes were functionally annotated. The annotated assembly contains 93.3% complete genes from the BUSCO reference set. In addition, we identified 409 Mb (46.23% of the genome) of repetitive sequence, and 11,213 non‐coding RNAs, in the genome. Evolutionary analysis revealed that O. macrolepis diverged from common carp approximately 24.25 million years ago. The chromosomes of O. macrolepis showed an unambiguous correspondence to the chromosomes of zebrafish. The high‐quality genome assembled in this work provides a valuable genomic resource for further biological and evolutionary studies of O. macrolepis.  相似文献   

5.
Dendrolimus spp. are important destructive pests of conifer forests, and Dendrolimus punctatus Walker (Lepidoptera; Lasiocampidae) is the most widely distributed Dendrolimus species. During periodic outbreaks, this species is said to make “fire without smoke” because large areas of pine forest can be quickly and heavily damaged. Yet, little is known about the molecular mechanisms that underlie the unique ecological characteristics of this forest insect. Here, we combined Pacific Biosciences (PacBio) RSII single‐molecule long reads and high‐throughput chromosome conformation capture (Hi‐C) genomics‐linked reads to produce a high‐quality, chromosome‐level reference genome for D. punctatus. The final assembly was 614 Mb with contig and scaffold N50 values of 1.39 and 22.15 Mb, respectively, and 96.96% of the contigs anchored onto 30 chromosomes. Based on the prediction, this genome contained 17,593 protein‐coding genes and 56.16% repetitive sequences. Phylogenetic analyses indicated that D. punctatus diverged from the common ancestor of Hyphantria cunea, Spodoptera litura and Thaumetopoea pityocampa ~ 108.91 million years ago. Many gene families that were expanded in the D. punctatus genome were significantly enriched for the xenobiotic biodegradation system, especially the cytochrome P450 gene family. This high‐quality, chromosome‐level reference genome will be a valuable resource for understanding mechanisms of D. punctatus outbreak and host resistance adaption. Because this is the first Lasiocampidae insect genome to be sequenced, it also will serve as a reference for further comparative genomics.  相似文献   

6.
Glycine latifolia (Benth.) Newell & Hymowitz (2= 40), one of the 27 wild perennial relatives of soybean, possesses genetic diversity and agronomically favorable traits that are lacking in soybean. Here, we report the 939‐Mb draft genome assembly of G. latifolia (PI 559298) using exclusively linked‐reads sequenced from a single Chromium library. We organized scaffolds into 20 chromosome‐scale pseudomolecules utilizing two genetic maps and the Glycine max (L.) Merr. genome sequence. High copy numbers of putative 91‐bp centromere‐specific tandem repeats were observed in consecutive blocks within predicted pericentromeric regions on several pseudomolecules. No 92‐bp putative centromeric repeats, which are abundant in G. max, were detected in G. latifolia or Glycine tomentella. Annotation of the assembled genome and subsequent filtering yielded a high confidence gene set of 54 475 protein‐coding loci. In comparative analysis with five legume species, genes related to defense responses were significantly overrepresented in Glycine‐specific orthologous gene families. A total of 304 putative nucleotide‐binding site (NBS)‐leucine‐rich‐repeat (LRR) genes were identified in this genome assembly. Different from other legume species, we observed a scarcity of TIR‐NBS‐LRR genes in G. latifolia. The G. latifolia genome was also predicted to contain genes encoding 367 LRR‐receptor‐like kinases, a family of proteins involved in basal defense responses and responses to abiotic stress. The genome sequence and annotation of G. latifolia provides a valuable source of alternative alleles and novel genes to facilitate soybean improvement. This study also highlights the efficacy and cost‐effectiveness of the application of Chromium linked‐reads in diploid plant genome de novo assembly.  相似文献   

7.
8.
Cicer arietinum L. (chickpea) is the third most important food legume crop. We have generated the draft sequence of a desi‐type chickpea genome using next‐generation sequencing platforms, bacterial artificial chromosome end sequences and a genetic map. The 520‐Mb assembly covers 70% of the predicted 740‐Mb genome length, and more than 80% of the gene space. Genome analysis predicts the presence of 27 571 genes and 210 Mb as repeat elements. The gene expression analysis performed using 274 million RNA‐Seq reads identified several tissue‐specific and stress‐responsive genes. Although segmental duplicated blocks are observed, the chickpea genome does not exhibit any indication of recent whole‐genome duplication. Nucleotide diversity analysis provides an assessment of a narrow genetic base within the chickpea cultivars. We have developed a resource for genetic markers by comparing the genome sequences of one wild and three cultivated chickpea genotypes. The draft genome sequence is expected to facilitate genetic enhancement and breeding to develop improved chickpea varieties.  相似文献   

9.
Cowpea (Vigna unguiculata [L.] Walp.) is a major crop for worldwide food and nutritional security, especially in sub‐Saharan Africa, that is resilient to hot and drought‐prone environments. An assembly of the single‐haplotype inbred genome of cowpea IT97K‐499‐35 was developed by exploiting the synergies between single‐molecule real‐time sequencing, optical and genetic mapping, and an assembly reconciliation algorithm. A total of 519 Mb is included in the assembled sequences. Nearly half of the assembled sequence is composed of repetitive elements, which are enriched within recombination‐poor pericentromeric regions. A comparative analysis of these elements suggests that genome size differences between Vigna species are mainly attributable to changes in the amount of Gypsy retrotransposons. Conversely, genes are more abundant in more distal, high‐recombination regions of the chromosomes; there appears to be more duplication of genes within the NBS‐LRR and the SAUR‐like auxin superfamilies compared with other warm‐season legumes that have been sequenced. A surprising outcome is the identification of an inversion of 4.2 Mb among landraces and cultivars, which includes a gene that has been associated in other plants with interactions with the parasitic weed Striga gesnerioides. The genome sequence facilitated the identification of a putative syntelog for multiple organ gigantism in legumes. A revised numbering system has been adopted for cowpea chromosomes based on synteny with common bean (Phaseolus vulgaris). An estimate of nuclear genome size of 640.6 Mbp based on cytometry is presented.  相似文献   

10.
The Tetraodontidae family are known to have relatively small and compact genomes compared to other vertebrates. The obscure puffer fish Takifugu obscurus is an anadromous species that migrates to freshwater from the sea for spawning. Thus the euryhaline characteristics of T. obscurus have been investigated to gain understanding of their survival ability, osmoregulation, and other homeostatic mechanisms in both freshwater and seawater. In this study, a high quality chromosome‐level reference genome for T. obscurus was constructed using long‐read Pacific Biosciences (PacBio) Sequel sequencing and a Hi‐C‐based chromatin contact map platform. The final genome assembly of T. obscurus is 381 Mb, with a contig N50 length of 3,296 kb and longest length of 10.7 Mb, from a total of 62 Gb of raw reads generated using single‐molecule real‐time sequencing technology from a PacBio Sequel platform. The PacBio data were further clustered into chromosome‐scale scaffolds using a Hi‐C approach, resulting in a 373 Mb genome assembly with a contig N50 length of 15.2 Mb and and longest length of 28 Mb. When we directly compared the 22 longest scaffolds of T. obscurus to the 22 chromosomes of the tiger puffer Takifugu rubripes, a clear one‐to‐one orthologous relationship was observed between the two species, supporting the chromosome‐level assembly of T. obscurus. This genome assembly can serve as a valuable genetic resource for exploring fugu‐specific compact genome characteristics, and will provide essential genomic information for understanding molecular adaptations to salinity fluctuations and the evolution of osmoregulatory mechanisms.  相似文献   

11.
The ladybird beetle Propylea japonica is an important natural enemy in agro‐ecological systems. Studies on the strong tolerance of P. japonica to high temperatures and insecticides, and its population and phenotype diversity have recently increased. However, abundant genome resources for obtaining insights into stress‐resistance mechanisms and genetic intra‐species diversity for P. japonica are lacking. Here, we constructed the P. japonica genome maps using Pacific Bioscience (PacBio) and Illumina sequencing technologies. The genome size was 850.90 Mb with a contig N50 of 813.13 kb. The Hi‐C sequence data were used to upgrade draft genome assemblies; 4,777 contigs were assembled to 10 chromosomes; and the final draft genome assembly was 803.93 Mb with a contig N50 of 813.98 kb and a scaffold N50 of 100.34 Mb. Approximately 495.38 Mb of repeated sequences was annotated. The 18,018 protein‐coding genes were predicted, of which 95.78% were functionally annotated, and 1,407 genes were species‐specific. The phylogenetic analysis showed that P. japonica diverged from the ancestor of Anoplophora glabripennis and Tribolium castaneum ~ 236.21 million years ago. We detected that some important gene families involved in detoxification of pesticides and tolerance to heat stress were expanded in P. japonica, especially cytochrome P450 and Hsp70 genes. Overall, the high‐quality draft genome sequence of P. japonica will provide invaluable resource for understanding the molecular mechanisms of stress resistance and will facilitate the research on population genetics, evolution and phylogeny of Coccinellidae. This genome will also provide new avenues for conserving the diversity of predator insects.  相似文献   

12.
Casuarina equisetifolia (C. equisetifolia), a conifer‐like angiosperm with resistance to typhoon and stress tolerance, is mainly cultivated in the coastal areas of Australasia. C. equisetifolia, making it a valuable model to study secondary growth associated genes and stress‐tolerance traits. However, the genome sequence is unavailable and therefore wood‐associated growth rate and stress resistance at the molecular level is largely unexplored. We therefore constructed a high‐quality draft genome sequence of C. equisetifolia by a combination of Illumina second‐generation sequencing reads and Pacific Biosciences single‐molecule real‐time (SMRT) long reads to advance the investigation of this species. Here, we report the genome assembly, which contains approximately 300 megabases (Mb) and scaffold size of N50 is 1.06 Mb. Additionally, gene annotation, assisted by a combination of prediction and RNA‐seq data, generated 29 827 annotated protein‐coding genes and 1983 non‐coding genes, respectively. Furthermore, we found that the total number of repetitive sequences account for one‐third of the genome assembly. Here we also construct the genome‐wide map of DNA modification, such as two novel forms N6‐adenine (6mA) and N4‐methylcytosine (4mC) at the level of single‐nucleotide resolution using single‐molecule real‐time (SMRT) sequencing. Interestingly, we found that 17% of 6mA modification genes and 15% of 4mC modification genes also included alternative splicing events. Finally, we investigated cellulose, hemicellulose, and lignin‐related genes, which were associated with secondary growth and contained different DNA modifications. The high‐quality genome sequence and annotation of C. equisetifolia in this study provide a valuable resource to strengthen our understanding of the diverse traits of trees.  相似文献   

13.
14.
Rhizobium strains nodulating summer legumes cow pea [Vigna unguiculata (L.)], green gram [V. radiata (L.) (Wilczek)], black gram [V. mungo (L.) (Hepper)] and cluster bean [Cyamopsis tetragonoloba (L.) (Taub)] and a winter legume chick pea [Cicer arietinum (L.)] were surveyed in the Northern Plains of India and screened for hydrogenase activity to determine distribution of Hup character in the native ecosystem. It was observed that 56% of the Rhizobium strains of summer legumes were Hup+ whereas that of the winter legume, chick pea, were all Hup-. Ex planta acetylene reduction activity was observed in most of the Hup+ but not in the Hup- strains of any of the host species. In summer legume, mixed inoculation of Hup+ and Hup- strains, under sterilized as well as unsterilized soil conditions, showed that the host species were predominantly nodulated with Hup+ strain.  相似文献   

15.
The self‐incompatible species Arabidopsis halleri is a close relative of the self‐compatible model plant Arabidopsis thaliana. The broad European and Asian distribution and heavy metal hyperaccumulation ability make A. halleri a useful model for ecological genomics studies. We used long‐insert mate‐pair libraries to improve the genome assembly of the A. halleri ssp. gemmifera Tada mine genotype (W302) collected from a site with high contamination by heavy metals in Japan. After five rounds of forced selfing, heterozygosity was reduced to 0.04%, which facilitated subsequent genome assembly. Our assembly now covers 196 Mb or 78% of the estimated genome size and achieved scaffold N50 length of 712 kb. To validate assembly and annotation, we used synteny of A. halleri Tada mine with a previously published high‐quality reference assembly of a closely related species, Arabidopsis lyrata. Further validation of the assembly quality comes from synteny and phylogenetic analysis of the HEAVY METAL ATPASE4 (HMA4) and METAL TOLERANCE PROTEIN1 (MTP1) regions using published sequences from European A. halleri for comparison. Three tandemly duplicated copies of HMA4, key gene involved in cadmium and zinc hyperaccumulation, were assembled on a single scaffold. The assembly will enhance the genomewide studies of A. halleri as well as the allopolyploid Arabidopsis kamchatica derived from A. lyrata and A. halleri.  相似文献   

16.
17.
The iconic orange clownfish, Amphiprion percula, is a model organism for studying the ecology and evolution of reef fishes, including patterns of population connectivity, sex change, social organization, habitat selection and adaptation to climate change. Notably, the orange clownfish is the only reef fish for which a complete larval dispersal kernel has been established and was the first fish species for which it was demonstrated that antipredator responses of reef fishes could be impaired by ocean acidification. Despite its importance, molecular resources for this species remain scarce and until now it lacked a reference genome assembly. Here, we present a de novo chromosome‐scale assembly of the genome of the orange clownfish Amphiprion percula. We utilized single‐molecule real‐time sequencing technology from Pacific Biosciences to produce an initial polished assembly comprised of 1,414 contigs, with a contig N50 length of 1.86 Mb. Using Hi‐C‐based chromatin contact maps, 98% of the genome assembly were placed into 24 chromosomes, resulting in a final assembly of 908.8 Mb in length with contig and scaffold N50s of 3.12 and 38.4 Mb, respectively. This makes it one of the most contiguous and complete fish genome assemblies currently available. The genome was annotated with 26,597 protein‐coding genes and contains 96% of the core set of conserved actinopterygian orthologs. The availability of this reference genome assembly as a community resource will further strengthen the role of the orange clownfish as a model species for research on the ecology and evolution of reef fishes.  相似文献   

18.
China is the origin and evolutionary centre of Oriental pears. Pyrus betuleafolia is a wild species native to China and distributed in the northern region, and it is widely used as rootstock. Here, we report the de novo assembly of the genome of P. betuleafolia‐Shanxi Duli using an integrated strategy that combines PacBio sequencing, BioNano mapping and chromosome conformation capture (Hi‐C) sequencing. The genome assembly size was 532.7 Mb, with a contig N50 of 1.57 Mb. A total of 59 552 protein‐coding genes and 247.4 Mb of repetitive sequences were annotated for this genome. The expansion genes in P. betuleafolia were significantly enriched in secondary metabolism, which may account for the organism's considerable environmental adaptability. An alignment analysis of orthologous genes showed that fruit size, sugar metabolism and transport, and photosynthetic efficiency were positively selected in Oriental pear during domestication. A total of 573 nucleotide‐binding site (NBS)‐type resistance gene analogues (RGAs) were identified in the P. betuleafolia genome, 150 of which are TIR‐NBS‐LRR (TNL)‐type genes, which represented the greatest number of TNL‐type genes among the published Rosaceae genomes and explained the strong disease resistance of this wild species. The study of flavour metabolism‐related genes showed that the anthocyanidin reductase (ANR) metabolic pathway affected the astringency of pear fruit and that sorbitol transporter (SOT) transmembrane transport may be the main factor affecting the accumulation of soluble organic matter. This high‐quality P. betuleafolia genome provides a valuable resource for the utilization of wild pear in fundamental pear studies and breeding.  相似文献   

19.
Parasitoid wasps represent a large proportion of hymenopteran species. They have complex evolutionary histories and are important biocontrol agents. To advance parasitoid research, a combination of Illumina short‐read, PacBio long‐read and Hi‐C scaffolding technologies was used to develop a high‐quality chromosome‐level genome assembly for Pteromalus puparum, which is an important pupal endoparasitoid of caterpillar pests. The chromosome‐level assembly has aided in studies of venom and detoxification genes. The assembled genome size is 338 Mb with a contig N50 of 38.7 kb and a scaffold N50 of 1.16 Mb. Hi‐C analysis assembled scaffolds onto five chromosomes and raised the scaffold N50 to 65.8 Mb, with more than 96% of assembled bases located on chromosomes. Gene annotation was assisted by RNA sequencing for the two sexes and four different life stages. Analysis detected 98% of the BUSCO (Benchmarking Universal Single‐Copy Orthologs) gene set, supporting a high‐quality assembly and annotation. In total, 40.1% (135.6 Mb) of the assembly is composed of repetitive sequences, and 14,946 protein‐coding genes were identified. Although venom genes play important roles in parasitoid biology, their spatial distribution on chromosomes was poorly understood. Mapping has revealed venom gene tandem arrays for serine proteases, pancreatic lipase‐related proteins and kynurenine–oxoglutarate transaminases, which have amplified in the P. puparum lineage after divergence from its common ancestor with Nasonia vitripennis. In addition, there is a large expansion of P450 genes in P. puparum. These examples illustrate how chromosome‐level genome assembly can provide a valuable resource for molecular, evolutionary and biocontrol studies of parasitoid wasps.  相似文献   

20.
The red‐spotted grouper Epinephelus akaara (E. akaara) is one of the most economically important marine fish in China, Japan and South‐East Asia and is a threatened species. The species is also considered a good model for studies of sex inversion, development, genetic diversity and immunity. Despite its importance, molecular resources for E. akaara remain limited and no reference genome has been published to date. In this study, we constructed a chromosome‐level reference genome of E. akaara by taking advantage of long‐read single‐molecule sequencing and de novo assembly by Oxford Nanopore Technology (ONT) and Hi‐C. A red‐spotted grouper genome of 1.135 Gb was assembled from a total of 106.29 Gb polished Nanopore sequence (GridION, ONT), equivalent to 96‐fold genome coverage. The assembled genome represents 96.8% completeness (BUSCO) with a contig N50 length of 5.25 Mb and a longest contig of 25.75 Mb. The contigs were clustered and ordered onto 24 pseudochromosomes covering approximately 95.55% of the genome assembly with Hi‐C data, with a scaffold N50 length of 46.03 Mb. The genome contained 43.02% repeat sequences and 5,480 noncoding RNAs. Furthermore, combined with several RNA‐seq data sets, 23,808 (99.5%) genes were functionally annotated from a total of 23,923 predicted protein‐coding sequences. The high‐quality chromosome‐level reference genome of E. akaara was assembled for the first time and will be a valuable resource for molecular breeding and functional genomics studies of red‐spotted grouper in the future.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号