首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Soybean cyst nematode (SCN, Heterodera glycines) is a major pest of soybean that is spreading across major soybean production regions worldwide. Increased SCN virulence has recently been observed in both the United States and China. However, no study has reported a genome assembly for H. glycines at the chromosome scale. Herein, the first chromosome‐level reference genome of X12, an unusual SCN race with high infection ability, is presented. Using whole‐genome shotgun (WGS) sequencing, Pacific Biosciences (PacBio) sequencing, Illumina paired‐end sequencing, 10X Genomics linked reads and high‐throughput chromatin conformation capture (Hi‐C) genome scaffolding techniques, a 141.01‐megabase (Mb) assembled genome was obtained with scaffold and contig N50 sizes of 16.27 Mb and 330.54 kilobases (kb), respectively. The assembly showed high integrity and quality, with over 90% of Illumina reads mapped to the genome. The assembly quality was evaluated using Core Eukaryotic Genes Mapping Approach and Benchmarking Universal Single‐Copy Orthologs. A total of 11,882 genes were predicted using de novo, homolog and RNAseq data generated from eggs, second‐stage juveniles (J2), third‐stage juveniles (J3) and fourth‐stage juveniles (J4) of X12, and 79.0% of homologous sequences were annotated in the genome. These high‐quality X12 genome data will provide valuable resources for research in a broad range of areas, including fundamental nematode biology, SCN–plant interactions and co‐evolution, and also contribute to the development of technology for overall SCN management.  相似文献   

2.
Bivalves, a highly diverse and the most evolutionarily successful class of invertebrates native to aquatic habitats, provide valuable molecular resources for understanding the evolutionary adaptation and aquatic ecology. Here, we reported a high‐quality chromosome‐level genome assembly of the razor clam Sinonovacula constricta using Pacific Bioscience single‐molecule real‐time sequencing, Illumina paired‐end sequencing, 10X Genomics linked‐reads and Hi‐C reads. The genome size was 1,220.85 Mb, containing scaffold N50 of 65.93 Mb and contig N50 of 976.94 Kb. A total of 899 complete (91.92%) and seven partial (0.72%) matches of the 978 metazoa Benchmarking Universal Single‐Copy Orthologs were determined in this genome assembly. And Hi‐C scaffolding of the genome resulted in 19 pseudochromosomes. A total of 28,594 protein‐coding genes were predicted in the S. constricta genome, of which 25,413 genes (88.88%) were functionally annotated. In addition, 39.79% of the assembled genome was composed of repetitive sequences, and 4,372 noncoding RNAs were identified. The enrichment analyses of the significantly expanded and contracted genes suggested an evolutionary adaptation of S. constricta to highly stressful living environments. In summary, the genomic resources generated in this work not only provide a valuable reference genome for investigating the molecular mechanisms of S. constricta biological functions and evolutionary adaptation, but also facilitate its genetic improvement and disease treatment. Meanwhile, the obtained genome greatly improves our understanding of the genetics of molluscs and their comparative evolution.  相似文献   

3.
Yellow perch, Perca flavescens, is an ecologically and economically important species native to a large portion of the northern United States and southern Canada and is also a promising candidate species for aquaculture. However, no yellow perch reference genome has been available to facilitate improvements in both fisheries and aquaculture management practices. By combining Oxford Nanopore Technologies long‐reads, 10X Genomics Illumina short linked reads and a chromosome contact map produced with Hi‐C, we generated a high‐continuity chromosome‐scale yellow perch genome assembly of 877.4 Mb. It contains, in agreement with the known diploid chromosome yellow perch count, 24 chromosome‐size scaffolds covering 98.8% of the complete assembly (N50 = 37.4 Mb, L50 = 11). We also provide a first characterization of the yellow perch sex determination locus that contains a male‐specific duplicate of the anti‐Mullerian hormone type II receptor gene (amhr2by) inserted at the proximal end of the Y chromosome (chromosome 9). Using this sex‐specific information, we developed a simple PCR genotyping assay which accurately differentiates XY genetic males (amhr2by+) from XX genetic females (amhr2by?). Our high‐quality genome assembly is an important genomic resource for future studies on yellow perch ecology, toxicology, fisheries and aquaculture research. In addition, characterization of the amhr2by gene as a candidate sex‐determining gene in yellow perch provides a new example of the recurrent implication of the transforming growth factor beta pathway in fish sex determination, and highlights gene duplication as an important genomic mechanism for the emergence of new master sex determination genes.  相似文献   

4.
《Genomics》2021,113(4):2656-2674
Here we report the 409.5 Mb chromosome-level assembly of the first bred semi-dwarf rice, the Taichung Native 1 (TN1), which served as the template for the development of the Green Revolution (GR) cultivar IR8 “miracle rice”. We sequenced the TN1 genome utilizing multiple platforms and produced PacBio long reads, Illumina paired-end reads, Illumina mate-pair reads and 10x Genomics linked reads. We used a hybrid approach to assemble the 226× coverage of sequences by a combination of de novo and reference-guided approaches. The assembled TN1 genome has an N50 scaffold size of 33.1 Mb with the longest measuring 45.5 Mb. We annotated 37,526 genes, in which 24,102 (64.23%) were assigned Blast2GO annotations. The genome has 4672 or 95.4% complete BUSCOs and a repeat content of 51.52%. We developed our own method of creating a GR pangenome using the orthologous relationships of the proteins of TN1, IR8, MH63 and IR64, identifying 16,999 core orthologue groups of Green Revolution. From the pangenome, we identified a set of shared and unique gene ontology terms for the accessory clusters, characterizing TN1, IR8, MH63 and IR64. This TN1 genome assembly and GR pangenome will be a resource for new genomic discoveries about Green Revolution, and for improving the disease and insect resistances and the yield of rice.  相似文献   

5.
Ark shells are commercially important clam species that inhabit in muddy sediments of shallow coasts in East Asia. For a long time, the lack of genome resources has hindered scientific research of ark shells. Here, we report a high-quality chromosome-level genome assembly of Scapharca kagoshimensis, with an aim to unravel the molecular basis of heme biosynthesis, and develop genomic resources for genetic breeding and population genetics in ark shells. Nineteen scaffolds corresponding to 19 chromosomes were constructed from 938 contigs (contig N50 = 2.01 Mb) to produce a final high-quality assembly with a total length of 1.11 Gb and scaffold N50 around 60.64 Mb. The genome assembly represents 93.4% completeness via matching 303 eukaryota core conserved genes. A total of 24,908 protein-coding genes were predicted and 24,551 genes (98.56%) of which were functionally annotated. The enrichment analyses suggested that genes in heme biosynthesis pathways were expanded and positive selection of the haemoglobin genes was also found in the genome of S. kagoshimensis, which gives important insights into the molecular mechanisms and evolution of the heme biosynthesis in mollusca. The valuable genome assembly of Skagoshimensis would provide a solid foundation for investigating the molecular mechanisms that underlie the diverse biological functions and evolutionary adaptations of Skagoshimensis.  相似文献   

6.
The Tetraodontidae family are known to have relatively small and compact genomes compared to other vertebrates. The obscure puffer fish Takifugu obscurus is an anadromous species that migrates to freshwater from the sea for spawning. Thus the euryhaline characteristics of T. obscurus have been investigated to gain understanding of their survival ability, osmoregulation, and other homeostatic mechanisms in both freshwater and seawater. In this study, a high quality chromosome‐level reference genome for T. obscurus was constructed using long‐read Pacific Biosciences (PacBio) Sequel sequencing and a Hi‐C‐based chromatin contact map platform. The final genome assembly of T. obscurus is 381 Mb, with a contig N50 length of 3,296 kb and longest length of 10.7 Mb, from a total of 62 Gb of raw reads generated using single‐molecule real‐time sequencing technology from a PacBio Sequel platform. The PacBio data were further clustered into chromosome‐scale scaffolds using a Hi‐C approach, resulting in a 373 Mb genome assembly with a contig N50 length of 15.2 Mb and and longest length of 28 Mb. When we directly compared the 22 longest scaffolds of T. obscurus to the 22 chromosomes of the tiger puffer Takifugu rubripes, a clear one‐to‐one orthologous relationship was observed between the two species, supporting the chromosome‐level assembly of T. obscurus. This genome assembly can serve as a valuable genetic resource for exploring fugu‐specific compact genome characteristics, and will provide essential genomic information for understanding molecular adaptations to salinity fluctuations and the evolution of osmoregulatory mechanisms.  相似文献   

7.
Dendrolimus spp. are important destructive pests of conifer forests, and Dendrolimus punctatus Walker (Lepidoptera; Lasiocampidae) is the most widely distributed Dendrolimus species. During periodic outbreaks, this species is said to make “fire without smoke” because large areas of pine forest can be quickly and heavily damaged. Yet, little is known about the molecular mechanisms that underlie the unique ecological characteristics of this forest insect. Here, we combined Pacific Biosciences (PacBio) RSII single‐molecule long reads and high‐throughput chromosome conformation capture (Hi‐C) genomics‐linked reads to produce a high‐quality, chromosome‐level reference genome for D. punctatus. The final assembly was 614 Mb with contig and scaffold N50 values of 1.39 and 22.15 Mb, respectively, and 96.96% of the contigs anchored onto 30 chromosomes. Based on the prediction, this genome contained 17,593 protein‐coding genes and 56.16% repetitive sequences. Phylogenetic analyses indicated that D. punctatus diverged from the common ancestor of Hyphantria cunea, Spodoptera litura and Thaumetopoea pityocampa ~ 108.91 million years ago. Many gene families that were expanded in the D. punctatus genome were significantly enriched for the xenobiotic biodegradation system, especially the cytochrome P450 gene family. This high‐quality, chromosome‐level reference genome will be a valuable resource for understanding mechanisms of D. punctatus outbreak and host resistance adaption. Because this is the first Lasiocampidae insect genome to be sequenced, it also will serve as a reference for further comparative genomics.  相似文献   

8.
The European rabbit (Oryctolagus cuniculus) is a domesticated species with one of the broadest ranges of economic and scientific applications and fields of investigation. Rabbit genome information and assembly are available (oryCun2.0), but so far few studies have investigated its variability, and massive discovery of polymorphisms has not been published yet for this species. Here, we sequenced two reduced representation libraries (RRLs) to identify single nucleotide polymorphisms (SNPs) in the rabbit genome. Genomic DNA of 10 rabbits belonging to different breeds was pooled and digested with two restriction enzymes (HaeIII and RsaI) to create two RRLs which were sequenced using the Ion Torrent Personal Genome Machine. The two RRLs produced 2 917 879 and 4 046 871 reads, for a total of 280.51 Mb (248.49 Mb with quality >20) and 417.28 Mb (360.89 Mb with quality >20) respectively of sequenced DNA. About 90% and 91% respectively of the obtained reads were mapped on the rabbit genome, covering a total of 15.82% of the oryCun2.0 genome version. The mapping and ad hoc filtering procedures allowed to reliably call 62 491 SNPs. SNPs in a few genomic regions were validated by Sanger sequencing. The Variant Effect Predictor Web tool was used to map SNPs on the current version of the rabbit genome. The obtained results will be useful for many applied and basic research programs for this species and will contribute to the development of cost‐effective solutions for high‐throughput SNP genotyping in the rabbit.  相似文献   

9.
Heterozyosity is an important feature of many plant genomes, and is related to heterosis. Sweet orange, a highly heterozygous species, is thought to have originated from an inter‐species hybrid between pummelo and mandarin. To investigate the heterozygosity of the sweet orange genome and examine how this heterozygosity affects gene expression, we characterized the genome of Valencia orange for single nucleotide variations (SNVs), small insertions and deletions (InDels) and structural variations (SVs), and determined their functional effects on protein‐coding genes and non‐coding sequences. Almost half of the genes containing large‐effect SNVs and InDels were expressed in a tissue‐specific manner. We identified 3542 large SVs (>50 bp), including deletions, insertions and inversions. Most of the 296 genes located in large‐deletion regions showed low expression levels. RNA‐Seq reads and DNA sequencing reads revealed that the alleles of 1062 genes were differentially expressed. In addition, we detected approximately 42 Mb of contigs that were not found in the reference genome of a haploid sweet orange by de novo assembly of unmapped reads, and annotated 134 protein‐coding genes within these contigs. We discuss how this heterozygosity affects the quality of genome assembly. This study advances our understanding of the genome architecture of sweet orange, and provides a global view of gene expression at heterozygous loci.  相似文献   

10.
We offer a guide to de novo genome assembly1 using sequence data generated by the Illumina platform for biologists working with fungi or other organisms whose genomes are less than 100 Mb in size. The guide requires no familiarity with sequencing assembly technology or associated computer programs. It defines commonly used terms in genome sequencing and assembly; provides examples of assembling short-read genome sequence data for four strains of the fungus Grosmannia clavigera using four assembly programs; gives examples of protocols and software; and presents a commented flowchart that extends from DNA preparation for submission to a sequencing center, through to processing and assembly of the raw sequence reads using freely available operating systems and software.  相似文献   

11.
Accurate sex identification is crucial for elucidating the biology of a species. In the absence of directly observable sexual characteristics, sex identification of wild fauna can be challenging, if not impossible. Molecular sexing offers a powerful alternative to morphological sexing approaches. Here, we present SeXY, a novel sex‐identification pipeline, for very low‐coverage shotgun sequencing data from a single individual. SeXY was designed to utilize low‐effort screening data for sex identification and does not require a conspecific sex‐chromosome assembly as reference. We assess the accuracy of our pipeline to data quantity by downsampling sequencing data from 100,000 to 1000 mapped reads and to reference genome selection by mapping to a variety of reference genomes of various qualities and phylogenetic distance. We show that our method is 100% accurate when mapping to a high‐quality (highly contiguous N50 > 30 Mb) conspecific genome, even down to 1000 mapped reads. For lower‐quality reference assemblies (N50 < 30 Mb), our method is 100% accurate with 50,000 mapped reads, regardless of reference assembly quality or phylogenetic distance. The SeXY pipeline provides several advantages over previously implemented methods; SeXY (i) requires sequencing data from only a single individual, (ii) does not require assembled conspecific sex chromosomes, or even a conspecific reference assembly, (iii) takes into account variation in coverage across the genome, and (iv) is accurate with only 1000 mapped reads in many cases.  相似文献   

12.
We present the development of a genomic library using RADseq (restriction site associated DNA sequencing) protocol for marker discovery that can be applied on evolutionary studies of the sugarcane borer Diatraea saccharalis, an important South American insect pest. A RADtag protocol combined with Illumina paired‐end sequencing allowed de novo discovery of 12 811 SNPs and a high‐quality assembly of 122.8M paired‐end reads from six individuals, representing 40 Gb of sequencing data. Approximately 1.7 Mb of the sugarcane borer genome distributed over 5289 minicontigs were obtained upon assembly of second reads from first reads RADtag loci where at least one SNP was discovered and genotyped. Minicontig lengths ranged from 200 to 611 bp and were used for functional annotation and microsatellite discovery. These markers will be used in future studies to understand gene flow and adaptation to host plants and control tactics.  相似文献   

13.
14.
Cicer arietinum L. (chickpea) is the third most important food legume crop. We have generated the draft sequence of a desi‐type chickpea genome using next‐generation sequencing platforms, bacterial artificial chromosome end sequences and a genetic map. The 520‐Mb assembly covers 70% of the predicted 740‐Mb genome length, and more than 80% of the gene space. Genome analysis predicts the presence of 27 571 genes and 210 Mb as repeat elements. The gene expression analysis performed using 274 million RNA‐Seq reads identified several tissue‐specific and stress‐responsive genes. Although segmental duplicated blocks are observed, the chickpea genome does not exhibit any indication of recent whole‐genome duplication. Nucleotide diversity analysis provides an assessment of a narrow genetic base within the chickpea cultivars. We have developed a resource for genetic markers by comparing the genome sequences of one wild and three cultivated chickpea genotypes. The draft genome sequence is expected to facilitate genetic enhancement and breeding to develop improved chickpea varieties.  相似文献   

15.
16.
The rice leaffolder Cnaphalocrocis exigua (Crambidae, Lepidoptera) is an important agricultural pest that damages rice crops and other members of related grass families. C. exigua exhibits a very similar morphological phenotype and feeding behaviour to C. medinalis, another species of rice leaffolder whose genome was recently reported. However, genomic information for C. exigua remains extremely limited. Here, we used a hybrid strategy combining different sequencing technologies, including Illumina, PacBio, 10× Genomics, and Hi – C scaffolding, to generate a high-quality chromosome-level genome assembly of C. exigua. We initially obtained a 798.8 Mb assembly with a contig N50 size of 2.9 Mb, and the N50 size was subsequently increased to 25.7 Mb using Hi – C technology to anchor 1413 scaffolds to 32 chromosomes. We detected a total of 97.7% Benchmarking Universal Single-Copy Orthologues (BUSCO) in the genome assembly, which was comprised of ~52% repetitive sequence and annotated 14,922 protein-coding genes. Of note, the Z and W sex chromosomes were assembled and identified. A comparative genomic analysis demonstrated that despite the high synteny observed between the two rice leaffolders, the species have distinct genomic features associated with expansion and contraction of gene families and selection pressure. In summary, our chromosome-level genome assembly and comparative genomic analysis of C. exigua provide novel insights into the evolution and ecology of this rice insect pests and offer useful information for pest control.  相似文献   

17.
Yellow drum (Nibea albiflora) is an important fish species in capture fishery and aquaculture in East Asia. We herein report the first and near‐complete genome assembly of an ultra‐homologous gynogenic female yellow drum using Illumina short sequencing reads. In summary, a total of 154.2 Gb of raw reads were generated via whole‐genome sequencing and were assembled to 565.3 Mb genome with a contig N50 size of 50.3 kb and scaffold N50 size of 2.2 Mb (BUSCO completeness of 97.7%), accounting for 97.3%–98.6% of the estimated genome size of this fish. We further identified 22,448 genes using combined methods of ab initio prediction, RNAseq annotation, and protein homology searching, of which 21,614 (96.3%) were functionally annotated in NCBI nr, trEMBL, SwissProt, and KOG databases. We also investigated the nucleotide diversity (around 1/390) of aquacultured individuals and found the genetic diversity of the aquacultured population decreased due to inbreeding. Evolutionary analyses illustrated significantly expanded and extracted gene families, such as myosin and sodium: neurotransmitter symporter (SNF), could help explain swimming motility of yellow drum. The presented genome will be an important resource for future studies on population genetics, conservation, understanding of evolutionary history and genetic breeding of the yellow drum and other Nibea species.  相似文献   

18.
《Genomics》2022,114(6):110472
Toxoptera aurantii Boyer de Fonscolombe (Hemiptera: Aphididae) can attack many plant hosts, including tea (Camellia sinensis L.), citrus (Citrus spp.), lychee (Litchi chinensis Sonn.), banana (Musa spp.), and pineapple (Ananas comasus L.) among others. It is a widely distributed hexapod and one of the most destructive pests in tea plantations, causing enormous economic losses in tea production each year. A high-quality reference genome is important to study the phylogenetics and evolution of T. aurantii because its genome is highly heterozygous and repetitive. We obtained a de novo genome assembly of T. aurantii at the chromosome level using a combination of long Nanopore reads from sequencing with high-throughput chromosome conformation capture technology. When finally assembled, the genome was 318.95 Mb on four chromosomes with a 15.19 Mb scaffold N50. A total of 12,162 genes encoded proteins, while there were 22.01% repetitive sequences that totaled 67.73 Mb. Phylogenetic analyses revealed that T. aurantii and Aphis gossypii parted ways approximately 7.6 million years ago (Mya). We used a combination of long-read single-molecule sequencing with Hi-C–based chromatin interaction maps that resulted in a reference chromosomal level reference genome of T. aurantii that was high quality. Our results will enable the exploration of the genetics behind the special biological features of T. aurantii and also provide a source of data that should be useful to compare the compare genome among the Hemiptera.  相似文献   

19.
Onychostoma macrolepis is an emerging commercial cyprinid fish species. It is a model system for studies of sexual dimorphism and genome evolution. Here, we report the chromosome‐level assembly of the O.macrolepis genome obtained from the integration of nanopore long‐read sequencing with physical maps produced using Bionano and Hi‐C technology. A total of 87.9 Gb of nanopore sequence provided approximately 100‐fold coverage of the genome. The preliminary genome assembly was 883.2 Mb in size with a contig N50 size of 11.2 Mb. The 969 corrected contigs obtained from Bionano optical mapping were assembled into 853 scaffolds and produced an assembly of 886.5 Mb with a scaffold N50 of 16.5 Mb. Finally, using the Hi‐C data, 881.3 Mb (99.4% of genome) in 526 scaffolds were anchored and oriented in 25 chromosomes ranging in size from 25.27 to 56.49 Mb. In total, 24,770 protein‐coding genes were predicted in the genome, and ~96.85% of the genes were functionally annotated. The annotated assembly contains 93.3% complete genes from the BUSCO reference set. In addition, we identified 409 Mb (46.23% of the genome) of repetitive sequence, and 11,213 non‐coding RNAs, in the genome. Evolutionary analysis revealed that O. macrolepis diverged from common carp approximately 24.25 million years ago. The chromosomes of O. macrolepis showed an unambiguous correspondence to the chromosomes of zebrafish. The high‐quality genome assembled in this work provides a valuable genomic resource for further biological and evolutionary studies of O. macrolepis.  相似文献   

20.
The red‐spotted grouper Epinephelus akaara (E. akaara) is one of the most economically important marine fish in China, Japan and South‐East Asia and is a threatened species. The species is also considered a good model for studies of sex inversion, development, genetic diversity and immunity. Despite its importance, molecular resources for E. akaara remain limited and no reference genome has been published to date. In this study, we constructed a chromosome‐level reference genome of E. akaara by taking advantage of long‐read single‐molecule sequencing and de novo assembly by Oxford Nanopore Technology (ONT) and Hi‐C. A red‐spotted grouper genome of 1.135 Gb was assembled from a total of 106.29 Gb polished Nanopore sequence (GridION, ONT), equivalent to 96‐fold genome coverage. The assembled genome represents 96.8% completeness (BUSCO) with a contig N50 length of 5.25 Mb and a longest contig of 25.75 Mb. The contigs were clustered and ordered onto 24 pseudochromosomes covering approximately 95.55% of the genome assembly with Hi‐C data, with a scaffold N50 length of 46.03 Mb. The genome contained 43.02% repeat sequences and 5,480 noncoding RNAs. Furthermore, combined with several RNA‐seq data sets, 23,808 (99.5%) genes were functionally annotated from a total of 23,923 predicted protein‐coding sequences. The high‐quality chromosome‐level reference genome of E. akaara was assembled for the first time and will be a valuable resource for molecular breeding and functional genomics studies of red‐spotted grouper in the future.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号