首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 828 毫秒
1.
The large genome size of many species hinders the development and application of genomic tools to study them. For instance, loblolly pine (Pinus taeda L.), an ecologically and economically important conifer, has a large and yet uncharacterized genome of 21.7 Gbp. To characterize the pine genome, we performed exome capture and sequencing of 14 729 genes derived from an assembly of expressed sequence tags. Efficiency of sequence capture was evaluated and shown to be similar across samples with increasing levels of complexity, including haploid cDNA, haploid genomic DNA and diploid genomic DNA. However, this efficiency was severely reduced for probes that overlapped multiple exons, presumably because intron sequences hindered probe:exon hybridizations. Such regions could not be entirely avoided during probe design, because of the lack of a reference sequence. To improve the throughput and reduce the cost of sequence capture, a method to multiplex the analysis of up to eight samples was developed. Sequence data showed that multiplexed capture was reproducible among 24 haploid samples, and can be applied for high‐throughput analysis of targeted genes in large populations. Captured sequences were de novo assembled, resulting in 11 396 expanded and annotated gene models, significantly improving the knowledge about the pine gene space. Interspecific capture was also evaluated with over 98% of all probes designed from P. taeda that were efficient in sequence capture, were also suitable for analysis of the related species Pinus elliottii Engelm.  相似文献   

2.
The ladybird beetle Propylea japonica is an important natural enemy in agro‐ecological systems. Studies on the strong tolerance of P. japonica to high temperatures and insecticides, and its population and phenotype diversity have recently increased. However, abundant genome resources for obtaining insights into stress‐resistance mechanisms and genetic intra‐species diversity for P. japonica are lacking. Here, we constructed the P. japonica genome maps using Pacific Bioscience (PacBio) and Illumina sequencing technologies. The genome size was 850.90 Mb with a contig N50 of 813.13 kb. The Hi‐C sequence data were used to upgrade draft genome assemblies; 4,777 contigs were assembled to 10 chromosomes; and the final draft genome assembly was 803.93 Mb with a contig N50 of 813.98 kb and a scaffold N50 of 100.34 Mb. Approximately 495.38 Mb of repeated sequences was annotated. The 18,018 protein‐coding genes were predicted, of which 95.78% were functionally annotated, and 1,407 genes were species‐specific. The phylogenetic analysis showed that P. japonica diverged from the ancestor of Anoplophora glabripennis and Tribolium castaneum ~ 236.21 million years ago. We detected that some important gene families involved in detoxification of pesticides and tolerance to heat stress were expanded in P. japonica, especially cytochrome P450 and Hsp70 genes. Overall, the high‐quality draft genome sequence of P. japonica will provide invaluable resource for understanding the molecular mechanisms of stress resistance and will facilitate the research on population genetics, evolution and phylogeny of Coccinellidae. This genome will also provide new avenues for conserving the diversity of predator insects.  相似文献   

3.
Populus euphratica is well adapted to extreme desert environments and is an important model species for elucidating the mechanisms of abiotic stress resistance in trees. The current assembly of P. euphratica genome is highly fragmented with many gaps and errors, thereby impeding downstream applications. Here, we report an improved chromosome‐level reference genome of P. euphratica (v2.0) using single‐molecule sequencing and chromosome conformation capture (Hi‐C) technologies. Relative to the previous reference genome, our assembly represents a nearly 60‐fold improvement in contiguity, with a scaffold N50 size of 28.59 Mb. Using this genome, we have found that extensive expansion of Gypsy elements in P. euphratica led to its rapid increase in genome size compared to any other Salicaceae species studied to date, and potentially contributed to adaptive divergence driven by insertions near genes involved in stress tolerance. We also detected a wide range of unique structural rearrangements in P. euphratica, including 2,549 translocations, 454 inversions, 121 tandem and 14 segmental duplications. Several key genes likely to be involved in tolerance to abiotic stress were identified within these regions. This high‐quality genome represents a valuable resource for poplar breeding and genetic improvement in the future, as well as comparative genomic analysis with other Salicaceae species.  相似文献   

4.
Advanced resources for genome‐assisted research in barley (Hordeum vulgare) including a whole‐genome shotgun assembly and an integrated physical map have recently become available. These have made possible studies that aim to assess genetic diversity or to isolate single genes by whole‐genome resequencing and in silico variant detection. However such an approach remains expensive given the 5 Gb size of the barley genome. Targeted sequencing of the mRNA‐coding exome reduces barley genomic complexity more than 50‐fold, thus dramatically reducing this heavy sequencing and analysis load. We have developed and employed an in‐solution hybridization‐based sequence capture platform to selectively enrich for a 61.6 megabase coding sequence target that includes predicted genes from the genome assembly of the cultivar Morex as well as publicly available full‐length cDNAs and de novo assembled RNA‐Seq consensus sequence contigs. The platform provides a highly specific capture with substantial and reproducible enrichment of targeted exons, both for cultivated barley and related species. We show that this exome capture platform provides a clear path towards a broader and deeper understanding of the natural variation residing in the mRNA‐coding part of the barley genome and will thus constitute a valuable resource for applications such as mapping‐by‐sequencing and genetic diversity analyzes.  相似文献   

5.
Erigeron breviscapus is an important medicinal plant in Compositae and the first species to realize the whole process from the decoding of the draft genome sequence to scutellarin biosynthesis in yeast. However, the previous low‐quality genome assembly has hindered the optimization of candidate genes involved in scutellarin synthesis and the development of molecular‐assisted breeding based on the genome. Here, the E. breviscapus genome was updated using PacBio RSII sequencing data and Hi‐C data, and increased in size from 1.2 Gb to 1.43 Gb, with a scaffold N50 of 156.82 Mb and contig N50 of 140.95 kb, and a total of 43,514 protein‐coding genes were obtained and oriented onto nine pseudo‐chromosomes, thus becoming the third plant species assembled to chromosome level after sunflower and lettuce in Compositae. Fourteen genes with evidence for positive selection were identified and found to be related to leaf morphology, flowering and secondary metabolism. The number of genes in some gene families involved in flavonoid biosynthesis in E. breviscapus have been significantly expanded. In particular, additional candidate genes involved in scutellarin biosynthesis, such as flavonoid‐7‐O‐glucuronosyltransferase genes (F7GATs) were identified using updated genome. In addition, three candidate genes encoding indole‐3‐pyruvate monooxygenase YUCCA2 (YUC2), serine carboxypeptidase‐like 18 (SCPL18), and F‐box protein (FBP), respectively, were identified to be probably related to leaf development and flowering by resequencing 99 individuals. These results provided a substantial genetic basis for improving agronomic and quality traits of E. breviscapus, and provided a platform for improving other draft genome assemblies to chromosome‐level.  相似文献   

6.
Samia ricini, a gigantic saturniid moth, has the potential to be a novel lepidopteran model species. Samia ricini is far more resistant to diseases than the current model species Bombyx mori, and therefore can be more easily reared. In addition, genetic resources available for S. ricini rival those for B. mori: at least 26 ecoraces of S. ricini are reported and S. ricini can hybridize with wild Samia species, which are distributed throughout Asian countries, and produce fertile progenies. Physiological traits such as food preference, integument colour and larval spot pattern differ among S. ricini strains and wild Samia species so that those traits can be targeted in forward genetic analyses. To facilitate genetic research in S. ricini, we determined its whole genome sequence. The assembled genome of S. ricini was 458 Mb with 155 scaffolds, and the scaffold N50 length of the assembly was ~ 21 Mb. In total, 16,702 protein coding genes were predicted. While the S. ricini genome was mostly collinear with that of B. mori with some rearrangements and few S. ricini‐specific genes were discovered, chorion genes and fibroin genes seemed to have expanded in the S. ricini lineage. As the first step of genetic analyses, causal genes for “Blue,” “Yellow,” “Spot,” and “Red cocoon” phenotypes were mapped to chromosomes.  相似文献   

7.
Glycine latifolia (Benth.) Newell & Hymowitz (2= 40), one of the 27 wild perennial relatives of soybean, possesses genetic diversity and agronomically favorable traits that are lacking in soybean. Here, we report the 939‐Mb draft genome assembly of G. latifolia (PI 559298) using exclusively linked‐reads sequenced from a single Chromium library. We organized scaffolds into 20 chromosome‐scale pseudomolecules utilizing two genetic maps and the Glycine max (L.) Merr. genome sequence. High copy numbers of putative 91‐bp centromere‐specific tandem repeats were observed in consecutive blocks within predicted pericentromeric regions on several pseudomolecules. No 92‐bp putative centromeric repeats, which are abundant in G. max, were detected in G. latifolia or Glycine tomentella. Annotation of the assembled genome and subsequent filtering yielded a high confidence gene set of 54 475 protein‐coding loci. In comparative analysis with five legume species, genes related to defense responses were significantly overrepresented in Glycine‐specific orthologous gene families. A total of 304 putative nucleotide‐binding site (NBS)‐leucine‐rich‐repeat (LRR) genes were identified in this genome assembly. Different from other legume species, we observed a scarcity of TIR‐NBS‐LRR genes in G. latifolia. The G. latifolia genome was also predicted to contain genes encoding 367 LRR‐receptor‐like kinases, a family of proteins involved in basal defense responses and responses to abiotic stress. The genome sequence and annotation of G. latifolia provides a valuable source of alternative alleles and novel genes to facilitate soybean improvement. This study also highlights the efficacy and cost‐effectiveness of the application of Chromium linked‐reads in diploid plant genome de novo assembly.  相似文献   

8.
A high-quality reference genome is necessary to determine the molecular mechanisms underlying important biological phenomena; therefore, in the present study, a chromosome-level genome assembly of the Chinese shrimp Fenneropenaeus chinensis was performed. Muscle of a male shrimp was sequenced using PacBio platform, and assembled by Hi-C technology. The assembled F. chinensis genome was 1.47 Gb with contig N50 of 472.84 Kb, including 57.73% repetitive sequences, and was anchored to 43 pseudochromosomes, with scaffold N50 of 36.87 Mb. In total, 25,026 protein-coding genes were predicted. The genome size of F. chinensis showed significant contraction in comparison with that of other penaeid species, which is likely related to migration observed in this species. However, the F. chinensis genome included several expanded gene families related to cellular processes and metabolic processes, and the contracted gene families were associated with virus infection process. The findings signify the adaptation of F. chinensis to the selection pressure of migration and cold environment. Furthermore, the selection signature analysis identified genes associated with metabolism, phototransduction, and nervous system in cultured shrimps when compared with wild population, indicating targeted, artificial selection of growth, vision, and behavior during domestication. The construction of the genome of F. chinensis provided valuable information for the further genetic mechanism analysis of important biological processes, and will facilitate the research of genetic changes during evolution.  相似文献   

9.
10.
Antheraea pernyi is a semi‐domesticated lepidopteran insect species valuable to the silk industry, human health, and ecological tourism. Owing to its economic influence and developmental properties, it serves as an ideal model for investigating divergence of the Bombycoidea super family. However, studies on the karyotype evolution and functional genomics of A. pernyi are limited by scarce genomic resource. Here, we applied PacBio sequencing and chromosome structure capture technique to assemble the first high‐quality A. pernyi genome from a single male individual. The genome is 720.67 Mb long with 49 chromosomes and a 13.77‐Mb scaffold N50. Approximately 441.75 Mb, accounting for 60.74% of the genome, was identified as repeats. The genome comprises 21,431 protein‐coding genes, 85.22% of which were functionally annotated. Comparative genomics analysis suggested that A. pernyi diverged from its common ancestor with A. yamamai ~30.3 million years ago, and that chromosome fission contributed to the increased chromosome number. The genome assembled in this work will not only facilitate future research on A. pernyi and related species but also help to progress comparative genomics analyses in Lepidoptera.  相似文献   

11.
Members of the Nanorana genus (family Dicroglossidae) are often referred to as excellent model species with which to study amphibian adaptations to extreme environments and also as excellent keystone taxa for providing insights into the evolution of the Dicroglossidae. However, a complete mitochondrial genome is currently only available for Nanorana pleskei. Thus, we analyzed the complete mitochondrial genomes of Nanorana parkeri and Nanorana ventripunctata to investigate their evolutionary relationships within Nanorana and their phylogenetic position in the family Dicroglossidae. Our results showed that the genomes of N. parkeri (17,837 bp) and N. ventripunctata (18,373 bp) encode 13 protein‐coding genes (PCGs), two ribosomal RNA genes, 23 transfer RNA (tRNA) genes, and a noncoding control region. Overall sequences and genome structure of the two species showed high degree of similarity with N. pleskei, although the motif structures and repeat sequences of the putative control region showed clear differences among these three Nanorana species. In addition, a tandem repeat of the tRNA‐Met gene was found located between the tRNA‐Gln and ND2 genes. On both the 5′ and 3′‐sides, the control region possessed distinct repeat regions; however, the CSB‐2 motif was not found in N. pleskei. Based on the nucleotide sequences of 13 PCGs, our phylogenetic analyses, using Bayesian inference and maximum‐likelihood methods, illustrate the taxonomic status of Nanorana with robust support showing that N. ventripunctata and N. pleskei are more closely related than they are to N. parkeri. In conclusion, our analyses provide a more robust and reliable perspective on the evolutionary history of Dicroglossidae than earlier analyses, which used only a single species (N. pleskei).  相似文献   

12.
The genus Pseudo‐nitzschia contains potentially toxic species of problematic taxonomy, making it one of the most intensively studied diatom genera. The study of 35 clonal strains isolated from the Bilbao estuary, an area that experiences recurrent blooms of Pseudo‐nitzschia, revealed the presence of two new species, P. abrensis and P. plurisecta, differing from their congeners in both morphology and gene sequence. The morphological features were analyzed by LM and EM, whereas molecular analyses were based on the internal transcribed spacer (ITS) and large subunit (LSU) regions of the rDNA. P. plurisecta appears closely related to P. cuspidata/P. pseudodelicatissima in the phylogenetic tree, whereas P. abrensis forms a moderately supported clade with P. heimii/P. subpacifica and P. caciantha/P. circumpora. Comparison of the secondary structure of ITS2 regions reveals marked differences in the most highly conserved regions among related taxa. Morphologically, the new species differ from their closest congeners in the arrangement of the poroid sectors and the density of valve striae and fibulae. The two species share similar pigment composition, and belong to the group of Pseudo‐nitzschia species containing only chlorophyll c2 and c3.  相似文献   

13.
The related A genome species of the Oryza genus are the effective gene pool for rice. Here, we report draft genomes for two Australian wild A genome taxa: O. rufipogon‐like population, referred to as Taxon A, and O. meridionalis‐like population, referred to as Taxon B. These two taxa were sequenced and assembled by integration of short‐ and long‐read next‐generation sequencing (NGS) data to create a genomic platform for a wider rice gene pool. Here, we report that, despite the distinct chloroplast genome, the nuclear genome of the Australian Taxon A has a sequence that is much closer to that of domesticated rice (O. sativa) than to the other Australian wild populations. Analysis of 4643 genes in the A genome clade showed that the Australian annual, O. meridionalis, and related perennial taxa have the most divergent (around 3 million years) genome sequences relative to domesticated rice. A test for admixture showed possible introgression into the Australian Taxon A (diverged around 1.6 million years ago) especially from the wild indica/O. nivara clade in Asia. These results demonstrate that northern Australia may be the centre of diversity of the A genome Oryza and suggest the possibility that this might also be the centre of origin of this group and represent an important resource for rice improvement.  相似文献   

14.
The nautilus, commonly known as a “living fossil,” is endangered and may be at risk of extinction. The lack of genomic information hinders a thorough understanding of its biology and evolution, which can shed light on the conservation of this endangered species. Here, we report the first high-quality chromosome-level genome assembly of Nautilus pompilius. The assembled genome size comprised 785.15 Mb. Comparative genomic analyses indicated that transposable elements (TEs) and large-scale genome reorganizations may have driven lineage-specific evolution in the cephalopods. Remarkably, evolving conserved genes and recent TE insertion activities were identified in N. pompilius, and we speculate that these findings reflect the strong adaptability and long-term survival of the nautilus. We also identified gene families that are potentially responsible for specific adaptation and evolution events. Our study provides unprecedented insights into the specialized biology and evolution of N. pompilius, and the results serve as an important resource for future conservation genomics of the nautilus and closely related species.  相似文献   

15.
Parasitoid wasps represent a large proportion of hymenopteran species. They have complex evolutionary histories and are important biocontrol agents. To advance parasitoid research, a combination of Illumina short‐read, PacBio long‐read and Hi‐C scaffolding technologies was used to develop a high‐quality chromosome‐level genome assembly for Pteromalus puparum, which is an important pupal endoparasitoid of caterpillar pests. The chromosome‐level assembly has aided in studies of venom and detoxification genes. The assembled genome size is 338 Mb with a contig N50 of 38.7 kb and a scaffold N50 of 1.16 Mb. Hi‐C analysis assembled scaffolds onto five chromosomes and raised the scaffold N50 to 65.8 Mb, with more than 96% of assembled bases located on chromosomes. Gene annotation was assisted by RNA sequencing for the two sexes and four different life stages. Analysis detected 98% of the BUSCO (Benchmarking Universal Single‐Copy Orthologs) gene set, supporting a high‐quality assembly and annotation. In total, 40.1% (135.6 Mb) of the assembly is composed of repetitive sequences, and 14,946 protein‐coding genes were identified. Although venom genes play important roles in parasitoid biology, their spatial distribution on chromosomes was poorly understood. Mapping has revealed venom gene tandem arrays for serine proteases, pancreatic lipase‐related proteins and kynurenine–oxoglutarate transaminases, which have amplified in the P. puparum lineage after divergence from its common ancestor with Nasonia vitripennis. In addition, there is a large expansion of P450 genes in P. puparum. These examples illustrate how chromosome‐level genome assembly can provide a valuable resource for molecular, evolutionary and biocontrol studies of parasitoid wasps.  相似文献   

16.
17.
Many recent studies have provided significant insights into polyploid breeding, but limited research has been carried out on trees. The genomic information needed to understand growth and response to abiotic stress in polyploidy trees is largely unknown, but has become critical due to the threats to forests imposed by climate change. Populus albaBerolinensis,’ also known “Yinzhong poplar,” is a triploid poplar from northeast China. This hybrid triploid poplar is widely used as a landscape ornamental and in urban forestry owing to its adaptation to adverse environments and faster growth than its parental diploid. It is an artificially synthesized male allotriploid hybrid, with three haploid genomes of P. albaBerolinensis’ originating from different poplar species, so it is attractive for studying polyploidy genomic mechanisms in heterosis. In this study, we focused on the allelic genomic interactions in P. albaBerolinensis,’ and generated a high-quality chromosome-level genome assembly consisting of 19 allelic chromosomes. Its three haploid chromosome sets are polymorphic with an average of 25.73 nucleotide polymorphism sites per kilobase. We found that some stress-related genes such as RD22 and LEA7 exhibited sequence differences between different haploid genomes. The genome assembly has been deposited in our polyploid genome online analysis website TreeGenomes ( https://www.treegenomes.com ). These polyploid genome-related resources will provide a critical foundation for the molecular breeding of P. albaBerolinensis’ and help us uncover the allopolyploidization effects of heterosis and abiotic stress resistance and traits of polyploidy species in the future.  相似文献   

18.
Apolygus lucorum (Miridae) is an omnivorous pest that occurs worldwide and is notorious for the serious damage it causes to various crops and substantial economic losses. Although some studies have examined the biological characteristics of the mirid bug, no reference genome is available in Miridae, limiting in‐depth studies of this pest. Here, we present a chromosome‐scale reference genome of A. lucorum, the first sequenced Miridae species. The assembled genome size was 1.02 Gb with a contig N50 of 785 kb. With Hi‐C scaffolding, 1,016 Mb contig sequences were clustered, ordered and assembled into 17 large scaffolds with scaffold N50 length 68 Mb, each corresponding to a natural chromosome. Numerous transposable elements occur in this genome and contribute to the large genome size. Expansions of genes associated with omnivorousness and mesophyll feeding such as those related to digestion, chemosensory perception, and detoxification were observed in A. lucorum, suggesting that gene expansion contributed to its strong environmental adaptability and severe harm to crops. We clarified that a salivary enzyme polygalacturonase is unique in mirid bugs and has significantly expanded in A. lucorum, which may contribute to leaf damage from this pest. The reference genome of A. lucorum not only facilitates biological studies of Hemiptera as well as an understanding of the damage mechanism of mesophyll feeding, but also provides a basis on which to develop efficient control technologies for mirid bugs.  相似文献   

19.
Salt tolerance genes constitute an important class of loci in plant genomes. Little is known about the extent to which natural selection in saline environments has acted upon these loci, and what types of nucleotide diversity such selection has given rise to. Here, we surveyed genetic diversity in three types of Na+/H+ antiporter gene (SOS, NhaD, and NHX, belonging to the cation/proton antiporter 1 family), which have well ‐ characterized essential roles in plant salt tolerance. Ten Na+/H+ antiporter genes and 16 neutral loci randomly selected as controls were sequenced from 17 accessions of two closely related members of the genus Populus, Populus euphratica and Populus pruinosa, section Turanga, which are native to northwest China. The results show that salt tolerance genes are common targets of natural selection in P. euphratica and P. pruinosa. Moreover, the patterns of nucleotide variation across the three types of Na+/H+ antiporter gene are distinctly different in these two closely related Populus species, and gene flow from P. pruinosa to P. euphratica is highly restricted. Our results suggest that natural selection played an important role in shaping the current distinct patterns of Na+/H+ antiporter genes, resulting in adaptive evolution in P. euphratica and P. pruinosa.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号