首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 453 毫秒
1.
2.
Bottle gourd (Lagenaria siceraria) is an important vegetable crop as well as a rootstock for other cucurbit crops. In this study, we report a high‐quality 313.4‐Mb genome sequence of a bottle gourd inbred line, USVL1VR‐Ls, with a scaffold N50 of 8.7 Mb and the longest of 19.0 Mb. About 98.3% of the assembled scaffolds are anchored to the 11 pseudomolecules. Our comparative genomic analysis identifies chromosome‐level syntenic relationships between bottle gourd and other cucurbits, as well as lineage‐specific gene family expansions in bottle gourd. We reconstructed the genome of the most recent common ancestor of Cucurbitaceae, which revealed that the ancestral Cucurbitaceae karyotypes consisted of 12 protochromosomes with 18 534 protogenes. The 12 protochromosomes are largely retained in the modern melon genome, while have undergone different degrees of shuffling events in other investigated cucurbit genomes. The 11 bottle gourd chromosomes derive from the ancestral Cucurbitaceae karyotypes followed by 19 chromosomal fissions and 20 fusions. The bottle gourd genome sequence has facilitated the mapping of a dominant monogenic locus, Prs, conferring Papaya ring‐spot virus (PRSV) resistance in bottle gourd, to a 317.8‐kb region on chromosome 1. We have developed a cleaved amplified polymorphic sequence (CAPS) marker tightly linked to the Prs locus and demonstrated its potential application in marker‐assisted selection of PRSV resistance in bottle gourd. This study provides insights into the paleohistory of Cucurbitaceae genome evolution, and the high‐quality genome sequence of bottle gourd provides a useful resource for plant comparative genomics studies and cucurbit improvement.  相似文献   

3.
4.
5.
The leopard coral grouper, Plectropomus leopardus, belonging to the family Epinephelinae, is a carnivorous coral reef fish widely distributed in tropical and subtropical waters of the Indo‐Pacific. Due to its appealing body appearance and delicious taste, P. leopardus has become a popular commercial fish for aquaculture in many countries. However, the lack of genomic and molecular resources for P. leopardus has hindered study of its biology and genomic breeding programmes. Here we report the de novo sequencing and assembly of the P. leopardus genome using a combination of 10 × Genomics, high‐throughput chromosome conformation capture (Hi‐C) and PacBio long‐read sequencing technologies. The genome assembly has a total length of 881.55 Mb with a scaffold N50 of 34.15 Mb, consisting of 24 pseudochromosome scaffolds. busco analysis showed that 97.2% of the conserved single‐copy genes were retrieved, indicating the assembly was almost entire. We predicted 25,248 protein‐coding genes, among which 96.5% were functionally annotated. Comparative genomic analyses revealed that gene family expansions in P. leopardus were associated with immune‐related pathways. In addition, we identified 5,178,453 single nucleotide polymorphisms based on genome resequencing of 54 individuals. The P. leopardus genome and genomic variation data provide valuable genomic resources for studies of its genetics, evolution and biology. In particular, it is expected to benefit the development of genomic breeding programmes in the farming industry.  相似文献   

6.
Sesame (Sesamum indicum L.) is an important oil crop renowned for its high oil content and quality. Recently, genome assemblies for five sesame varieties including two landraces (S. indicum cv. Baizhima and Mishuozhima) and three modern cultivars (S. indicum var. Zhongzhi13, Yuzhi11 and Swetha), have become available providing a rich resource for comparative genomic analyses and gene discovery. Here, we employed a reference‐assisted assembly approach to improve the draft assemblies of four of the sesame varieties. We then constructed a sesame pan‐genome of 554.05 Mb. The pan‐genome contained 26 472 orthologous gene clusters; 15 409 (58.21%) of them were core (present across all five sesame genomes), whereas the remaining 41.79% (11 063) clusters and the 15 890 variety‐specific genes were dispensable. Comparisons between varieties suggest that modern cultivars from China and India display significant genomic variation. The gene families unique to the sesame modern cultivars contain genes mainly related to yield and quality, while those unique to the landraces contain genes involved in environmental adaptation. Comparative evolutionary analysis indicates that several genes involved in plant‐pathogen interaction and lipid metabolism are under positive selection, which may be associated with sesame environmental adaption and selection for high seed oil content. This study of the sesame pan‐genome provides insights into the evolution and genomic characteristics of this important oilseed and constitutes a resource for further sesame crop improvement.  相似文献   

7.
Complete and highly accurate reference genomes and gene annotations are indispensable for basic biological research and trait improvement of woody tree species. In this study, we integrated single‐molecule sequencing and high‐throughput chromosome conformation capture techniques to produce a high‐quality and long‐range contiguity chromosome‐scale genome assembly of the soft‐seeded pomegranate cultivar ‘Tunisia’. The genome covers 320.31 Mb (scaffold N50 = 39.96 Mb; contig N50 = 4.49 Mb) and includes 33 594 protein‐coding genes. We also resequenced 26 pomegranate varieties that varied regarding seed hardness. Comparative genomic analyses revealed many genetic differences between soft‐ and hard‐seeded pomegranate varieties. A set of selective loci containing SUC8‐like, SUC6, FoxO and MAPK were identified by the selective sweep analysis between hard‐ and soft‐seeded populations. An exceptionally large selective region (26.2 Mb) was identified on chromosome 1. Our assembled pomegranate genome is more complete than other currently available genome assemblies. Our results indicate that genomic variations and selective genes may have contributed to the genetic divergence between soft‐ and hard‐seeded pomegranate varieties.  相似文献   

8.
9.
Marine medaka (Oryzias melastigma) is considered to be a useful fish model for marine and estuarine ecotoxicology studies and has good potential for field‐based population genomics because of its geographical distribution in Asian estuarine and coastal areas. In this study, we present the first whole‐genome draft of O. melastigma. The genome assembly consists of 8,602 scaffolds (N50 = 23.737 Mb) and a total genome length of 779.4 Mb. A total of 23,528 genes were predicted, and 12,670 gene families shared with three teleost species (Japanese medaka, mangrove killifish and zebrafish) were identified. Genome analyses revealed that the O. melastigma genome is highly heterozygous and contains a large number of repeat sequences. This assembly represents a useful genomic resource for fish scientists.  相似文献   

10.
Antheraea pernyi is a semi‐domesticated lepidopteran insect species valuable to the silk industry, human health, and ecological tourism. Owing to its economic influence and developmental properties, it serves as an ideal model for investigating divergence of the Bombycoidea super family. However, studies on the karyotype evolution and functional genomics of A. pernyi are limited by scarce genomic resource. Here, we applied PacBio sequencing and chromosome structure capture technique to assemble the first high‐quality A. pernyi genome from a single male individual. The genome is 720.67 Mb long with 49 chromosomes and a 13.77‐Mb scaffold N50. Approximately 441.75 Mb, accounting for 60.74% of the genome, was identified as repeats. The genome comprises 21,431 protein‐coding genes, 85.22% of which were functionally annotated. Comparative genomics analysis suggested that A. pernyi diverged from its common ancestor with A. yamamai ~30.3 million years ago, and that chromosome fission contributed to the increased chromosome number. The genome assembled in this work will not only facilitate future research on A. pernyi and related species but also help to progress comparative genomics analyses in Lepidoptera.  相似文献   

11.
Triplophysa is an endemic fish genus of the Tibetan Plateau in China. Triplophysa tibetana, which lives at a recorded altitude of ~4,000 m and plays an important role in the highland aquatic ecosystem, serves as an excellent model for investigating high‐altitude environmental adaptation. However, evolutionary and conservation studies of T. tibetana have been limited by scarce genomic resources for the genus Triplophysa. In the present study, we applied PacBio sequencing and the Hi‐C technique to assemble the T. tibetana genome. A 652‐Mb genome with 1,325 contigs with an N50 length of 3.1 Mb was obtained. The 1,137 contigs were further assembled into 25 chromosomes, representing 98.7% and 80.47% of all contigs at the base and sequence number level, respectively. Approximately 260 Mb of sequence, accounting for ~39.8% of the genome, was identified as repetitive elements. DNA transposons (16.3%), long interspersed nuclear elements (12.4%) and long terminal repeats (11.0%) were the most repetitive types. In total, 24,372 protein‐coding genes were predicted in the genome, and ~95% of the genes were functionally annotated via a search in public databases. Using whole genome sequence information, we found that T. tibetana diverged from its common ancestor with Danio rerio ~121.4 million years ago. The high‐quality genome assembled in this work not only provides a valuable genomic resource for future population and conservation studies of T. tibetana, but it also lays a solid foundation for further investigation into the mechanisms of environmental adaptation of endemic fishes in the Tibetan Plateau.  相似文献   

12.
13.
Peach (Prunus persica L. Batsch) is an economically important fruit crop worldwide. Although a high-quality peach genome has previously been published, Sanger sequencing was used for its assembly, which generated short contigs. Here, we report a chromosome-level genome assembly and sequence analysis of Chinese Cling, an important founder cultivar for peach breeding programs worldwide. The assembled genome contained 247.33 Mb with a contig N50 of 4.13 Mb and a scaffold N50 of 29.68 Mb, representing 99.8% of the estimated genome. Comparisons between this genome and the recently published one (Lovell peach) uncovered 685 407 single nucleotide polymorphisms, 162 655 insertions and deletions, and 16 248 structural variants. Gene family analysis highlighted the contraction of the gene families involved in flavone, flavonol, flavonoid, and monoterpenoid biosynthesis. Subsequently, the volatile compounds of 256 peach varieties were quantitated in mature fruits in 2015 and 2016 to perform a genome-wide association analysis. A comparison with the identified domestication genomic regions allowed us to identify 25 quantitative trait loci, associated with seven volatile compounds, in the domestication region, which is consistent with the differences in volatile compounds between wild and cultivated peaches. Finally, a gene encoding terpene synthase, located within a previously reported quantitative trait loci region, was identified to be associated with linalool synthesis. Such findings highlight the importance of this new assembly for the analysis of evolutionary mechanisms and gene identification in peach species. Furthermore, this high-quality peach genome provides valuable information for future fruit improvement.  相似文献   

14.
Ramie, Boehmeria nivea (L.) Gaudich, family Urticaceae, is a plant native to eastern Asia, and one of the world's oldest fibre crops. It is also used as animal feed and for the phytoremediation of heavy metal‐contaminated farmlands. Thus, the genome sequence of ramie was determined to explore the molecular basis of its fibre quality, protein content and phytoremediation. For further understanding ramie genome, different paired‐end and mate‐pair libraries were combined to generate 134.31 Gb of raw DNA sequences using the Illumina whole‐genome shotgun sequencing approach. The highly heterozygous B. nivea genome was assembled using the Platanus Genome Assembler, which is an effective tool for the assembly of highly heterozygous genome sequences. The final length of the draft genome of this species was approximately 341.9 Mb (contig N50 = 22.62 kb, scaffold N50 = 1,126.36 kb). Based on ramie genome annotations, 30,237 protein‐coding genes were predicted, and the repetitive element content was 46.3%. The completeness of the final assembly was evaluated by benchmarking universal single‐copy orthologous genes (BUSCO); 90.5% of the 1,440 expected embryophytic genes were identified as complete, and 4.9% were identified as fragmented. Phylogenetic analysis based on single‐copy gene families and one‐to‐one orthologous genes placed ramie with mulberry and cannabis, within the clade of urticalean rosids. Genome information of ramie will be a valuable resource for the conservation of endangered Boehmeria species and for future studies on the biogeography and characteristic evolution of members of Urticaceae.  相似文献   

15.
The Tetraodontidae family are known to have relatively small and compact genomes compared to other vertebrates. The obscure puffer fish Takifugu obscurus is an anadromous species that migrates to freshwater from the sea for spawning. Thus the euryhaline characteristics of T. obscurus have been investigated to gain understanding of their survival ability, osmoregulation, and other homeostatic mechanisms in both freshwater and seawater. In this study, a high quality chromosome‐level reference genome for T. obscurus was constructed using long‐read Pacific Biosciences (PacBio) Sequel sequencing and a Hi‐C‐based chromatin contact map platform. The final genome assembly of T. obscurus is 381 Mb, with a contig N50 length of 3,296 kb and longest length of 10.7 Mb, from a total of 62 Gb of raw reads generated using single‐molecule real‐time sequencing technology from a PacBio Sequel platform. The PacBio data were further clustered into chromosome‐scale scaffolds using a Hi‐C approach, resulting in a 373 Mb genome assembly with a contig N50 length of 15.2 Mb and and longest length of 28 Mb. When we directly compared the 22 longest scaffolds of T. obscurus to the 22 chromosomes of the tiger puffer Takifugu rubripes, a clear one‐to‐one orthologous relationship was observed between the two species, supporting the chromosome‐level assembly of T. obscurus. This genome assembly can serve as a valuable genetic resource for exploring fugu‐specific compact genome characteristics, and will provide essential genomic information for understanding molecular adaptations to salinity fluctuations and the evolution of osmoregulatory mechanisms.  相似文献   

16.
Cicer arietinum L. (chickpea) is the third most important food legume crop. We have generated the draft sequence of a desi‐type chickpea genome using next‐generation sequencing platforms, bacterial artificial chromosome end sequences and a genetic map. The 520‐Mb assembly covers 70% of the predicted 740‐Mb genome length, and more than 80% of the gene space. Genome analysis predicts the presence of 27 571 genes and 210 Mb as repeat elements. The gene expression analysis performed using 274 million RNA‐Seq reads identified several tissue‐specific and stress‐responsive genes. Although segmental duplicated blocks are observed, the chickpea genome does not exhibit any indication of recent whole‐genome duplication. Nucleotide diversity analysis provides an assessment of a narrow genetic base within the chickpea cultivars. We have developed a resource for genetic markers by comparing the genome sequences of one wild and three cultivated chickpea genotypes. The draft genome sequence is expected to facilitate genetic enhancement and breeding to develop improved chickpea varieties.  相似文献   

17.
The family Euphorbiaceae includes some of the most efficient biomass accumulators. Whole genome sequencing and the development of genetic maps of these species are important components in molecular breeding and genetic improvement. Here we report the draft genome of physic nut (Jatropha curcas L.), a biodiesel plant. The assembled genome has a total length of 320.5 Mbp and contains 27 172 putative protein‐coding genes. We established a linkage map containing 1208 markers and anchored the genome assembly (81.7%) to this map to produce 11 pseudochromosomes. After gene family clustering, 15 268 families were identified, of which 13 887 existed in the castor bean genome. Analysis of the genome highlighted specific expansion and contraction of a number of gene families during the evolution of this species, including the ribosome‐inactivating proteins and oil biosynthesis pathway enzymes. The genomic sequence and linkage map provide a valuable resource not only for fundamental and applied research on physic nut but also for evolutionary and comparative genomics analysis, particularly in the Euphorbiaceae.  相似文献   

18.
Taro (Colocasia esculenta (L.), Schott), from the Araceae family, is one of the oldest crops with important edible, medicinal, nutritional and economic value. Taro is a highly polymorphic species including diverse genotypes adapted to a broad range of environments, but the taro genome has rarely been investigated. Here, a high‐quality chromosome‐level genome of C. esculenta was assembled using data sequenced by Illumina, PacBio and Nanopore platforms. The assembled genome size was 2,405 Mb with a contig N50 of 400.0 kb and a scaffold N50 of 159.4 Mb. In total, 2,311 Mb (96.09%) of the contig sequences was anchored onto 14 chromosomes to form pseudomolecules, and 2,126 Mb (88.43%) was annotated as repetitive sequences. Of the 28,695 predicted protein‐coding genes, 26,215 genes (91.4%) could be functionally annotated. On the basis of phylogenetic analysis using 769 genes, C. esculenta and Spirodela polyrhiza were placed on one branch of the tree that diverged approximately 73.23 million years ago. The synteny analyses showed that there have been two whole‐genome duplication events in C. esculenta separated by a relatively short gap. According to comparative genome analysis, a larger number (1,189) of distinct gene families and long terminal repeats were enriched in C. esculenta. Our high‐quality taro genome will provide valuable resources for further genetic, ecological and evolutionary analyses of taro or other species in the Araceae.  相似文献   

19.
Chimonanthus salicifolius, a member of the Calycanthaceae of magnoliids, is one of the most famous medicinal plants in Eastern China. Here, we report a chromosome‐level genome assembly of Csalicifolius, comprising 820.1 Mb of genomic sequence with a contig N50 of 2.3 Mb and containing 36 651 annotated protein‐coding genes. Phylogenetic analyses revealed that magnoliids were sister to the eudicots. Two rounds of ancient whole‐genome duplication were inferred in the Csalicifolious genome. One is shared by Calycanthaceae after its divergence with Lauraceae, and the other is in the ancestry of Magnoliales and Laurales. Notably, long genes with > 20 kb in length were much more prevalent in the magnoliid genomes compared with other angiosperms, which could be caused by the length expansion of introns inserted by transposon elements. Homologous genes within the flavonoid pathway for Csalicifolius were identified, and correlation of the gene expression and the contents of flavonoid metabolites revealed potential critical genes involved in flavonoids biosynthesis. This study not only provides an additional whole‐genome sequence from the magnoliids, but also opens the door to functional genomic research and molecular breeding of Csalicifolius.  相似文献   

20.
Rubus chingii Hu (Fu-Pen-Zi), a perennial woody plant in the Rosaceae family, is a characteristic traditional Chinese medicinal plant because of its unique pharmacological effects. There are abundant hydrolyzable tannin (HT) components in R. chingii that provide health benefits. Here, an R. chingii chromosome-scale genome and related functional analysis provide insights into the biosynthetic pathway of HTs. In total, sequence data of 231.21 Mb (155 scaffolds with an N50 of 8.2 Mb) were assembled into seven chromosomes with an average length of 31.4 Mb, and 33 130 protein-coding genes were predicted, 89.28% of which were functionally annotated. Evolutionary analysis showed that R. chingii was most closely related to Rubus occidentalis, from which it was predicted to have diverged 22.46 million years ago (Table S8). Comparative genomic analysis showed that there was a tandem gene cluster of UGT, carboxylesterase (CXE) and SCPL genes on chromosome 02 of R. chingii, including 11 CXE, eight UGT, and six SCPL genes, which may be critical for the synthesis of HTs. In vitro enzyme assays indicated that the proteins encoded by the CXE (LG02.4273) and UGT (LG02.4102) genes have tannin hydrolase and gallic acid glycosyltransferase functions, respectively. The genomic sequence of R. chingii will be a valuable resource for comparative genomic analysis within the Rosaceae family and will be useful for understanding the biosynthesis of HTs.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号