首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Here we use whole-genome de novo assembly of second-generation sequencing reads to map structural variation (SV) in an Asian genome and an African genome. Our approach identifies small- and intermediate-size homozygous variants (1-50 kb) including insertions, deletions, inversions and their precise breakpoints, and in contrast to other methods, can resolve complex rearrangements. In total, we identified 277,243 SVs ranging in length from 1-23 kb. Validation using computational and experimental methods suggests that we achieve overall <6% false-positive rate and <10% false-negative rate in genomic regions that can be assembled, which outperforms other methods. Analysis of the SVs in the genomes of 106 individuals sequenced as part of the 1000 Genomes Project suggests that SVs account for a greater fraction of the diversity between individuals than do single-nucleotide polymorphisms (SNPs). These findings demonstrate that whole-genome de novo assembly is a feasible approach to deriving more comprehensive maps of genetic variation.  相似文献   

2.
Riemerella anatipestifer (RA) is a gram-negative bacterium that has a high potential to infect waterfowl. Although more and more genomes of RA have been generated comparaed to genomic analysis of RA still remains at the level of individual species. In this study, we analysed the pan-genome of 27 RA virulent isolates to reveal the intraspecies genomic diversity from various aspects. The multi-locus sequence typing (MLST) analysis suggests that the geographic origin of R. anatipestifer is Guangdong province, China. Results of pan-genome analysis revealed an open pan-genome for all 27 species with the sizes of 2967 genes. We identified 387 genes among 555 unique genes originated by horizontal gene transfer. Further studies showed 204 strain-specific HGT genes were predicted as virulent proteins. Screening the 1113 core genes in RA through subtractive genomic approach, 70 putative vaccine targets out of 125 non-cytoplasmic proteins have been predicted. Further analysis of these non A. platyrhynchos homologous proteins predicted that 56 essential proteins as drug target with more interaction partners were involved in unique metabolic pathways of RA. In conclusion, the present study indicated the essence and the diversity of RA and also provides useful information for identification of vaccine and drugs candidates in future.  相似文献   

3.
Temperate japonica/geng (GJ) rice yield has significantly improved due to intensive breeding efforts, dramatically enhancing global food security. However, little is known about the underlying genomic structural variations (SVs) responsible for this improvement. We compared 58 long-read assemblies comprising cultivated and wild rice species in the present study, revealing 156 319 SVs. The phylogenomic analysis based on the SV dataset detected the putatively selected region of GJ sub-populations. A significant portion of the detected SVs overlapped with genic regions were found to influence the expression of involved genes inside GJ assemblies. Integrating the SVs and causal genetic variants underlying agronomic traits into the analysis enables the precise identification of breeding signatures resulting from complex breeding histories aimed at stress tolerance, yield potential and quality improvement. Further, the results demonstrated genomic and genetic evidence that the SV in the promoter of LTG1 is accounting for chilling sensitivity, and the increased copy numbers of GNP1 were associated with positive effects on grain number. In summary, the current study provides genomic resources for retracing the properties of SVs-shaped agronomic traits during previous breeding procedures, which will assist future genetic, genomic and breeding research on rice.  相似文献   

4.
《Genomics》2021,113(5):3083-3091
Revealing genomic variation of representative and diverse germplasm is the cornerstone of deploying genomics information into genetic improvement programs of species of agricultural importance. Here we report the re-sequencing of 239 japonica rice elites representing the genetic diversity of japonica germplasm in China, Japan and Korea. A total of 4.8 million SNPs and PAV of 35,634 genes were identified. The elites from Japan and Korea are closely related and relatively less diverse than those from China. A japonica rice pan-genome was constructed, and 35 Mb non-redundant novel sequences were identified, from which 1131 novel genes were predicted. Strong selection signals of genomic regions were detected on most of the chromosomes. The heading date genes Hd1 and Hd3a have been artificially selected during the breeding process. The results from this study lay the foundation for future whole genome sequences-enabled breeding in rice and provide a paradigm for other species.  相似文献   

5.
SARS-CoV-2 belongs to the coronavirus family. Comparing genomic features of viral genomes of coronavirus family can improve our understanding about SARS-CoV-2. Here we present the first pan-genome analysis of 3,932 whole genomes of 101 species out of 4 genera from the coronavirus family. We found that a total of 181 genes in the pan-genome of coronavirus family, among which only 3 genes, the S gene, M gene and N gene, are highly conserved. We also constructed a pan-genome from 23,539 whole genomes of SARS-CoV-2. There are 13 genes in total in the SARS-CoV-2 pan-genome. All of the 13 genes are core genes for SARS-CoV-2. The pan-genome of coronaviruses shows a lower level of diversity than the pan-genomes of other RNA viruses, which contain no core gene. The three highly conserved genes in coronavirus family, which are also core genes in SARS-CoV-2 pan-genome, could be potential targets in developing nucleic acid diagnostic reagents with a decreased possibility of cross-reaction with other coronavirus species.  相似文献   

6.
《Trends in plant science》2023,28(8):857-860
A better understanding of crop genomes reveals that structural variations (SVs) are crucial for genetic improvement. A graph-based pan-genome by Yan et al. uncovered 424 085 genomic SVs and provided novel insights into heat tolerance of pearl millet. We discuss how these SVs can fast-track pearl millet breeding under harsh environments.  相似文献   

7.
Li S  Wang S  Deng Q  Zheng A  Zhu J  Liu H  Wang L  Gao F  Zou T  Huang B  Cao X  Xu L  Yu C  Ai P  Li P 《PloS one》2012,7(2):e30952
Rice restorer lines play an important role in three-line hybrid rice production. Previous research based on molecular tagging has suggested that the restorer lines used widely today have narrow genetic backgrounds. However, patterns of genetic variation at a genome-wide scale in these restorer lines remain largely unknown. The present study performed re-sequencing and genome-wide variation analysis of three important representative restorer lines, namely, IR24, MH63, and SH527, using the Solexa sequencing technology. With the genomic sequence of the Indica cultivar 9311 as the reference, the following genetic features were identified: 267,383 single-nucleotide polymorphisms (SNPs), 52,847 insertion/deletion polymorphisms (InDels), and 3,286 structural variations (SVs) in the genome of IR24; 288,764 SNPs, 59,658 InDels, and 3,226 SVs in MH63; and 259,862 SNPs, 55,500 InDels, and 3,127 SVs in SH527. Variations between samples were also determined by comparative analysis of authentic collections of SNPs, InDels, and SVs, and were functionally annotated. Furthermore, variations in several important genes were also surveyed by alignment analysis in these lines. Our results suggest that genetic variations among these lines, although far lower than those reported in the landrace population, are greater than expected, indicating a complicated genetic basis for the phenotypic diversity of the restorer lines. Identification of genome-wide variation and pattern analysis among the restorer lines will facilitate future genetic studies and the molecular improvement of hybrid rice.  相似文献   

8.
Here we present the genomic sequence of the African cultivated rice, Oryza glaberrima, and compare these data with the genome sequence of Asian cultivated rice, Oryza sativa. We obtained gene‐enriched sequences of O. glaberrima that correspond to about 25% of the gene regions of the O. sativa (japonica) genome by methylation filtration and subtractive hybridization of repetitive sequences. While patterns of amino acid changes did not differ between the two species in terms of the biochemical properties, genes of O. glaberrima generally showed a larger synonymous–nonsynonymous substitution ratio, suggesting that O. glaberrima has undergone a genome‐wide relaxation of purifying selection. We further investigated nucleotide substitutions around splice sites and found that eight genes of O. sativa experienced changes at splice sites after the divergence from O. glaberrima. These changes produced novel introns that partially truncated functional domains, suggesting that these newly emerged introns affect gene function. We also identified 2451 simple sequence repeats (SSRs) from the genomes of O. glaberrima and O. sativa. Although tri‐nucleotide repeats were most common among the SSRs and were overrepresented in the protein‐coding sequences, we found that selection against indels of tri‐nucleotide repeats was relatively weak in both African and Asian rice. Our genome‐wide sequencing of O. glaberrima and in‐depth analyses provide rice researchers not only with useful genomic resources for future breeding but also with new insights into the genomic evolution of the African and Asian rice species.  相似文献   

9.
区树俊  汪鸿儒  储成才  张帅 《遗传》2012,34(11):1389-1389
作物的驯化是人类从开始种植和储存的野生作物中选择优良性状,使之形态特征适应于农业生产方向进化的过程,因此,大部分种子作物驯化后在落粒性、种子休眠和植株形态等方面都出现了相似的变化。水稻是研究谷类作物驯化的良好模式生物。稻属包含2种栽培稻,分别为亚洲栽培稻(Oryza sativa L.)和非洲栽培稻(O. glaberrima Steud.),其中亚洲栽培稻遍布全世界,包含两个主要亚种,粳稻亚种(O. sativa L. ssp. japonica)和籼稻亚种(O. sativa L. ssp. indica)。稻属丰富的近缘种和广泛的地域分布非常有利于研究确定现代栽培稻的驯化地域。此外,水稻基因组较小、具高质量精细图谱,加上功能基因研究上的进展,也为深入开展水稻驯化进程研究奠定了基础。详见本期第XX-XX页区树俊,汪鸿儒,储成才“亚洲栽培稻主要驯化性状研究进展”,对水稻关键驯化性状研究进行的比较全面的综述。封面图中央是选取23株AA基因组的亚洲栽培稻及其近缘野生稻,利用水稻驯化过程中受到选择的控制稻壳颜色基因Bh4上下游各50 kb中的SNP位点所构建的进化树;图外从左下至右下沿顺时针方向,反映的是水稻驯化过程中稻壳颜色、谷粒形状、穗型的变化趋势。 区树俊,汪鸿儒,储成才(绘图:区树俊)  相似文献   

10.
Genomic structural variations (SVs) are pervasive in many types of cancers. Characterizing their underlying mechanisms and potential molecular consequences is crucial for understanding the basic biology of tumorigenesis. Here, we engineered a local assembly-based algorithm (laSV) that detects SVs with high accuracy from paired-end high-throughput genomic sequencing data and pinpoints their breakpoints at single base-pair resolution. By applying laSV to 97 tumor-normal paired genomic sequencing datasets across six cancer types produced by The Cancer Genome Atlas Research Network, we discovered that non-allelic homologous recombination is the primary mechanism for generating somatic SVs in acute myeloid leukemia. This finding contrasts with results for the other five types of solid tumors, in which non-homologous end joining and microhomology end joining are the predominant mechanisms. We also found that the genes recursively mutated by single nucleotide alterations differed from the genes recursively mutated by SVs, suggesting that these two types of genetic alterations play different roles during cancer progression. We further characterized how the gene structures of the oncogene JAK1 and the tumor suppressors KDM6A and RB1 are affected by somatic SVs and discussed the potential functional implications of intergenic SVs.  相似文献   

11.
Domestication and breeding have reshaped the genomic architecture of chicken, but the retention and loss of genomic elements during these evolutionary processes remain unclear. We present the first chicken pan-genome constructed using 664 individuals, which identified an additional approximately 66.5-Mb sequences that are absent from the reference genome (GRCg6a). The constructed pan-genome encoded 20,491 predicated protein-coding genes, of which higher expression levels are observed in conserved genes relative to dispensable genes. Presence/absence variation (PAV) analyses demonstrated that gene PAV in chicken was shaped by selection, genetic drift, and hybridization. PAV-based genome-wide association studies identified numerous candidate mutations related to growth, carcass composition, meat quality, or physiological traits. Among them, a deletion in the promoter region of IGF2BP1 affecting chicken body size is reported, which is supported by functional studies and extra samples. This is the first time to report the causal variant of chicken body size quantitative trait locus located at chromosome 27 which was repeatedly reported. Therefore, the chicken pan-genome is a useful resource for biological discovery and breeding. It improves our understanding of chicken genome diversity and provides materials to unveil the evolution history of chicken domestication.  相似文献   

12.
Structural variants (SVs) are a largely unstudied feature of plant genome evolution, despite the fact that SVs contribute substantially to phenotypes. In this study, we discovered SVs across a population sample of 347 high-coverage, resequenced genomes of Asian rice (Oryza sativa) and its wild ancestor (O. rufipogon). In addition to this short-read data set, we also inferred SVs from whole-genome assemblies and long-read data. Comparisons among data sets revealed different features of genome variability. For example, genome alignment identified a large (∼4.3 Mb) inversion in indica rice varieties relative to japonica varieties, and long-read analyses suggest that ∼9% of genes from the outgroup (O. longistaminata) are hemizygous. We focused, however, on the resequencing sample to investigate the population genomics of SVs. Clustering analyses with SVs recapitulated the rice cultivar groups that were also inferred from SNPs. However, the site-frequency spectrum of each SV type—which included inversions, duplications, deletions, translocations, and mobile element insertions—was skewed toward lower frequency variants than synonymous SNPs, suggesting that SVs may be predominantly deleterious. Among transposable elements, SINE and mariner insertions were found at especially low frequency. We also used SVs to study domestication by contrasting between rice and O. rufipogon. Cultivated genomes contained ∼25% more derived SVs and mobile element insertions than O. rufipogon, indicating that SVs contribute to the cost of domestication in rice. Peaks of SV divergence were enriched for known domestication genes, but we also detected hundreds of genes gained and lost during domestication, some of which were enriched for traits of agronomic interest.  相似文献   

13.
随着测序技术和生物信息学的快速发展,已有数百种植物的参考基因组被测序,极大地促进了植物功能基因组学、进化遗传学和分子育种学等领域的蓬勃发展。然而,随着研究的深入,越来越多的证据表明来自单一个体的参考基因组远不能代表整个物种的遗传多样性,由此催生了泛基因组(Pan-genome)的概念,并已成功应用于20余种植物的研究,揭示了丰富的遗传变异,发掘了大量的新基因,深化了对相关物种遗传多样性的认识。本文简述了泛基因组的概念、构建方法以及在当前植物研究中的应用现状,最后对其未来发展进行了展望。  相似文献   

14.
Heterozyosity is an important feature of many plant genomes, and is related to heterosis. Sweet orange, a highly heterozygous species, is thought to have originated from an inter‐species hybrid between pummelo and mandarin. To investigate the heterozygosity of the sweet orange genome and examine how this heterozygosity affects gene expression, we characterized the genome of Valencia orange for single nucleotide variations (SNVs), small insertions and deletions (InDels) and structural variations (SVs), and determined their functional effects on protein‐coding genes and non‐coding sequences. Almost half of the genes containing large‐effect SNVs and InDels were expressed in a tissue‐specific manner. We identified 3542 large SVs (>50 bp), including deletions, insertions and inversions. Most of the 296 genes located in large‐deletion regions showed low expression levels. RNA‐Seq reads and DNA sequencing reads revealed that the alleles of 1062 genes were differentially expressed. In addition, we detected approximately 42 Mb of contigs that were not found in the reference genome of a haploid sweet orange by de novo assembly of unmapped reads, and annotated 134 protein‐coding genes within these contigs. We discuss how this heterozygosity affects the quality of genome assembly. This study advances our understanding of the genome architecture of sweet orange, and provides a global view of gene expression at heterozygous loci.  相似文献   

15.
Monocotyledons and dicotyledons are distinct, not only in their body plans and developmental patterns, but also in the structural features of their cell walls. The recent completion of the rice (Oryza sativa) genomic sequence and publication of the sequence data, together with the completed database of the Arabidopsis thaliana genome, provide the first opportunity to compare the full complement of cell-wall-related genes from the two distinct classes of flowering plants. We made this comparison by exploiting the fact that Arabidopsis and rice have type I and type II walls, respectively, and therefore represent the two extremes in terms of the structural features of plant cell walls. In this review article, we classify all cell-wall-related genes into 32 gene families, and generate their phylogenetic trees. Using these data, we can phylogenetically compare individual genes of particular interest between Arabidopsis and rice. This comparative genome approach shows that the differences in wall architecture in the two plant groups actually mirror the diversity of the individual gene families involved in the cell-wall dynamics of the respective plant species. This study also identifies putative rice orthologs of genes with well-defined functions in Arabidopsis and other plant species.  相似文献   

16.
A cytochrome c gene, OsCc-1, from rice (Oryza sativa) has been isolated and analyzed. The OsCc-1 gene encodes a cytochrome c protein that is typical of higher-plant cytochrome c proteins. OsCc-1 consists of three exons separated by two introns that are 817 and 747 bp in length, respectively. From genomic DNA hybridization analysis, OsCc-1 appears to be one of possibly two cytochrome c genes in several Asian, American, and Indian rice species and varieties surveyed. A single, unique cytochrome c gene appears to be present in one African cultivated rice species. We performed comparative molecular evolutionary analyses of OsCc-1 and other cytochrome c genes. We calculated a unit evolutionary period of 19.4 Myr for cytochrome c DNA sequences, which agrees closely with previous estimates based on protein sequence comparisons.  相似文献   

17.
Oryza sativa or Asian cultivated rice is one of the major cereal grass species domesticated for human food use during the Neolithic. Domestication of this species from the wild grass Oryza rufipogon was accompanied by changes in several traits, including seed shattering, percent seed set, tillering, grain weight, and flowering time. Quantitative trait locus (QTL) mapping has identified three genomic regions in chromosome 3 that appear to be associated with these traits. We would like to study whether these regions show signatures of selection and whether the same genetic basis underlies the domestication of different rice varieties. Fragments of 88 genes spanning these three genomic regions were sequenced from multiple accessions of two major varietal groups in O. sativa--indica and tropical japonica--as well as the ancestral wild rice species O. rufipogon. In tropical japonica, the levels of nucleotide variation in these three QTL regions are significantly lower compared to genome-wide levels, and coalescent simulations based on a complex demographic model of rice domestication indicate that these patterns are consistent with selection. In contrast, there is no significant reduction in nucleotide diversity in the homologous regions in indica rice. These results suggest that there are differences in the genetic and selective basis for domestication between these two Asian rice varietal groups.  相似文献   

18.
Thermococcales has a strong adaptability to extreme environments, which is of profound interest in explaining how complex life forms emerge on earth. However, their gene composition, thermal stability and evolution in hyperthermal environments are still little known. Here, we characterized the pan-genome architecture of 30 Thermococcales species to gain insight into their genetic properties, evolutionary patterns and specific metabolisms adapted to niches. We revealed an open pan-genome of Thermococcales comprising 6070 gene families that tend to increase with the availability of additional genomes. The genome contents of Thermococcales were flexible, with a series of genes experienced gene duplication, progressive divergence, or gene gain and loss events exhibiting distinct functional features. These archaea had concise types of heat shock proteins, such as HSP20, HSP60 and prefoldin, which were constrained by strong purifying selection that governed their conservative evolution. Furthermore, purifying selection forced genes involved in enzyme, motility, secretion system, defence system and chaperones to differ in functional constraints and their disparity in the rate of evolution may be related to adaptation to specific niche. These results deepened our understanding of genetic diversity and adaptation patterns of Thermococcales, and provided valuable research models for studying the metabolic traits of early life forms.  相似文献   

19.
Disease susceptibility and resistance are important factors for the conservation of endangered species, including elephants. We analyzed pathology data from 26 zoos and report that Asian elephants have increased neoplasia and malignancy prevalence compared with African bush elephants. This is consistent with observed higher susceptibility to tuberculosis and elephant endotheliotropic herpesvirus (EEHV) in Asian elephants. To investigate genetic mechanisms underlying disease resistance, including differential responses between species, among other elephant traits, we sequenced multiple elephant genomes. We report a draft assembly for an Asian elephant, and defined 862 and 1,017 conserved potential regulatory elements in Asian and African bush elephants, respectively. In the genomes of both elephant species, conserved elements were significantly enriched with genes differentially expressed between the species. In Asian elephants, these putative regulatory regions were involved in immunity pathways including tumor-necrosis factor, which plays an important role in EEHV response. Genomic sequences of African bush, forest, and Asian elephant genomes revealed extensive sequence conservation at TP53 retrogene loci across three species, which may be related to TP53 functionality in elephant cancer resistance. Positive selection scans revealed outlier genes related to additional elephant traits. Our study suggests that gene regulation plays an important role in the differential inflammatory response of Asian and African elephants, leading to increased infectious disease and cancer susceptibility in Asian elephants. These genomic discoveries can inform future functional and translational studies aimed at identifying effective treatment approaches for ill elephants, which may improve conservation.  相似文献   

20.
The species in family Planctomycetaceae are ideal groups for investigating the origin of eukaryotes. Their cells are divided by a lipidic intracytoplasmic membrane and they share a number of eukaryote-like molecular characteristics. However, their genomic structures, potential abilities, and evolutionary status are still unknown. In this study, we searched for common protein families and a core genome/pan genome based on 11 sequenced species in family Planctomycetaceae. Then, we constructed phylogenetic tree based on their 832 common protein families. We also annotated the 11 genomes using the Clusters of Orthologous Groups database. Moreover, we predicted and reconstructed their core/pan metabolic pathways using the KEGG (Kyoto Encyclopedia of Genes and Genomes) orthology system. Subsequently, we identified genomic islands (GIs) and structural variations (SVs) among the five complete genomes and we specifically investigated the integration of two Planctomycetaceae plasmids in all 11 genomes. The results indicate that Planctomycetaceae species share diverse genomic variations and unique genomic characteristics, as well as have huge potential for human applications.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号