首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.

Background

Aldehyde dehydrogenases (ALDHs) are members of the NAD(P)+-dependent protein superfamily which catalyzes aliphatic and aromatic aldehyde oxidation to non-toxic carboxylic acids. ALDH genes may offer promise for improving plant adaptation to environmental stress. Recently, elucidated genome sequences of Gossypium raimondii provide a foundation for systematic identification and analysis of ALDH genes. To date, this has been accomplished for many plant species except G. raimondii.

Results

In this study, thirty unique ALDH sequences that code for 10 ALDH families were identified in the G. raimondii genome. Phylogenetic analysis revealed that ALDHs were split into six clades in G. raimondii, and ALDH proteins from the same families were clustered together. Phylogenetic relationships of ALDHs from 11 plant species suggest that ALDHs in G. raimondii shared the highest protein homology with ALDHs from poplar. Members within ALDH families possessed homologous exon–intron structures. Chromosomal distribution of ALDH did not occur evenly in the G. raimondii genome and many ALDH genes were involved in the syntenic region as documented by identification of physical locations among single chromosomes. In addition, syntenic analysis revealed that homologues of many G. raimondii ALDHs appeared in corresponding Arabidopsis and poplar syntenic blocks, indicating that these genes arose prior to G. raimondii, Arabidopsis and poplar speciation. Finally, based on gene expression analysis of microarray and RNA-seq, we can speculate that some G. raimondii ALDH genes might respond to drought or waterlogging stresses.

Conclusion

Genome-wide identification and analysis of the evolution and expression of ALDH genes in G. raimondii laid a foundation for studying this gene superfamily and offers new insights into the evolution history and speculated roles in Gossypium. These data can be used to inform functional genomic studies and molecular breeding in cotton.  相似文献   

2.
3.
《Genomics》2021,113(5):3274-3284
Red swamp crayfish Procambarus clarkii is an ecologically and economically important crustacean species. Here, based on a de novo assembly strategy combining PacBio with Hi-C sequencing, we presented a high quality chromosome-level P. clarkii genome. The assembled genome is 2.75 Gb in size with a contig N50 of 216.75 kb. Transposable elements (TEs) make up the largest fraction of the genome (~79.61%), and LINEs comprise the majority of the TEs. Frequent molting and rapid growth of the red swamp crayfish may be explained by the expansion of multiple gene families regarding growth or development. Phylogenetic analysis revealed that P. clarkii diverged from Portunus trituberculatus at 278–407 million years ago (Mya). PSMC analysis identified multiple bottleneck events of the P. clarkii population between 2 kaBP to 14 kaBP. The obtained P. clarkii genome should not only facilitate us understanding the development and evolution of the crayfish species, but also contribute to the genetic improvement in future breeding selections.  相似文献   

4.
Gene family size variation is an important mechanism that shapes the natural variation for adaptation in various species. Despite its importance, the pattern of gene family size variation in green plants is still not well understood. In particular, the evolutionary pattern of genes and gene families remains unknown in the model plant Arabidopsis thaliana in the context of green plants. In this study, eight representative genomes of green plants are sampled to study gene family evolution and characterize the origination of A. thaliana genes, respectively. Four important insights gained are that: (i) the rate of gene gains and losses is about 0.001359 per gene every million years, similar to the rate in yeast, Drosophila, and mammals; (ii) some gene families evolved rapidly with extreme expansions or contractions, and 2745 gene families present in all the eight species represent the ‘core’ proteome of green plants; (iii) 70% of A. thaliana genes could be traced back to 450 million years ago; and (iv) intriguingly, A. thaliana genes with early origination are under stronger purifying selection and more conserved. In summary, the present study provides genome‐wide insights into evolutionary history and mechanisms of genes and gene families in green plants and especially in A. thaliana.  相似文献   

5.
Oryza (23 species; 10 genome types) contains the world's most important food crop — rice. Although the rice genome serves as an essential tool for biological research, little is known about the evolution of the other Oryza genome types. They contain a historical record of genomic changes that led to diversification of this genus around the world as well as an untapped reservoir of agriculturally important traits. To investigate the evolution of the collective Oryza genome, we sequenced and compared nine orthologous genomic regions encompassing the Adh1-Adh2 genes (from six diploid genome types) with the rice reference sequence. Our analysis revealed the architectural complexities and dynamic evolution of this region that have occurred over the past ~15 million years. Of the 46 intact genes and four pseudogenes in the japonica genome, 38 (76%) fell into eight multigene families. Analysis of the evolutionary history of each family revealed independent and lineage-specific gain and loss of gene family members as frequent causes of synteny disruption. Transposable elements were shown to mediate massive replacement of intergenic space (>95%), gene disruption, and gene/gene fragment movement. Three cases of long-range structural variation (inversions/deletions) spanning several hundred kilobases were identified that contributed significantly to genome diversification.  相似文献   

6.
The Brassicaceae include several major crop plants and numerous important model species in comparative evolutionary research such as Arabidopsis, Brassica, Boechera, Thellungiella, and Arabis species. As any evolutionary hypothesis needs to be placed in a temporal context, reliably dated major splits within the evolution of Brassicaceae are essential. We present a comprehensive time-calibrated framework with important divergence time estimates based on whole-chloroplast sequence data for 29 Brassicaceae species. Diversification of the Brassicaceae crown group started at the Eocene-to-Oligocene transition. Subsequent major evolutionary splits are dated to ∼20 million years ago, coinciding with the Oligocene-to-Miocene transition, with increasing drought and aridity and transient glaciation events. The age of the Arabidopsis thaliana crown group is 6 million years ago, at the Miocene and Pliocene border. The overall species richness of the family is well explained by high levels of neopolyploidy (43% in total), but this trend is neither directly associated with an increase in genome size nor is there a general lineage-specific constraint. Our results highlight polyploidization as an important source for generating new evolutionary lineages adapted to changing environments. We conclude that species radiation, paralleled by high levels of neopolyploidization, follows genome size decrease, stabilization, and genetic diploidization.  相似文献   

7.
Cytochrome P450 monooxygenases (P450s) in the sacred lotus (Nelumbo nucifera) genome have been identified and named according to systematic P450 nomenclatures. Comparisons of these sequences with those in the papaya and grape CYPomes have indicated that gene blooms exist in the CYP89, CYP94, CYP96 and CYP714 families and that less dramatic expansions exist in the CYP71 and CYP72 families. Expansions in the CYP94 and CYP96 families may be associated with generation of the extremely hydrophobic leaf surfaces associated with the “lotus effect” in this water-adapted species, since these families are known to hydroxylate fatty acids and alkanes in the wax biosynthetic pathways of other plant species. Evolution of the CYP719 and CYP80 families may be associated with production of a number of benzylisoquinoline and aporphine alkaloids. Structures for anonaine and roemerine, two of the most abundant aporphine alkaloids in lotus leaves and seeds, contain methylenedioxy bridges that are known to be generated by members of the CYP719 family. With only one CYP719A22 gene existing in the lotus genome, it is likely that it is involved in making aporphine alkaloids. The fact that CYP719 has not previously been seen in angiosperm phylogeny below the order of Ranunculales suggests that its presence in lotus (in the Proteales) presents an evolutionary terminus prior to its loss in more recent eudicot species. With several CYP80 family genes existing in the lotus genome, there are multiple candidates for those involved in conducting benzylisoquinoline alkaloid synthesis.  相似文献   

8.
Changes in the physical interaction between cis-regulatory DNA sequences and proteins drive the evolution of gene expression. However, it has proven difficult to accurately quantify evolutionary rates of such binding change or to estimate the relative effects of selection and drift in shaping the binding evolution. Here we examine the genome-wide binding of CTCF in four species of Drosophila separated by between ∼2.5 and 25 million years. CTCF is a highly conserved protein known to be associated with insulator sequences in the genomes of human and Drosophila. Although the binding preference for CTCF is highly conserved, we find that CTCF binding itself is highly evolutionarily dynamic and has adaptively evolved. Between species, binding divergence increased linearly with evolutionary distance, and CTCF binding profiles are diverging rapidly at the rate of 2.22% per million years (Myr). At least 89 new CTCF binding sites have originated in the Drosophila melanogaster genome since the most recent common ancestor with Drosophila simulans. Comparing these data to genome sequence data from 37 different strains of Drosophila melanogaster, we detected signatures of selection in both newly gained and evolutionarily conserved binding sites. Newly evolved CTCF binding sites show a significantly stronger signature for positive selection than older sites. Comparative gene expression profiling revealed that expression divergence of genes adjacent to CTCF binding site is significantly associated with the gain and loss of CTCF binding. Further, the birth of new genes is associated with the birth of new CTCF binding sites. Our data indicate that binding of Drosophila CTCF protein has evolved under natural selection, and CTCF binding evolution has shaped both the evolution of gene expression and genome evolution during the birth of new genes.  相似文献   

9.
Plants depend on cytochrome P450 (CYP) enzymes for nearly every aspect of their biology. In several sequenced angiosperms, CYP genes constitute up to 1% of the protein coding genes. The angiosperm sequence diversity is encapsulated by 59 CYP families, of which 52 families form a widely distributed core set. In the 20 years since the first plant P450 was sequenced, 3,387 P450 sequences have been identified and annotated in plant databases. As no new angiosperm CYP families have been discovered since 2004, it is now apparent that the sampling of CYP diversity is beginning to plateau. This review presents a comparison of 1,415 cytochrome P450 sequences from the six sequenced genomes of Vitis vinifera (grape), Carica papaya (papaya), Populus trichocarpa (poplar), Oryza sativa (rice), Arabidopsis thaliana (Arabidopsis or mouse ear’s cress) and Physcomitrella patens (moss). An evolutionary analysis is presented that tracks land plant P450 innovation over time from the most ancient and conserved sequences to the newest dicot-specific families. The earliest or oldest P450 families are devoted to the essential biochemistries of sterol and carotenoid synthesis. The next evolutionary radiation of P450 families appears to mediate crucial adaptations to a land environment. And, the newest CYP families appear to have driven the diversity of angiosperms in mediating the synthesis of pigments, odorants, flavors and order-/genus-specific secondary metabolites. Family-by-family comparisons allow the visualization of plant genome plasticity by whole genome duplications and massive gene family expansions via tandem duplications. Molecular evidence of human domestication is quite apparent in the repeated P450 gene duplications occurring in the grape genome.  相似文献   

10.
Analysis of the patterns and levels of diversity in duplicate gene not only traces evolutionary history of polyploids, but also provides insight into how the evolutionary process differs between lineages and between homoeologous loci within lineages. Elymus sensu lato is a group of allopolyploid species, which share a common St genome and with the different combinations of H, Y, P, and W genomes. To estimate the evolutionary process of the rbcL gene in species of Elymus s. l. and its putative dioploid relatives, 74 sequences were obtained from 21 species of Elymus s. l. together with 24 diploid taxa representing 19 basic genomes in Triticeae. Phylogeny and sequence diversity pattern analysis suggested that (1) species of Pseudoroegneria (Nevski) Á. Löve might serve as the maternal donor of the species of Elymus s. l; (2) differentiation of St genome were shown in the species of Elymus s. l. following polyploidy event; (3) divergences within the species might associate with geographic diversity and morphological variability; (4) differences in the levels and patterns of nucleotide diversity of the rbcL gene implied that the St genome lineages in the species of Elymus s. l. have differently evolutionary potentials.  相似文献   

11.
《Genomics》2021,113(4):1705-1718
BackgroundUnderstanding sex determination (SD) across taxa is a major challenge for evolutionary biology. The new genomic tools are paving the way to identify genomic features underlying SD in fish, a group frequently showing limited sex chromosome differentiation and high SD evolutionary turnover. Turbot (Scophthalmus maximus) is a commercially important flatfish with an undifferentiated ZW/ZZ SD system and remarkable sexual dimorphism. Here we describe a new long-read turbot genome assembly used to disentangle the genetic architecture of turbot SD by combining genomics and classical genetics approaches.ResultsThe new turbot genome assembly consists of 145 contigs (N50 = 22.9 Mb), 27 of them representing >95% of its estimated genome size. A genome wide association study (GWAS) identified a ~ 6.8 Mb region on chromosome 12 associated with sex in 69.4% of the 36 families analyzed. The highest associated markers flanked sox2, the only gene in the region showing differential expression between sexes before gonad differentiation. A single SNP showed consistent differences between Z and W chromosomes. The analysis of a broad sample of families suggested the presence of additional genetic and/or environmental factors on turbot SD.ConclusionsThe new chromosome-level turbot genome assembly, one of the most contiguous fish assemblies to date, facilitated the identification of sox2 as a consistent candidate gene putatively driving SD in this species. This chromosome SD system barely showed any signs of differentiation, and other factors beyond the main QTL seem to control SD in a certain proportion of families.  相似文献   

12.
The availability of nearly complete moso bamboo genome sequences permits the detailed discovery and cross-species comparison of transposable elements (TEs) between Bambusoideae and other Poaceae species at the whole genome level. Long terminal repeat retroelements (LTR-retroelements) are the single largest components of most plant genomes and can substantially impact the genome in various ways. Through a combination of structure- and homology-based approaches, we initially investigated 982 LTR-retroelement families comprising 2,004,644 LTR-retroelement sequences, which accounted for more than 40% of the moso bamboo genome. Further analysis revealed that the ratio of solo LTRs to intact elements (S/I) in moso bamboo is significantly low (approximately 0.28:1), indicating that bamboo LTR-retroelements might have undergone relatively low frequencies of unequal recombination and illegitimate recombination. Phylogenetic analysis revealed four Ty1-copia and five Ty3-gypsy evolutionary lineages that were present before the divergence of eudicot and monocot species, but the scales and timeframes within which they proliferated significantly varied across families and lineages. Insertion time estimates showed that LTR-retroelements were amplified for approximately 0~3 million years and had longer periods of activity than those of rice and Arabidopsis. These findings suggest that the expansion of LTR-retroelements might be responsible for host large genome size during moso bamboo evolution.  相似文献   

13.
A genome triplication took place in the ancestor of Brassiceae species after the split of the Arabidopsis lineage. The postfragmentation and shuffling of the genome turned the ancestral hexaploid back to diploids and caused the radiation of Brassiceae species. The course of speciation was accompanied by the loss of duplicate genes and also influenced the evolution of retained genes. Of all the genes, those encoding NBS domains are typical R genes that confer resistance to invading pathogens. In this study, using the genome of Arabidopsis thaliana as a reference, we examined the loss/retention of orthologous NBS-encoding loci in the tripled Brassica rapa genome and discovered differential loss/retention frequencies. Further analysis indicated that loci of different retention ratios showed different evolutionary patterns. The loci of classesII and III (maintaining two and three syntenic loci, respectively, multi-loci) show sharper expansions by tandem duplications, have faster evolutionary rates and have more potential to be associated with novel gene functions. On the other hand, the loci that are retained at the minimal rate (keeping only one locus, class I, single locus) showed opposite patterns. Phylogenetic analysis indicated that recombination and translocation events were common among multi-loci in B. rapa, and differential evolutionary patterns between multi- and single-loci are likely the consequence of recombination. Investigations towards other gene families demonstrated different evolutionary characteristics between different gene families. The evolution of genes is more likely determined by the property of each gene family, and the whole genome triplication provided only a specific condition.  相似文献   

14.
15.
Plant evolution is characterized by frequent genome duplication events. Expansion of habitat resulted in the origin of many novel genes and genome duplication events which in turn resulted in the expansion of many regulatory gene families. The plant-specific FCS-Like Zinc finger (FLZ) gene family is characterized by the presence of a FCS-Like Zinc finger (FLZ) domain which mediates the protein-protein interaction. In this study, we identified that the expansion of FLZ gene family size in different species is correlated with ancestral and lineage-specific whole genome duplication events. The subsequent gene loss found to have a greater role in determining the size of this gene family in many species. However, genomic block duplications played the significant role in the expansion of FLZ gene family in some species. Comparison of Arabidopsis thaliana and Oryza sativa FLZ gene family revealed monocot and dicot specific evolutionary trends. The FLZ genes were found to be under high purifying selection. The spatiotemporal expression analyses of Arabidopsis thaliana FLZ gene family revealed that majority of the members are highly expressed in reproductive organs. FLZ genes were also found to be highly expressed during vegetative-to-reproductive phase transition which is correlated with the proposed role of this gene family in sugar signaling. The comparison of sequence, structural and expression features of duplicated genes identified lineage-specific redundancy and divergence. This extensive evolutionary analysis and expression analysis of Arabidopsis thaliana FLZ genes will pave the way for further functional analysis of FLZ genes.  相似文献   

16.
Simple sequence repeats (SSRs) can be derived from the complete genome sequence. These markers are important for gene mapping as well as marker-assisted selection (MAS). To develop SSRs for cotton gene mapping, we selected the complete genome sequence of Gossypium raimondii, which consisted of 4447 non-redundant scaffolds. Out of 775.2 Mb sequence examined, a total of 136,345 microsatellites were identified with a density of 5.69 kb per SSR in the G. raimondii genome leading to development of 112,177 primer pairs. The distributions of SSRs in the genome were non-random. Among the different motifs ranging from 1 to 6 bp, penta-nucleotide repeats were most abundant (30.5%), followed by tetra-nucleotide repeats (18.2%) and di-nucleotide repeats (16.9%). Among all identified 457 motif types, the most frequently occurring repeat motifs were poly-AT/TA, which accounted for 79.8% of the total di-nt SSRs, followed by AAAT/TTTA with 51.5% of the total tetra-nucleotede. Further, 18,834 microsatellites were detected from the protein-coding genes, and the frequency of gene containing SSRs was 46.0% in 40,976 genes of G. raimondii. These genome-based SSRs developed in the present study will lay the groundwork for developing large numbers of SSR markers for genetic mapping, gene discovery, genetic diversity analysis, and MAS breeding in cotton.  相似文献   

17.
18.
Nocardia is a complex and diverse genus of aerobic actinomycetes that cause complex clinical presentations, which are difficult to diagnose due to being misunderstood. To date, the genetic diversity, evolution, and taxonomic structure of the genus Nocardia are still unclear. In this study, we investigated the pan-genome of 86 Nocardia type strains to clarify their genetic diversity. Our study revealed an open pan-genome for Nocardia containing 265,836 gene families, with about 99.7% of the pan-genome being variable. Horizontal gene transfer appears to have been an important evolutionary driver of genetic diversity shaping the Nocardia genome and may have caused historical taxonomic confusion from other taxa (primarily Rhodococcus, Skermania, Aldersonia, and Mycobacterium). Based on single-copy gene families, we established a high-accuracy phylogenomic approach for Nocardia using 229 genome sequences. Furthermore, we found 28 potentially new species and reclassified 16 strains. Finally, by comparing the topology between a phylogenomic tree and 384 phylogenetic trees (from 384 single-copy genes from the core genome), we identified a novel locus for inferring the phylogeny of this genus. The dapb1 gene, which encodes dipeptidyl aminopeptidase BI, was far superior to commonly used markers for Nocardia and yielded a topology almost identical to that of genome-based phylogeny. In conclusion, the present study provides insights into the genetic diversity, contributes a robust framework for the taxonomic classification, and elucidates the evolutionary relationships of Nocardia. This framework should facilitate the development of rapid tests for the species identification of highly variable species and has given new insight into the behavior of this genus.  相似文献   

19.
In this study, we characterized the miR482 family in cotton using existing small RNA datasets and the recently released draft genome sequence of Gossypium raimondii, a diploid cotton species whose progenitor is the putative contributor of the Dt (representing the D genome of tetraploid) genome of the cultivated tetraploid cotton species G. hirsutum and G. barbadense. Of the three ghr-miR482 members reported in G. hirsutum, ghr-miR482a has no homolog in G. raimondii, ghr-miR482b and ghr-miR482c each has a single homolog in G. raimondii. Gra-miR482d has five homologous loci (gra-miR482d, f-i) in G. raimondii and also exists in G. hirsutum (ghr-miR482d). A variant, miR482.2 that is a homolog of miR2118 in other species, is produced from several GHR-MIR482 loci in G. hirsutum. Approximately 12% of the G. raimondii NBS-LRR genes were predicted targets of various members of the gra-miR482 family. Based on the rationale that the regulatory relationship between miR482 and NBS-LRR genes will be conserved in G. raimondii and G. hirsutum, we investigated this relationship using G. hirsutum miR482 and G. raimondii NBS-LRR genes, which are not currently available in G. hirsutum. Ghr-miR482/miR482.2-mediated cleavage was confirmed for three of the four NBS-LRR genes analysed. As in tomato, miR482-mediated cleavage of NBS-LRR genes triggered production of phased secondary small RNAs in cotton. In seedlings of the susceptible cultivar Sicot71 (G. hirsutum) infected with the fungal pathogen Verticillium dahliae, the expression levels of ghr-miR482b/miR482b.2, ghr-miR482c and ghr-miR482d.2 were down-regulated, and several NBS-LRR targets of ghr-miR482c and ghr-miR482d were up-regulated. These results imply that, like tomato plants infected with viruses or bacteria, cotton plants are able to induce expression of NBS-LRR defence genes by suppression of the miRNA-mediated gene silencing pathway upon fungal pathogen attack.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号