首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Large numbers of expressed sequence tags (ESTs) have now been generated from a variety of model organisms. In plants, substantial collections of ESTs are available for Arabidopsis and rice, in each case representing significant proportions of the estimated total numbers of genes. Large-scale comparisons of Arabidopsis and rice sequences are especially interesting due to the fact that these two species are representatives of the two subclasses of the flowering plants (Dicotyledonae and Monocotyledonae, respectively). Here we present the results of systematic analysis of the Arabidopsis and rice EST sets. Non-redundant sets of sequences from Arabidopsis and rice were first separately derived and then combined so that gene families in common between the two species could be identified. Our results show that 58% of non-singleton ESTs are derived from genes in gene families common to the two species. These gene families constitute the basis of a core set of higher plant genes.  相似文献   

2.
The generation of large numbers of partial cDNA sequences, or expressed sequence tags (ESTs), has provided a method with which to sample a large number of genes from an organism. More than 25,000 Arabidopsis thaliana ESTs have been deposited in public databases, producing the largest collection of ESTs for any plant species. We describe here the application of a method of reducing redundancy and increasing information content in this collection by grouping overlapping ESTs representing the same gene into a "contig" or assembly. The increased information content of these assemblies allows more putative identifications to be assigned based on the results of similarity searches with nucleotide and protein databases. The results of this analysis indicate that sequence information is available for approximately 12,600 nonoverlapping ESTs from Arabidopsis. Comparison of the assemblies with 953 Arabidopsis coding sequences indicates that up to 57% of all Arabidopsis genes are represented by an EST. Clustering analysis of these sequences suggests that between 300 and 700 gene families are represented by between 700 and 2000 sequences in the EST database. A database of the assembled sequences, their putative identifications, and cellular roles is available through the World Wide Web.  相似文献   

3.
Identification and characterization of new plant microRNAs using EST analysis   总被引:50,自引:0,他引:50  
Seventy-five previously known plant microRNAs (miRNAs) were classified into 14 families according to their gene sequence identity. A total of 18,694 plant expressed sequence tags (EST) were found in the GenBank EST databases by comparing all previously known Arabidopsis miRNAs to GenBank‘s plant EST databases with BLAST algorithms. After removing the EST sequences with high numbers (more than 2) of mismatched nucleotides, a total of 812 EST contigs were identified. After predicting and scoring the RNA secondary structure of the 812 EST sequences using mFold software, 338 new potential miRNAs were identified in 60 plant species, miRNAs are widespread. Some microRNAsmay highly conserve in the plant kingdom, and they may have the same ancestor in very early evolution. There is no nucleotide substitution in most miRNAs among many plant species. Some of the new identified potential miRNAs may be induced and regulated by environmental biotic and abiotic stresses. Some may be preferentially expressed in specific tissues, and are regulated by developmental switching. These findings suggest that EST analysis is a good alternative strategy for identifying new miRNA candidates, their targets, and other genes. A large number of miRNAs exist in different plant species and play important roles in plant developmental switching and plant responses to environmental abiotic and biotic stresses as well as signal transduction. Environmental stresses and developmental switching may be the signals for synthesis and regulation of miRNAs in plants. A model for miRNA induction and expression, and gene regulation by miRNA is hypothesized.  相似文献   

4.
5.
Monocotyledons and dicotyledons are distinct, not only in their body plans and developmental patterns, but also in the structural features of their cell walls. The recent completion of the rice (Oryza sativa) genomic sequence and publication of the sequence data, together with the completed database of the Arabidopsis thaliana genome, provide the first opportunity to compare the full complement of cell-wall-related genes from the two distinct classes of flowering plants. We made this comparison by exploiting the fact that Arabidopsis and rice have type I and type II walls, respectively, and therefore represent the two extremes in terms of the structural features of plant cell walls. In this review article, we classify all cell-wall-related genes into 32 gene families, and generate their phylogenetic trees. Using these data, we can phylogenetically compare individual genes of particular interest between Arabidopsis and rice. This comparative genome approach shows that the differences in wall architecture in the two plant groups actually mirror the diversity of the individual gene families involved in the cell-wall dynamics of the respective plant species. This study also identifies putative rice orthologs of genes with well-defined functions in Arabidopsis and other plant species.  相似文献   

6.
7.
Using a strategy requiring only modest computational resources, wheat expressed sequence tag (EST) sequences from various sources were assembled into contigs and compared with a nonredundant barley sequence assembly, with ESTs, with complete draft genome sequences of rice and Arabidopsis thaliana, and with ESTs from other plant species. These comparisons indicate that (i) wheat sequences available from public sources represent a substantial proportion of the diversity of wheat coding sequences, (ii) prediction of open reading frames in the whole genome sequence improves when supplemented with EST information from other species, (iii) a substantial number of candidates for novel genes that are unique to wheat or related species can be identified, and (iv) a smaller number of genes can be identified that are common to monocots and dicots but absent from Arabidopsis. The sequences in the last group may have been lost from Arabidopsis after descendance from a common ancestor. Examples of potential novel wheat genes and Triticeae-specific genes are presented.  相似文献   

8.
The first sequenced plant genome, from the small mustard plant Arabidopsis thaliana, was published at the end of 2000. The sequencing of the rice genome is well under way. The sizes of plant genomes vary by a factor of up to 1000, and many important crop plants have genomes that are several times larger than the human genome. To gain insight into the gene toolbox of plant species, numerous large-scale EST sequencing projects have been launched successfully, and analysis procedures are constantly being refined to add maximum value to the sequence data. In addition, an alternative approach to exclude repetitive noncoding DNA and to enrich sequence libraries for gene-containing genomic regions has been developed. This strategy has the potential to deliver information about both genes and regulatory regions outside the transcribed regions.  相似文献   

9.
We used RT-PCR to sequence approximately 3 kb of the gene coding for the largest subunit of RNA polymerase II (rpb1) from nine land plants. Our results show that plant rpb1 genes all have a similar GC-content and that their amino acid sequences evolve at a similar rate in most species we examined, except for the Arabidopsis thaliana and rice sequences which evolve faster. This gene also exists as a single copy in most species and contains enough phylogenetically informative sites to resolve the evolutionary relationships among seed plants. Protein maximum parsimony, as well as neighbor-joining and maximum likelihood analyses of DNA and protein sequences, all generated identical tree topologies with similar strong support values at each node. The angiosperms are a clade comprising Amborella as a sister group to all other angiosperms, followed by Nymphaea, Magnolia, Arabidopsis, and a monocot clade containing maize and rice. The gymnosperms also form a monophyletic clade with Welwitschia and pine grouped together and sister to a Cycas and Zamia clade. These findings concur with recent studies that refute the Anthophyte Hypothesis and place Amborella at the base of the angiosperm tree. These rpb1 sequences also give a more consistent picture of seed plant relationships than similar analyses performed on data sets made of 18S rDNA, atpB, and rbcL sequences from the same species. These sequences therefore show great promise to help further resolve the phylogenetic relationships of seed plants.  相似文献   

10.
11.
Canopy light and plant health   总被引:1,自引:0,他引:1  
  相似文献   

12.
The Fabaceae, the third largest family of plants and the source of many crops, has been the target of many genomic studies. Currently, only the grasses surpass the legumes for the number of publicly available expressed sequence tags (ESTs). The quantity of sequences from diverse plants enables the use of computational approaches to identify novel genes in specific taxa. We used BLAST algorithms to compare unigene sets from Medicago truncatula, Lotus japonicus, and soybean (Glycine max and Glycine soja) to nonlegume unigene sets, to GenBank's nonredundant and EST databases, and to the genomic sequences of rice (Oryza sativa) and Arabidopsis. As a working definition, putatively legume-specific genes had no sequence homology, below a specified threshold, to publicly available sequences of nonlegumes. Using this approach, 2,525 legume-specific EST contigs were identified, of which less than three percent had clear homology to previously characterized legume genes. As a first step toward predicting function, related sequences were clustered to build motifs that could be searched against protein databases. Three families of interest were more deeply characterized: F-box related proteins, Pro-rich proteins, and Cys cluster proteins (CCPs). Of particular interest were the >300 CCPs, primarily from nodules or seeds, with predicted similarity to defensins. Motif searching also identified several previously unknown CCP-like open reading frames in Arabidopsis. Evolutionary analyses of the genomic sequences of several CCPs in M. truncatula suggest that this family has evolved by local duplications and divergent selection.  相似文献   

13.
Characterization of the rice (Oryza sativa) actin gene family   总被引:11,自引:0,他引:11  
  相似文献   

14.
Wang D  Pei K  Fu Y  Sun Z  Li S  Liu H  Tang K  Han B  Tao Y 《Gene》2007,394(1-2):13-24
  相似文献   

15.
Plant microRNA: a small regulatory molecule with big impact   总被引:20,自引:0,他引:20  
  相似文献   

16.
Passardi F  Longet D  Penel C  Dunand C 《Phytochemistry》2004,65(13):1879-1893
Plant peroxidases (class III peroxidases, E.C. 1.11.1.7) are secreted glycoproteins known to be involved in the mechanism of cell elongation, in cell wall construction and differentiation, and in the defense against pathogens. They usually form large multigenic families in angiosperms. The recent completion of rice (Oryza sativa japonica c.v. Nipponbare) genome sequencing allowed drawing up the full inventory of the genes encoding class III peroxidases in this plant. We found 138 peroxidase genes distributed among the 12 rice chromosomes. In contrast to several other gene families studied so far, peroxidase genes are twice as numerous in rice as in Arabidopsis. This large number of genes results from various duplication events that were tentatively traced back using a phylogenetic tree based on the alignment of conserved amino acid sequences. We also searched for peroxidase encoding genes in the major phyla of plant kingdom. In addition to gymnosperms and angiosperms, sequences were found in liverworts, mosses and ferns, but not in unicellular green algae. Two rice and one Arabidopsis peroxidase genes appeared to be rather close to the only known sequence from the liverwort Marchantia polymorpha. The possible relationship of these peroxidases with the putative ancestor of peroxidase genes is discussed, as well as the connection between the development of the class III peroxidase multigenic family and the emergence of the first land plants.  相似文献   

17.
一种新的EST聚类方法   总被引:11,自引:0,他引:11  
该研究发展了一种EST(expressed sequence tag)聚类方法(ESTClustering),用于分析大规模EST测序中所产生的大量数据,以获得高质量,非重复表达序列,该方法在聚类过程中采用MEGABLAST工具对一致序列进行序列同源比较,并用phrap程序对每一EST簇进行拼接检验。这一聚类策略能降低测序错误带来的影响,有效识别基因家族成员,并避免选择性剪接的干扰,与NCB(National Center for Biotechnology Information)的UniGene clustering)方法相比,ESTClustering的聚类结果可以更好地反映表达序列的多样性,用ESTClustering对112256条拟南芥EST聚类测试,产生23581个EST簇,其中13597个EST簇有对应拟南芥基因组编码序列,与该基因组中有EST作为依据的预测基因数目接近。应用该方法对收集的147191条水稻EST序列进行聚类,形成33896个EST簇。  相似文献   

18.
Jiang D  Yin C  Yu A  Zhou X  Liang W  Yuan Z  Xu Y  Yu Q  Wen T  Zhang D 《Cell research》2006,16(5):507-518
To understand the expansion ofmulticopy microRNA (miRNA) families in plants, we localized the reported miRNA genes from Arabidopsis and rice to their chromosomes, respectively, and observed that 37% of 117 miRNA genes from Arabidopsis and 35% of 173 miRNA genes from rice were segmental duplications in the genome. In order to characterize whether the expression diversification has occurred among plant multicopy miRNA family members, we designed PCR primers targeting 48 predicted miRNA precursors from 10 families in Arabidopsis and rice. Results from RT-PCR data suggest that the transcribed precursors of members within the same miRNA family were present at different expression levels. In addition, although miRl60 and miR162 sequences were conserved in Arabidopsis and rice, we found that the expression patterns of these genes differed between the two species. These data suggested that expression diversification has occurred in multicopy miRNA families, increasing our understanding of the expression regulation of miRNAs in plants.  相似文献   

19.
Ancient signals: comparative genomics of plant MAPK and MAPKK gene families   总被引:10,自引:0,他引:10  
MAPK signal transduction modules play crucial roles in regulating many biological processes in plants, and their components are encoded by highly conserved genes. The recent availability of genome sequences for rice and poplar now makes it possible to examine how well the previously described Arabidopsis MAPK and MAPKK gene family structures represent the broader evolutionary situation in plants, and analysis of gene expression data for MPK and MKK genes in all three species allows further refinement of those families, based on functionality. The Arabidopsis MAPK nomenclature appears sufficiently robust to allow it to be usefully extended to other well-characterized plant systems.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号