首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 156 毫秒
1.
2.
3.
Expressed sequence tags (ESTs) currently encompass more entries in the public databases than any other form of sequence data. Thus, EST data sets provide a vast resource for gene identification and expression profiling. We have mapped the complete set of 176,915 publicly available Arabidopsis EST sequences onto the Arabidopsis genome using GeneSeqer, a spliced alignment program incorporating sequence similarity and splice site scoring. About 96% of the available ESTs could be properly aligned with a genomic locus, with the remaining ESTs deriving from organelle genomes and non-Arabidopsis sources or displaying insufficient sequence quality for alignment. The mapping provides verified sets of EST clusters for evaluation of EST clustering programs. Analysis of the spliced alignments suggests corrections to current gene structure annotation and provides examples of alternative and non-canonical pre-mRNA splicing. All results of this study were parsed into a database and are accessible via a flexible Web interface at http://www.plantgdb.org/AtGDB/.  相似文献   

4.
Exploring the plant transcriptome through phylogenetic profiling   总被引:5,自引:0,他引:5       下载免费PDF全文
Publicly available protein sequences represent only a small fraction of the full catalog of genes encoded by the genomes of different plants, such as green algae, mosses, gymnosperms, and angiosperms. By contrast, an enormous amount of expressed sequence tags (ESTs) exists for a wide variety of plant species, representing a substantial part of all transcribed plant genes. Integrating protein and EST sequences in comparative and evolutionary analyses is not straightforward because of the heterogeneous nature of both types of sequence data. By combining information from publicly available EST and protein sequences for 32 different plant species, we identified more than 250,000 plant proteins organized in more than 12,000 gene families. Approximately 60% of the proteins are absent from current sequence databases but provide important new information about plant gene families. Analysis of the distribution of gene families over different plant species through phylogenetic profiling reveals interesting insights into plant gene evolution, and identifies species- and lineage-specific gene families, orphan genes, and conserved core genes across the green plant lineage. We counted a similar number of approximately 9,500 gene families in monocotyledonous and eudicotyledonous plants and found strong evidence for the existence of at least 33,700 genes in rice (Oryza sativa). Interestingly, the larger number of genes in rice compared to Arabidopsis (Arabidopsis thaliana) can partially be explained by a larger amount of species-specific single-copy genes and species-specific gene families. In addition, a majority of large gene families, typically containing more than 50 genes, are bigger in rice than Arabidopsis, whereas the opposite seems true for small gene families.  相似文献   

5.
6.
Conservation of Arabidopsis flowering genes in model legumes   总被引:14,自引:0,他引:14       下载免费PDF全文
The model plants Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa) have provided a wealth of information about genes and genetic pathways controlling the flowering process, but little is known about the corresponding pathways in legumes. The garden pea (Pisum sativum) has been used for several decades as a model system for physiological genetics of flowering, but the lack of molecular information about pea flowering genes has prevented direct comparison with other systems. To address this problem, we have searched expressed sequence tag and genome sequence databases to identify flowering-gene-related sequences from Medicago truncatula, soybean (Glycine max), and Lotus japonicus, and isolated corresponding sequences from pea by degenerate-primer polymerase chain reaction and library screening. We found that the majority of Arabidopsis flowering genes are represented in pea and in legume sequence databases, although several gene families, including the MADS-box, CONSTANS, and FLOWERING LOCUS T/TERMINAL FLOWER1 families, appear to have undergone differential expansion, and several important Arabidopsis genes, including FRIGIDA and members of the FLOWERING LOCUS C clade, are conspicuously absent. In several cases, pea and Medicago orthologs are shown to map to conserved map positions, emphasizing the closely syntenic relationship between these two species. These results demonstrate the potential benefit of parallel model systems for an understanding of flowering phenology in crop and model legume species.  相似文献   

7.
The genome of Arabidopsis thaliana is exceedingly small, in part because it lacks the large middle repetitive DNA component characteristic of other plants. In this paper we have characterized a member of the low copy DNA component: the gene family for the light-harvesting chlorophyll a/b-protein. This gene family is unusual in that it contains far fewer members than the 7-16 coding sequences for this protein found in other plants. We used cross-hybridization with a Lemna gene encoding a light-harvesting chlorophyll a/b-protein to isolate 3 genes from Arabidopsis, all of which are clustered on an 11-kb genomic clone. Southern blot analysis suggests that there is a fourth related gene in Arabidopsis. Sequence analysis of the three genes demonstrates that within the translated region the nucleic acid sequence homology is 96%, the deduced amino acid sequence of the mature proteins is identical for the three genes, and two of the genes have a high degree of sequence homology in both their 5' and 3' immediate flanking regions. The genes have regulatory sequences typical of eukaryotic genes upstream of the translation start sites. However, not all of these genes are equally expressed in plants grown under normal light-dark conditions.  相似文献   

8.
Ethylene governs a range of developmental and response processes in plants. In Arabidopsis thaliana, the Raf-like kinase CTR1 acts as a key negative regulator of ethylene responses. While only one gene with CTR1 function apparently exists in Arabidopsis, we have isolated a family of CTR1- like genes in tomato ( Lycopersicon esculentum ). Based on amino acid alignments and phylogenetic analysis, these tomato CTR1- like genes are more similar to Arabidopsis CTR1 than any other sequences in the Arabidopsis genome. Structural analysis reveals considerable conservation in the size and position of the exons between Arabidopsis and tomato CTR1 genomic sequences. Complementation of the Arabidopsis ctr1-8 mutant with each of the tomato CTR genes indicates that they are all capable of functioning as negative regulators of the ethylene pathway. We previously reported that LeCTR1 expression is up-regulated in response to ethylene. Here, quantitative real-time PCR was carried out to detail expression for LeCTR1 and the additional CTR1 -like genes of tomato. Our results indicate that the tomato CTR1 gene family is differentially regulated at the mRNA level by ethylene and during stages of development marked by increased ethylene biosynthesis, including fruit ripening. The possibility of a multi-gene family of CTR1 -like genes in other species besides tomato was examined through mining of EST and genomic sequence databases.  相似文献   

9.
Characterization of the Brassinosteroid Insensitive 1 Genes of Cotton   总被引:11,自引:0,他引:11  
Suppression of brassinosteroid (BR) biosynthesis in cotton ovules by treatment with brassinazole inhibits fiber formation, indicating that BR plays an important role in cotton fiber development. Plant responses to brassinosteroids (BR) are mediated through a plasma membrane-bound leucine-rich repeat (LRR) receptor-like protein kinase known as BRI1. Mutations in the BRI1 genes of several species result in dwarfed plants with reduced sensitivity to BR. A single expressed sequence tag (EST) from cotton with strong sequence similarity to Arabidopsis BRI1 ( AtBRI1 ) was identified in a search of publicly available databases. With this EST as a starting point, full-length cDNAs and genomic coding sequences from upland cotton ( Gossypium hirsutum ) BRI1 ( GhBRI1 ) were obtained and characterized. Ectopic expression of this coding sequence in BR-insensitive Arabidopsis plants resulted in recovery of normal growth indicating that GhBRI1 is a functional homologue of AtBRI1. G. hirsutum is an allotetraploid (AADD) derived from diploid ancestors. Analysis of several GhBRI1 cDNAs showed two distinct sequences indicating the presence of two GhBRI1 genes, denoted GhBRI1-1 and GhBRI1-2. Sequence comparisons between these GhBRI1 coding sequences and those from related A and D genome diploid Gossypium species ( G. arboreum and G. thurberi ) indicated that GhBRI1-1 is likely to the A sub-genome orthologue while GhBRI1-2 is from the D sub-genome.  相似文献   

10.
Small GTP-binding genes play an essential regulatory role in a multitude of cellular processes such as vesicle-mediated intracellular trafficking, signal transduction, cytoskeletal organization, and cell division in plants and animals. Medicago truncatula and Lotus japonicus are important model plants for studying legume-specific biological processes such as nodulation. The publicly available online resources for these plants from websites such as http://www.ncbi.nih.gov, http://www.medicago.org, http://www.tigr.org, and related sites were searched to collect nucleotide sequences that encode GTP-binding protein homologues. A total of 460 small GTPase sequences from several legume species including Medicago and Lotus, Arabidopsis, human, and yeast were phyletically analysed to shed light on the evolution and functional characteristics of legume-specific homologues. One of the main emphases of this study was the elucidation of the possible involvement of some members of small GTPase homologues in the establishment and maintenance of symbiotic associations in root nodules of legumes. A high frequency of vesicle-mediated trafficking in nodules led to the idea of a probable subfunctionalization of some members of this family in legumes. As a result of the analyses, a group of 10 small GTPases that are likely to be mainly expressed in nodules was determined. The sequences determined as a result of this study could be used in more detailed molecular genetic analyses such as creation of RNA interference silencing mutants for further clarification of the role of GTPases in nodulation. This study will also assist in furthering our understanding of the evolutionary history of small GTPases in legume species.  相似文献   

11.
One restriction to the cultivation of common bean, Phaseolus vulgaris L., is its limited tolerance to low temperatures. In the present study, subtraction suppression hybridization was employed to enrich for stress responsive genes in both a chilling-susceptible common bean and a relatively more chilling-tolerant wild bean species, Phaseolus angustissimus. For each species, approximately 11 000 expressed sequence tags were generated. Comparative sequence analysis of the EST collection with the available annotated genome sequences of the model Fabaceae species Medicago truncatula and Glycine max identified protein homologues for approximately 65% and 80% of the Phaseolus sequences, respectively. This difference reflects the closer phylogenetic relationship between the genera Phaseolus and Glycine compared with Medicago. Annotation of the Phaseolus sequences was facilitated through this comparative analysis and indicated that several heat shock proteins, cytochrome P450s, and DNA binding factors were uniquely found among the sequences from the wild species P. angustissimus. The Phaseolus sequences have been made available on a GBrowse implementation using M. truncatula as the reference genome, providing rapid access to the sequence data and associated comparative genome data.  相似文献   

12.
13.
Ride  J.P.  Davies  E.M.  Franklin  F.C.H.  Marshall  D.F. 《Plant molecular biology》1999,39(5):927-932
A detailed analysis of the currently available Arabidopsis thaliana genomic sequence has revealed the presence of a large number of open reading frames with homology to the stigmatic self-incompatibility (S) genes of Papaver rhoeas. The products of these potential genes are all predicted to be relatively small, basic, secreted proteins with similar predicted secondary structures. We have named these potential genes SPH (S-protein homologues). Their presence appears to have been largely missed by the prediction methods currently used on the genomic sequence. Equivalent homologues could not be detected in the human, microbial, Drosophila or C. elegans genomic databases, suggesting a function specific to plants. Preliminary RT-PCR analysis indicates that at least two members of the family (SPH1, SPH8) are expressed, with expression being greatest in floral tissues. The gene family may total more than 100 members, and its discovery not only illustrates the importance of the genome sequencing efforts, but also indicates the extent of information which remains hidden after the initial trawl for potential genes.  相似文献   

14.
Large volumes of genomic data have been generated for several plant species over the past decade, including structural sequence data and functional annotation at the genome level. Various technologies such as expressed sequence tags (ESTs), massively parallel signature sequencing (MPSS) and microarrays have been used to study gene expression and to provide functional data for many genes simultaneously. This review focuses on recent advances in the application of microarrays in plant genomic research and in gene expression databases available for plants. Large sets of Arabidopsis microarray data are publicly available. Recently developed array platforms are currently being used to generate genome-wide expression profiles for several crop species. Coupled to these platforms are public databases that provide access to these large-scale expression data, which can be used to aid the functional discovery of gene function.  相似文献   

15.
Non-redundant expressed sequence tags (ESTs) were generated from six different organs at various developmental stages of Chinese cabbage, Brassica rapa L. ssp. pekinensis. Of the 1,295 ESTs, 915 (71%) showed significantly high homology in nucleotide or deduced amino acid sequences with other sequences deposited in databases, while 380 did not show similarity to any sequences. Briefly, 598 ESTs matched with proteins of identified biological function, 177 with hypothetical proteins or non-annotated Arabidopsis genome sequences, and 140 with other ESTs. About 82% of the top-scored matching sequences were from Arabidopsis or Brassica, but overall 558 (43%) ESTs matched with Arabidopsis ESTs at the nucleotide sequence level. This observation strongly supports the idea that gene-expression profiles of Chinese cabbage differ from that of Arabidopsis, despite their genome structures being similar to each other. Moreover, sequence analyses of 21 Brassica ESTs revealed that their primary structure is different from those of corresponding annotated sequences of Arabidopsis genes. Our data suggest that direct prediction of Brassica gene expression pattern based on the information from Arabidopsis genome research has some limitations. Thus, information obtained from the Brassica EST study is useful not only for understanding of unique developmental processes of the plant, but also for the study of Arabidopsis genome structure.  相似文献   

16.
17.
18.
Genetic structure and evolution of RAC-GTPases in Arabidopsis thaliana   总被引:10,自引:0,他引:10  
Winge P  Brembu T  Kristensen R  Bones AM 《Genetics》2000,156(4):1959-1971
Rho GTPases regulate a number of important cellular functions in eukaryotes, such as organization of the cytoskeleton, stress-induced signal transduction, cell death, cell growth, and differentiation. We have conducted an extensive screening, characterization, and analysis of genes belonging to the Ras superfamily of GTPases in land plants (embryophyta) and found that the Rho family is composed mainly of proteins with homology to RAC-like proteins in terrestrial plants. Here we present the genomic and cDNA sequences of the RAC gene family from the plant Arabidopsis thaliana. On the basis of amino acid alignments and genomic structure comparison of the corresponding genes, the 11 encoded AtRAC proteins can be divided into two distinct groups of which one group apparently has evolved only in vascular plants. Our phylogenetic analysis suggests that the plant RAC genes underwent a rapid evolution and diversification prior to the emergence of the embryophyta, creating a group that is distinct from rac/cdc42 genes in other eukaryotes. In embryophyta, RAC genes have later undergone an expansion through numerous large gene duplications. Five of these RAC duplications in Arabidopsis thaliana are reported here. We also present an hypothesis suggesting that the characteristic RAC proteins in higher plants have evolved to compensate the loss of RAS proteins.  相似文献   

19.
A porcine BAC clone harboring the tightly linked IFNAR1 and IFNGR2 genes was identified by comparative analysis of the publicly available porcine BAC end sequences. The complete 168,835 bp insert sequence of this clone was determined. Sequence comparisons of the genomic sequence with EST sequences from public databases were performed and allowed a detailed annotation of the IFNAR1 and IFNGR2 genes. The analyzed genes showed a conserved genomic organization with their known mammalian orthologs, however the sequence conservation of these genes across species was relatively low. In addition to the IFNAR1 and IFNGR2 genes, which were completely sequenced, the analyzed BAC clone also contained parts of an orphan gene encoding a putative transmembrane protein (TMEM50B). In contrast to the IFNAR1 and IFNGR2 genes the sequence conservation of the TMEM50B gene across different mammalian species was extremely high.  相似文献   

20.
Development and annotation of perennial Triticeae ESTs and SSR markers   总被引:2,自引:0,他引:2  
Triticeae contains hundreds of species of both annual and perennial types. Although substantial genomic tools are available for annual Triticeae cereals such as wheat and barley, the perennial Triticeae lack sufficient genomic resources for genetic mapping or diversity research. To increase the amount of sequence information available in the perennial Triticeae, three expressed sequence tag (EST) libraries were developed and annotated for Pseudoroegneria spicata, a mixture of both Elymus wawawaiensis and E. lanceolatus, and a Leymus cinereus x L. triticoides interspecific hybrid. The ESTs were combined into unigene sets of 8 780 unigenes for P. spicata, 11 281 unigenes for Leymus, and 7 212 unigenes for Elymus. Unigenes were annotated based on putative orthology to genes from rice, wheat, barley, other Poaceae, Arabidopsis, and the non-redundant database of the NCBI. Simple sequence repeat (SSR) markers were developed, tested for amplification and polymorphism, and aligned to the rice genome. Leymus EST markers homologous to rice chromosome 2 genes were syntenous on Leymus homeologous groups 6a and 6b (previously 1b), demonstrating promise for in silico comparative mapping. All ESTs and SSR markers are available on an EST information management and annotation database (http://titan.biotec.uiuc.edu/triticeae/).  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号