首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Over 28,000 expressed sequence tags (ESTs) were produced from cDNA libraries representing a variety of growth conditions and cell types. Several Magnaporthe grisea strains were used to produce the libraries, including a nonpathogenic strain bearing a mutation in the PMK1 mitogen-activated protein kinase. Approximately 23,000 of the ESTs could be clustered into 3,050 contigs, leaving 5,127 singleton sequences. The estimate of 8,177 unique sequences indicates that over half of the genes of the fungus are represented in the ESTs. Analysis of EST frequency reveals growth and cell type-specific patterns of gene expression. This analysis establishes criteria for identification of fungal genes involved in pathogenesis. A large fraction of the genes represented by ESTs have no known function or described homologs. Manual annotation of the most abundant cDNAs with no known homologs allowed us to identify a family of metallothionein proteins present in M. grisea, Neurospora crassa, and Fusarium graminearum. In addition, multiply represented ESTs permitted the identification of alternatively spliced mRNA species. Alternative splicing was rare, and in most cases, the alternate mRNA forms were unspliced, although alternative 5' splice sites were also observed.  相似文献   

2.
There is a general lack of genomic information available for chlorophyte seaweed genera such as Ulva, and in particular there is no information concerning the genes that contribute to adhesion and cell wall biosynthesis for this organism. Partial sequencing of cDNA libraries to generate expressed sequence tags (ESTs) is an effective means of gene discovery and characterization of expression patterns. In this study, a cDNA library was created from sporulating tissue of Ulva linza L. Initially, 650 ESTs were randomly selected from a cDNA library and sequenced from their 5′ ends to obtain an indication of the level of redundancy of the library (21%). The library was normalized to enrich for rarer sequences, and a further 1920 ESTs were sequenced. These sequences were subjected to contig assembly that resulted in a unigene set of approximately 1104 ESTs. Forty‐eight percent of these sequences exhibited significant similarity to sequences in the databases. Phylogenetic comparisons are made between selected sequences with similarity in the databases to proteins involved in aspects of extracellular matrix/cell wall assembly and adhesion.  相似文献   

3.
Analysis of genes expressed during rice-Magnaporthe grisea interactions.   总被引:4,自引:0,他引:4  
Expressed sequence tag (EST) analysis was applied to identify rice genes involved in defense responses against infection by the blast fungus Magnaporthe grisea and fungal genes involved in growth within the host during a compatible interaction. A total of 511 clones was sequenced from a cDNA library constructed from rice leaves (Oryza sativa cv. Nipponbare) infected with M. grisea strain 70-15 to generate 296 nonredundant ESTs. The sequences of 293 clones (57.3%) significantly matched National Center for Biotechnology Information database entries; 221 showed homologies with previously identified plant genes and 72 with fungal genes. Among the genes with assigned functions, 32.8% were associated with metabolism, 29.4% with cell/organism defense or pathogenicity, and 18.4% with gene/protein expression. cDNAs encoding a type I metallothionein (MTs-1) of rice and a homolog of glucose-repressible gene 1 (GRG1) of Neurospora crassa were the most abundant representatives of plant and fungal genes, comprising 2.9 and 1.6% of the total clones, respectively. The expression patterns of 10 ESTs, five each from rice and M. grisea, were analyzed. Five defense-related genes in rice, including four pathogenesis-related genes and MTs-1, were highly expressed during M. grisea infection. Expression of five stress-inducible or pathogenicity-related genes of the fungus, including two hydrophobin genes, was also induced during growth within the host. Further characterization of the genes represented in this study would be an aid in unraveling the mechanisms of pathogenicity of M. grisea and the defense responses of rice.  相似文献   

4.
Chlamydomonas reinhardtii is a unicellular green alga that has been used as a model organism for the study of flagella and basal bodies as well as photosynthesis. This report analyzes finished genomic DNA sequence for 0.5% of the nuclear genome. We have used three gene prediction programs as well as EST and protein homology data to estimate the total number of genes in Chlamydomonas to be between 12,000 and 16,400. Chlamydomonas appears to have many more genes than any other unicellular organism sequenced to date. Twenty-seven percent of the predicted genes have significant identity to both ESTs and to known proteins in other organisms, 32% of the predicted genes have significant identity to ESTs alone, and 14% have significant similarity to known proteins in other organisms. For gene prediction in Chlamydomonas, GreenGenie appeared to have the highest sensitivity and specificity at the exon level, scoring 71% and 82%. respectively. Two new alternative splicing events were predicted by aligning Chlamydomonas ESTs to the genomic sequence. Finally recombination differs between the two sequenced contigs. The 350-Kb of the Linkage group III contig is devoid of recombination, while the Linkage group I contig is 30 map units long over 33-kb.  相似文献   

5.
6.
To identify new vaccine candidates, Eimeria tenella expressed sequence tags (ESTs) from public databases were analysed for secretory molecules with an especially developed automated in silico strategy termed DNAsignalP. A total of 12,187 ESTs were clustered into 2881 contigs followed by a blastx search, which resulted in a significant number of E. tenella contigs with homologies to entries in public databases. Amino acid sequences of appropriate homologous proteins were analysed for the occurrence of an N-terminal signal sequence using the algorithm signalP. The resulting list of 84 entries comprised 51 contigs whose deduced proteins showed homologies to proteins of apicomplexan parasites. Based on function or localisation, we selected candidate proteins classified as (i) secreted proteins of Apicomplexa parasites, (ii) secreted enzymes, and (iii) transport and signalling proteins. To verify our strategy experimentally, we used a functional complementation system in yeast. For five selected candidate proteins we found that these were indeed secreted. Our approach thus represents an efficient method to identify secretory and surface proteins out of EST databases.  相似文献   

7.
为了分离鉴定柔嫩艾美耳球虫(Eimeria tenella)孢子发育阶段虫体的差异表达基因,分别以柔嫩艾美耳球虫未孢子化卵囊和孢子化卵囊为驱动组、子孢子为实验组,或未孢子化卵囊为驱动组、孢子化卵囊为实验组,利用抑制性消减杂交(SSH)技术,构建了2个子孢子cDNA消减文库和1个孢子化卵囊cDNA消减文库。随机从3个cDNA消减文库中分别挑取50个克隆,经PCR鉴定2个子孢子cDNA消减文库的重组率都为96%,孢子化卵囊cDNA消减文库的重组率为98%。从每个文库中随机挑取50个克隆测序,并进行同源性比较分析,结果显示:从孢子化卵囊cDNA消减文库中获得了13个单一有效序列,其中8个EST与已知蛋白同源性很高;从2个子孢子cDNA消减文库中共获得了40个单一有效序列,其中9个EST与已知蛋白同源,其余可能为柔嫩艾美耳球虫的新基因。这些结果为分离柔嫩艾美耳球虫新功能基因和进一步探索防治球虫病的方法提供了理论基础。  相似文献   

8.
Coccidiosis in chickens is caused by 7 species of Eimeria. Even though coccidiosis is a complex disease that can be caused by any combination of these species, most of the molecular research concerning chicken coccidiosis has been limited to Eimeria tenella. The present study describes the first large-scale analysis of expressed sequence tags (ESTs) generated primarily from second-stage merozoites (and schizonts) of E. acervulina. In total, 1,847 ESTs were sequenced; these represent 1,026 unique sequences. Approximately half of the ESTs encode proteins of unknown function, or hypothetical proteins. Twenty-nine percent of the E. acervulina ESTs share significant sequence identity with sequences in the E. tenella genome. Additionally, EST hits seem to be much different compared with those of E. tenella. One of the differences is the very low number of ESTs that encode putative microneme proteins. This study underlines the potential differences in the molecular aspects of 2 Eimeria species that in the past were thought to be highly similar in nature.  相似文献   

9.
10.
11.
The generation of large numbers of partial cDNA sequences, or expressed sequence tags (ESTs), has provided a method with which to sample a large number of genes from an organism. More than 25,000 Arabidopsis thaliana ESTs have been deposited in public databases, producing the largest collection of ESTs for any plant species. We describe here the application of a method of reducing redundancy and increasing information content in this collection by grouping overlapping ESTs representing the same gene into a "contig" or assembly. The increased information content of these assemblies allows more putative identifications to be assigned based on the results of similarity searches with nucleotide and protein databases. The results of this analysis indicate that sequence information is available for approximately 12,600 nonoverlapping ESTs from Arabidopsis. Comparison of the assemblies with 953 Arabidopsis coding sequences indicates that up to 57% of all Arabidopsis genes are represented by an EST. Clustering analysis of these sequences suggests that between 300 and 700 gene families are represented by between 700 and 2000 sequences in the EST database. A database of the assembled sequences, their putative identifications, and cellular roles is available through the World Wide Web.  相似文献   

12.
A Longissimus Dorsi muscle cDNA library of Xiang Pig was constructed, and 131 randomly isolated clones were sequenced in this study. The results of bioinformatics analysis showed that 131 ESTs represented 109 unique clones sequences, of which 99 showed homology to previously identified genes in humans or other mammals, 3 matched other uncharacterized expressed sequence tags (ESTs), and 7 showed no significant matches to sequences already present in DNA databases. No protein matches were found for 10 ESTs. Functional analysis of the ESTs showed that a considerable proportion of them encoded proteins involved in gene/protein expression (45.46%). Other classes included genes involved in metabolism (10.10%), cell structure/motility (10.10%), cell/organism defense (5.05%), cell signaling/communication (2.02%), and cell division (0.0%). Unclassified genes constituted the remaining 27.27%. This study reported the results of the first gene expression profile analysis of Chinese native Xiang Pig skeletal muscle cells, thereby greatly facilitating the functional study of candidate genes involved in muscle growth as well as in the improvement of meat quality in domestic pigs.  相似文献   

13.
14.
通过构建香猪肌肉组织cDNA文库,并在文库中随机挑选克隆进行测序的方法,获得了131个香猪肌肉EST序列.在这131个EST序列所代表的109个单一克隆中,有99个为人类及其他物种的同源序列,3个为已知的猪的ESTs,7个为未知ESTs.对这10个已知、未知ESTs进行开放阅读框预测并进行B1ast分析,没有找到高度同源的氨基酸序列.对上述EST所对应的基因功能分析结果表明,除去27.27%的EST未能分类外,克隆到的EST大多来自与基因/蛋白的表达调控相关的基因(占45.46%).来自具有其他功能的基因的EST依次是细胞代谢占10.10%、细胞结构/迁移占10.10%、细胞/机体防御占5.05%和细胞信号/传导占2.02%.没有发现和细胞分裂相关的已知功能基因.本研究结果为中国地方品种香猪提供了第一个骨骼肌的基因表达谱,为今后寻找猪肌肉生长和肉用品质的候选基因奠定了基础.  相似文献   

15.
Uniformly repeated DNA sequences in genomes known as tandem repeats are one of the most interesting features of many organisms analyzed so far. Among the tandem repeats, microsatellites have attracted many researchers since their associations in several human diseases. The discovery of tandem repeats in the expressed sequence tags (ESTs) or in the cDNA libraries contributed to new ideas and tools for evolutionary studies. With the advent of new biotechnological tools the number of ESTs deposited in databases is rapidly increasing. Therefore, new informative bioinformatics tools are needed to assist the analysis and interpretation of these tandem repeats in ESTs and in other type of DNAs. In the present study we report two new utility tools; Organism Miner and Keyword Finder. Organism Miner utility collects, sorts, splice and provides statistical overview on DNA data files. Keyword Finder analyses all the sequences in the input folder and extracts and collects keywords for each specific organism or the all the organisms, which have the DNA sequence and generates statistical overview. We are currently generating cotton and pepper cDNA libraries and often using the GenBank DNA sequences. Therefore, in this study we used cDNAs and ESTs of cotton and pepper for the demonstrating the use of these two tools. With help of these two utilities we observed that most of ESTs are useful for downstream applications such as mining microsatellites specific to an organ, tissue or development stage. The analyses of ESTs indicated that not only tandem repeats existed in ESTs but also tandem repeats differentially presented in different organ or tissue specific ESTs within and between the species. Utilities and the sample data sets are self-extracting files and freely available from or can be obtained upon request from the corresponding author.  相似文献   

16.
Aplysia californica is a marine opisthobranch mollusc used as a model organism in neurobiology for cellular analyses of learning and behavior because it possesses a comparatively small number of neurons of large size. The mollusca comprise the second largest animal phylum, yet detailed genetic and genomic information is only recently beginning to accrue. Thus developmental and comparative evolutionary biology as well as biomedical research would benefit from additional information on DNA sequences of Aplysia. Therefore, we have constructed a series of unidirectional cDNA libraries from different life stages of Aplysia. These include whole organisms from the egg, veliger, metamorphic, and juvenile stages as well as adult neural tissue for reference. Individual clones were randomly picked, and high-throughput, single pass sequence analysis was performed to generate 7971 sequences. Of these, there were 5507 quality-filtered ESTs that clustered into 1988 unigenes, which are annotated and deposited into GenBank. A significant number (497) of ESTs did not match existing Aplysia ESTs and are thus potentially novel sequences for Aplysia. GO and KEGG analyses of these novel sequences indicated that a large number were involved in protein binding and translation, consistent with the predominant biosynthetic role in development and the presence of stage-specific protein isoforms.  相似文献   

17.
Discovery of non-synonymous single nucleotide polymorphisms (nsSNP), which cause amino acid substitutions, is important because they are more likely to alter protein function than synonymous SNPs (sSNP) or those SNPs that do not result in amino acid changes. By changing the coding sequences, nsSNP may play a role in heritable differences between individual organisms. In the chicken and many other vertebrates, the main obstacle for identifying nsSNP is that there is insufficient protein and mRNA sequence information for self-species referencing and thus, determination of the correct reading frame for expressed sequence tags (ESTs) is difficult. Therefore, in order to estimate the correct reading frame at nsSNP in chicken ESTs, a double-screening approach was designed using self- or cross-species protein referencing, in addition to the ESTScan coding region estimation programme. Starting with 23 427 chicken ESTs, 1210 potential SNPs were discovered using a phred/phrap/polyphred/consed pipeline process and among these, 108 candidate nsSNP were identified with the double screening method. A searchable SNP database (chicksnps) for the candidate chicken SNPs, including both nsSNPs and sSNPs is available at http://chicksnps.afs.udel.edu. The chicken SNP data described in this paper have been submitted to the data base SNP under National Center for Biotechnology Information assay ID ss4387050-ss4388259.  相似文献   

18.
Expressed sequence tags (ESTs) are partial cDNA sequences read from both ends of random expressed gene fragments used for discovering new genes. DNA libraries from four different developmental stages of Schistosoma mansoni used in this study generated 141 ESTs representing about 2.5% of S. mansoni sequences in dbEST. Sequencing was done by the dideoxy chain termination method. The sequences were submitted to GenBank for homology searching in nonredundant databases using Basic Local Alignment Search Tool for DNA (BLASTN) alignment and for protein (BLASTX) alignment at the National Center for Biotechnology Information (NCBI). Among submitted ESTs, 29 were derived from lambdagt11 sporocyst library, 70 from lambdaZap adult worm library, 31 from lambdaZap cercarial library, and 11 from lambdaZap female B worm library. Homology search revealed that eight (5.6%) ESTs shared homology to previously identified S.mansoni genes in dbEST, 15 (10.6%) are homologous to known genes in other organisms, 116 (81.7%) showed no significant sequence homology in the databases, and the remaining sequences (2.1%) showed low homologies to rRNA or mitochondrial DNA sequences. Thus, among the 141 ESTs studied, 116 sequences are derived from noval, uncharactarized S. mansoni genes. Those 116 ESTs are important for identification of coding regions in the sequences, helping in mapping of schistosome genome, and identifying genes of immunological and pharmacological significance.  相似文献   

19.
To better understand the molecular basis of the defense response against the rice blast fungus (Magnaporthe grisea), a large-scale expressed sequence tag (EST) sequencing approach was used to identify genes involved in the early infection stages in rice (Oryza sativa). Six cDNA libraries were constructed using infected leaf tissues harvested from 6 conditions: resistant, partially resistant, and susceptible reactions at both 6 and 24 h after inoculation. Two additional libraries were constructed using uninoculated leaves and leaves from the lesion mimic mutant spl11. A total of 68,920 ESTs were generated from 8 libraries. Clustering and assembly analyses resulted in 13,570 unique sequences from 10,934 contigs and 2,636 singletons. Gene function classification showed that 42% of the ESTs were predicted to have putative gene function. Comparison of the pathogen-challenged libraries with the uninoculated control library revealed an increase in the percentage of genes in the functional categories of defense and signal transduction mechanisms and cell cycle control, cell division, and chromosome partitioning. In addition, hierarchical clustering analysis grouped the eight libraries based on their disease reactions. A total of 7,748 new and unique ESTs were identified from our collection compared with the KOME full-length cDNA collection. Interestingly, we found that rice ESTs are more closely related to sorghum (Sorghum bicolor) ESTs than to barley (Hordeum vulgare), wheat (Triticum aestivum), and maize (Zea mays) ESTs. The large cataloged collection of rice ESTs in this study provides a solid foundation for further characterization of the rice defense response and is a useful public genomic resource for rice functional genomics studies.  相似文献   

20.
Xin D  Sun J  Wang J  Jiang H  Hu G  Liu C  Chen Q 《Molecular biology reports》2012,39(9):9047-9057
Microsatellites, or simple sequence repeats (SSRs), are very useful molecular markers for a number of plant species. We used a new publicly available module (TROLL) to extract microsatellites from the public database of soybean expressed sequence tag (EST) sequences. A total of 12,833 sequences containing di- to penta-type SSRs were identified from 200,516 non-redundant soybean ESTs. On average, one SSR was found per 7.25?kb of EST sequences, with the tri-nucleotide motifs being the most abundant. Primer sequences flanking the SSR motifs were successfully designed for 9,638 soybean ESTs using the software primer3.0 and only 59 pairs of them were found in earlier studies. We synthesized 124 pairs of the primers to determine the polymorphism and heterozygosity among eight genotypes of soybean cultivars, which represented a wide range of the cultivated soybean cultivars. PCR amplification products with anticipated SSRs were obtained with 81 pairs of primers; 36 PCR products appeared to be homozygous and the remaining 45 PCR products appeared to be heterozygous and displayed polymorphism among the eight cultivars. We further analysed the EST sequences containing 45 polymorphic EST-SSR markers using the programs BLASTN and BLASTX. Sequence alignment showed that 29 ESTs have homologous sequences and 15 ESTs could be classified into a Uni-gene cluster with comparatively convincing protein products. Among these 15 ESTs belonging to a Uni-gene cluster, 9 SSRs were located in 3'-UTR, 4 SSRs were located in the intron region and 2 SSRs were located in the CDS region. None of these SSRs was located in the 5'-UTR. These novel SSRs identified in the ESTs of soybean provide useful information for gene mapping and cloning in future studies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号