首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
To identify new vaccine candidates, Eimeria tenella expressed sequence tags (ESTs) from public databases were analysed for secretory molecules with an especially developed automated in silico strategy termed DNAsignalP. A total of 12,187 ESTs were clustered into 2881 contigs followed by a blastx search, which resulted in a significant number of E. tenella contigs with homologies to entries in public databases. Amino acid sequences of appropriate homologous proteins were analysed for the occurrence of an N-terminal signal sequence using the algorithm signalP. The resulting list of 84 entries comprised 51 contigs whose deduced proteins showed homologies to proteins of apicomplexan parasites. Based on function or localisation, we selected candidate proteins classified as (i) secreted proteins of Apicomplexa parasites, (ii) secreted enzymes, and (iii) transport and signalling proteins. To verify our strategy experimentally, we used a functional complementation system in yeast. For five selected candidate proteins we found that these were indeed secreted. Our approach thus represents an efficient method to identify secretory and surface proteins out of EST databases.  相似文献   

2.
Mining single-nucleotide polymorphisms from hexaploid wheat ESTs.   总被引:20,自引:0,他引:20  
Single-nucleotide polymorphisms (SNPs) represent a new form of functional marker, particularly when they are derived from expressed sequence tags (ESTs). A bioinformatics strategy was developed to discover SNPs within a large wheat EST database and to demonstrate the utility of SNPs in genetic mapping and genetic diversity applications. A collection of > 90000 wheat ESTs was assembled into contiguous sequences (contigs), and 45 random contigs were then visually inspected to identify primer pairs capable of amplifying specific alleles. We estimate that homoeologue sequence variants occurred 1 in 24 bp and the frequency of SNPs between wheat genotypes was 1 SNP/540 bp (theta = 0.0069). Furthermore, we estimate that one diagnostic SNP test can be developed from every contig with 10-60 EST members. Thus, EST databases are an abundant source of SNP markers. Polymorphism information content for SNPs ranged from 0.04 to 0.50 and ESTs could be mapped into a framework of microsatellite markers using segregating populations. The results showed that SNPs in wheat can be discovered in ESTs, validated, and be applied to conventional genetic studies.  相似文献   

3.
Expressed sequence tags (ESTs) from the Antarctic green algae Pyramimonas gelidicola were analyzed to obtain molecular information on cold acclimation of psychrophilic microorganisms. A total of 2,112 EST clones were sequenced, generating 222 contigs and 219 singletons, and 200 contigs and 391 singletons from control (4 degrees C) and cold-shock conditions (-2 degrees C), respectively. The complete EST sequences were deposited to the DDBJ EST database (http:// www.ddbj.nig.ac.jp/index-e.html) and the nucleotide sequences reported in this study are available in the DDBJ/EMBL/ GenBank. These EST databases of Antarctic green algae can be used in a wide range of studies on psychrophilic genes expressed by polar microorganisms.  相似文献   

4.
5.
EST sequencing of Onychophora and phylogenomic analysis of Metazoa   总被引:4,自引:0,他引:4  
Onychophora (velvet worms) represent a small animal taxon considered to be related to Euarthropoda. We have obtained 1873 5' cDNA sequences (expressed sequence tags, ESTs) from the velvet worm Epiperipatus sp., which were assembled into 833 contigs. BLAST similarity searches revealed that 51.9% of the contigs had matches in the protein databases with expectation values lower than 10(-4). Most ESTs had the best hit with proteins from either Chordata or Arthropoda (approximately 40% respectively). The ESTs included sequences of 27 ribosomal proteins. The orthologous sequences from 28 other species of a broad range of phyla were obtained from the databases, including other EST projects. A concatenated amino acid alignment comprising 5021 positions was constructed, which covers 4259 positions when problematic regions were removed. Bayesian and maximum likelihood methods place Epiperipatus within the monophyletic Ecdysozoa (Onychophora, Arthropoda, Tardigrada and Nematoda), but its exact relation to the Euarthropoda remained unresolved. The "Articulata" concept was not supported. Tardigrada and Nematoda formed a well-supported monophylum, suggesting that Tardigrada are actually Cycloneuralia. In agreement with previous studies, we have demonstrated that random sequencing of cDNAs results in sequence information suitable for phylogenomic approaches to resolve metazoan relationships.  相似文献   

6.
The large-scale genomic resource for kelampayan was generated from a developing xylem cDNA library. A total of 6,622 high quality expressed sequence tags (ESTs) were generated through high-throughput 5’ EST sequencing of cDNA clones. The ESTs were analyzed and assembled to generate 4,728 xylogenesis unigenes distributed in 2,100 contigs and 2,628 singletons. About 59.3 % of the ESTs were assigned with putative identifications whereas 40.7 % of the sequences showed no significant similarity to any sequences in GenBank. Interestingly, most genes involved in lignin biosynthesis and several other cell wall biosynthesis genes were identified in the kelampayan EST database. The identified genes in this study will be candidates for functional genomics and association genetic studies in kelampayan aiming at the production of high value forests.  相似文献   

7.
Simple Sequence Repeats (SSRs) developed from Expressed Sequence Tags (ESTs), known as EST-SSRs are most widely used and potentially valuable source of gene based markers for their high levels of crosstaxon portability, rapid and less expensive development. The EST sequence information in the publicly available databases is increasing in a faster rate. The emerging computational approach provides a better alternative process of development of SSR markers from the ESTs than the conventional methods. In the present study, 12,851 EST sequences of Camellia sinensis, downloaded from National Center for Biotechnology Information (NCBI) were mined for the development of Microsatellites. 6148 (4779 singletons and 1369 contigs) non redundant EST sequences were found after preprocessing and assembly of these sequences using various computational tools. Out of total 3822.68 kb sequence examined, 1636 (26.61%) EST sequences containing 2371 SSRs were detected with a density of 1 SSR/1.61 kb leading to development of 245 primer pairs. These mined EST-SSR markers will help further in the study of variability, mapping, evolutionary relationship in Camellia sinensis. In addition, these developed SSRs can also be applied for various studies across species.  相似文献   

8.
9.
10.
11.
The increasing availability of expressed sequence tags (ESTs) in wheat (Triticum aestivum) and related cereals provides a valuable resource of non-anonymous DNA molecular markers. We examined 170,746 wheat ESTs from the public (International Triticeae EST Cooperative) and Génoplante databases, previously clustered in contigs, for the presence of di- to hexanucleotide simple sequence repeats (SSRs). Analysis of 46,510 contigs identified 3,530 SSRs, which represented 7.5% of the total number of contigs. Only 74% of the sequences allowed primer pairs to be designed, 70% led to an amplification product, mainly of a high quality (68%), and 53% exhibited polymorphism for at least one cultivar among the eight tested. Even though dinucleotide SSRs were less represented than trinucleotide SSRs (15.5% versus 66.5%, respectively), the former showed a much higher polymorphism level (83% versus 46%). The effect of the number and type of repeats is also discussed. The development of new EST-SSRs markers will have important implications for the genetic analysis and exploitation of the genetic resources of wheat and related species and will provide a more direct estimate of functional diversity.  相似文献   

12.
13.
This report describes the rationale, approaches, organization, and resource development leading to a large-scale deletion bin map of the hexaploid (2n = 6x = 42) wheat genome (Triticum aestivum L.). Accompanying reports in this issue detail results from chromosome bin-mapping of expressed sequence tags (ESTs) representing genes onto the seven homoeologous chromosome groups and a global analysis of the entire mapped wheat EST data set. Among the resources developed were the first extensive public wheat EST collection (113,220 ESTs). Described are protocols for sequencing, sequence processing, EST nomenclature, and the assembly of ESTs into contigs. These contigs plus singletons (unassembled ESTs) were used for selection of distinct sequence motif unigenes. Selected ESTs were rearrayed, validated by 5′ and 3′ sequencing, and amplified for probing a series of wheat aneuploid and deletion stocks. Images and data for all Southern hybridizations were deposited in databases and were used by the coordinators for each of the seven homoeologous chromosome groups to validate the mapping results. Results from this project have established the foundation for future developments in wheat genomics.  相似文献   

14.
The sequencing of expressed sequence tags (ESTs) from Xenopus laevis has lagged behind efforts on many other common experimental organisms and man, partly because of the pseudotetraploid nature of the Xenopus genome. Nonetheless, large collections of Xenopus ESTs would be useful in gene discovery, oligonucleotide-based knockout studies, gene chip analyses of normal and perturbed development, mapping studies in the related diploid frog X. tropicalis, and for other reasons. We have created a normalized library of cDNAs from unfertilized Xenopus eggs. These cells contain all of the information necessary for the first several cell divisions in the early embryo, as well as much of the information needed for embryonic pattern formation and cell fate determination. To date, we have successfully sequenced 13,879 ESTs out of 16,607 attempts (83.6% success rate), with an average sequence read length of 508 bp. Using a fragment assembly program, these ESTs were assembled into 8,985 'contigs' comprised of up to 11 ESTs each. When these contigs were used to search publicly available databases, 46.2% bore no relationship to protein or DNA sequences in the database at the significance level of 1e-6. Examination of a sample of 100 of the assembled contigs revealed that most ( approximately 87%) were comprised of two apparent allelic variants. Expression profiles of 16 of the most prominent contigs showed that 12 exhibited some degree of zygotic expression. These findings have implications for sequence-specific applications for Xenopus ESTs, particularly the use of allele-specific oligonucleotides for knockout studies, differential hybridization techniques such as gene chip analysis, and the establishment of accurate nomenclature and databases for this species.  相似文献   

15.
16.
Rattans serve as an important source of raw non-wood materials for furniture and handicraft industries worldwide. However, their genomic sequence information in public databases is very limited. In this study, a set of 2,528 good-quality expressed sequence tags (ESTs) were generated from a full-length cDNA library constructed previously with root, stem and male inflorescence tissues of Calamus simplicifolius C. F. Wei, a rattan species native to Hainan Island, China. The ESTs were assembled into 1,588 unigenes, including 1,221 singletons and 367 contigs. BlastX searches against the GenBank non-redundant protein database revealed that 1,248 (78.6 %) unigenes had at least one significant match (E ≤ 10?5). The gene ontology functional classification assigned 991, 669 and 977 of the unigenes to the cellular component, molecular function and biological process categories, respectively. A total of 71 simple sequence repeat (SSR) loci were developed among these ESTs, including 65 polymorphic across 19 rattan species representing three genera. High levels of cross-species/genus transferability were observed for the EST-SSRs. For the polymorphic EST-SSR markers, the number of alleles per locus and polymorphic information content ranged from 2 to 25 (mean 11.1) and from 0.135 to 0.949 (mean 0.695), respectively. The EST sequences and the EST-SSR primers have been deposited in GenBank databases of EST (IDs JK838364–40891) and Probe (IDs Pr16718978–9048, to be assigned).  相似文献   

17.
To study gene expression in the water flea Daphnia magna we constructed a cDNA library and characterized the expressed sequence tags (ESTs) of 7210 clones. The EST sequences clustered into 2958 nonredundant groups. BLAST analyses of both protein and DNA databases showed that 1218 (41%) of the unique sequences shared significant similarities to known nucleotide or amino acid sequences, whereas the remaining 1740 (59%) showed no significant similarities to other genes. Clustering analysis revealed particularly high expression of genes related to ATP synthesis, structural proteins, and proteases. The cDNA clones and EST sequence information should be useful for future functional analysis of daphnid biology and investigation of the links between ecology and genomics.  相似文献   

18.
Rhizoctonia solani is a ubiquitous basidiomycetous soilborne fungal pathogen causing damping-off of seedlings, aerial blights and postharvest diseases. To gain insight into the molecular mechanisms of pathogenesis a global approach based on analysis of expressed sequence tags (ESTs) was undertaken. To get broad gene-expression coverage, two normalized EST libraries were developed from mycelia grown under high nitrogen-induced virulent and low nitrogen/methylglucose-induced hypovirulent conditions. A pilot-scale assessment of gene diversity was made from the sequence analyses of the two libraries. A total of 2280 cDNA clones was sequenced that corresponded to 220 unique sequence sets or clusters (contigs) and 805 singlets, making up a total of 1025 unique genes identified from the two virulence-differentiated cDNA libraries. From the total sequences, 295 genes (38.7%) exhibited strong similarities with genes in public databases and were categorized into 11 functional groups. Approximately 61.3% of the R. solani ESTs have no apparent homologs in publicly available fungal genome databases and are considered unique genes. We have identified several cDNAs with potential roles in fungal pathogenicity, virulence, signal transduction, vegetative incompatibility and mating, drug resistance, lignin degradation, bioremediation and morphological differentiation. A codon-usage table has been formulated based on 14694 R. solani EST codons. Further analysis of ESTs might provide insights into virulence mechanisms of R. solani AG 4 as well as roles of these genes in development, saprophytic colonization and ecological adaptation of this important fungal plant pathogen.  相似文献   

19.
20.
Using a strategy requiring only modest computational resources, wheat expressed sequence tag (EST) sequences from various sources were assembled into contigs and compared with a nonredundant barley sequence assembly, with ESTs, with complete draft genome sequences of rice and Arabidopsis thaliana, and with ESTs from other plant species. These comparisons indicate that (i) wheat sequences available from public sources represent a substantial proportion of the diversity of wheat coding sequences, (ii) prediction of open reading frames in the whole genome sequence improves when supplemented with EST information from other species, (iii) a substantial number of candidates for novel genes that are unique to wheat or related species can be identified, and (iv) a smaller number of genes can be identified that are common to monocots and dicots but absent from Arabidopsis. The sequences in the last group may have been lost from Arabidopsis after descendance from a common ancestor. Examples of potential novel wheat genes and Triticeae-specific genes are presented.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号