首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
In an effort to determine genes that are expressed in mycelial cultures of Neurospora crassa over the course of the circadian day, we have sequenced 13,000 cDNA clones from two time-of-day-specific libraries (morning and evening library) generating approximately 20,000 sequences. Contig analysis allowed the identification of 445 unique expressed sequence tags (ESTs) and 986 ESTs present in multiple cDNA clones. For approximately 50% of the sequences (710 of 1431), significant matches to sequences in the National Center for Biotechnology Information database (of known or unknown function) were detected. About 50% of the ESTs (721 of 1431) showed no similarity to previously identified genes. We hybridized Northern blots with probes derived from 26 clones chosen from contigs identified by multiple cDNA clones and EST sequences. Using these sequences, the representation of genes among the morning and evening sequences, respectively, in most cases does not reflect their expression patterns over the course of the day. Nevertheless, we were able to identify four new clock-controlled genes. On the basis of these data we predict that a significant proportion of the expressed Neurospora genes may be regulated by the circadian clock. The mRNA levels of all four genes peak in the subjective morning as is the case with previously identified ccgs.  相似文献   

3.
一种新的EST聚类方法   总被引:11,自引:0,他引:11  
该研究发展了一种EST(expressed sequence tag)聚类方法(ESTClustering),用于分析大规模EST测序中所产生的大量数据,以获得高质量,非重复表达序列,该方法在聚类过程中采用MEGABLAST工具对一致序列进行序列同源比较,并用phrap程序对每一EST簇进行拼接检验。这一聚类策略能降低测序错误带来的影响,有效识别基因家族成员,并避免选择性剪接的干扰,与NCB(National Center for Biotechnology Information)的UniGene clustering)方法相比,ESTClustering的聚类结果可以更好地反映表达序列的多样性,用ESTClustering对112256条拟南芥EST聚类测试,产生23581个EST簇,其中13597个EST簇有对应拟南芥基因组编码序列,与该基因组中有EST作为依据的预测基因数目接近。应用该方法对收集的147191条水稻EST序列进行聚类,形成33896个EST簇。  相似文献   

4.
Chicken Expressed Sequence Tags (ESTs) were analyzed to identify genes associated with myogenesis during embryonic development. A total of 6,184 ESTs were generated from three cDNA libraries constructed from whole embryos (Stage 26), somites associated with neural tube (Stage 15), and limb buds (Stages 21, 24 and 26). Clustering and assembly of 4,998 valid ESTs resulted in 2,329 unique sequences with 902 clusters (38.7%) and 1,427 singletons (61.3%). There are more than 400,000 chicken ESTs available at GenBank and we were able to identify 143 novel sequences. From these, 45 sequences found either a human EST homolog or a match with conserved regions among proteins. Most of these sequences were found to be expressed in somites, an important tissue for muscle development and not characterized before. This study revealed the value of micro dissected embryonic libraries for describing gene expression profiles associated with myogenesis and gene discovery.  相似文献   

5.
6.
Soybean rust is caused by the obligate fungal pathogen Phakopsora pachyrhizi Sydow. A unidirectional cDNA library was constructed using mRNA isolated from germinating P. pachyrhizi urediniospores to identify genes expressed at this physiological stage. Single pass sequence analysis of 908 clones revealed 488 unique expressed sequence tags (ESTs, unigenes) of which 107 appeared as multiple copies. BLASTX analysis identified 189 unigenes with significant similarities (Evalue<10(-5)) to sequences deposited in the NCBI non-redundant protein database. A search against the NCBI dbEST using the BLASTN algorithm revealed 32 ESTs with high or moderate similarities to plant and fungal sequences. Using the Expressed Gene Anatomy Classification, 31.7% of these ESTs were involved in primary metabolism, 14.3% in gene/protein expression, 7.4% in cell structure and growth, 6.9% in cell division, 4.8% in cell signaling/cell communication, and 4.8% in cell/organism defense. Approximately 29.6% of the identities were to hypothetical proteins and proteins with unknown function.  相似文献   

7.
8.
Efforts to construct a genetic linkage map of channel catfish have involved identification of random genomic microsatellite markers, as well as anchored Type I loci (expressed genes) from channel catfish. To identify Type I markers we constructed a directional cDNA library from brain tissue to obtain expressed catfish sequences that could be used for single nucleotide polymorphism (SNP) marker development. These cDNA sequences surprisingly contained a high proportion of microsatellites (about 14%) in noncoding regions of expressed sequence tags (ESTs), many of which were not associated with known sequences. To further identify cDNAs with microsatellites and reduce the number of sequencing reactions needed for marker development, we enriched this library for repeat sequences and sequenced clones from both directions. A total of 1644 clones from seven repeat-enriched captures (CA, GT, CT, GA, MTT, TAG, and TAC) were sequenced from both ends, and 795 nonredundant clones were assembled. Thirty-seven percent of the clones contained microsatellites in the trimmed sequence. After assembly in the TIGR Catfish Gene Index (CfGI), 154 contigs matched known vertebrate genes and 92 contigs contained microsatellites. When BLAST-matched orthologues were available for similarity alignments, 28% of these contigs contained repeats in the 5'-UTR, 72% contained repeats in the 3'-UTR, and 8% contained repeats at both ends. Using biotinylated repeat oligonucleotides coupled with streptavidin-coated magnetic beads, and rapid, single-pass hybridization, we were able to enrich our plasmid library greater than two-fold for repeat sequences and increase the ability to link these ESTs with known sequences greater than six-fold.  相似文献   

9.
Efforts to construct a genetic linkage map of channel catfish have involved identification of random genomic microsatellite markers, as well as anchored Type I loci (expressed genes) from channel catfish. To identify Type I markers we constructed a directional cDNA library from brain tissue to obtain expressed catfish sequences that could be used for single nucleotide polymorphism (SNP) marker development. These cDNA sequences surprisingly contained a high proportion of microsatellites (about 14%) in noncoding regions of expressed sequence tags (ESTs), many of which were not associated with known sequences. To further identify cDNAs with microsatellites and reduce the number of sequencing reactions needed for marker development, we enriched this library for repeat sequences and sequenced clones from both directions. A total of 1644 clones from seven repeat-enriched captures (CA, GT, CT, GA, MTT, TAG, and TAC) were sequenced from both ends, and 795 nonredundant clones were assembled. Thirty-seven percent of the clones contained microsatellites in the trimmed sequence. After assembly in the TIGR Catfish Gene Index (CfGI), 154 contigs matched known vertebrate genes and 92 contigs contained microsatellites. When BLAST-matched orthologues were available for similarity alignments, 28% of these contigs contained repeats in the 5'-UTR, 72% contained repeats in the 3'-UTR, and 8% contained repeats at both ends. Using biotinylated repeat oligonucleotides coupled with streptavidin-coated magnetic beads, and rapid; single-pass hybridization, we were able to enrich our plasmid library greater than two-fold for repeat sequences and increase the ability to link these ESTs with known sequences greater than six-fold.  相似文献   

10.
To help develop an understanding of the genes that govern the developmental characteristics of the potato (Solanum tuberosum), as well as the genes associated with responses to specified pathogens and storage conditions, The Canadian Potato Genome Project (CPGP) carried out 5′ end sequencing of regular, normalized and full-length cDNA libraries of the Shepody potato cultivar, generating over 66,600 expressed sequence tags (ESTs). Libraries sequenced represented tuber developmental stages, pathogen-challenged tubers, as well as leaf, floral developmental stages, suspension cultured cells and roots. All libraries analysed to date have contributed unique sequences, with the normalized libraries high on the list. In addition, a low molecular weight library has enhanced the 3′ ends of our sequence assemblies. Using the combined assembly dataset, unique tuber developmental, cold storage and pathogen-challenged sequences have been identified. A comparison of the ESTs specific to the pathogen-challenged tuber and foliar libraries revealed minimal overlap between these libraries. Mixed assemblies using over 189,000 potato EST sequences from CPGP and The Institute for Genomics Research (TIGR) has revealed common sequences, as well as CPGP- and TIGR-unique sequences. Electronic Supplementary Material Electronic Supplementary material is available for this article at and accessible for authorised users.  相似文献   

11.
To isolate useful and interesting plant genes in large quantities, random sequencing of cDNA clones from potato leaf library treated with ethylene was performed. Partial sequences of randomly selected 210 clones with the insert of longer than 500 base pair (bp) as well as poly (A) tail have been compared with sequences in GeneBank, EMBL and DDBJ nucleic acid databases and fostered 193 expressed sequence tags (ESTs). The 210 cDNA clones identified are related to various aspect of metabolic pathways such as glycolysis, amino acid synthesis, translation mechanism, ribosome synthesis, hormone response, stress response, regulation of gene expression, and signal transduction. Among the 193 ESTs, 12 ESTs (29 cDNA clones) appeared more than once and 181 ESTs appeared once regarded as a solitary group. Out of 210 clones, 29 clones (13.8%) have no similarity to the known nucleotide sequences and could serve as a potentially useful resource for plant molecular biology referring to particular genes. Nucleotide sequencing to generate more ESTs from ethylene-induced as well as non-induced potato leaf is in progress as well.  相似文献   

12.
In an effort to expand the Gossypium hirsutum L. (cotton) expressed sequence tag (EST) database, ESTs representing a variety of tissues and treatments were sequenced. Assembly of these sequences with ESTs already in the EST database (dbEST, GenBank) identified 9675 cotton sequences not present in GenBank. Statistical analysis of a subset of these ESTs identified genes likely differentially expressed in stems, cotyledons, and drought-stressed tissues. Annotation of the differentially expressed cDNAs tentatively identified genes involved in lignin metabolism, starch biosynthesis and stress response, consistent with pathways likely to be active in the tissues under investigation. Simple sequence repeats (SSRs) were identified among these ESTs, and an inexpensive method was developed to screen genomic DNA for the presence of these SSRs. At least 69 SSRs potentially useful in mapping were identified. Selected amplified SSRs were isolated and sequenced. The sequences corresponded to the EST containing the SSRs, confirming that these SSRs will potentially map the gene represented by the EST. The ESTs containing SSRs were annotated to help identify the genes that may be mapped using these markers.  相似文献   

13.
14.
15.
Comprehensive complementary DNA (cDNA) library is a valuable resource for functional genomics. In this study, we set up a normalized cDNA library of Mo17 (MONL) by saturation hybridization with genomic DNA, which contained expressed genes of eight tissues and organs from inbred Mo17 of maize (Zea mays L.). In this library, the insert sizes range from 0.4 kb to 4 kb and the average size is 1.18 kb. 10.830 clones were spotted on nylon membrane to make a cDNA microarray. Randomly picked 300 clones from the cDNA library were sequenced. The cDNA microarry was hybridized with pooled tissue mRNA probes or housekeeping gene cDNA probes. The results showed the normalized cDNA library comprehensively includes tissue-specific genes in which 71% are unique ESTs (expressed sequence tags) based on the 300 sequences analyzed. Using BLAST program to compare the sequences against online nucleotide databases, 88% sequences were found in ZmDB or NCBI, and 12% sequences were not found in existing nucleotide databases. More than 73% sequences are of unknown function. The library could be extensively used in developing DNA markers, sequencing ESTs, mining new genes, identifying positional cloning and candidate gene, and developing microarrays in maize genomics research.  相似文献   

16.
The bay scallop, Argopecten irradians irradians, introduced from North America, has become one of the most important aquaculture species in China. Inan effort to identify scallop genes involved in host defense, a high-quality cDNA library was constructed from whole body tissues of the bay scallop. A total of 5828 successful sequencing reactions yielded 4995 expressed sequence tags (ESTs) longer than 100 bp. Cluster and assembly analyses of the ESTs identified 637 contigs (consisting of 2853 sequences) and 2142 singletons, totaling 2779 unique sequences. Basic Local Alignment Search Tool (BLAST) analysis showed that the majority (73%) of the unique sequences had no significant homology (E-value ≤ 0.005) to sequences in GenBank. Among the 748 sequences with significant GenBank matches, 160 (21.4%) were for genes related to metabolism, 131 (17.5%) for cell/organism defense, 124 (16.6%) for gene/protein expression, 83 (11.1%) for cell structure/motility, 70 (9.4%) for cell signaling/communication, 17 (2.3%) for cell division, and 163 (21.8%) matched to genes of unknown functions. The list of host-defense genes included many genes with known and important roles in innate defense such as lectins, defensins, proteases, protease inhibitors, heat shock proteins, antioxidants, and Toll-like receptors. The study provides a significant number of ESTs for gene discovery and candidate genes for studying host defense in scallops and other molluscs.  相似文献   

17.
18.
Using a strategy requiring only modest computational resources, wheat expressed sequence tag (EST) sequences from various sources were assembled into contigs and compared with a nonredundant barley sequence assembly, with ESTs, with complete draft genome sequences of rice and Arabidopsis thaliana, and with ESTs from other plant species. These comparisons indicate that (i) wheat sequences available from public sources represent a substantial proportion of the diversity of wheat coding sequences, (ii) prediction of open reading frames in the whole genome sequence improves when supplemented with EST information from other species, (iii) a substantial number of candidates for novel genes that are unique to wheat or related species can be identified, and (iv) a smaller number of genes can be identified that are common to monocots and dicots but absent from Arabidopsis. The sequences in the last group may have been lost from Arabidopsis after descendance from a common ancestor. Examples of potential novel wheat genes and Triticeae-specific genes are presented.  相似文献   

19.
The indica subspecies of cultivated rice occupies the largest area of rice production in the world. However, a systematic analysis of cDNA sequences from the indica subspecies has not been performed. The aim of the present study was to collect and analyze the expressed sequence tags (ESTs) of indica rice on a large scale. A total of 39 208 raw sequences were generated from a normalized cDNA library prepared by use of 15 different tissues of the indica cultivar Minghui 63. After trimming, processing and analysis, 17 835 unique sequences were obtained, each of which presumably represents a unique gene. Of these sequences, 2663 were novel, and at least 70 were indica specific. Comparison of the Minghui 63 sequences with the ESTs/full-length cDNAs in GenBank revealed a large number of deletion/insertion/substitution (DIS) at both the inter- and intra-subspecific levels. The overall number of polymorphisms in the expressed sequences was higher in the inter-subspecific comparisons than in the intra-subspecific comparisons. However, the extent of DIS-based polymorphism was highly variable among different rice varieties. In total, 15 726 unique sequences, including 697 novel sequences, were assigned to regions where large numbers of quantitative trait loci (QTLs) for agronomic traits had been detected previously. These results may be useful for developing new molecular markers for genetic mapping, detecting allelic polymorphisms associated with phenotypic variations between rice varieties, and facilitating QTL cloning by providing the starting points for candidate-gene identification.  相似文献   

20.
Late blight, caused by Phytophthora infestans, is one of the most devastating diseases in cultivated potato. Breeding of new potato cultivars with high levels of resistance to P. infestans is considered the most durable strategy for future potato cultivation. In this study, we report the identification of a new late-blight resistance (R) locus from the wild potato species Solanum bulbocastanum. Using several different approaches, a high-resolution genetic map of the new locus was generated, delimiting Rpi-blb3 to a 0.93 cM interval on chromosome 4. One amplification fragment length polymorphism marker was identified that cosegregated in 1,396 progeny plants of an intraspecific mapping population with Rpi-blb3. For comparative genomics purposes, markers linked to Rpi-blb3 were tested in mapping populations used to map the three other late-blight R loci Rpi-abpt, R2, and R2-like also to chromosome 4. Marker order and allelic conservation suggest that Rpi-blb3, Rpi-abpt, R2, and R2-like reside in the same R gene cluster on chromosome 4 and likely belong to the same gene family. Our findings provide novel insights in the evolution of R gene clusters conferring late-blight resistance in Solanum spp.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号