首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
We report on a large-scale expressed sequence tag (EST) sequencing and analysis program aimed at characterizing the sets of genes expressed in roots of the model legume Medicago truncatula during interactions with either of two microsymbionts, the nitrogen-fixing bacterium Sinorhizobium meliloti or the arbuscular mycorrhizal fungus Glomus intraradices. We have designed specific tools for in silico analysis of EST data, in relation to chimeric cDNA detection, EST clustering, encoded protein prediction, and detection of differential expression. Our 21 473 5′- and 3′-ESTs could be grouped into 6359 EST clusters, corresponding to distinct virtual genes, along with 52 498 other M.truncatula ESTs available in the dbEST (NCBI) database that were recruited in the process. These clusters were manually annotated, using a specifically developed annotation interface. Analysis of EST cluster distribution in various M.truncatula cDNA libraries, supported by a refined R test to evaluate statistical significance and by ‘electronic northern’ representation, enabled us to identify a large number of novel genes predicted to be up- or down-regulated during either symbiotic root interaction. These in silico analyses provide a first global view of the genetic programs for root symbioses in M.truncatula. A searchable database has been built and can be accessed through a public interface.  相似文献   

3.
4.
NotI linking clones contain sequences flanking NotI recognition sites and were previously shown to be tightly associated with CpG islands and genes. To directly assess the value of NotI clones in genome research, high density grids with 50 000 NotI linking clones originating from six representative NotI linking libraries were constructed. Altogether, these libraries contained nearly 100 times the total number of NotI sites in the human genome. A total of 3437 sequences flanking NotI sites were generated. Analysis of 3265 unique sequences demonstrated that 51% of the clones displayed significant protein similarity to SWISSPROT and TREMBL database proteins based on MSPcrunch filtering with stringent parameters. Of the 3265 sequences, 1868 (57.2%) were new sequences, not present in the EMBL and EST databases (similarity  90%). Among these new sequences, 795 (24.3%) showed similarity to known proteins and 712 (21.8%) displayed an identity of >75% at the nucleotide level to sequences from EMBL or EST databases. The remaining 361 (11.1%) sequences were completely new, i.e. <75% identical. The work also showed tight, specific association of NotI sites with the first exon and suggest that the so-called 3′ ESTs can actually be generated from 5′-ends of genes that contain NotI sites in their first exon.  相似文献   

5.
6.
A molecular understanding of porcine reproduction is of biological interest and economic importance. Our Midwest Consortium has produced cDNA libraries containing the majority of genes expressed in major female reproductive tissues, and we have deposited into public databases 21,499 expressed sequence tag (EST) gene sequences from the 3 end of clones from these libraries. These sequences represent 10,574 different genes, based on sequence comparison among these data, and comparison with existing porcine ESTs and genes indicate as many as 4652 of these EST clusters are novel. In silico analysis identified sequences that are expressed in specific pig tissues or organs and confirmed the broad expression in pig for many genes ubiquitously expressed in human tissues. Furthermore, we have developed computer software to identify sequence similarity of these pig genes with their human counterparts, and to extract the mapping information of these human homologues from genome databases. We demonstrate the utility of this software for comparative mapping by localizing 61 genes on the porcine physical map for Chromosomes (Chrs) 5, 10, and 14. The following Accession numbers were assigned to our deposited sequences: BF701840 – BF704551, BF708383, BF708386 – BF713604, BG322266 – BG322271, BI398567 – BI405235, BQ597354 – BQ605166.  相似文献   

7.
《DNA research》2008,15(6):333-346
A large collection of full-length cDNAs is essential for the correct annotation of genomic sequences and for the functional analysis of genes and their products. We obtained a total of 39 936 soybean cDNA clones (GMFL01 and GMFL02 clone sets) in a full-length-enriched cDNA library which was constructed from soybean plants that were grown under various developmental and environmental conditions. Sequencing from 5′ and 3′ ends of the clones generated 68 661 expressed sequence tags (ESTs). The EST sequences were clustered into 22 674 scaffolds involving 2580 full-length sequences. In addition, we sequenced 4712 full-length cDNAs. After removing overlaps, we obtained 6570 new full-length sequences of soybean cDNAs so far. Our data indicated that 87.7% of the soybean cDNA clones contain complete coding sequences in addition to 5′- and 3′-untranslated regions. All of the obtained data confirmed that our collection of soybean full-length cDNAs covers a wide variety of genes. Comparative analysis between the derived sequences from soybean and Arabidopsis, rice or other legumes data revealed that some specific genes were involved in our collection and a large part of them could be annotated to unknown functions. A large set of soybean full-length cDNA clones reported in this study will serve as a useful resource for gene discovery from soybean and will also aid a precise annotation of the soybean genome.Key words: EST, full-length cDNA, functional annotation, legume, soybean  相似文献   

8.
9.
The cDNA expression libraries that produce correct proteins are essential in facilitating the identification of protein-protein interactions. The 5′-untranslated regions (UTRs) that are present in the majority of mammalian and non-mammalian genes are predicted to alter the expression of correct proteins from cDNA libraries. We developed a novel cDNA expression library from which 5′-UTRs were removed using a mixture of polymerase chain reaction primers that complement the Kozak sequences we refer to as an “in-frame cDNA library.” We used this library with the protein complementation assay to identify two novel binding partners for ras-related ADP-ribosylation factor-like 11 (ARL11), cellular retinoic acid binding protein 2 (CRABP2), and phosphoglycerate mutase 1 (PGAM1). Thus, the in-frame cDNA library without 5′-UTRs we describe here increases the chance of correctly identifying protein interactions and will have wide applications in both mammalian and non-mammalian detection systems.  相似文献   

10.
Gene expression and processing during mouse male germ cell maturation (spermatogenesis) is highly specialized. Previous reports have suggested that there is a high incidence of alternative 3′-processing in male germ cell mRNAs, including reduced usage of the canonical polyadenylation signal, AAUAAA. We used EST libraries generated from mouse testicular cells to identify 3′-processing sites used at various stages of spermatogenesis (spermatogonia, spermatocytes and round spermatids) and testicular somatic Sertoli cells. We assessed differences in 3′-processing characteristics in the testicular samples, compared to control sets of widely used 3′-processing sites. Using a new method for comparison of degenerate regulatory elements between sequence samples, we identified significant changes in the use of putative 3′-processing regulatory sequence elements in all spermatogenic cell types. In addition, we observed a trend towards truncated 3′-untranslated regions (3′-UTRs), with the most significant differences apparent in round spermatids. In contrast, Sertoli cells displayed a much smaller trend towards 3′-UTR truncation and no significant difference in 3′-processing regulatory sequences. Finally, we identified a number of genes encoding mRNAs that were specifically subject to alternative 3′-processing during meiosis and postmeiotic development. Our results highlight developmental differences in polyadenylation site choice and in the elements that likely control them during spermatogenesis.  相似文献   

11.
We investigated the thermodynamic stability of double-stranded DNAs with an oxidative DNA lesion, 2-hydroxyadenine (2-OH-Ade), in two different sequence contexts (5′-GA*C-3′ and 5′-TA*A-3′, A* represents 2-OH-Ade). When an A*–N pair (N, any nucleotide base) was located in the center of a duplex, the thermodynamic stabilities of the duplexes were similar for all the natural bases except A (N = T, C and G). On the other hand, for the duplexes with the A*–N pair at the end, which mimic the nucleotide incorporation step, the stabilities of the duplexes were dependent on their sequence. The order of stability is T > G > C >> A in the 5′-GA*C-3′ sequences and T > A > C > G in the 5′-TA*A-3′ sequences. Because T/G/C and T/A are nucleotides incorporated opposite to 2-OH-Ade in the 5′-GA*C-3′ and 5′-TA*A-3′ sequences, respectively, these results agree with the tendency of mutagenic misincorporation of the nucleotides opposite to 2-OH-Ade in vitro. Thus, the thermodynamic stability of the A*–N base pair may be an important factor for the mutation spectra of 2-OH-Ade.  相似文献   

12.
13.
A wealth of molecular resources have been developed for rice genomics, including dense genetic maps, expressed sequence tags (ESTs), yeast artificial chromosome maps, bacterial artificial chromosome (BAC) libraries and BAC end sequence databases. Integration of genetic and physical maps involves labor-intensive empirical experiments. To accelerate the integration of the bacterial clone resources with the genetic map for the International Rice Genome Sequencing Project, we cleaned and filtered the available EST and BAC end sequences for repetitive sequences and then searched all available rice genetic markers with our filtered databases. We identified 418 genetic markers that aligned with at least one BAC end sequence with >95% sequence identity, providing a set of large insert clones with an average separation of 1 Mb that can serve as nucleation points for the sequencing phase of the International Rice Genome Sequencing Project.  相似文献   

14.
15.
16.
Analysis of expressed sequence tags from oil palm (Elaeis guineensis)   总被引:3,自引:0,他引:3  
This is the first report of a systematic study of genes expressed by means of expressed sequence tag (EST) analysis in oil palm, a species of the Arecales order, a phylogenetically key clade of monocotyledons that is not widely represented in the sequence databases. Five different cDNA libraries were generated from male and female inflorescences, shoot apices and zygotic embryos and unidirectional systematic sequencing was performed. A total of 2411 valid EST sequences were thus obtained. Cluster analysis enabled the identification of 209 groups of related sequences and 1874 singletons. Putative functions were assigned to 1252 of the set of 2083 non-redundant ESTs obtained. The EST database described here is a first step towards gene discovery and cDNA array-based expression analysis in oil palm.  相似文献   

17.
18.
MAGEST is a database for maternal gene expression information for an ascidian, Halocynthia roretzi. The ascidian has become an animal model in developmental biological research because it shows a simple developmental process, and belongs to one of the chordate groups. Various data are deposited into the MAGEST database, e.g. the 3′- and 5′-tag sequences from the fertilized egg cDNA library, the results of similarity searches against GenBank and the expression data from whole mount in situ hybridization. Over the last 2 years, the data retrieval systems have been improved in several aspects, and the tag sequence entries have increased to over 20 000 clones. Additionally, we constructed a database, translated MAGEST, for the amino acid fragment sequences predicted from the EST data sets. Using this information comprehensively, we should obtain new information on gene functions. The MAGEST database is accessible via the Internet at http://www.genome.ad.jp/magest/.  相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号