首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Maiti AK  Jorissen M  Bouvagnet P 《Genome biology》2001,2(7):research0026.1-research00269

Background

Immotile cilia syndrome (ICS) or primary ciliary dyskinesia (PCD) is an autosomal recessive disorder in humans in which the beating of cilia and sperm flagella is impaired. Ciliated epithelial cell linings are present in many tissues. To understand ciliary assembly and motility, it is important to isolate those genes involved in the process.

Results

Total RNA was isolated from cultured ciliated nasal epithelial cells after in vitro ciliogenesis and expressed sequenced tags (ESTs) were generated. The functions and locations of 63 of these ESTs were derived by BLAST from two public databases. These ESTs are grouped into various classes. One group has high homology not only with the mitochondrial genome but also with one or more chromosomal DNAs, suggesting that very similar genes, or genes with very similar domains, are expressed from both mitochondrial and nuclear DNA. A second class comprises genes with complete homology with part of a known gene, suggesting that they are the same genes. A third group has partial homology with domains of known genes. A fourth group, constituting 33% of the ESTs characterized, has no significant homology with any gene or EST in the database.

Conclusions

We have shown that sufficient information about the location of ESTs could be derived electronically from the recently completed human genome sequences. This strategy of EST localization should be significantly useful for mapping and identification of new genes in the forthcoming human genome sequences with the vast number of ESTs in the dbEST database.  相似文献   

2.
3.
4.
Wu XL  Griffin KB  Garcia MD  Michal JJ  Xiao Q  Wright RW  Jiang Z 《Gene》2004,340(2):213-225
The launch of large-scale chicken expressed sequence tags (EST) projects has placed the chicken in the lead for the number of EST sequences in agriculturally important animals. More than 451,000 chicken ESTs derived from over 158 libraries have been deposited in the NCBI dbEST database as of December 2003. But how many genes these ESTs represent and how they are expressed in different chicken tissues/organs remain undetermined. In the present research, we developed a human gene-based strategy for census of chicken orthologous genes and identification of their expression patterns. Among 34,157 human coding genes used in the study, BLAST analysis revealed that 11,066 genes provisionally matched 248,628 chicken ESTs. Based on the average EST abundance of the orthologous genes, the current public repository of chicken ESTs could represent 20,000 provisional genes. Analysis of gene expression in 14 single tissues/organs showed that approximately 15% of genes were expressed exclusively in single tissue/organ whereas the remaining 85% of genes were co-expressed in two or more tissues/organs. A majority (91.15%) of genes expressed in chicken embryos were also expressed at post-hatch stages, indicating that most genes activated in chicken embryos could serve housekeeping functions. Self-organizing maps (SOM) analysis organized 8807 provisional genes in selected chicken tissues into 98 clusters with each cluster being indicative of common regulatory factors and pathways. A total of 969 provisional orthologous genes were identified as preferentially expressed genes (PEGs) in various chicken tissues/organs (LOD>3.0). No doubt, the present study on gene expression patterns will provide insight into dynamics of metabolic pathways and tissue/organ programming and reprogramming in chickens.  相似文献   

5.
A quality control algorithm for DNA sequencing projects.   总被引:2,自引:0,他引:2       下载免费PDF全文
Heterologous DNA sequences from rearrangements with the genomes of host cells, genomic fragments from hybrid cells, or impure tissue sources can threaten the purity of libraries that are derived from RNA or DNA. Hybridization methods can only detect contaminants from known or suspected heterologous sources, and whole library screening is technically very difficult. Detection of contaminating heterologous clones by sequence alignment is only possible when related sequences are present in a known database. We have developed a statistical test to identify heterologous sequences that is based on the differences in hexamer composition of DNA from different organisms. This test does not require that sequences similar to potential heterologous contaminants are present in the database, and can in principle detect contamination by previously unknown organisms. We have applied this test to the major public expressed sequence tag (EST) data sets to evaluate its utility as a quality control measure and a peer evaluation tool. There is detectable heterogeneity in most human and C.elegans EST data sets but it is not apparently associated with cross-species contamination. However, there is direct evidence for both yeast and bacterial sequence contamination in some public database sequences annotated as human. Results obtained with the hexamer test have been confirmed with similarity searches using sequences from the relevant data sets.  相似文献   

6.
Human bone marrow stromal cells (HBMSC) are pluripotent cells with the potential to differentiate into osteoblasts, chondrocytes, myelosupportive stroma, and marrow adipocytes. We used high-throughput DNA sequencing analysis to generate 4258 single-pass sequencing reactions (known as expressed sequence tags, or ESTs) obtained from the 5' (97) and 3' (4161) ends of human cDNA clones from a HBMSC cDNA library. Our goal was to obtain tag sequences from the maximum number of possible genes and to deposit them in the publicly accessible database for ESTs (dbEST of the National Center for Biotechnology Information). Comparisons of our EST sequencing data with nonredundant human mRNA and protein databases showed that the ESTs represent 1860 gene clusters. The EST sequencing data analysis showed 60 novel genes found only in this cDNA library after BLAST analysis against 3.0 million ESTs in NCBI's dbEST database. The BLAST search also showed the identified ESTs that have close homology to known genes, which suggests that these may be newly recognized members of known gene families. The gene expression profile of this cell type is revealed by analyzing both the frequency with which a message is encountered and the functional categorization of expressed sequences. Comparing an EST sequence with the human genomic sequence database enables assignment of an EST to a specific chromosomal region (a process called digital gene localization) and often enables immediate partial determination of intron/exon boundaries within the genomic structure. It is expected that high-throughput EST sequencing and data mining analysis will greatly promote our understanding of gene expression in these cells and of growth and development of the skeleton.  相似文献   

7.
Expressed sequence tags (ESTs) from fungal-infected plant tissues are composed of a mixture of plant and fungal sequences. Using freely available software and tools, a novel procedure is described for distinguishing plant and fungal DNA sequences. Although the GenBank non-redundant (NR) database is larger and therefore one would presume that BLASTX analysis of it would be more accurate, superior resolution of 700 randomly selected fungal ESTs was found with Standalone TBLASTX analyses with a local matching database composed of a plant and a fungal genome. Standalone TBLASTX analyses of 3,983 ESTs from nine different fungal-infected plant EST libraries also proved to be superior in identifying the origin of sequences as either plant or fungal compared to GenBank BLASTX analysis. Standalone TBLASTX with a matching database comprised of a single plant and a single fungal genome appears to be a faster and more accurate method than BLASTX searches of the GenBank non-redundant database to distinguish fungal and plant sequences in mixed EST collections.  相似文献   

8.
9.
转化的大鼠胚胎成纤维细胞系差异表达基因的筛选研究   总被引:9,自引:5,他引:4  
来源于转化的大鼠胚胎成纤维细胞系的两株细胞,A1-5细胞与B4细胞相比表现出非常强的抗辐射性并伴随不同寻常强的G2延迟效应;用PCR选择性抑制消减杂交方法对这两株细胞进行差减,希望找到对A1-5细胞表现出的不同寻常的表型起关键作用的某一个或某一些基因。结果得到了160个差减转化子,逐个进行序列测定,并进行Dot blot杂交,共得到35个差异表达基因片段(EST)。通过对美国国家生物技术信息中心(NCBI)的非冗余序列库(NT)、鼠EST库及人EST库的BLAST进行同源检索,发现其中21个代表了尚未登录的新基因,另外14个分别与已知基因高度同源。  相似文献   

10.
11.
In an effort to expand the Gossypium hirsutum L. (cotton) expressed sequence tag (EST) database, ESTs representing a variety of tissues and treatments were sequenced. Assembly of these sequences with ESTs already in the EST database (dbEST, GenBank) identified 9675 cotton sequences not present in GenBank. Statistical analysis of a subset of these ESTs identified genes likely differentially expressed in stems, cotyledons, and drought-stressed tissues. Annotation of the differentially expressed cDNAs tentatively identified genes involved in lignin metabolism, starch biosynthesis and stress response, consistent with pathways likely to be active in the tissues under investigation. Simple sequence repeats (SSRs) were identified among these ESTs, and an inexpensive method was developed to screen genomic DNA for the presence of these SSRs. At least 69 SSRs potentially useful in mapping were identified. Selected amplified SSRs were isolated and sequenced. The sequences corresponded to the EST containing the SSRs, confirming that these SSRs will potentially map the gene represented by the EST. The ESTs containing SSRs were annotated to help identify the genes that may be mapped using these markers.  相似文献   

12.
13.
Two cDNA libraries were constructed from cultures of the vascular wilt fungus Verticillium dahliae, grown either in simulated xylem fluid medium (SXM) or under conditions that induce near-synchronous development of microsclerotia. Expressed sequence tags (ESTs) were obtained for over 1000 clones from each library. Most sequences in the two EST collections were unique; nearly 55% of the translated ESTs had strong similarity to protein sequences in the NCBI nonredundant database. ESTs corresponding to melanin biosynthetic enzymes were exclusive to the developing microsclerotia (DMS) collection, and sequences corresponding to extracellular hydrolases (plant cell wall degrading enzymes) were more abundant in that collection. ESTs corresponding to proteins involved in transport and cell growth were more abundant in the SXM collection. The results of this preliminary analysis suggest that the in vitro growth conditions used here provide useful model systems that will facilitate studies of pathogenesis and microsclerotia development in V. dahliae.  相似文献   

14.
An EST survey of the sugarcane transcriptome   总被引:8,自引:0,他引:8  
Its large genome and high polyploidy makes sugarcane (Saccharum spp.) a singularly challenging crop to study and improve using genetic approaches. To provide large numbers of functionally characterized candidate genes that might be tested for direct association (rather than distant linkage) with economically important traits, we sequenced the 5' ends of 9,216 clones from three cDNA libraries (apex, leaf and mature internode), representing 3,401 non-redundant sequences. About 57% of these sequences could be assigned a tentative function based on statistically significant similarity to previously characterized proteins or DNA sequences. Another 28% corresponded to previously identified, but uncharacterized, sequences. Some of the remaining unidentified sequences were predicted to be genes which could potentially be new to plants or unique to sugarcane. Comparisons of the sugarcane ESTs to a large sorghum EST database revealed similar compositions of expressed genes between some different tissues. Comparison to a detailed Arabidopsis protein database showed some highly conserved sequences, which might be useful DNA markers for pan-angiosperm comparative mapping. These EST sequences provide a foundation for many new studies to accelerate isolation of agronomically important genes from the cumbersome sugarcane genome.  相似文献   

15.
Expressed sequence tags (ESTs) from the marine red alga Gracilaria gracilis   总被引:2,自引:0,他引:2  
Expressed sequence tags (ESTs) are partial sequences of cDNAs, and can be used to characterize gene expression in organisms or tissues. We have constructed a 200-sequence EST database from vegetative thalli of Gracilaria gracilis, the first ESTs reported from any alga. This database contains recognizable ESTs corresponding to genes of carbohydrate metabolism (seven), amino acid metabolism (three), photosynthesis (five), nucleic acid synthesis, repair and processing (three), protein synthesis (14), protein degradation (six), cellular maintenance and stress response (three), other identifiable protein-coding genes (13) and 146 sequences for which significant matches were not found in existing sequence databases. We have already used this EST database to recover genes of carbohydrate biosynthesis from G. gracilis. This revised version was published online in August 2006 with corrections to the Cover Date.  相似文献   

16.
Lotus japonicus has received increased attention as a potential model legume plant. In order to study gene expression in reproductive organs and to identify genes that play a crucial function in sexual reproduction, we constructed a cDNA library from immature flower buds containing anthers at the stage of developing tapetum cells in L. japonicus, and characterized 919 expressed sequence tags (ESTs) randomly selected from a cDNA library of the immature flower buds. The 919 ESTs analyzed were clustered into 821 non-redundant EST groups. As a result of a database search, 436 groups (53%) out of the 821 groups showed sequence similarity to genes registered in the public database. Out of these 436 groups, 109 groups showed similarity to genes encoding hypothetical proteins whose function had not yet been estimated. Three hundred eighty five groups (47%) showed no significant homology to known sequences and were classified as novel sequences. A comparison of 821 non-redundant EST sequences and EST sequences derived from the whole plant L. japonicus revealed that 474 EST sequences derived from immature flower buds were not found in the EST sequences of the whole plant. In order to confirm the expression pattern of potential reproductive-organ specific EST clones, nine clones, which were not matched to ESTs derived from the whole plant, were selected, and RT-PCR analysis was performed on these clones. As a result of RT-PCR, we found two novel anther specific clones. One clone was homologous to a gene encoding human cleft lip and palate associated transmembrane protein (CLPTM1) like protein, and the other clone did not show a significant similarity to any genes deposited in the public database. These results indicate that ESTs analyzed here represent a valuable resource for finding reproductive-organ specific genes in Lotus japonicus.  相似文献   

17.
18.
MOTIVATION: Using bioinformatic approaches we aimed to characterize poorly understood abnormalities in splicing known as exon scrambling, exon repetition and trans-splicing. RESULTS: We developed a software package that allows large-scale comparison of all human expressed sequence tags (EST) sequences to the entire set of human gene sequences. Among 5,992,495 EST sequences, 401 cases of exon repetition and 416 cases of exon scrambling were found. The vast majority of identified ESTs contain fragments rather than full-length repeated or scrambled exons. Their structures suggest that the scrambled or repeated exon fragments may have arisen in the process of cDNA cloning and not from splicing abnormalities. Nevertheless, we found 11 cases of full-length exon repetition showing that this phenomenon is real yet very rare. In searching for examples of trans-splicing, we looked only at reproducible events where at least two independent ESTs represent the same putative trans-splicing event. We found 15 ESTs representing five types of putative trans-splicing. However, all 15 cases were derived from human malignant tissues and could have resulted from genomic rearrangements. Our results provide support for a very rare but physiological occurrence of exon repetition, but suggest that apparent exon scrambling and trans-splicing result, respectively, from in vitro artifact and gene-level abnormalities. AVAILABILITY: Exon-Intron Database (EID) is available at http://www.meduohio.edu/bioinfo/eid. Programs are available at http://www.meduohio.edu/bioinfo/software.html. The Laboratory website is available at http://www.meduohio.edu/medicine/fedorov Supplementary information: Supplementary file is available at http://www.meduohio.edu/bioinfo/software.html.  相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号