首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
基于PC/Linux的核酸序列电子延伸系统的构建及其应用   总被引:5,自引:0,他引:5  
新基因全长cDNA序列的获得常常是分子生物学工作者面临的难题。人类基因组计划及其相关计划的实施导致了大量表达序列标签(EST)的产生。利用一定的生物信息学算法,这些EST序列往往可用来对新基因片段进行延伸。采用Linux操作系统,利用Blast软件和Phrap软件以及EST数据库在微机上构建了EST序列的电子延伸系统,并对来自于人胎肝的11386条EST序列和511条插入片段全长cDNA序列进行了电子延伸,结果显示8373条EST序列和389条插入片段全长cDNA序列得到了程度不等的延伸,部分结果通过RACE实验得到证实。该套系统可高效地、规模化进行EST序列的延伸,可为通过实验获得新基因全长cDNA序列提供重要线索。 Abstract:Normally it is difficult to obtain full-length cDNA sequence of novel genes.More and more expressed sequence tags(ESTs) have been obtained since the start-up of human genome project.Powerful system is badly needed for data mining on these EST sequences.Based on a personal computer coupled with Linux operating system and EST database,the Blast software and Phrap software were used to construct a platform for in silico elongation of ESTs in our lab.The performance was tested using 11386 EST sequences and 511 partial-length cDNA sequences.Results demonstrated that 8373 EST and 389 cDNA sequence were elongated using this system.Thus the platform seems to be a fast way for full-length cDNA sequence cloning of new genes.  相似文献   

2.
Xu Z  Jablons DM  Gruenert DC 《Gene》2001,263(1-2):265-272
Current strategies for cDNA cloning are based on construction of cDNA libraries and colony screening. The process of obtaining a full-length cDNA clone can be highly time and labor intensive. Using the human actin gene as a model target cDNA, we have developed an RNA-capture method for rapid cloning of full-length cDNAs. The approach involves the capture of mRNA with expressed sequence tag (EST)-derived, biotin labeled antisense "capture" primers and streptavidin-coated magnetic beads. Full-length cDNA is then synthesized from purified EST-specific mRNA and cloned directly into plasmid vectors. The results of using beta-actin-based capture primers on cytoplasmic RNA were the isolation of both beta- and gamma-actin cDNA clones. Of the 16 actin-specific cDNA clones analyzed, 15 (93%) were full-length. This approach for cloning full-length cDNAs from available ESTs or partial cDNA sequences will facilitate a more rapid and efficient characterization of gene structure and function.  相似文献   

3.
基于EST的新基因克隆策略   总被引:1,自引:0,他引:1  
刘媛  蔡嘉斌  蒋国松  童强松 《遗传》2008,30(3):257-262
表达序列标签(expressed sequence tags, EST) 是从随机选择的cDNA 克隆进行单向测序获得的短的cDNA序列, 代表一个完整基因的一部分。随着生物信息学和基因定位的迅猛发展, EST已成为基因定位、基因克隆、基因表达分析的有力工具。近年来, 由于EST数据库的迅速扩张, 运用EST来克隆和定位基因, 使得新基因克隆的策略发生了革命性变革。尽管存在一些不足, 实践证明EST可大大加速新基因的发现与研究。本文将就EST技术尤其是它在新基因克隆中的应用策略作详细介绍。  相似文献   

4.
The genome of spinach single chromosome complement is about 1000 Mbp, which is the model material to study the molecular mechanisms of plant sex differentiation. The cytological study showed that the biggest spinach chromosome (chromosome 1) was taken as spinach sex chromosome. It had three alleles of sex-related X, X m and Y. Many researchers have been trying to clone the sex-determining genes and investigated the molecular mechanism of spinach sex differentiation. However, there are no successful cloned reports about these genes. A new technology combining chromosome microdissection with hybridization-specific amplification (HSA) was adopted. The spinach Y chromosome degenerate oligonucleotide primed-PCR (DOP-PCR) products were hybridized with cDNA of the male spinach flowers in florescence. The female spinach genome was taken as blocker and cDNA library specifically expressed in Y chromosome was constructed. Moreover, expressed sequence tag (EST) sequences in cDNA library were cloned, sequenced and bioinformatics was analysed. There were 63 valid EST sequences obtained in this study. The fragment size was between 53 and 486 bp. BLASTn homologous alignment indicated that 12 EST sequences had homologous sequences of nucleic acids, the rest were new sequences. BLASTx homologous alignment indicated that 16 EST sequences had homologous protein-encoding nucleic acid sequence. The spinach Y chromosome-specific EST sequences laid the foundation for cloning the functional genes, specifically expressed in spinach Y chromosome. Meanwhile, the establishment of the technology system in the research provided a reference for rapid cloning of other biological sex chromosome-specific EST sequences.  相似文献   

5.
6.
We have designed a simple and efficient polymerase chain reaction (PCR)-based cDNA subtraction protocol for high-throughput cloning of differentially expressed genes from plants that can be applied to any experimental system and as an alternative to DNA chip technology. Sequence-independent PCR-amplifiable first-strand cDNA population was synthesized by priming oligo-dT primer with a defined 5' heel sequence and ligating another specified single-stranded oligonucleotide primer on the 3' ends of first-strand cDNAs by T4 RNA ligase. A biotin label was introduced into the sense strands of cDNA that must be subtracted by using 5' biotinylated forward primer during PCR amplification to immobilize the sense strand onto the streptavidin-linked paramagnetic beads. The unamplified first strand (antisense) of the interrogating cDNA population was hybridized with a large excess of amplified sense strands of control cDNA. We used magnetic bead technology for the efficient removal of common cDNA population after hybridization to reduce the complexity of the cDNA prior to PCR amplification for the enrichment and sequence abundance normalization of differentially expressed genes. Construction of a subtracted and normalized cDNA library efficiently eliminates common abundant cDNA messages and also increases the probability of identifying clones differentially expressed in low-abundance cDNA messages. We used this method to successfully isolate differentially expressed genes from Pennisetum seedlings in response to salinity stress. Sequence analysis of the selected clones showed homologies to genes that were reported previously and shown to be involved in plant stress adaptation.  相似文献   

7.
We have adapted the "directional tag subtractive hybridization" technique as a means of investigating stage-specific gene expression in Plasmodium falciparum. This technique utilizes unidirectional cDNA libraries cloned into separate lambda vectors and involves hydroxyapatite chromatographic separation of target antisense cDNA and driver sense strand cRNA followed by PCR amplification of cDNA sequences specific to the target stage. This technique enabled efficient subtraction of asexual blood stage sequences from a P. falciparum sporozoite cDNA library and led to identification of novel sporozoite sequences. This technique can be applied to study gene expression in parasite stages that are difficult to obtain routinely.  相似文献   

8.
H Liu  Y Fu  J Xie  J Cheng  SA Ghabrial  G Li  X Yi  D Jiang 《PloS one》2012,7(7):e42147
Genome sequence of viruses can contribute greatly to the study of viral evolution, diversity and the interaction between viruses and hosts. Traditional molecular cloning methods for obtaining RNA viral genomes are time-consuming and often difficult because many viruses occur in extremely low titers. DsRNA viruses in the families, Partitiviridae, Totiviridae, Endornaviridae, Chrysoviridae, and other related unclassified dsRNA viruses are generally associated with symptomless or persistent infections of their hosts. These characteristics indicate that samples or materials derived from eukaryotic organisms used to construct cDNA libraries and EST sequencing might carry these viruses, which were not easily detected by the researchers. Therefore, the EST databases may include numerous unknown viral sequences. In this study, we performed in silico cloning, a procedure for obtaining full or partial cDNA sequence of a gene by bioinformatics analysis, using known dsRNA viral sequences as queries to search against NCBI Expressed Sequence Tag (EST) database. From this analysis, we obtained 119 novel virus-like sequences related to members of the families, Endornaviridae, Chrysoviridae, Partitiviridae, and Totiviridae. Many of them were identified in cDNA libraries of eukaryotic lineages, which were not known to be hosts for these viruses. Furthermore, comprehensive phylogenetic analysis of these newly discovered virus-like sequences with known dsRNA viruses revealed that these dsRNA viruses may have co-evolved with respective host supergroups over a long evolutionary time while potential horizontal transmissions of viruses between different host supergroups also is possible. We also found that some of the plant partitiviruses may have originated from fungal viruses by horizontal transmissions. These findings extend our knowledge of the diversity and possible host range of dsRNA viruses and offer insight into the origin and evolution of relevant viruses with their hosts.  相似文献   

9.
Over three million sequences from approximately 200 plant species have been deposited in the publicly available plant expressed sequence tag (EST) sequence databases. Many of the ESTs have been sequenced as an alternative to complete genome sequencing or as a substrate for cDNA array-based expression analyses. This creates a formidable resource from both biodiversity and gene-discovery standpoints. Bioinformatics-based sequence analysis tools have extended the scope of EST analysis into the fields of proteomics, marker development and genome annotation. Although EST collections are certainly no substitute for a whole genome scaffold, this "poor man's genome" resource forms the core foundations for various genome-scale experiments within the as yet unsequenceable plant genomes.  相似文献   

10.
47个早期人胚胎低丰度表达基因ESTs筛选及结果分析   总被引:1,自引:0,他引:1  
构建高质量cDNA文库在基因克隆、mRNA差异展示、表达序列标签测序和基因定位等研究中具有十分重要的作用。为了从早期胚胎中分离人类新基因,构建了受精后3周龄的人cDNA文库,用标记的一链cDNA探针对该文库的6508个克隆子进行菌落原位杂交,得到1677个无任何杂交信号的低丰度表达克隆子,从中随机挑选了47个进行5′端部分测序,将测序结果与三大基因库进行序列同源性比较,发现18个克隆(38.3%)  相似文献   

11.
The generation of large numbers of partial cDNA sequences, or expressed sequence tags (ESTs), has provided a method with which to sample a large number of genes from an organism. More than 25,000 Arabidopsis thaliana ESTs have been deposited in public databases, producing the largest collection of ESTs for any plant species. We describe here the application of a method of reducing redundancy and increasing information content in this collection by grouping overlapping ESTs representing the same gene into a "contig" or assembly. The increased information content of these assemblies allows more putative identifications to be assigned based on the results of similarity searches with nucleotide and protein databases. The results of this analysis indicate that sequence information is available for approximately 12,600 nonoverlapping ESTs from Arabidopsis. Comparison of the assemblies with 953 Arabidopsis coding sequences indicates that up to 57% of all Arabidopsis genes are represented by an EST. Clustering analysis of these sequences suggests that between 300 and 700 gene families are represented by between 700 and 2000 sequences in the EST database. A database of the assembled sequences, their putative identifications, and cellular roles is available through the World Wide Web.  相似文献   

12.
13.
To study gene expression in the water flea Daphnia magna we constructed a cDNA library and characterized the expressed sequence tags (ESTs) of 7210 clones. The EST sequences clustered into 2958 nonredundant groups. BLAST analyses of both protein and DNA databases showed that 1218 (41%) of the unique sequences shared significant similarities to known nucleotide or amino acid sequences, whereas the remaining 1740 (59%) showed no significant similarities to other genes. Clustering analysis revealed particularly high expression of genes related to ATP synthesis, structural proteins, and proteases. The cDNA clones and EST sequence information should be useful for future functional analysis of daphnid biology and investigation of the links between ecology and genomics.  相似文献   

14.
15.
In order to study gene expression in a reproductive organ, we constructed a cDNA library of mature flower buds in Lotus japonicus, and characterized expressed sequence tags (ESTs) of 842 clones randomly selected. The EST sequences were clustered into 718 non-redundant groups. From BLAST and FASTA search analyses of both protein and DNA databases, 58.5% of the EST groups showed significant sequence similarities to known genes. Several genes encoding these EST clones were identified as pollen-specific genes, such as pectin methylesterase, ascorbate oxidase, and polygalacturonase, and as homologous genes involved in pollen-pistil interaction. Comparison of these EST sequences with those derived from the whole plant of L. japonicus, revealed that 64.8% of EST sequences from the flower buds were not found in EST sequences of the whole plant. Taken together, the EST data from flower buds generated in this study is useful in dissecting gene expression in floral organ of L. japonicus.  相似文献   

16.
The damaging effect of aphids to crops is largely determined by the spectacular rate of increase of populational expansion due to their parthenogenetic generations. Despite this, the molecular processes triggering the transition between the parthenogenetic and sexual phases between their annual life cycle have received little attention. Here, we describe a collection of genes from the cereal aphid Rhopalosiphum padi expressed during the switch from parthenogenetic to sexual reproduction. After cDNA cloning and sequencing, 726 expressed sequence tags (EST) were annotated. The R. padi EST collection contained a substantial number (139) of bacterial endosymbiont sequences. The majority of R. padi cDNAs encoded either unknown proteins (56%) or housekeeping polypeptides (38%). The large proportion of sequences without similarities in the databases is related to both their small size and their high GC content, corresponding probably to the presence of 5'-unstranslated regions. Fifteen genes involved in developmental and differentiation events were identified by similarity to known genes. Some of these may be useful candidates for markers of the early steps of sexual differentiation.  相似文献   

17.
The sequences at the splice junctions of many early region 4 (E4) mRNAs from adenovirus 2 (Ad2) were determined by analysis of cDNA clones. The cDNAs were synthesized from poly(A)+ mRNA isolated from HeLa cells early during Ad2 infection. A standard library was constructed, in pBR322, from double stranded cDNAs initiated by oligo-dT priming. Approximately 1% of total recombinants contained E4 sequences, however among eighty clones analyzed in detail, only four contained the 5' leader sequence. A second library was prepared using a new method that led to a greatly increased representation of desired clones. This method employed oligo-dT to prime the synthesis of the first strand and an oligonucleotide ligated to pBR322, whose sequence was present in the 5' leader, to prime the synthesis of the second strand. With this method the percentage of recombinants containing E4 sequences ranged between 15 and 50% of the total colonies. Virtually all of these E4 cDNA clones contained the 5' leader sequence and several hundred were analyzed by comparing the results from single channel dideoxy sequencing reactions. Nine unique sequence patterns were identified and representative clones were completely sequenced.  相似文献   

18.
The public EST (expressed sequence tag) databases represent an enormous but heterogeneous repository of sequences, including many from a broad selection of plant species and a wide range of distinct varieties. The significant redundancy within large EST collections makes them an attractive resource for rapid pre-selection of candidate sequence polymorphisms. Here we present a strategy that allows rapid identification of candidate SNPs in barley (Hordeum vulgare L.) using publicly available EST databases. Analysis of 271,630 EST sequences from different cDNA libraries, representing 23 different barley varieties, resulted in the generation of 56,302 tentative consensus sequences. In all, 8171 of these unigene sequences are members of clusters with six or more ESTs. By applying a novel SNP detection algorithm (SNiPpER) to these sequences, we identified 3069 candidate inter-varietal SNPs. In order to verify these candidate SNPs, we selected a small subset of 63 present in 36 ESTs. Of the 63 SNPs selected, we were able to validate 54 (86%) using a direct sequencing approach. For further verification, 28 ESTs were mapped to distinct loci within the barley genome. The polymorphism information content (PIC) and nucleotide diversity () values of the SNPs identified by the SNiPpER algorithm are significantly higher than those that were obtained by random sequencing. This demonstrates the efficiency of our strategy for SNP identification and the cost-efficient development of EST-based SNP-markers.The first two authors contributed equally to this work  相似文献   

19.
Searches of zebrafish EST and whole genome shotgun sequence databases for sequences encoding the sterol-sensing domain (SSD) protein motif identified two sets of DNA sequences with significant homology to the Drosophila dispatched gene required for release of secreted Hedgehog protein. Using morpholino antisense oligonucleotides, we found that inhibition of one of these genes, designated Disp1, results in a phenotype similar to that of the "you-type" mutants, previously implicated in signalling by Hedgehog proteins in the zebrafish embryo. Injection of disp1 mRNA into embryos homozygous for one such mutation, chameleon (con) results in rescue of the mutant phenotype. Radiation hybrid mapping localised disp1 to the same region of LG20 to which the con mutation was mapped by meiotic recombination analysis. Sequence analysis of disp1 cDNA derived from homozygous con mutant embryos revealed that both mutant alleles are associated with premature termination codons in the disp1 coding sequence. By analysing the expression of markers of specific cell types in the neural tube, pancreas and myotome of con mutant and Disp1 morphant embryos, we conclude that Disp1 activity is essential for the secretion of lipid-modified Hh proteins from midline structures.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号