首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
一个鼻咽癌相关EST的鉴定及其全长cDNA序列分析   总被引:1,自引:0,他引:1  
鼻咽癌是我国南方及东南亚地区常见的恶性肿瘤之一.通过对鼻咽癌染色体高频率杂合性丢失区域3p21的表达序列标签(expressedsequencetag,EST)进行同源性比较分析,运用逆转录聚合酶链式反应的方法,筛选到一个在41.18%(14/34)的鼻咽癌活检组织及20.0%(1/5)的鼻咽癌细胞系中表达下调的ESTBG772301;并用Northern杂交方法,检测了该EST在多种正常成人组织中的表达状况及其所代表基因的转录本大小.在此基础上,对该EST来源的cDNA克隆(IMAGE:4839190)进行直接测序,获得了一个全长为2377bp的新cDNA序列;经生物信息学分析,发现它与已知基因序列无明显同源性,属于一个新基因,定位于染色体3p21.3,被命名为鼻咽癌表达下调基因(NPCEDRG,GenBank登录号:AF538150).其编码的蛋白质含169个氨基酸,与一个已报道的在进化上相对保守、功能未知的人类蛋白Nicolin1(简称NICN1)N端170个氨基酸残基的序列同源性为97%,但缺少NICN1蛋白C端43个氨基酸残基,可能是nicolin1基因不同剪接本的编码产物.  相似文献   

3.
Using the sequence information of expressed sequences tags (ESTs), cDNAs and genes from Lotus japonicus and other legumes, 73 TAC (transformation-competent artificial chromosomes) clones were selected from a genomic library of L. japonicus accession MG-20, and their nucleotide sequences were determined. The length of the DNA sequenced in this study was 7,455,959 bp, and the total length of the DNA regions sequenced so far is 26,167,443 bp together with the nucleotide sequences of 183 TAC clones previously reported. By similarity searches against the sequences in protein and EST databases and prediction by computer programs, a total of 699 potential protein-encoding genes with known or predicted functions, 163 gene segments and 267 pseudogenes were assigned to the newly sequenced regions. Based oil the nucleotide sequences of the clones, simple sequence repeat length polymorphism (SSLP) or derived cleaved amplified polymorphic sequence (dCAPS) markers were generated, and each clone was located onto the linkage map of two accessions of L. japonicus, Gifu B-129 and Miyakojima MG-20. The sequence data, gene information and mapping information are available through the World Wide Web at http://www.kazusa.or.jp/lotus/.  相似文献   

4.
We determined the nucleotide sequences of 64 TAC (transformation-competent artificial chromosome) clones selected from genomic libraries of Lotus japonicus accession Miyakojima MG-20 based on the sequence information of expressed sequence tags (ESTs), cDNAs, genes and DNA markers from L. japonicus and other legumes. The length of the DNA regions sequenced in this study was 6,370,255 bp, and the total length of the L. japonicus genome sequenced so far is 32,537,698 bp together with the nucleotide sequences of 256 TAC clones previously reported. Five hundred forty-eight potential protein-encoding genes with known or predicted functions, 127 gene segments and 224 pseudogenes were assigned to the newly sequenced regions by computer prediction and similarity searches against the sequences in protein and EST databases. Based on the nucleotide sequences of the clones, simple sequence repeat length polymorphism (SSLP) or derived cleaved amplified polymorphic sequence (dCAPS) markers were generated, and each clone was genetically localized onto the linkage map of two accessions of L. japonicus, MG-20 and Gifu B-129. The sequence data, gene information and mapping information are available through the World Wide Web at http://www.kazusa.or.jp/lotus/.  相似文献   

5.
Sixty-five TAC (transformation-competent artificial chromosomes) clones were selected from a genomic library of Lotus japonicus accession MG-20 based on the sequence information of expressed sequences tags (ESTs), cDNA and gene information, and their nucleotide sequences were determined. The average insert size of the TAC clone was approximately 100 kb, and the total length of the sequenced regions in this study is 6,556,100 bp. Together with the nucleotide sequences of 56 TAC clones previously reported, the regions sequenced so far total 12,029,295 bp. By comparison with the sequences in protein and EST databases and by analysis with computer programs for gene modeling, a total of 711 potential protein-encoding genes with known or predicted functions, 239 gene segments and 90 pseudogenes were identified in the newly sequenced regions. The average gene density assigned so far was 1 gene/9140 bp. The average length of the assigned genes was 2.6 kb, which is considerably larger than that assigned in the Arabidopsis thaliana genome (1.9 kb for 6451 genes). Introns were identified in approximately 73% of the potential genes, and the average number and length of the introns per gene were 3.4 and 377 bp, respectively. Simple sequence repeat length polymorphism (SSLP) or derived cleaved amplified polymorphic sequence (dCAPS) markers were generated based on the nucleotide sequences of the genomic clones obtained, and each clone was mapped onto the linkage map using the F2 mapping population derived from a cross of two accessions of L. japonicus, Gifu B-129 and Miyakojima MG-20. The sequence data, gene information and mapping information are available through the World Wide Web at http://www.kazusa.or.jp/lotus/.  相似文献   

6.
7.
8.
In a database search for homologs of acyl-coenzyme A oxidases (ACX) in Arabidopsis, we identified a partial genomic sequence encoding an apparently novel member of this gene family. Using this sequence information we then isolated the corresponding full-length cDNA from etiolated Arabidopsis cotyledons and have characterized the encoded recombinant protein. The polypeptide contains 675 amino acids. The 34 residues at the amino terminus have sequence similarity to the peroxisomal targeting signal 2 of glyoxysomal proteins, including the R-[I/Q/L]-X5-HL-XL-X15-22-C consensus sequence, suggesting a possible microsomal localization. Affinity purification of the encoded recombinant protein expressed in Escherichia coli followed by enzymatic assay, showed that this enzyme is active on C8:0- to C14:0-coenzyme A with maximal activity on C12:0-coenzyme A, indicating that it has medium-chain-specific activity. These data indicate that the protein reported here is different from previously characterized classes of ACX1, ACX2, and short-chain ACX (SACX), both in sequence and substrate chain-length specificity profile. We therefore, designate this new gene AtACX3. The temporal and spatial expression patterns of AtACX3 during development and in various tissues were similar to those of the AtSACX and other genes expressed in glyoxysomes. Currently available database information indicates that AtACX3 is present as a single copy gene.  相似文献   

9.
A total of sixty-two clones were selected from a TAC (transformation-competent artificial chromosome) genomic library of the Lotus japonicus accession MG-20 based on the sequence information of expressed sequence tags (ESTs), cDNA and gene information, and their nucleotide sequences were determined. The length of the sequenced regions in this study is 6,682,189 bp, and the total length of the regions sequenced so far is 18,711,484 bp together with the nucleotide sequences of 121 TAC clones previously reported. By comparison with the sequences in protein and EST databases and analysis with computer programs for gene modeling, a total of 573 potential protein-coding genes with known or predicted functions, 91 gene segments and 272 pseudogenes were identified in the newly sequenced regions. Each of the sequenced clones was localized onto the linkage map of two accessions of L. japonicus, Gifu B-129 and Miyakojima MG-20, using simple sequence repeat length polymorphism (SSLP) or derived cleaved amplified polymorphic sequence (dCAPS) markers generated based on the nucleotide sequences of the clones. The sequence data, gene information and mapping information are available through the World Wide Web at http://www.kazusa.or.jp/lotus/.  相似文献   

10.
本研究对眼镜蛇科广西华珊瑚蛇(Sinomicrurus peinani)线粒体基因组序列进行测定与分析,并探究其与近缘种的系统发育关系。结果表明,广西华珊瑚蛇线粒体基因组是一条全长19 477 bp的环状DNA,基因组碱基构成为A(33.4%)、T(28.1%)、C(26.6%)和G(11.9%)。共编码38个基因,包含2个核糖体RNA(rRNA)基因、22个转移RNA(tRNA)基因、13个蛋白质编码基因及1个线粒体基因控制区(D-loop)。13个蛋白质编码基因均采用AUG作为起始密码子,UAA和UGA作为终止密码子;蛋白质编码基因编码频率较高的氨基酸分别为亮氨酸(Leu)、异亮氨酸(Ile)、苏氨酸(Thr)和丝氨酸(Ser);相对密码子使用度(RSCU)频率最高的4个密码子依次是CGA、UGA、CUA和CCA。22个tRNA,除tRNASer(一臂两环)外其他均可形成典型三叶草结构。基于眼镜蛇科线粒体基因组系统发育分析结果表明,与广西华珊瑚蛇关系最密切的是中华珊瑚蛇(Sinomicrurus macclellandi),其次是孟加拉眼镜蛇(Naja kaouthia)与眼镜王蛇(Ophiophagus hannah)。  相似文献   

11.
12.
13.
14.
MOTIVATION: Locating protein-coding exons (CDSs) on a eukaryotic genomic DNA sequence is the initial and an essential step in predicting the functions of the genes embedded in that part of the genome. Accurate prediction of CDSs may be achieved by directly matching the DNA sequence with a known protein sequence or profile of a homologous family member(s). RESULTS: A new convention for encoding a DNA sequence into a series of 23 possible letters (translated codon or tron code) was devised to improve this type of analysis. Using this convention, a dynamic programming algorithm was developed to align a DNA sequence and a protein sequence or profile so that the spliced and translated sequence optimally matches the reference the same as the standard protein sequence alignment allowing for long gaps. The objective function also takes account of frameshift errors, coding potentials, and translational initiation, termination and splicing signals. This method was tested on Caenorhabditis elegans genes of known structures. The accuracy of prediction measured in terms of a correlation coefficient (CC) was about 95% at the nucleotide level for the 288 genes tested, and 97. 0% for the 170 genes whose product and closest homologue share more than 30% identical amino acids. We also propose a strategy to improve the accuracy of prediction for a set of paralogous genes by means of iterative gene prediction and reconstruction of the reference profile derived from the predicted sequences. AVAILABILITY: The source codes for the program 'aln' written in ANSI-C and the test data will be available via anonymous FTP at ftp.genome.ad.jp/pub/genomenet/saitama-cc. CONTACT: gotoh@cancer-c.pref.saitama.jp  相似文献   

15.
16.
Complete nucleotide sequence of mitochondrial genome (mitogenome) of the Catla catla (Ostariophysi: Cypriniformes: Cyprinidae) was determined in the present study. Its length is 16,594 bp and contains 13 protein coding genes, 22 transfer RNAs, two ribosomal RNAs and one non-coding control region. Most of the genes were encoded on the H-strand, while the ND6 and eight tRNA (Gln, Ala, Asn, Cys, Tyr, Ser (UCN), Glu and Pro) genes were encoded on the L-strand. The reading frames of two pair of genes overlapped: ATPase 8 with 6 and ND4L with ND4 by seven nucleotides each. The main non-coding region was 929 bp, with three conserved sequence blocks (CSB-I, CSB-II, and CSB-III) and an unusual simple sequence repeat, (TA)7. Phylogenetic analyses based on complete mitochondrial genome sequences were in favor of the traditional taxonomy of family Cyprinidae. In conclusion present mitogenome of Catla catla adds more information to our understanding of diversity and evolution of mitogenome in fishes.  相似文献   

17.
BAGET (Bacterial and Archaeal Gene Exploration Tool) is a web service designed to facilitate extraction, by molecular geneticists and phylogeneticists, of specific gene and protein sequences from completely determined prokaryotic genomes. Upon selection of a particular prokaryotic organism and gene, two levels of visual gene context information are provided on a single dynamic page: (i) a graphical representation of a user defined portion of the chromosome centered on the gene of interest and (ii) the DNA sequence of the query gene, of the immediate neighboring genes and the intergenic regions each identified by a consistent color code. The aminoacid sequence is provided for protein-coding query genes. Query results can be exported as a rich text format (RTF) word processor file for printing, archival or further analysis. AVAILABILITY: http://archaea.u-psud.fr/bin/baget.dll.  相似文献   

18.
Two major chloroplast proteins are encoded by nuclear genes and synthesized on free cytoplasmic ribosomes: the small subunit of ribulose 1,5-bisphosphate carboxylase and the apoprotein components of the chlorophyll a/b light harvesting complex. We have recently reported the isolation of two cDNA clones from pea which encode both the small subunit of ribulose 1,5-bisphosphate carboxylase (pSS15) and the polypeptide 15 (pAB96), the major chlorophyll a/b binding protein (Broglie, R., Bellemare, G., Bartlett, S., Chua, N.-H., and Cashmore, A. R. (1981) Proc. Natl. Acad. Sci. U.S.A. 78, 7304-7308). To further characterize these clones, we determined their nucleotide sequence. Clone pSS15 contains a 691-base pair cDNA insert which encodes the entire 123 amino acids of the mature small subunit protein. In addition, this clone also encodes 33 amino acids of the NH2-terminal transit peptide extension and 148 nucleotides of the 3' noncoding region preceding the poly(A)tail. A second cDNA clone (pAB96) contains an 833-nucleotide insert which encodes most of polypeptide 15. The DNA sequence of this cloned cDNA was used to deduce the previously undetermined amino acid sequence of this integral thylakoid membrane protein. The nucleotide sequence of the cDNA clone, pSS15, should provide information concerning the role of the transit sequence in the transport of cytoplasmically synthesized chloroplast proteins. Similarly, the deduced amino acid sequence of polypeptide 15 will provide information for predicting its orientation in thylakoid membranes as well as its role in binding chlorophyll.  相似文献   

19.
The Exon/Intron (ExInt) database incorporates information on the exon/intron structure of eukaryotic genes. Features in the database include: intron nucleotide sequence, amino acid sequence of the corresponding protein, position of the introns at the amino acid level and intron phase. From ExInt, we have also generated four additional databases each with ExInt entries containing predicted introns, introns experimentally defined, organelle introns or nuclear introns. ExInt is accessible through a retrieval system with pointers to GenBank. The database can be searched by keywords, locus name, NID, accession number or length of the protein. ExInt is freely accessible at http://intron.bic.nus.edu.sg/exint/exint.html  相似文献   

20.
The complete CDS sequences of three porcine genes: UCHL3, RIT1 and CCND3 were amplified using RT-PCR based on the sequence information of the mouse or other mammals and referenced highly homologous pig ESTs. Sequence analysis of these three genes revealed that the porcine UCHL3 gene encodes a protein of 230 amino acids and has high homology with the ubiquitin carboxyl-terminal hydrolase isozyme L3 (UCHL3) of four species-bovine (97%), human (96%), mouse (95%) and rat (94%). The porcine RIT1 gene encodes a protein of 219 amino acids and has high homology with the GTP-binding protein Rit1 (RIT1) of two species-human (97%), mouse (97%). The porcine CCND3 gene encodes a protein of 292 amino acids and has high homology with the G1/S-specific cyclin-D3 (CCND3) of four species-bovine (98%), human (97%), mouse (93%) and rat (92%). The phylogenetic tree analysis revealed that the swine UCHL3 has a closer genetic relationship with the UCHL3 of bovine, and the swine RIT1 has closer genetic relationships with the RIT1 of human, but the swine CCND3 has a closer genetic relationship with the CCND3 of bovine. The RT-PCR gene expression analysis indicated that the swine UCHL3, RIT1 and CCND3 genes were differentially expressed in tissues including small intestine, large intestine, liver, muscle, fat, lung, spleen and kidney. Our experiment established the primary foundation for further research on these three swine genes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号