首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
3.
Nineteen Pl and TAC clones, which have been mapped on the finephysical map of the Arabidopsis thaliana chromosome 5, weresequenced according to the shotgun-based strategy, and theirstructural features were analysed. The total length of the regionssequenced in this study was 1,367,185 bp. Combining this withthe regions covered by 90 P1 and TAC clones proviously reported,the total length of chromosome 5 sequenced to date becomes 8,058,855bp. On the basis of similarity search against protein and ESTdatabases and gene modeling with computer programs, a totalof 330 potential protein-coding regions were identified, bringingan average density of the genes to approximately one gene per4.1 kb. Introns were identified in 81.0% of the potential proteingenes for which the entire gene structure was predicted, withan average number per gene of 4.2 and an average length of theintrons of 180 bp. The RNA-coding genes identified were 9 tRNAgenes corresponding to 8 amino acid species and 2 genes forU2 nuclear RNA. These sequence features are essentially identicalto those in the previously reported sequences. The sequencedata and gene information are available on the World Wide Webdatabase KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/arabi/.  相似文献   

4.
5.
6.
7.
In this series of projects sequencing the entire genome of Arabidopsis thaliana chromosome 5, non-redundant P1 and TAC clones have been sequenced according to the fine physical map, and as of May 7, 1999, the sequences of 16.2 Mb representing approximately 60% of chromosome 5 have been accumulated and released at our web site. In parallel, structural features of the sequenced regions have been analyzed by applying a variety of computer programs, and to date we have predicted a total of 2380 potential protein-coding genes in the 10,154,580 bp regions, which are covered by 142 P1 and TAC clones. In this paper, we newly analyzed the structural features of the 1,011,550 bp regions covered by additional 17 P1 and TAC clones, and predicted 298 protein-coding genes. The average density of the genes identified was 1 gene per 3394 bp. Introns were observed in 67% of the genes, and the average number per gene and the average length of the introns were 3.2 and 159 bp, respectively. The gene density became higher than the value estimated in the previously analyzed regions (1 gene per 4,267 bp), as the data in this paper were compiled based on a new standard of gene assignment including the computer-predicted hypothetical genes. The regions also contained 8 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available on the database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/arabi/.  相似文献   

8.
In our ongoing project to deduce the nucleotide sequence of Arabidopsis thaliana chromosome 5, non-redundant P1 and TAC clones have been sequenced on the basis of the fine physical map, and as of January, 2000, the sequences of 16.6 Mb representing approximately 60% of chromosome 5 have been accumulated and released at our web site. Along with the sequence determination, structural features of the sequenced regions have been analyzed by applying a variety of computer programs, and we already predicted a total of 2697 potential protein coding genes in the 11,166,130 bp regions, which are covered by 159 P1 and TAC clones. In this paper, we describe the structural features of the 3,076,755 bp regions covered by newly analyzed 60 P1 and TAC clones. A total of 715 potential protein coding genes were identified, giving an average density of the genes identified of 1 gene per 4001 bp. Introns were observed in 80% of the genes, and the average number per gene and the average length of the introns were 4.5 and 147 bp, respectively. These sequence features are nearly identical to those in our latest report in which the data were compiled based on a new standard of gene assignment including the computer-predicted hypothetical genes. The regions also contained 12 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available through the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/kaos/.  相似文献   

9.
A total of 17 Pl and TAC clones each representing an assigned region of chromosome 5 were isolated from P1 and TAC genomic libraries of Arabidopsis thaliana Columbia, and their nucleotide sequences were determined. The length of the clones sequenced in this study summed up to 1,081,958 bp. As we have previously reported the sequence of 9,072,622 bp by analysis of 125 P1 and TAC clones, the total length of the sequences of chromosome 5 determined so far is now 10,154,580 bp. The sequences were subjected to similarity search against protein and EST databases and analysis with computer programs for gene modeling. As a consequence, a total of 253 potential protein-coding genes with known or predicted functions were identified. The positions of exons which do not show apparent similarity to known genes were also assigned using computer programs for exon prediction. The average density of the genes identified in this study was 1 gene per 4277 bp. Introns were observed in 74% of the potential protein genes, and the average number per gene and the average length of the introns were 4.3 and 168 bp, respectively. The sequence data and gene information are available on the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/arabi/.  相似文献   

10.
Based on the physical map of Arabidopsis thaliana chromosome 3 previously constructed with CIC YAC, TAC, P1 and BAC clones (Sato, S. et al., DNA Res., 5, 163-168, 1998), a total of 60 P1 and TAC clones were sequenced, and the sequence features of the resulting 4,504,864 bp regions were analyzed by applying various computer programs for similarity search and gene modeling. As a result, a total of 1054 potential protein-coding genes were identified. The average density of the genes identified was 1 gene per 4066 bp. Introns were observed in 77% of the genes, and the average number per gene and the average length of the introns were 3.9 and 156 bp, respectively. These sequence features are essentially identical to those of chromosome 5 in our previous reports, but the gene density was slightly higher than that observed for chromosomes 2 and 4. The regions also contained 10 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available through the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/kaos/.  相似文献   

11.
Sixty-five TAC (transformation-competent artificial chromosomes) clones were selected from a genomic library of Lotus japonicus accession MG-20 based on the sequence information of expressed sequences tags (ESTs), cDNA and gene information, and their nucleotide sequences were determined. The average insert size of the TAC clone was approximately 100 kb, and the total length of the sequenced regions in this study is 6,556,100 bp. Together with the nucleotide sequences of 56 TAC clones previously reported, the regions sequenced so far total 12,029,295 bp. By comparison with the sequences in protein and EST databases and by analysis with computer programs for gene modeling, a total of 711 potential protein-encoding genes with known or predicted functions, 239 gene segments and 90 pseudogenes were identified in the newly sequenced regions. The average gene density assigned so far was 1 gene/9140 bp. The average length of the assigned genes was 2.6 kb, which is considerably larger than that assigned in the Arabidopsis thaliana genome (1.9 kb for 6451 genes). Introns were identified in approximately 73% of the potential genes, and the average number and length of the introns per gene were 3.4 and 377 bp, respectively. Simple sequence repeat length polymorphism (SSLP) or derived cleaved amplified polymorphic sequence (dCAPS) markers were generated based on the nucleotide sequences of the genomic clones obtained, and each clone was mapped onto the linkage map using the F2 mapping population derived from a cross of two accessions of L. japonicus, Gifu B-129 and Miyakojima MG-20. The sequence data, gene information and mapping information are available through the World Wide Web at http://www.kazusa.or.jp/lotus/.  相似文献   

12.
To deduce the entire sequence of the top arm of the Arabidopsis thaliana chromosome 3, the sequence determination was performed on a total of 90 P1, TAC and BAC clones chosen according to our sequencing strategy. Sequence features of the resulting 4,251,695 bp regions were analyzed with various computer programs for similarity search and gene modeling. As a result, a total of 941 potential protein-coding genes were identified. The average density of the genes identified was 1 gene per 4210 bp. Introns were observed in 73% of the genes, and the average number per gene and the average length of the introns were 3.6 and 159 bp, respectively. These sequence features are essentially identical to those of chromosomes 3 and 5 in our previous reports. The regions also contained 14 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available through the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/kaos/.  相似文献   

13.
A total of sixty-two clones were selected from a TAC (transformation-competent artificial chromosome) genomic library of the Lotus japonicus accession MG-20 based on the sequence information of expressed sequence tags (ESTs), cDNA and gene information, and their nucleotide sequences were determined. The length of the sequenced regions in this study is 6,682,189 bp, and the total length of the regions sequenced so far is 18,711,484 bp together with the nucleotide sequences of 121 TAC clones previously reported. By comparison with the sequences in protein and EST databases and analysis with computer programs for gene modeling, a total of 573 potential protein-coding genes with known or predicted functions, 91 gene segments and 272 pseudogenes were identified in the newly sequenced regions. Each of the sequenced clones was localized onto the linkage map of two accessions of L. japonicus, Gifu B-129 and Miyakojima MG-20, using simple sequence repeat length polymorphism (SSLP) or derived cleaved amplified polymorphic sequence (dCAPS) markers generated based on the nucleotide sequences of the clones. The sequence data, gene information and mapping information are available through the World Wide Web at http://www.kazusa.or.jp/lotus/.  相似文献   

14.
We determined the nucleotide sequences of 64 TAC (transformation-competent artificial chromosome) clones selected from genomic libraries of Lotus japonicus accession Miyakojima MG-20 based on the sequence information of expressed sequence tags (ESTs), cDNAs, genes and DNA markers from L. japonicus and other legumes. The length of the DNA regions sequenced in this study was 6,370,255 bp, and the total length of the L. japonicus genome sequenced so far is 32,537,698 bp together with the nucleotide sequences of 256 TAC clones previously reported. Five hundred forty-eight potential protein-encoding genes with known or predicted functions, 127 gene segments and 224 pseudogenes were assigned to the newly sequenced regions by computer prediction and similarity searches against the sequences in protein and EST databases. Based on the nucleotide sequences of the clones, simple sequence repeat length polymorphism (SSLP) or derived cleaved amplified polymorphic sequence (dCAPS) markers were generated, and each clone was genetically localized onto the linkage map of two accessions of L. japonicus, MG-20 and Gifu B-129. The sequence data, gene information and mapping information are available through the World Wide Web at http://www.kazusa.or.jp/lotus/.  相似文献   

15.
A total of 56 TAC clones with an average insert size of 100 kb were isolated from a TAC library of the Lotus japonicus genome based on the expressed sequences tags (ESTs), cDNA and gene information, and their nucleotide sequences were determined according to the shot-gun based strategy. The total length of the sequenced regions is 5,473,195 bp. By comparison with the sequences in protein and EST databases and analysis with computer programs for gene modeling, a total of 605 potential protein-encoding genes with known or predicted functions, 69 gene segments, and 172 pseudogenes were identified. The average density of the genes assigned so far is 1 gene/8120 bp. Introns were identified in approximately 78% of the potential genes. There was an average of 3.8 introns per gene and the average length of the introns was 375 bp. DNA markers were generated based on the nucleotide sequences obtained, and each clone was mapped onto the linkage map using the F2 mapping population derived from a cross of L. japonicus Gifu B-129 and Miyakojima MG-20. The sequence data, gene information and mapping information are available through the World Wide Web at http://www.kazusa.or.jp/lotus/.  相似文献   

16.
Using the sequence information of expressed sequences tags (ESTs), cDNAs and genes from Lotus japonicus and other legumes, 73 TAC (transformation-competent artificial chromosomes) clones were selected from a genomic library of L. japonicus accession MG-20, and their nucleotide sequences were determined. The length of the DNA sequenced in this study was 7,455,959 bp, and the total length of the DNA regions sequenced so far is 26,167,443 bp together with the nucleotide sequences of 183 TAC clones previously reported. By similarity searches against the sequences in protein and EST databases and prediction by computer programs, a total of 699 potential protein-encoding genes with known or predicted functions, 163 gene segments and 267 pseudogenes were assigned to the newly sequenced regions. Based oil the nucleotide sequences of the clones, simple sequence repeat length polymorphism (SSLP) or derived cleaved amplified polymorphic sequence (dCAPS) markers were generated, and each clone was located onto the linkage map of two accessions of L. japonicus, Gifu B-129 and Miyakojima MG-20. The sequence data, gene information and mapping information are available through the World Wide Web at http://www.kazusa.or.jp/lotus/.  相似文献   

17.
Within the framework of an international project for the sequencingof the entire Bacillus subtilis genome, a 36-kb chromosome segment,which covers the region between the gnt and iol operons, hasbeen cloned and sequenced. This region (36447 bp) contains 33complete open reading frames (ORFs; genes) including the fourgnt genes and one partial gene. A homology search for the productsof the 33 complete ORFs revealed significant homology to knownproteins in 16 of them such as tetracycline resistance protein(Clostridium perfringens), asparagine synthetase (Arabidopsisthaliana), aldehyde dehydrogenase (Pseudomonas oleovorans),2,5-dichloro-2,5-cyclohexadiene-1,4-diol dehydrogenase (P. paucimobilis),heat shock protein HtpG (Escherichia coli), galactose-protonsymporter (E. coli), auxin-induced protein (common tobacco),glucitol operon repressor (E. coli) and methylmalonate-semialdehydedehydrogenase (P. aeruginosa). Unlike the regions we sequencedso far, this region contained two short sequence multiplications:one was a tandem sequence duplication (409 and 410 bp), andthe other a triplication consisting of two highly conserved118-bp tandem sequences preceded by a less conserved similarsequence (129 bp). The reasons for the presence of these sequencemultiplications in the gnt to iol region were deduced.  相似文献   

18.
To improve the comparative map for pig chromosome 2 and increase the gene density on this chromosome, a porcine bacterial artificial chromosome (BAC) library was screened with 17 microsatellite markers and 18 genes previously assigned to pig chromosome 2. Fifty-one BAC clones located in the region of a maternally imprinted quantitative trait locus for backfat thickness (BFT) were identified. From these BACs 372 kb were sample sequenced. The average read length of a subclone was 442 basepair (bp). Contig assembly analysis showed that every bp was sequenced 1.28 times. Subsequently, sequences were compared with sequences in the nucleotide databases to identify homology with other mammalian sequences. Sequence identity was observed with sequences derived from 35 BACs. The average percentage identity with human sequences was 87.6%, with an average length of 143 bp. In total, sample sequencing of all BACs resulted in sequence identity with 29 human genes, 13 human expressed sequence tags (ESTs), 17 human genomic clones, one rat gene, one porcine gene and nine porcine ESTs. Eighteen genes located on human chromosome 11 and 19, and seven genes from other human locations, one rat gene and one porcine gene were assigned to pig chromosome 2 for the first time. The new genes were added to the radiation hybrid map at the same position as the locus from which the BAC that was sequenced was derived. In total 57 genes were placed on the radiation hybrid map of SSC2p-q13.  相似文献   

19.
The Down syndrome (DS) region on chromosome 21, which is responsiblefor the DS main features, has been defined by analysis of DSpatients with partial trisomy 21. Within the DS region, we constructeda 1.6-Mb P1 contig map previously. To isolate gene fragmentsfrom the 1.6-Mb region, we performed direct cDNA library screeningand exon trapping using the P1 clones and a human fetal braincDNA library, and obtained 67 cDNA fragments and 52 possibleexons. Among them, 23 cDNA fragments and 4 exons were interpretedto be derived from a single gene by localization on P1 clonesand by Northern analysis. To obtain the full-length cDNA sequence,longer cDNA clones were further screened from another humancDNA library which was enriched with longer cDNA species. Theseclones were sequenced and assembled to a sequence of 9045 bp.This transcribed sequence encodes a novel 2025 amino-acid proteincontaining tetratricopeptide repeat (TPR) motifs and thereforethe gene was designated as TPRD (a gene containing the TPR motifson the Down syndrome region). The TPR domain has been foundin a certain protein phosphatase and in other proteins involvedin the regulation of RNA synthesis or mitosis. The TPRD gene,the novel gene which was proved to be in the 1.6-Mb region andto have the interesting features described above, is a candidatefor genes responsible for the DS phenotypes.  相似文献   

20.
Identification of Open Reading Frames in Schizosaccharomyces pombe cDNAs   总被引:1,自引:0,他引:1  
A total of 214 non-overlapping cDNA clones from Schizosaccharomycespombe were selected and completely sequenced. The clones notpreviously reported were divided into the following three groups:1) homologous to Saccharomyces cerevisiae genes (139 clones);2) homologous to genes from other organisms but not to thosefrom Sac. cerevisiae (4 clones); and 3) no similar sequences(40 clones). Among the 31 sequences identical to those in thepublic databases, 4 genes have regions corresponding to introns.Protein sequences which had homologs both in budding yeast andmammals were compared with those from Sac. cerevisiae and mammals.The search revealed that the evolutionary distances among thesespecies are similar at least with genes of this category.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号