首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Sixty-five TAC (transformation-competent artificial chromosomes) clones were selected from a genomic library of Lotus japonicus accession MG-20 based on the sequence information of expressed sequences tags (ESTs), cDNA and gene information, and their nucleotide sequences were determined. The average insert size of the TAC clone was approximately 100 kb, and the total length of the sequenced regions in this study is 6,556,100 bp. Together with the nucleotide sequences of 56 TAC clones previously reported, the regions sequenced so far total 12,029,295 bp. By comparison with the sequences in protein and EST databases and by analysis with computer programs for gene modeling, a total of 711 potential protein-encoding genes with known or predicted functions, 239 gene segments and 90 pseudogenes were identified in the newly sequenced regions. The average gene density assigned so far was 1 gene/9140 bp. The average length of the assigned genes was 2.6 kb, which is considerably larger than that assigned in the Arabidopsis thaliana genome (1.9 kb for 6451 genes). Introns were identified in approximately 73% of the potential genes, and the average number and length of the introns per gene were 3.4 and 377 bp, respectively. Simple sequence repeat length polymorphism (SSLP) or derived cleaved amplified polymorphic sequence (dCAPS) markers were generated based on the nucleotide sequences of the genomic clones obtained, and each clone was mapped onto the linkage map using the F2 mapping population derived from a cross of two accessions of L. japonicus, Gifu B-129 and Miyakojima MG-20. The sequence data, gene information and mapping information are available through the World Wide Web at http://www.kazusa.or.jp/lotus/.  相似文献   

2.
A total of sixty-two clones were selected from a TAC (transformation-competent artificial chromosome) genomic library of the Lotus japonicus accession MG-20 based on the sequence information of expressed sequence tags (ESTs), cDNA and gene information, and their nucleotide sequences were determined. The length of the sequenced regions in this study is 6,682,189 bp, and the total length of the regions sequenced so far is 18,711,484 bp together with the nucleotide sequences of 121 TAC clones previously reported. By comparison with the sequences in protein and EST databases and analysis with computer programs for gene modeling, a total of 573 potential protein-coding genes with known or predicted functions, 91 gene segments and 272 pseudogenes were identified in the newly sequenced regions. Each of the sequenced clones was localized onto the linkage map of two accessions of L. japonicus, Gifu B-129 and Miyakojima MG-20, using simple sequence repeat length polymorphism (SSLP) or derived cleaved amplified polymorphic sequence (dCAPS) markers generated based on the nucleotide sequences of the clones. The sequence data, gene information and mapping information are available through the World Wide Web at http://www.kazusa.or.jp/lotus/.  相似文献   

3.
We determined the nucleotide sequences of 64 TAC (transformation-competent artificial chromosome) clones selected from genomic libraries of Lotus japonicus accession Miyakojima MG-20 based on the sequence information of expressed sequence tags (ESTs), cDNAs, genes and DNA markers from L. japonicus and other legumes. The length of the DNA regions sequenced in this study was 6,370,255 bp, and the total length of the L. japonicus genome sequenced so far is 32,537,698 bp together with the nucleotide sequences of 256 TAC clones previously reported. Five hundred forty-eight potential protein-encoding genes with known or predicted functions, 127 gene segments and 224 pseudogenes were assigned to the newly sequenced regions by computer prediction and similarity searches against the sequences in protein and EST databases. Based on the nucleotide sequences of the clones, simple sequence repeat length polymorphism (SSLP) or derived cleaved amplified polymorphic sequence (dCAPS) markers were generated, and each clone was genetically localized onto the linkage map of two accessions of L. japonicus, MG-20 and Gifu B-129. The sequence data, gene information and mapping information are available through the World Wide Web at http://www.kazusa.or.jp/lotus/.  相似文献   

4.
Using the sequence information of expressed sequences tags (ESTs), cDNAs and genes from Lotus japonicus and other legumes, 73 TAC (transformation-competent artificial chromosomes) clones were selected from a genomic library of L. japonicus accession MG-20, and their nucleotide sequences were determined. The length of the DNA sequenced in this study was 7,455,959 bp, and the total length of the DNA regions sequenced so far is 26,167,443 bp together with the nucleotide sequences of 183 TAC clones previously reported. By similarity searches against the sequences in protein and EST databases and prediction by computer programs, a total of 699 potential protein-encoding genes with known or predicted functions, 163 gene segments and 267 pseudogenes were assigned to the newly sequenced regions. Based oil the nucleotide sequences of the clones, simple sequence repeat length polymorphism (SSLP) or derived cleaved amplified polymorphic sequence (dCAPS) markers were generated, and each clone was located onto the linkage map of two accessions of L. japonicus, Gifu B-129 and Miyakojima MG-20. The sequence data, gene information and mapping information are available through the World Wide Web at http://www.kazusa.or.jp/lotus/.  相似文献   

5.
A total of 17 Pl and TAC clones each representing an assigned region of chromosome 5 were isolated from P1 and TAC genomic libraries of Arabidopsis thaliana Columbia, and their nucleotide sequences were determined. The length of the clones sequenced in this study summed up to 1,081,958 bp. As we have previously reported the sequence of 9,072,622 bp by analysis of 125 P1 and TAC clones, the total length of the sequences of chromosome 5 determined so far is now 10,154,580 bp. The sequences were subjected to similarity search against protein and EST databases and analysis with computer programs for gene modeling. As a consequence, a total of 253 potential protein-coding genes with known or predicted functions were identified. The positions of exons which do not show apparent similarity to known genes were also assigned using computer programs for exon prediction. The average density of the genes identified in this study was 1 gene per 4277 bp. Introns were observed in 74% of the potential protein genes, and the average number per gene and the average length of the introns were 4.3 and 168 bp, respectively. The sequence data and gene information are available on the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/arabi/.  相似文献   

6.
In our ongoing project to deduce the nucleotide sequence of Arabidopsis thaliana chromosome 5, non-redundant P1 and TAC clones have been sequenced on the basis of the fine physical map, and as of January, 2000, the sequences of 16.6 Mb representing approximately 60% of chromosome 5 have been accumulated and released at our web site. Along with the sequence determination, structural features of the sequenced regions have been analyzed by applying a variety of computer programs, and we already predicted a total of 2697 potential protein coding genes in the 11,166,130 bp regions, which are covered by 159 P1 and TAC clones. In this paper, we describe the structural features of the 3,076,755 bp regions covered by newly analyzed 60 P1 and TAC clones. A total of 715 potential protein coding genes were identified, giving an average density of the genes identified of 1 gene per 4001 bp. Introns were observed in 80% of the genes, and the average number per gene and the average length of the introns were 4.5 and 147 bp, respectively. These sequence features are nearly identical to those in our latest report in which the data were compiled based on a new standard of gene assignment including the computer-predicted hypothetical genes. The regions also contained 12 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available through the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/kaos/.  相似文献   

7.
8.
Sixteen Pl and TAC clones assigned to Arabidopsis thaliana chromosome5 were sequenced, and their sequence features were analyzedusing various computer programs. The total length of the sequencesdetermined was 1,013,767 bp. Together with the nucleotide sequencesof 109 clones previously reported, the regions of chromosome5 sequenced so far now total 9,072,622 bp, which presumablycovers approximately one-third of the chromosome. A similaritysearch against the reported gene sequences predicted the presenceof a total of 225 protein-coding genes and/or gene segmentsin the newly sequenced regions, indicating an average gene densityof one gene per 4.5 kb. Introns were identified in 72.4% ofthe potential protein genes for which the entire gene structurewas predicted, and the average number per gene and the averagelength of the introns were 3.3 and 163 bp, respectively. Thesesequence features are essentially identical to those in thepreviously reported sequences. The sequence data and gene informationare available on the World Wide Web database KAOS (Kazusa Arabidopsisdata Opening Site) at http://www.kazusa.or.jp/arabi/.  相似文献   

9.
In this series of projects sequencing the entire genome of Arabidopsis thaliana chromosome 5, non-redundant P1 and TAC clones have been sequenced according to the fine physical map, and as of May 7, 1999, the sequences of 16.2 Mb representing approximately 60% of chromosome 5 have been accumulated and released at our web site. In parallel, structural features of the sequenced regions have been analyzed by applying a variety of computer programs, and to date we have predicted a total of 2380 potential protein-coding genes in the 10,154,580 bp regions, which are covered by 142 P1 and TAC clones. In this paper, we newly analyzed the structural features of the 1,011,550 bp regions covered by additional 17 P1 and TAC clones, and predicted 298 protein-coding genes. The average density of the genes identified was 1 gene per 3394 bp. Introns were observed in 67% of the genes, and the average number per gene and the average length of the introns were 3.2 and 159 bp, respectively. The gene density became higher than the value estimated in the previously analyzed regions (1 gene per 4,267 bp), as the data in this paper were compiled based on a new standard of gene assignment including the computer-predicted hypothetical genes. The regions also contained 8 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available on the database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/arabi/.  相似文献   

10.
11.
To deduce the entire sequence of the top arm of the Arabidopsis thaliana chromosome 3, the sequence determination was performed on a total of 90 P1, TAC and BAC clones chosen according to our sequencing strategy. Sequence features of the resulting 4,251,695 bp regions were analyzed with various computer programs for similarity search and gene modeling. As a result, a total of 941 potential protein-coding genes were identified. The average density of the genes identified was 1 gene per 4210 bp. Introns were observed in 73% of the genes, and the average number per gene and the average length of the introns were 3.6 and 159 bp, respectively. These sequence features are essentially identical to those of chromosomes 3 and 5 in our previous reports. The regions also contained 14 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available through the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/kaos/.  相似文献   

12.
Based on the physical map of Arabidopsis thaliana chromosome 3 previously constructed with CIC YAC, TAC, P1 and BAC clones (Sato, S. et al., DNA Res., 5, 163-168, 1998), a total of 60 P1 and TAC clones were sequenced, and the sequence features of the resulting 4,504,864 bp regions were analyzed by applying various computer programs for similarity search and gene modeling. As a result, a total of 1054 potential protein-coding genes were identified. The average density of the genes identified was 1 gene per 4066 bp. Introns were observed in 77% of the genes, and the average number per gene and the average length of the introns were 3.9 and 156 bp, respectively. These sequence features are essentially identical to those of chromosome 5 in our previous reports, but the gene density was slightly higher than that observed for chromosomes 2 and 4. The regions also contained 10 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available through the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/kaos/.  相似文献   

13.
14.
Nineteen Pl and TAC clones, which have been mapped on the finephysical map of the Arabidopsis thaliana chromosome 5, weresequenced according to the shotgun-based strategy, and theirstructural features were analysed. The total length of the regionssequenced in this study was 1,367,185 bp. Combining this withthe regions covered by 90 P1 and TAC clones proviously reported,the total length of chromosome 5 sequenced to date becomes 8,058,855bp. On the basis of similarity search against protein and ESTdatabases and gene modeling with computer programs, a totalof 330 potential protein-coding regions were identified, bringingan average density of the genes to approximately one gene per4.1 kb. Introns were identified in 81.0% of the potential proteingenes for which the entire gene structure was predicted, withan average number per gene of 4.2 and an average length of theintrons of 180 bp. The RNA-coding genes identified were 9 tRNAgenes corresponding to 8 amino acid species and 2 genes forU2 nuclear RNA. These sequence features are essentially identicalto those in the previously reported sequences. The sequencedata and gene information are available on the World Wide Webdatabase KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/arabi/.  相似文献   

15.
16.
17.
Complete structure of the chloroplast genome of a legume, Lotus japonicus.   总被引:4,自引:0,他引:4  
The nucleotide sequence of the entire chloroplast genome (150,519 bp) of a legume, Lotus japonicus, has been determined. The circular double-stranded DNA contains a pair of inverted repeats of 25,156 bp which are separated by a small and a large single copy region of 18,271 bp and 81,936 bp, respectively. A total of 84 predicted protein-coding genes including 7 genes duplicated in the inverted repeat regions, 4 ribosomal RNA genes and 37 tRNA genes (30 gene species) representing 20 amino acids species were assigned on the genome based on similarity to genes previously identified in other chloroplasts. All the predicted genes were conserved among dicot plants except that rpl22, a gene encoding chloroplast ribosomal protein CL22, was missing in L. japonicus. Inversion of a 51-kb segment spanning rbcL to rpsl6 (positions 5161-56,176) in the large single copy region was observed in the chloroplast genome of L. japonicus. The sequence data and gene information are available on our World Wide Web database at http://www.kazusa.or.jp/en/plant/database.html.  相似文献   

18.
19.
A series of 18 small overlapping restriction fragments has been cloned, covering the complete mitochondrial genome of Schizosaccharomyces pombe. By hybridizing mitochondrial gene probes from Saccharomyces cerevisiae and Neurospora crassa with restriction fragments of Schizosaccharomyces pombe mitochondrial DNA, the following homologous genes were localized on the mitochondrial genome of S. pombe: cob, cox1, cox2 and cox3, ATPase subunit 6 and 9 genes, the large rRNA gene and both types of open reading frames occurring in mitochondrial introns of various ascomycetes. The region of the genome, hybridizing with cob exon probes is separated by an intervening sequence of about 2500 bp, which is homologous with the first two introns of the cox1 gene in Saccharomyces cerevisiae (class II introns according to Michel et al. 1982). Similarly, in the cox1 homologous region, which covers about 4000 bp, two regions were detected hybridizing with class I intron probes, suggesting the existence of two cox1 introns in Schizosaccharomyces pombe. Hybridization with several specific exon probes with a determined order has revealed that cob, cox1, cox3 and the large rRNA gene are all transcribed from the same DNA strand. The low intensities of hybridization signals suggest a large evolutionary distance between Schizosaccharomyces pombe and Saccharomyces cerevisiae or Neurospora crassa mitochondrial genes. Considering the length of the mitochondrial DNA of Schizosaccharomyces pombe (about 19.4 kbp) and the expected length of the localized genes and intron sequences there is enough space left for encoding the expected set of tRNAs and the small rRNA gene. The existence of leader-, trailer-, ori- and spacer sequences or further unassigned reading frames is then restricted to a total length of about 3000 bp only.  相似文献   

20.
The human genome is revisited using exon and intron distribution profiles. The 26,564 annotated genes in the human genome (build October, 2003) contain 233,785 exons and 207,344 introns. On average, there are 8.8 exons and 7.8 introns per gene. About 80% of the exons on each chromosome are < 200 bp in length. < 0.01% of the introns are < 20 bp in length and < 10% of introns are more than 11,000 bp in length. These results suggest constraints on the splicing machinery to splice out very long or very short introns and provide insight to optimal intron length selection. Interestingly, the total length in introns and intergenic DNA on each chromosome is significantly correlated to the determined chromosome size with a coefficient of correlation r = 0.95 and r = 0.97, respectively. These results suggest their implication in genome design.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号