首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
During the last ten years, Arabidopsis thaliana has become the most favoured plant system for the study of many aspects of development and adaptation to adverse conditions and diseases. The sequencing of the Arabidopsis thaliana genome is nearly completed with more than 90% of the sequence being released in public databases. This is the first plant genome to be analysed and it has revealed a tremendous amount of information about the nature of the genes it contains and its largely duplicated organisation. French groups have been involved in Arabidopsis genomics at several steps: EST (expressed sequence tags) sequencing, construction and ordering (physical mapping of chromosomes) of a YAC (yeast artificial chromosomes) library, genomic sequencing. In parallel an extensive programme of functional genomics is being undertaken through the systematic analysis of insertional mutants. This information provides a support for analysing other more economically important plant genomes such as the rice genome and constitutes the beginning of a systematic investigation on plant gene functions and will promote new strategies for plant improvement.  相似文献   

2.
Nearly 4 years after launching the International Rice Genome Sequencing Project (IRGSP), the rice genome sequence is almost completed. This is the second plant genome after Arabidopsis thaliana and one expect that it is more representative of other cereal genomes. Indeed, no more than 4 sequences have been independently reported as a result of a tough competition between economy, politics and media. The efficiency and impact of this way of managing a large scale project is questionable. This paper reports the various phases in sequencing rice genome as well as what we start to learn.  相似文献   

3.
4.
5.
MIPS: a database for genomes and protein sequences   总被引:17,自引:0,他引:17       下载免费PDF全文
The Munich Information Center for Protein Sequences (MIPS-GSF), Martinsried, near Munich, Germany, continues its longstanding tradition to develop and maintain high quality curated genome databases. In addition, efforts have been intensified to cover the wealth of complete genome sequences in a systematic, comprehensive form. Bioinformatics, supporting national as well as European sequencing and functional analysis projects, has resulted in several up-to-date genome-oriented databases. This report describes growing databases reflecting the progress of sequencing the Arabidopsis thaliana (MATDB) and Neurospora crassa genomes (MNCDB), the yeast genome database (MYGD) extended by functional analysis data, the database of annotated human EST-clusters (HIB) and the database of the complete cDNA sequences from the DHGP (German Human Genome Project). It also contains information on the up-to-date database of complete genomes (PEDANT), the classification of protein sequences (ProtFam) and the collection of protein sequence data within the framework of the PIR-International Protein Sequence Database. These databases can be accessed through the MIPS WWW server (http://www. mips.biochem.mpg.de).  相似文献   

6.
7.
Genome sequence information has continued to accumulate at a spectacular pace during the past year. Details of the sequence and gene content of human chromosome 22 were published. The sequencing and annotation of the first two Arabidopsis thaliana chromosomes was completed. The sequence of chromosome 3 from Plasmodium falciparum, the second sequenced malaria chromosome, was reported, as was that of chromosome 1 from Leishmania major. The complete genomic sequences of five microbes were reported. Approaches to using data from completely sequenced microbial genomes in phylogenetic studies are being explored, as is the application of microarrays to whole genome expression analysis.  相似文献   

8.
Recent advances, such as the availability of extensive genome survey sequence (GSS) data and draft physical maps, are radically transforming the means by which we can dissect Brassica genome structure and systematically relate it to the Arabidopsis model. Hitherto, our view of the co-linearities between these closely related genomes had been largely inferred from comparative RFLP data, necessitating substantial interpolation and expert interpretation. Sequencing of the Brassica rapa genome by the Multinational Brassica Genome Project will, however, enable an entirely computational approach to this problem. Meanwhile we have been developing databases and bioinformatics tools to support our work in Brassica comparative genomics, including a recently completed draft physical map of B. rapa integrated with anchor probes derived from the Arabidopsis genome sequence. We are also exploring new ways to display the emerging Brassica-Arabidopsis sequence homology data. We have mapped all publicly available Brassica sequences in silico to the Arabidopsis TIGR v5 genome sequence and published this in the ATIDB database that uses Generic Genome Browser (GBrowse). This in silico approach potentially identifies all paralogous sequences and so we colour-code the significance of the mappings and offer an integrated, real-time multiple alignment tool to partition them into paralogous groups. The MySQL database driving GBrowse can also be directly interrogated, using the powerful API offered by the Perl BioColon, two colonsDBColon, two colonsGFF methods, facilitating a wide range of data-mining possibilities.  相似文献   

9.
Arabidopsis thaliana has a relatively small genome of approximately 130 Mb containing about 10% repetitive DNA. Genome sequencing studies reveal a gene-rich genome, predicted to contain approximately 25000 genes spaced on average every 4.5 kb. Between 10 to 20% of the predicted genes occur as clusters of related genes, indicating that local sequence duplication and subsequent divergence generates a significant proportion of gene families. In addition to gene families, repetitive sequences comprise individual and small clusters of two to three retroelements and other classes of smaller repeats. The clustering of highly repetitive elements is a striking feature of the A. thaliana genome emerging from sequence and other analyses.  相似文献   

10.
Rice as a model for cereal genomics.   总被引:9,自引:0,他引:9  
Over the past two years, selected regions of the rice genome have been sequenced and shown to be colinear at the sequence level with limited regions of other cereal genomes. A large number of expressed gene sequences and molecular markers have accumulated in the public databases. Large insert clone libraries of the rice genome have been constructed, and rice has become an increasingly attractive candidate for whole genome sequencing.  相似文献   

11.
GDB: the Human Genome Database.   总被引:6,自引:0,他引:6       下载免费PDF全文
The Genome Database (GDB, http://www.gdb.org ) is a public repository of data on human genes, clones, STSs, polymorphisms and maps. GDB entries are highly cross-linked to each other, to literature citations and to entries in other databases, including the sequence databases, OMIM, and the Mouse Genome Database. Mapping data from large genome centers and smaller mapping efforts are added to GDB on an ongoing basis. The database can be searched by a variety of methods, ranging from keyword searches to complex queries. Major functionality extensions in the last year include the ongoing computation of integrated human genome maps, called Comprehensive Maps, and the use of those maps to support positional queries and graphic displays. The capabilities of the GDB map viewer (Mapview) have been extended to include map printing and the graphical display of ad hoc query results. The HUGO Nomenclature Committee continues to curate the proposed and official gene symbols and related data in collaboration with GDB. As genome research shifts its emphasis from mapping to sequencing and functional analysis, the scope of the GDB schema is being extended. We are in the process of adding representations of gene function and expression, and improving our representation of human polymorphism and mutation.  相似文献   

12.
An extensive effort of the International Rice Genome Sequencing Project (IRGSP) has resulted in rapid accumulation of genome sequence, and >137 Mb has already been made available to the public domain as of August 2001. This requires a high-throughput annotation scheme to extract biologically useful and timely information from the sequence data on a regular basis. A new automated annotation system and database called Rice Genome Automated Annotation System (RiceGAAS) has been developed to execute a reliable and up-to-date analysis of the genome sequence as well as to store and retrieve the results of annotation. The system has the following functional features: (i) collection of rice genome sequences from GenBank; (ii) execution of gene prediction and homology search programs; (iii) integration of results from various analyses and automatic interpretation of coding regions; (iv) re-execution of analysis, integration and automatic interpretation with the latest entries in reference databases; (v) integrated visualization of the stored data using web-based graphical view. RiceGAAS also has a data submission mechanism that allows public users to perform fully automated annotation of their own sequences. The system can be accessed at http://RiceGAAS.dna.affrc.go.jp/.  相似文献   

13.
A small freshwater fish medaka (Oryzias latipes) has been one of the most attractive experimental systems for research in genetics and developmental biology. We have formed an international consortium Medaka Genome Initiative (MGI) to collect and share various information and resources on medaka. The MGI has set an ambitious goal aiming at the complete sequencing of the medaka genome and as a feasibility study we have begun sequencing one particular chromosome, linkage group 22 (LG22) of 22 Mb in size. Initial sequence analysis revealed unique features of the medaka genome in comparison to fugu genome.  相似文献   

14.
The year 2001 may well be called the Year of the Human Genome. Less in the limelight, but equally exciting for plant scientists, is the rapid progress in plant genomics. With relatively modest resources, a lot has been achieved. The Arabidopsis genomic sequence (125 megabases [Mb]) is essentially finished, and rice sequencing is progressing rapidly. For many species, expressed sequence tag (EST) resources are plentiful, allowing broad inter-specific comparisons. At the same time, development of integrated physical-genetic maps for large-genome crop species is not progressing as rapidly as desired, while resources for the complete sequencing of these crops are not likely to become available. Some important plant genomes are so large that their complete sequencing may not be practical for many years. Significant plant genome research is concentrated in industry, and not freely available, creating some frustration in the academic community. Growing interest is anticipated in the development of metabolic profiling technologies, RNA profiling, proteomics and integrated systems approaches to plant biology.  相似文献   

15.
宋述慧  滕徐菲  肖景发 《遗传》2018,40(11):1048-1054
随着人类基因组计划和国际千人基因组计划的实施,已公开数百个中国人个体的全基因组数据。建立高精度的中国人群参考基因组序列,发现并解析中国人群特有的序列变异,是我国未来精准医学研究的基础。为满足未来精准医学研究中国人基因组数据持续增长的科学管理和深入研究的需求,中国科学院北京基因组研究所发展并建立了基于中国人群全基因组测序数据的虚拟中国人基因组数据库(Virtual Chinese Genome Database, VCGDB)和中国人群基因组变异数据库(Genome Variation Map, GVM),面向国内外用户提供数据检索、共享、下载和在线分析服务。本文重点介绍了这两个数据库的特点和功能,以及未来发展与应用前景,以期为中国人群参考基因组及基因组变异图谱资源库的推广使用、发展完善提供有益信息。  相似文献   

16.
DNA Data Bank of Japan at work on genome sequence data.   总被引:5,自引:3,他引:2       下载免费PDF全文
We at the DNA Data Bank of Japan (DDBJ) (http://www.ddbj.nig.ac.jp) have recently begun receiving, processing and releasing EST and genome sequence data submitted by various Japanese genome projects. The data include those for human, Arabidopsis thaliana, rice, nematode, Synechocystis sp. and Escherichia coli. Since the quantity of data is very large, we organized teams to conduct preliminary discussions with project teams about data submission and handling for release to the public. We also developed a mass submission tool to cope with a large quantity of data. In addition, to provide genome data on WWW, we developed a genome information system using Java. This system (http://mol.genes.nig.ac.jp/ecoli/) can in theory be used for any genome sequence data. These activities will facilitate processing of large quantities of EST and genome data.  相似文献   

17.
Large volumes of genomic data have been generated for several plant species over the past decade, including structural sequence data and functional annotation at the genome level. Various technologies such as expressed sequence tags (ESTs), massively parallel signature sequencing (MPSS) and microarrays have been used to study gene expression and to provide functional data for many genes simultaneously. This review focuses on recent advances in the application of microarrays in plant genomic research and in gene expression databases available for plants. Large sets of Arabidopsis microarray data are publicly available. Recently developed array platforms are currently being used to generate genome-wide expression profiles for several crop species. Coupled to these platforms are public databases that provide access to these large-scale expression data, which can be used to aid the functional discovery of gene function.  相似文献   

18.
With the advent of DNA sequencing technologies, more and more reference genome sequences are available for many organisms. Analyzing sequence variation and understanding its biological importance are becoming a major research aim. However, how to store and process the huge amount of eukaryotic genome data, such as those of the human, mouse and rice, has become a challenge to biologists. Currently available bioinformatics tools used to compress genome sequence data have some limitations, such as the requirement of the reference single nucleotide polymorphisms (SNPs) map and information on deletions and insertions. Here, we present a novel compression tool for storing and analyzing Genome ReSequencing data, named GRS. GRS is able to process the genome sequence data without the use of the reference SNPs and other sequence variation information and automatically rebuild the individual genome sequence data using the reference genome sequence. When its performance was tested on the first Korean personal genome sequence data set, GRS was able to achieve ~159-fold compression, reducing the size of the data from 2986.8 to 18.8 MB. While being tested against the sequencing data from rice and Arabidopsis thaliana, GRS compressed the 361.0 MB rice genome data to 4.4 MB, and the A. thaliana genome data from 115.1 MB to 6.5 KB. This de novo compression tool is available at http://gmdd.shgmo.org/Computational-Biology/GRS.  相似文献   

19.
A complete genome sequence provides unlimited information in the sequenced organism as well as in related taxa. According to the guidance of the Multinational Brassica Genome Project (MBGP), the Korea Brassica Genome Project (KBGP) is sequencing chromosome 1 (cytogenetically oriented chromosome #1) of Brassica rapa. We have selected 48 seed BACs on chromosome 1 using EST genetic markers and FISH analyses. Among them, 30 BAC clones have been sequenced and 18 are on the way. Comparative genome analyses of the EST sequences and sequenced BAC clones from Brassica chromosome 1 revealed their homeologous partner regions on the Arabidopsis genome and a syntenic comparative map between Brassica chromosome 1 and Arabidopsis chromosomes. In silico chromosome walking and clone validation have been successfully applied to extending sequence contigs based on the comparative map and BAC end sequences. In addition, we have defined the (peri)centromeric heterochromatin blocks with centromeric tandem repeats, rDNA and centromeric retrotransposons. In-depth sequence analyses of five homeologous BAC clones and an Arabidopsis chromosomal region reveal overall co-linearity, with 82% sequence similarity. The data indicate that the Brassica genome has undergone triplication and subsequent gene losses after the divergence of Arabidopsis and Brassica. Based on in-depth comparative genome analyses, we propose a comparative genomics approach for conquering the Brassica genome. In 2005 we intend to construct an integrated physical map, including sequence information from 500 BAC clones and integration of fingerprinting data and end sequence data of more than 100 000 BAC clones. The sequences have been submitted to GenBank with accession numbers: 10 204 BAC ends of the KBrH library (CW978640-CW988843); KBrH138P04, AC155338; KBrH117N09, AC155337; KBrH097M21, AC155348; KBrH093K03, AC155347; KBrH081N08, AC155346; KBrH080L24, AC155345; KBrH077A05, AC155343; KBrH020D15, AC155340; KBrH015H17, AC155339; KBrH001H24, AC155335; KBrH080A08, AC155344; KBrH004D11, AC155341; KBrH117M18, AC146875; KBrH052O08, AC155342.  相似文献   

20.
This report illustrates development of plant sequencing programmes. So far Arabidopsis genome has been completely sequenced and a draft of the rice genome is available. The Arabidopsis programmes stimulated sequencing of EST (expressed sequence tags) from numerous cultivated species thus creating an enormous resource. The major challenge is now to correctly annotate all the genes in Arabidopsis and find out a biological and biochemical function for each one. The availability of EST and genome sequence now allows one to analyse the expression of genes at the level of the whole genome.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号