首页 | 本学科首页   官方微博 | 高级检索  
     


Biological sequences integrated: a relational database approach
Authors:Bergholz A  Heymann S  Schenk J A  Freytag J C
Affiliation:(1) Institute of Computer Science, Humboldt-University Berlin, Unter den Linden 6, D-10099 Berlin, Germany;(2) Max-Delbrück-Center for Molecular Medicine (MDC), Robert-Rössle–Str. 10, D-13125 Berlin, Germany;(3) Kelman GmbH, Berlin, Germany;(4) Max-Delbrück-Center for Molecular Medicine (MDC), Robert-Rössle–Str. 10, D-13125 Berlin, Germany;(5) Institute of Biochemistry and Biology, Department of Biotechnology, University of Potsdam, Golm, Germany
Abstract:Over the last decade the modeling and the storage of biological data has been a topic of wide interest for scientists dealing with biological and biomedical research. Currently most data is still stored in text files which leads to data redundancies and file chaos.In this paper we show how to use relational modeling techniques and relational database technology for modeling and storing biological sequence data, i.e. for data maintained in collections like EMBL or SWISS-PROT to better serve the needs for these application domains.For this reason we propose a two step approach. First, we model the structure (and therefore the meaning of the) data using an Entity-Relationship approach. The ER model leads to a clean design of a relational database schema for storing and retrieving the DNA and protein data extracted from various sources. Our approach provides the clean basis for building complex biological applications that are more amenable to changes and software ports than their file-base counterparts.
Keywords:
本文献已被 PubMed SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号