共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
Background
Large molecular sequence databases are fundamental resources for modern bioscientists. Whether for project-specific purposes or sharing data with colleagues, it is often advantageous to maintain smaller sequence databases. However, this is usually not an easy task for the average bench scientist. 相似文献3.
Background
Improvements in DNA sequencing technology and methodology have led to the rapid expansion of databases comprising DNA sequence, gene and genome data. Lower operational costs and heightened interest resulting from initial intriguing novel discoveries from genomics are also contributing to the accumulation of these data sets. A major challenge is to analyze and to mine data from these databases, especially whole genomes. There is a need for computational tools that look globally at genomes for data mining. 相似文献4.
Background
The omics fields promise to revolutionize our understanding of biology and biomedicine. However, their potential is compromised by the challenge to analyze the huge datasets produced. Analysis of omics data is plagued by the curse of dimensionality, resulting in imprecise estimates of model parameters and performance. Moreover, the integration of omics data with other data sources is difficult to shoehorn into classical statistical models. This has resulted in ad hoc approaches to address specific problems. 相似文献5.
Background
Although molecular pathway information and the International HapMap Project data can help biomedical researchers to investigate the aetiology of complex diseases more effectively, such information is missing or insufficient in current genetic association databases. In addition, only a few of the environmental risk factors are included as gene-environment interactions, and the risk measures of associations are not indexed in any association databases. 相似文献6.
Edyta Koscianska Julia Starega-Roslan Lukasz J Sznajder Marta Olejniczak Paulina Galka-Marciniak Wlodzimierz J Krzyzosiak 《BMC molecular biology》2011,12(1):14
Background
Numerous microRNAs (miRNAs) have heterogeneous ends resulting from imprecise cleavages by processing nucleases and from various non-templated nucleotide additions. The scale of miRNA end-heterogeneity is best shown by deep sequencing data revealing not only the major miRNA variants but also those that occur in only minute amounts and are unlikely to be of functional importance. All RNA interference (RNAi) technology reagents that are expressed and processed in cells are also exposed to the same machinery generating end-heterogeneity of the released short interfering RNAs (siRNAs) or miRNA mimetics. 相似文献7.
Background
With the completion of the Human Genome Project and recent advancements in mutation detection technologies, the volume of data available on genetic variations has risen considerably. These data are stored in online variation databases and provide important clues to the cause of diseases and potential side effects or resistance to drugs. However, the data presentation techniques employed by most of these databases make them difficult to use and understand. 相似文献8.
Norman Wang Alison R Sherwood Akira Kurihara Kimberly Y Conklin Thomas Sauvage Gernot G Presting 《BMC plant biology》2009,9(1):117-5
Background
Organization and presentation of biodiversity data is greatly facilitated by databases that are specially designed to allow easy data entry and organized data display. Such databases also have the capacity to serve as Laboratory Information Management Systems (LIMS). The Hawaiian Algal Database was designed to showcase specimens collected from the Hawaiian Archipelago, enabling users around the world to compare their specimens with our photographs and DNA sequence data, and to provide lab personnel with an organizational tool for storing various biodiversity data types. 相似文献9.
Background
Integration of heterogeneous data types is a challenging problem, especially in biology, where the number of databases and data types increase rapidly. Amongst the problems that one has to face are integrity, consistency, redundancy, connectivity, expressiveness and updatability. 相似文献10.
Xue Gong Ruihong Wu Yuannv Zhang Wenyuan Zhao Lixin Cheng Yunyan Gu Lin Zhang Jing Wang Jing Zhu Zheng Guo 《BMC bioinformatics》2010,11(1):76
Background
Hundreds of genes that are causally implicated in oncogenesis have been found and collected in various databases. For efficient application of these abundant but diverse data sources, it is of fundamental importance to evaluate their consistency. 相似文献11.
Jan Küntzer Christina Backes Torsten Blum Andreas Gerasch Michael Kaufmann Oliver Kohlbacher Hans-Peter Lenhof 《BMC bioinformatics》2007,8(1):367
Background
Technological advances in high-throughput techniques and efficient data acquisition methods have resulted in a massive amount of life science data. The data is stored in numerous databases that have been established over the last decades and are essential resources for scientists nowadays. However, the diversity of the databases and the underlying data models make it difficult to combine this information for solving complex problems in systems biology. Currently, researchers typically have to browse several, often highly focused, databases to obtain the required information. Hence, there is a pressing need for more efficient systems for integrating, analyzing, and interpreting these data. The standardization and virtual consolidation of the databases is a major challenge resulting in a unified access to a variety of data sources. 相似文献12.
Background
Recent, rapid growth in the quantity of available genomic data has generated many protein sequences that are not yet biochemically classified. Thus, the prediction of biochemical function based on structural motifs is an important task in post-genomic analysis. The InterPro databases are a major resource for protein function information. For optimal results, these databases should be searched at regular intervals, since they are frequently updated. 相似文献13.
Background
With advances in high-throughput genomics and proteomics, it is challenging for biologists to deal with large data files and to map their data to annotations in public databases. 相似文献14.
Riadh Hammami Abdelmajid Zouhir Karim Naghmouchi Jeannette Ben Hamida Ismail Fliss 《BMC bioinformatics》2008,9(1):121
Background
The exponential growth of research in molecular biology has brought concomitant proliferation of databases for stocking its findings. A variety of protein sequence databases exist. While all of these strive for completeness, the range of user interests is often beyond their scope. Large databases covering a broad range of domains tend to offer less detailed information than smaller, more specialized resources, often creating a need to combine data from many sources in order to obtain a complete picture. Scientific researchers are continually developing new specific databases to enhance their understanding of biological processes. 相似文献15.
Michael Baitaluk Xufei Qian Shubhada Godbole Alpan Raval Animesh Ray Amarnath Gupta 《BMC bioinformatics》2006,7(1):55-13
Background
The goal of information integration in systems biology is to combine information from a number of databases and data sets, which are obtained from both high and low throughput experiments, under one data management scheme such that the cumulative information provides greater biological insight than is possible with individual information sources considered separately. 相似文献16.
Background
The analysis of biological data is greatly enhanced by existing or emerging databases. Most existing databases, with few exceptions are not designed to easily support large scale computational analysis, but rather offer exclusively a web interface to the resource. We have recognized the growing need for a database which can be used successfully as a backend to computational analysis tools and pipelines. Such database should be sufficiently versatile to allow easy system integration. 相似文献17.
Background
The nematode Caenorhabditis elegans is widely used for the genetic analysis of neuronal cell biology, development, and behavior. Because traditional methods for evaluating behavioral phenotypes are qualitative and imprecise, there is a need for tools that allow quantitation and standardization of C. elegans behavioral assays. 相似文献18.
Background
The majority of information in the biological literature resides in full text articles, instead of abstracts. Yet, abstracts remain the focus of many publicly available literature data mining tools. Most literature mining tools rely on pre-existing lexicons of biological names, often extracted from curated gene or protein databases. This is a limitation, because such databases have low coverage of the many name variants which are used to refer to biological entities in the literature. 相似文献19.
Diogo FT Veiga Helena F Deus Caner Akdemir Ana Tereza R Vasconcelos Jonas S Almeida 《BMC systems biology》2009,3(1):109-9
Background
DAS is a widely adopted protocol for providing syntactic interoperability among biological databases. The popularity of DAS is due to a simplified and elegant mechanism for data exchange that consists of sources exposing their RESTful interfaces for data access. As a growing number of DAS services are available for molecular biology resources, there is an incentive to explore this protocol in order to advance data discovery and integration among these resources. 相似文献20.