首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 687 毫秒
1.
从信息处理的角度来看,生物信息学与自然语言处理中的许多问题是非常相似的,因此,可以将一些自然语言处理中的经典方法应用到生物信息学文字中。本文介绍了自然语言处理和生物信息学中共有的问题,如比对、分类、预测等,以及这些问题的解决方法。通过对两个领域形似问题的分析可知,优秀的自然语言处理技术也可用来解决生物信息学方面的问题,并且一些还未在生物信息学领域得到应用的自然语言理解技术也有其潜在的应用价值。最后给出了一个分类问题的解决方案,演示了如何在生物数据上应用算法进行实验。  相似文献   

2.
The discovery of candidate biomarkers from biological materials coupled with the development of detection methods holds both incredible clinical potential as well as significant challenges. However, the proteomic techniques still provide the low dynamic range of protein detection at lower abundances. This review describes the current development of potential methods to enhance the detection and quantification in proteome studies. It also includes the bioinformatics tools that are helpfully used for data mining of protein ontology. Therefore, we believe that this review provided many proteomic approaches, which would be very potent and useful for proteome studies and for further diagnostic and therapeutic applications.  相似文献   

3.
Proteomic studies involve the identification as well as qualitative and quantitative comparison of proteins expressed under different conditions, and elucidation of their properties and functions, usually in a large-scale, high-throughput format. The high dimensionality of data generated from these studies will require the development of improved bioinformatics tools and data-mining approaches for efficient and accurate data analysis of biological specimens from healthy and diseased individuals. Mining large proteomics data sets provides a better understanding of the complexities between the normal and abnormal cell proteome of various biological systems, including environmental hazards, infectious agents (bioterrorism) and cancers. This review will shed light on recent developments in bioinformatics and data-mining approaches, and their limitations when applied to proteomics data sets, in order to strengthen the interdependence between proteomic technologies and bioinformatics tools.  相似文献   

4.
Quantitative proteome profiling using mass spectrometry and stable isotope dilution is being widely applied for the functional analysis of biological systems and for the detection of clinical, diagnostic or prognostic marker proteins. Because of the enormous complexity of proteomes, their comprehensive analysis is unlikely to be routinely achieved in the near future. However, in recent years, significant progress has been achieved focusing quantitative proteomic analyses on specific protein classes or subproteomes that are rich in biologically or clinically important information. Such projects typically combine the use of chemical probes that are specific for a targeted group of proteins and may contain stable isotope signatures for accurate quantification with automated tandem mass spectrometry and bioinformatics tools for data analysis. In this review, we summarize technical and conceptual advances in quantitative subproteome profiling based on tandem mass spectrometry and chemical probes.  相似文献   

5.
Proteomic studies involve the identification as well as qualitative and quantitative comparison of proteins expressed under different conditions, and elucidation of their properties and functions, usually in a large-scale, high-throughput format. The high dimensionality of data generated from these studies will require the development of improved bioinformatics tools and data-mining approaches for efficient and accurate data analysis of biological specimens from healthy and diseased individuals. Mining large proteomics data sets provides a better understanding of the complexities between the normal and abnormal cell proteome of various biological systems, including environmental hazards, infectious agents (bioterrorism) and cancers. This review will shed light on recent developments in bioinformatics and data-mining approaches, and their limitations when applied to proteomics data sets, in order to strengthen the interdependence between proteomic technologies and bioinformatics tools.  相似文献   

6.
Functional proteomics can be defined as a strategy to couple proteomic information with biochemical and physiological analyses with the aim of understanding better the functions of proteins in normal and diseased organs. In recent years, a variety of publicly available bioinformatics databases have been developed to support protein-related information management and biological knowledge discovery. In addition to being used to annotate the proteome, these resources also offer the opportunity to develop global approaches to the study of the functional role of proteins both in health and disease. Here, we present a comprehensive review of the major human protein bioinformatics databases. We conclude this review by discussing a few examples that illustrate the importance of these databases in functional proteomics research.  相似文献   

7.
Recent advances in analytical methods, particularly in the area of protein microarrays, have brought the field of proteomics to the forefront of biological science. Protein arrays have shown to be useful for the multiplexed analysis of several hundreds of proteins in parallel. While much of the effort has focused on developing methods to identify expressed proteins, the identification of post-translational modifications is equally important for comprehensive proteome characterization. Protein phosphorylation constitutes a major type of post-translational modification that mobilizes a high number of genes, is involved in many crucial cell functions and largely contriubutes to the complexity of the proteome. One of the major challenges to analyze phosphoproteins using arrays is the availability of specific antibodies. Thus far, this has hampered the development of highly complex phosphoprotein arrays. This review discusses some of the recent progress made in the development of techniques and reagents to quantitatively determine sites of protein phosphorylation.  相似文献   

8.
Extensive proteome discovery projects using a variety of mass spectrometric techniques have identified proteins matching to 50-70% of the predicted gene models of various species. Comprehensive proteome coverage is desirable for the unbiased comparison of protein quantities between different biological states and for the meaningful comparison of data from multiple samples. Here we discuss the feasibility of this goal in the light of recent technological developments.  相似文献   

9.
随着深度测序和基因芯片技术的不断发展,基因组、转录组、表达谱数据大量积累。目前,至少有10多个昆虫的基因组已被测序,30多个昆虫的转录组数据被报道。显然,传统的生物统计学方法无法处理如此海量的生物数据。量变引发质变,生物数据的大量积累催生了一门新兴学科,生物信息学。生物信息学融合了统计学、信息科学和生物学等各学科的理论和研究内容,在医学、基础生物学、农业科学以及昆虫学等方面获得了广泛的应用。生物信息学的目标是存储数据、管理数据和数据挖掘。因此,建立维护生物学数据库、设计开发基于模式识别、机器学习、数据挖掘等方法的生物软件,以及运用上述工具进行深度的数据挖掘,是生物信息学的重要研究内容。本文首先简要介绍了生物信息学的历史、研究现状及其在昆虫学科中的应用,然后综述了昆虫基因组学和转录组学的研究进展,最后对生物信息学在昆虫学研究中的应用前景进行了展望。  相似文献   

10.
ABSTRACT

Introduction: Protein microarray is a powerful tool for both biological study and clinical research. The most useful features of protein microarrays are their miniaturized size (low reagent and sample consumption), high sensitivity and their capability for parallel/high-throughput analysis. The major focus of this review is functional proteome microarray.

Areas covered: For proteome microarray, this review will discuss some recently constructed proteome microarrays and new concepts that have been used for constructing proteome microarrays and data interpretation in past few years, such as PAGES, M-NAPPA strategy, VirD technology, and the first protein microarray database. this review will summarize recent proteomic scale applications and address the limitations and future directions of proteome microarray technology.

Expert opinion: Proteome microarray is a powerful tool for basic biological and clinical research. It is expected to see improvements in the currently used proteome microarrays and the construction of more proteome microarrays for other species by using traditional strategies or novel concepts. It is anticipated that the maximum number of features on a single microarray and the number of possible applications will be increased, and the information that can be obtained from proteome microarray experiments will more in-depth in the future.  相似文献   

11.
12.
Maintenance of proteome integrity (proteostasis) is essential for cellular and organismal survival. Various cellular mechanisms work to preserve proteostasis by ensuring correct protein maturation and efficient degradation of misfolded and damaged proteins. Despite this cellular effort, under certain circumstances subsets of aggregation-prone proteins escape the quality control surveillance, accumulate within the cell and form insoluble aggregates that can lead to the development of disorders including late-onset neurodegenerative diseases. Cells respond to the appearance of insoluble aggregates by actively transporting them to designated deposition sites where they often undergo degradation. Although several protein aggregate deposition sites have been described and extensively studied, key questions regarding their biological roles and how they are affected by aging remained unanswered. Here we review the recent advances in the field, describe the different subtypes of these cellular compartments and outline the evidence that these structures change their properties over time. Finally, we propose models to explain the possible mechanistic links between aggregate deposition sites, neurodegenerative disorders and the aging process.  相似文献   

13.
应用CGH数据和树模型探索癌症的发病机理   总被引:1,自引:0,他引:1  
李小波  陈俭  吕炳建  来茂德 《遗传》2008,30(4):407-412
比较基因组杂交技术(comparative genomic hybridization, CGH)主要用于检测肿瘤的染色体缺失和扩嘱, 迄今已积累了大量的实验数据, 为全基因组分析肿瘤的发生机制提供了可能。树模型在生物信息学领域通常被用于研究生物形成和进化的历史, 物种之间的进化关系常以系统发生树来表示。树模型同样可以作为一种有力的生物信息学工具来分析CGH数据, 探索癌症的发病机理。文中介绍了两种常见的树模型—— 分支树和距离树, 详细叙述了重建树模型的基本原理和方法, 分析了创建树模型时要注意的几个技术问题, 并对其在肿瘤研究中的应用进行了回顾和总结。肿瘤的树状模型作为单路径线性模型的泛化, 克服了以往单路径线性模型的缺点, 理论上能更加精确地概括到肿脉的多基因、多路径、多阶段的发生发展模式, 从不同角度探讨肿瘤发生发展的分子机制。该模型除可用于分析肿瘤的CGH数据外, 还可用于分析其他多种类型的数据, 包括微阵列CGH(array-CGH)技术等产生的高分辨率数据。  相似文献   

14.
In analysis of bioinformatics data, a unique challenge arises from the high dimensionality of measurements. Without loss of generality, we use genomic study with gene expression measurements as a representative example but note that analysis techniques discussed in this article are also applicable to other types of bioinformatics studies. Principal component analysis (PCA) is a classic dimension reduction approach. It constructs linear combinations of gene expressions, called principal components (PCs). The PCs are orthogonal to each other, can effectively explain variation of gene expressions, and may have a much lower dimensionality. PCA is computationally simple and can be realized using many existing software packages. This article consists of the following parts. First, we review the standard PCA technique and their applications in bioinformatics data analysis. Second, we describe recent 'non-standard' applications of PCA, including accommodating interactions among genes, pathways and network modules and conducting PCA with estimating equations as opposed to gene expressions. Third, we introduce several recently proposed PCA-based techniques, including the supervised PCA, sparse PCA and functional PCA. The supervised PCA and sparse PCA have been shown to have better empirical performance than the standard PCA. The functional PCA can analyze time-course gene expression data. Last, we raise the awareness of several critical but unsolved problems related to PCA. The goal of this article is to make bioinformatics researchers aware of the PCA technique and more importantly its most recent development, so that this simple yet effective dimension reduction technique can be better employed in bioinformatics data analysis.  相似文献   

15.
The intermediary steps between a biological hypothesis, concretized in the input data, and meaningful results, validated using biological experiments, commonly employ bioinformatics tools. Starting with storage of the data and ending with a statistical analysis of the significance of the results, every step in a bioinformatics analysis has been intensively studied and the resulting methods and models patented. This review summarizes the bioinformatics patents that have been developed mainly for the study of genes, and points out the universal applicability of bioinformatics methods to other related studies such as RNA interference. More specifically, we overview the steps undertaken in the majority of bioinformatics analyses, highlighting, for each, various approaches that have been developed to reveal details from different perspectives. First we consider data warehousing, the first task that has to be performed efficiently, optimizing the structure of the database, in order to facilitate both the subsequent steps and the retrieval of information. Next, we review data mining, which occupies the central part of most bioinformatics analyses, presenting patents concerning differential expression, unsupervised and supervised learning. Last, we discuss how networks of interactions of genes or other players in the cell may be created, which help draw biological conclusions and have been described in several patents.  相似文献   

16.
Over the past few decades, major advances in the field of molecular biology, coupled with advances in genomic technologies, have led to an explosive growth in the biological data generated by the scientific community. The critical need to process and analyze such a deluge of data and turn it into useful knowledge has caused bioinformatics to gain prominence and importance. Bioinformatics is an interdisciplinary research area that applies techniques, methodologies, and tools in computer and information science to solve biological problems. In Nigeria, bioinformatics has recently played a vital role in the advancement of biological sciences. As a developing country, the importance of bioinformatics is rapidly gaining acceptance, and bioinformatics groups comprised of biologists, computer scientists, and computer engineers are being constituted at Nigerian universities and research institutes. In this article, we present an overview of bioinformatics education and research in Nigeria. We also discuss professional societies and academic and research institutions that play central roles in advancing the discipline in Nigeria. Finally, we propose strategies that can bolster bioinformatics education and support from policy makers in Nigeria, with potential positive implications for other developing countries.  相似文献   

17.
Bioinformatics is a central discipline in modern life sciences aimed at describing the complex properties of living organisms starting from large-scale data sets of cellular constituents such as genes and proteins. In order for this wealth of information to provide useful biological knowledge, databases and software tools for data collection, analysis and interpretation need to be developed. In this paper, we review recent advances in the design and implementation of bioinformatics resources devoted to the study of metals in biological systems, a research field traditionally at the heart of bioinorganic chemistry. We show how metalloproteomes can be extracted from genome sequences, how structural properties can be related to function, how databases can be implemented, and how hints on interactions can be obtained from bioinformatics.  相似文献   

18.
19.
Computer science and biology have enjoyed a long and fruitful relationship for decades. Biologists rely on computational methods to analyze and integrate large data sets, while several computational methods were inspired by the high‐level design principles of biological systems. Recently, these two directions have been converging. In this review, we argue that thinking computationally about biological processes may lead to more accurate models, which in turn can be used to improve the design of algorithms. We discuss the similar mechanisms and requirements shared by computational and biological processes and then present several recent studies that apply this joint analysis strategy to problems related to coordination, network analysis, and tracking and vision. We also discuss additional biological processes that can be studied in a similar manner and link them to potential computational problems. With the rapid accumulation of data detailing the inner workings of biological systems, we expect this direction of coupling biological and computational studies to greatly expand in the future.  相似文献   

20.
Whereas genomics describes the study of genome, mainly represented by its gene expression on the DNA or RNA level, the term proteomics denotes the study of the proteome, which is the protein complement encoded by the genome. In recent years, the number of proteomic experiments increased tremendously. While all fields of proteomics have made major technological advances, the biggest step was seen in bioinformatics. Biological information management relies on sequence and structure databases and powerful software tools to translate experimental results into meaningful biological hypotheses and answers. In this resource article, I provide a collection of databases and software available on the Internet that are useful to interpret genomic and proteomic data. The article is a toolbox for researchers who have genomic or proteomic datasets and need to put their findings into a biological context.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号