期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Extracting biologically significant patterns from short time series gene expression data

Alain B Tchagang Kevin V Bui Thomas McGinnis Panayiotis V Benos 《BMC bioinformatics》2009,10(1):255

Background

Time series gene expression data analysis is used widely to study the dynamics of various cell processes. Most of the time series data available today consist of few time points only, thus making the application of standard clustering techniques difficult. 相似文献

2.

Estimating woody and herbaceous vegetation cover from time series satellite observations

Michael L. Roderick Ian R. Noble Shane W. Cridland 《Global Ecology and Biogeography》1999,8(6):501-508

In this paper we test a method to estimate the tree and grass vegetation cover over Australia from satellite-derived normalized difference vegetation index (NDVI) time series (monthly 1981–91, ≈5 km pixels) observations. The evergreen cover is assumed to track along the base of the NDVI time series, which is assumed to be equivalent to the woody vegetation cover. The base of the NDVI time series is estimated using modifications to a classical econometric model (i.e. time series is the sum of trend, seasonal and random components). Estimates of the average evergreen component during 1982–85 and 1986–89 were generally consistent with known vegetation distributions. Changes in evergreen cover were largely restricted to the south-west and south-east of Australia. Those changes were largely the result of differences in rainfall between the two periods. The proposed method for estimating woody vegetation cover is found to be generally robust. However, there are some regions where the grass (or pasture) is mostly evergreen. Some possible refinements are proposed to handle such cases. 相似文献

3.

Eco-climatic image segmentation based on time series

Lhermitte S Verbesselt J Jonckheere I Van Aardt J Coppin P 《Communications in agricultural and applied biological sciences》2005,70(2):165-168

相似文献

4.

Discovering gene expression patterns in time course microarray experiments by ANOVA-SCA 总被引：2，自引：0，他引：2

Nueda MJ Conesa A Westerhuis JA Hoefsloot HC Smilde AK Talón M Ferrer A 《Bioinformatics (Oxford, England)》2007,23(14):1792-1800

相似文献

5.

Discovering local patterns of co - evolution: computational aspects and biological examples

Tamir Tuller Yifat Felder Martin Kupiec 《BMC bioinformatics》2010,11(1):1-19

Background

Co-evolution is the process in which two (or more) sets of orthologs exhibit a similar or correlative pattern of evolution. Co-evolution is a powerful way to learn about the functional interdependencies between sets of genes and cellular functions and to predict physical interactions. More generally, it can be used for answering fundamental questions about the evolution of biological systems. Orthologs that exhibit a strong signal of co-evolution in a certain part of the evolutionary tree may show a mild signal of co-evolution in other branches of the tree. The major reasons for this phenomenon are noise in the biological input, genes that gain or lose functions, and the fact that some measures of co-evolution relate to rare events such as positive selection. Previous publications in the field dealt with the problem of finding sets of genes that co-evolved along an entire underlying phylogenetic tree, without considering the fact that often co-evolution is local.

Results

In this work, we describe a new set of biological problems that are related to finding patterns of local co-evolution. We discuss their computational complexity and design algorithms for solving them. These algorithms outperform other bi-clustering methods as they are designed specifically for solving the set of problems mentioned above. We use our approach to trace the co-evolution of fungal, eukaryotic, and mammalian genes at high resolution across the different parts of the corresponding phylogenetic trees. Specifically, we discover regions in the fungi tree that are enriched with positive evolution. We show that metabolic genes exhibit a remarkable level of co-evolution and different patterns of co-evolution in various biological datasets. In addition, we find that protein complexes that are related to gene expression exhibit non-homogenous levels of co-evolution across different parts of the fungi evolutionary line. In the case of mammalian evolution, signaling pathways that are related to neurotransmission exhibit a relatively higher level of co-evolution along the primate subtree.

Conclusions

We show that finding local patterns of co-evolution is a computationally challenging task and we offer novel algorithms that allow us to solve this problem, thus opening a new approach for analyzing the evolution of biological systems. 相似文献

6.

Discovering patterns to extract protein-protein interactions from full texts 总被引：5，自引：0，他引：5

Huang M Zhu X Hao Y Payan DG Qu K Li M 《Bioinformatics (Oxford, England)》2004,20(18):3604-3612

MOTIVATION: Although there are several databases storing protein-protein interactions, most such data still exist only in the scientific literature. They are scattered in scientific literature written in natural languages, defying data mining efforts. Much time and labor have to be spent on extracting protein pathways from literature. Our aim is to develop a robust and powerful methodology to mine protein-protein interactions from biomedical texts. RESULTS: We present a novel and robust approach for extracting protein-protein interactions from literature. Our method uses a dynamic programming algorithm to compute distinguishing patterns by aligning relevant sentences and key verbs that describe protein interactions. A matching algorithm is designed to extract the interactions between proteins. Equipped only with a dictionary of protein names, our system achieves a recall rate of 80.0% and precision rate of 80.5%. AVAILABILITY: The program is available on request from the authors. 相似文献

7.

Discovering statistically significant biclusters in gene expression data 总被引：1，自引：0，他引：1

Tanay A Sharan R Shamir R 《Bioinformatics (Oxford, England)》2002,18(Z1):S136-S144

In gene expression data, a bicluster is a subset of the genes exhibiting consistent patterns over a subset of the conditions. We propose a new method to detect significant biclusters in large expression datasets. Our approach is graph theoretic coupled with statistical modelling of the data. Under plausible assumptions, our algorithm is polynomial and is guaranteed to find the most significant biclusters. We tested our method on a collection of yeast expression profiles and on a human cancer dataset. Cross validation results show high specificity in assigning function to genes based on their biclusters, and we are able to annotate in this way 196 uncharacterized yeast genes. We also demonstrate how the biclusters lead to detecting new concrete biological associations. In cancer data we are able to detect and relate finer tissue types than was previously possible. We also show that the method outperforms the biclustering algorithm of Cheng and Church (2000). 相似文献

8.

Concerted evolution of satellite DNA in Sarcocapnos: a matter of time

Pérez-Gutiérrez MA Suárez-Santiago VN López-Flores I Romero AT Garrido-Ramos MA 《Plant molecular biology》2012,78(1-2):19-29

SarkOne is a genus-specific satellite-DNA family, isolated from the genomes of the species of the genus Sarcocapnos. This satellite DNA is composed of repeats with a consensus length of 855 bp and a mean G+C content of 52.5%. We have sequenced a total of 189 SarkOne monomeric repeats belonging to a total of seven species of the genus Sarcocapnos. The comparative analysis of these sequences both at the intraspecific and the interspecific levels have revealed divergence patterns between species are proportional to between-species divergence according to the phylogeny of the genus. Our study demonstrates that the molecular drive leading to the concerted-evolution pattern of this satellite DNA is a time-dependent process by which new mutations are spreading through genomes and populations at a gradual pace. However, time is a limiting factor in the observation of concerted evolution in some pairwise comparisons. Thus, pairwise comparisons of species sharing a recent common ancestor did not reveal nucleotide sites in transitional stages higher than stage III according to the Strachan's model. By contrast, there was a gradation in the percentage of upper transition stages (IV, V, VI) the more phylogenetically distant the species were. In addition, closely related species shared a high number of polymorphic sites, but these types of sites were not common when comparing more distant species. All these data are discussed in the light of current life-cycle models of satellite-DNA evolution. 相似文献

9.

A method for analyzing temporal patterns of variability of a time series from Poincare plots

Fishman M Jacono FJ Park S Jamasebi R Thungtong A Loparo KA Dick TE 《Journal of applied physiology (Bethesda, Md. : 1985)》2012,113(2):297-306

The Poincaré plot is a popular two-dimensional, time series analysis tool because of its intuitive display of dynamic system behavior. Poincaré plots have been used to visualize heart rate and respiratory pattern variabilities. However, conventional quantitative analysis relies primarily on statistical measurements of the cumulative distribution of points, making it difficult to interpret irregular or complex plots. Moreover, the plots are constructed to reflect highly correlated regions of the time series, reducing the amount of nonlinear information that is presented and thereby hiding potentially relevant features. We propose temporal Poincaré variability (TPV), a novel analysis methodology that uses standard techniques to quantify the temporal distribution of points and to detect nonlinear sources responsible for physiological variability. In addition, the analysis is applied across multiple time delays, yielding a richer insight into system dynamics than the traditional circle return plot. The method is applied to data sets of R-R intervals and to synthetic point process data extracted from the Lorenz time series. The results demonstrate that TPV complements the traditional analysis and can be applied more generally, including Poincaré plots with multiple clusters, and more consistently than the conventional measures and can address questions regarding potential structure underlying the variability of a data set. 相似文献

10.

Discovering well-ordered folding patterns in nucleotide sequences 总被引：3，自引：0，他引：3

Le SY Chen JH Konings D Maizel JV 《Bioinformatics (Oxford, England)》2003,19(3):354-361

相似文献

11.

Discovering approximate-associated sequence patterns for protein-DNA interactions

Chan TM Wong KC Lee KH Wong MH Lau CK Tsui SK Leung KS 《Bioinformatics (Oxford, England)》2011,27(4):471-478

相似文献

12.

Discovering patterns to extract protein-protein interactions from the literature: Part II

Hao Y Zhu X Huang M Li M 《Bioinformatics (Oxford, England)》2005,21(15):3294-3300

相似文献

13.

A polynomial time biclustering algorithm for finding approximate expression patterns in gene expression time series

Sara C Madeira Arlindo L Oliveira 《Algorithms for molecular biology : AMB》2009,4(1):8-39

Background

The ability to monitor the change in expression patterns over time, and to observe the emergence of coherent temporal responses using gene expression time series, obtained from microarray experiments, is critical to advance our understanding of complex biological processes. In this context, biclustering algorithms have been recognized as an important tool for the discovery of local expression patterns, which are crucial to unravel potential regulatory mechanisms. Although most formulations of the biclustering problem are NP-hard, when working with time series expression data the interesting biclusters can be restricted to those with contiguous columns. This restriction leads to a tractable problem and enables the design of efficient biclustering algorithms able to identify all maximal contiguous column coherent biclusters. 相似文献

14.

On phase-lag estimation from non-Gaussian time series

WALDEN A. T. 《Biometrika》1988,75(4):785-787

相似文献

15.

Ecogeographic patterns of rabies in southern Ontario based on time series analysis

Tinline RR MacInnes CD 《Journal of wildlife diseases》2004,40(2):212-221

We describe a method based on time series analysis that divided the rabies enzootic area of southern Ontario into 13 regions using data collected at the township level, the smallest available geographical unit for Ontario (Canada). The intent was to discover ecogeographic patterns if such existed. For the period 1957-89, the quarterly time series of fox rabies cases for each of the 423 townships in the study area was correlated with the time series of its adjacent neighbors. Townships were then linked to adjacent townships provided the pair-wise correlations had significant correlation coefficients. This procedure produced 13 clusters that remained stable when additional lead/lag relationships between townships were examined. Furthermore, those clusters, which we then termed "rabies units," had different behaviors in terms of species distribution, persistence, and periodicity. Time series in adjacent units were not synchronous. We discuss how our findings influenced the rabies control program in Ontario, how they relate to recent findings about the distribution of fox rabies virus subtypes, and how they lend support for the role of metapopulation structulre in persistence of disease. 相似文献

16.

Discovering numerical laws of plant microRNA by evolution

Zhu R Li X Chen Q 《Biochemical and biophysical research communications》2011,(2):313-318

相似文献

17.

Discovering functional interaction patterns in protein-protein interaction networks

Mehmet E Turanalp Tolga Can 《BMC bioinformatics》2008,9(1):276

Background

In recent years, a considerable amount of research effort has been directed to the analysis of biological networks with the availability of genome-scale networks of genes and/or proteins of an increasing number of organisms. A protein-protein interaction (PPI) network is a particular biological network which represents physical interactions between pairs of proteins of an organism. Major research on PPI networks has focused on understanding the topological organization of PPI networks, evolution of PPI networks and identification of conserved subnetworks across different species, discovery of modules of interaction, use of PPI networks for functional annotation of uncharacterized proteins, and improvement of the accuracy of currently available networks. 相似文献

18.

Extended causal modeling to assess Partial Directed Coherence in multiple time series with significant instantaneous interactions

Faes L Nollo G 《Biological cybernetics》2010,103(5):387-400

The Partial Directed Coherence (PDC) and its generalized formulation (gPDC) are popular tools for investigating, in the frequency domain, the concept of Granger causality among multivariate (MV) time series. PDC and gPDC are formalized in terms of the coefficients of an MV autoregressive (MVAR) model which describes only the lagged effects among the time series and forsakes instantaneous effects. However, instantaneous effects are known to affect linear parametric modeling, and are likely to occur in experimental time series. In this study, we investigate the impact on the assessment of frequency domain causality of excluding instantaneous effects from the model underlying PDC evaluation. Moreover, we propose the utilization of an extended MVAR model including both instantaneous and lagged effects. This model is used to assess PDC either in accordance with the definition of Granger causality when considering only lagged effects (iPDC), or with an extended form of causality, when we consider both instantaneous and lagged effects (ePDC). The approach is first evaluated on three theoretical examples of MVAR processes, which show that the presence of instantaneous correlations may produce misleading profiles of PDC and gPDC, while ePDC and iPDC derived from the extended model provide here a correct interpretation of extended and lagged causality. It is then applied to representative examples of cardiorespiratory and EEG MV time series. They suggest that ePDC and iPDC are better interpretable than PDC and gPDC in terms of the known cardiovascular and neural physiologies. 相似文献

19.

Poission patterns in behavioural time series: The perception of randomness in complexity

Thomas Getty 《Animal behaviour》1981,29(3):960-961

相似文献

20.

Detecting periodic patterns in unevenly spaced gene expression time series using Lomb-Scargle periodograms 总被引：3，自引：0，他引：3

Glynn EF Chen J Mushegian AR 《Bioinformatics (Oxford, England)》2006,22(3):310-316

MOTIVATION: Periodic patterns in time series resulting from biological experiments are of great interest. The commonly used Fast Fourier Transform (FFT) algorithm is applicable only when data are evenly spaced and when no values are missing, which is not always the case in high-throughput measurements. The choice of statistic to evaluate the significance of the periodic patterns for unevenly spaced gene expression time series has not been well substantiated. METHODS: The Lomb-Scargle periodogram approach is used to search time series of gene expression to quantify the periodic behavior of every gene represented on the DNA array. The Lomb-Scargle periodogram analysis provides a direct method to treat missing values and unevenly spaced time points. We propose the combination of a Lomb-Scargle test statistic for periodicity and a multiple hypothesis testing procedure with controlled false discovery rate to detect significant periodic gene expression patterns. RESULTS: We analyzed the Plasmodium falciparum gene expression dataset. In the Quality Control Dataset of 5080 expression patterns, we found 4112 periodic probes. In addition, we identified 243 probes with periodic expression in the Complete Dataset, which could not be examined in the original study by the FFT analysis due to an excessive number of missing values. While most periodic genes had a period of 48 h, some had a period close to 24 h. Our approach should be applicable for detection and quantification of periodic patterns in any unevenly spaced gene expression time-series data. 相似文献