期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Maximally efficient modeling of DNA sequence motifs at all levels of complexity

Stormo GD 《Genetics》2011,187(4):1219-1224

相似文献

2.

Statistical significance versus fit: estimating the importance of individual factors in ecological analysis of variance 总被引：3，自引：0，他引：3

Michael H. Graham Matthew S. Edwards 《Oikos》2001,93(3):505-513

相似文献

3.

Does your species have memory? Analyzing capture–recapture data with memory models

下载免费PDF全文

Diana J. Cole Byron J. T. Morgan Rachel S. McCrea Roger Pradel Olivier Gimenez Remi Choquet 《Ecology and evolution》2014,4(11):2124-2133

We examine memory models for multisite capture–recapture data. This is an important topic, as animals may exhibit behavior that is more complex than simple first‐order Markov movement between sites, when it is necessary to devise and fit appropriate models to data. We consider the Arnason–Schwarz model for multisite capture–recapture data, which incorporates just first‐order Markov movement, and also two alternative models that allow for memory, the Brownie model and the Pradel model. We use simulation to compare two alternative tests which may be undertaken to determine whether models for multisite capture–recapture data need to incorporate memory. Increasing the complexity of models runs the risk of introducing parameters that cannot be estimated, irrespective of how much data are collected, a feature which is known as parameter redundancy. Rouan et al. (JABES, 2009, pp 338–355) suggest a constraint that may be applied to overcome parameter redundancy when it is present in multisite memory models. For this case, we apply symbolic methods to derive a simpler constraint, which allows more parameters to be estimated, and give general results not limited to a particular configuration. We also consider the effect sparse data can have on parameter redundancy and recommend minimum sample sizes. Memory models for multisite capture–recapture data can be highly complex and difficult to fit to data. We emphasize the importance of a structured approach to modeling such data, by considering a priori which parameters can be estimated, which constraints are needed in order for estimation to take place, and how much data need to be collected. We also give guidance on the amount of data needed to use two alternative families of tests for whether models for multisite capture–recapture data need to incorporate memory. 相似文献

4.

The gambin model provides a superior fit to species abundance distributions with a single free parameter: evidence,implementation and interpretation

Thomas J. Matthews Michael K. Borregaard Karl I. Ugland Paulo A. V. Borges François Rigal Pedro Cardoso Robert J. Whittaker 《Ecography》2014,37(10):1002-1011

The species abundance distribution (SAD) has been a central focus of community ecology for over fifty years, and is currently the subject of widespread renewed interest. The gambin model has recently been proposed as a model that provides a superior fit to commonly preferred SAD models. It has also been argued that the model's single parameter (α) presents a potentially informative ecological diversity metric, because it summarises the shape of the SAD in a single number. Despite this potential, few empirical tests of the model have been undertaken, perhaps because the necessary methods and software for fitting the model have not existed. Here, we derive a maximum likelihood method to fit the model, and use it to undertake a comprehensive comparative analysis of the fit of the gambin model. The functions and computational code to fit the model are incorporated in a newly developed free‐to‐download R package (gambin). We test the gambin model using a variety of datasets and compare the fit of the gambin model to fits obtained using the Poisson lognormal, logseries and zero‐sum multinomial distributions. We found that gambin almost universally provided a better fit to the data and that the fit was consistent for a variety of sample grain sizes. We demonstrate how α can be used to differentiate intelligibly between community structures of Azorean arthropods sampled in different land use types. We conclude that gambin presents a flexible model capable of fitting a wide variety of observed SAD data, while providing a useful index of SAD form in its single fitted parameter. As such, gambin has wide potential applicability in the study of SADs, and ecology more generally. 相似文献

5.

Detection of autoimmunity by pharmaceuticals

Pieters R 《Methods (San Diego, Calif.)》2007,41(1):112-117

Despite the important health and economic impact of autoimmunogenicity or allergenicity by pharmaceuticals models to detect such adverse effects are not available yet. The most important reason for this is the related complex interplay of multiple factors, for which reason these adverse effects are also referred to as idiosyncratic in nature. Moreover, clinical effects are quite diverse, and involve both organ-specific and systemic effects, including a diversity of skin diseases. Because of its complexity on the one hand and the fundamental knowledge on certain particular mechanistic effects it may be more relevant to design a rationalistic toolbox of test models from which a predictive strategy can be composed. Since one mechanistic aspect centers around T cell sensitization a straightforward lymph node assay such as the reporter antigen-popliteal lymph node assay (RA-PLNA) would fit in such a toolbox. This RA-PLNA holds a strong promise to distinguish sensitizing and/or neoantigen-forming capacity of low molecular weight pharmaceuticals. In addition, from the pharmacokinetic point of view a rationalistic toolbox should also contain oral exposure models with immunological read out parameters in normal or in genetically predisposed animal strains. This review focuses on these two categories of candidate test methods, PLNA and oral exposure models, and proposes to use these in tandem in order to predict the hazard of induction of allergy or autoimmune phenomena by new pharmaceutical candidates. 相似文献

6.

Rapid Global Fitting of Large Fluorescence Lifetime Imaging Microscopy Datasets

Sean C. Warren Anca Margineanu Dominic Alibhai Douglas J. Kelly Clifford Talbot Yuriy Alexandrov Ian Munro Matilda Katan Chris Dunsby Paul M. W. French 《PloS one》2013,8(8)

Fluorescence lifetime imaging (FLIM) is widely applied to obtain quantitative information from fluorescence signals, particularly using Förster Resonant Energy Transfer (FRET) measurements to map, for example, protein-protein interactions. Extracting FRET efficiencies or population fractions typically entails fitting data to complex fluorescence decay models but such experiments are frequently photon constrained, particularly for live cell or in vivo imaging, and this leads to unacceptable errors when analysing data on a pixel-wise basis. Lifetimes and population fractions may, however, be more robustly extracted using global analysis to simultaneously fit the fluorescence decay data of all pixels in an image or dataset to a multi-exponential model under the assumption that the lifetime components are invariant across the image (dataset). This approach is often considered to be prohibitively slow and/or computationally expensive but we present here a computationally efficient global analysis algorithm for the analysis of time-correlated single photon counting (TCSPC) or time-gated FLIM data based on variable projection. It makes efficient use of both computer processor and memory resources, requiring less than a minute to analyse time series and multiwell plate datasets with hundreds of FLIM images on standard personal computers. This lifetime analysis takes account of repetitive excitation, including fluorescence photons excited by earlier pulses contributing to the fit, and is able to accommodate time-varying backgrounds and instrument response functions. We demonstrate that this global approach allows us to readily fit time-resolved fluorescence data to complex models including a four-exponential model of a FRET system, for which the FRET efficiencies of the two species of a bi-exponential donor are linked, and polarisation-resolved lifetime data, where a fluorescence intensity and bi-exponential anisotropy decay model is applied to the analysis of live cell homo-FRET data. A software package implementing this algorithm, FLIMfit, is available under an open source licence through the Open Microscopy Environment. 相似文献

7.

An empirical comparison of models for the phenology of bird migration

下载免费PDF全文

Andreas Lindén Kalle Meller Jonas Knape 《Journal of avian biology》2017,48(2):255-265

Bird migration phenology shows strong responses to climate change. Studies of trends and patterns in phenology are typically based on annual summarizing metrics, such as means and quantiles calculated from raw daily count data. However, with irregularly sampled data and large day‐to‐day variation, such metrics can be biased and noisy, and may be analysed using phenological functions fitted to the data. Here we use count data of migration passage from a Finnish bird observatory to compare different models for the phenological distributions of spring migration (27 species) and autumn migration (57 species). We assess parsimony and goodness‐of‐fit in a set of models, with phenological functions of different complexity, optionally with covariates accounting for day‐to‐day variability. The covariates describe migration intensities of related species or relative migration intensities the previous day (autocovariates). We found that parametric models are often preferred over the more flexible generalized additive models with constrained degrees of freedom. Models corresponding to a mixture of two distinct passing populations were frequently preferred over simpler ones, but usually no more complex models are needed. Slightly more complex models were favoured in spring compared to autumn. Related species’ migration activity effectively improves the model by accounting for the large day‐to‐day variation. Autocovariates were usually not that relevant, implying that autocorrelation is generally not a major concern if phenology is modelled properly. We suggest that parametric models are relatively good for studying single‐population migration phenology, or a mix of two groups with distinct phenologies, especially if daily variation in migration intensity can be controlled for. Generalized additive models may be useful when the migrating population composition is unknown. Despite these guidelines, choosing an appropriate model involves case‐by‐case assessment or the biological relevance and rationale for modelling phenology. 相似文献

8.

Markov chain Monte Carlo fitting of single-channel data from inositol trisphosphate receptors

Elan Gin David I. Yule 《Journal of theoretical biology》2009,257(3):460-1929

In many cell types, the inositol trisphosphate receptor (IPR) is one of the important components that control intracellular calcium dynamics, and an understanding of this receptor (which is also a calcium channel) is necessary for an understanding of calcium oscillations and waves. Recent advances in experimental techniques now allow for the measurement of single-channel activity of the IPR in conditions similar to its native environment, and these data can be used to determine the rate constants in Markov models of the IPR. We illustrate a parameter estimation method based on Markov chain Monte Carlo, which can be used to fit directly to single-channel data, and determining, as an intrinsic part of the fit, the times at which the IPR is opening and closing. We show, using simulated data, the most complex Markov model that can be unambiguously determined from steady-state data and show that non-steady-state data is required to determine more complex models. 相似文献

9.

A Kinetic Model of the Inositol Trisphosphate Receptor Based on Single-Channel Data

Elan Gin Larry E. Wagner II James Sneyd 《Biophysical journal》2009,96(10):4053-4062

In many cell types, the inositol trisphosphate receptor is one of the important components controlling intracellular calcium dynamics, and an understanding of this receptor is necessary for an understanding of calcium oscillations and waves. Based on single-channel data from the type-I inositol trisphosphate receptor, and using a Markov chain Monte Carlo approach, we show that the most complex time-dependent model that can be unambiguously determined from steady-state data is one with three closed states and one open state, and we determine how the rate constants depend on calcium. Because the transitions between these states are complex functions of calcium concentration, each model state must correspond to a group of physical states. We fit two different topologies and find that both models predict that the main effect of [Ca²⁺] is to modulate the probability that the receptor is in a state that is able to open, rather than to modulate the transition rate to the open state. 相似文献

10.

Growth curve models of repeated binary response

E J Stanek S R Diehl 《Biometrics》1988,44(4):973-983

Experimental designs that include repeated measures of binary response variables over time and under different conditions are common in biology. In such settings, it is often desirable to characterize the response pattern over time. When response variables are continuous, this characterization can be made in terms of a growth model such as the Potthoff-Roy growth curve model. We illustrate how a similar growth curve modeling strategy can be implemented using weighted least squares (WLS) methods for binary response data. The growth models are constructed in terms of polynomial functions across marginal response. However, when growth models are fit to repeated binary response, the nonsignificant higher-order polynomial functions are dropped from the model, rather than used as covariates. Dropping the nonsignificant polynomials from the model will reduce the number of response functions, and help avoid small-sample problems that can occur when the number of correlated response functions is large and sample sizes are small. The reduced set of response functions are then modeled using WLS methods. We illustrate such models with an example of binary fly oviposition response (accept or reject) exhibited by two populations of flies at four ages to two types of fruit. 相似文献

11.

Phylodynamic Inference for Structured Epidemiological Models

David A. Rasmussen Erik M. Volz Katia Koelle 《PLoS computational biology》2014,10(4)

Coalescent theory is routinely used to estimate past population dynamics and demographic parameters from genealogies. While early work in coalescent theory only considered simple demographic models, advances in theory have allowed for increasingly complex demographic scenarios to be considered. The success of this approach has lead to coalescent-based inference methods being applied to populations with rapidly changing population dynamics, including pathogens like RNA viruses. However, fitting epidemiological models to genealogies via coalescent models remains a challenging task, because pathogen populations often exhibit complex, nonlinear dynamics and are structured by multiple factors. Moreover, it often becomes necessary to consider stochastic variation in population dynamics when fitting such complex models to real data. Using recently developed structured coalescent models that accommodate complex population dynamics and population structure, we develop a statistical framework for fitting stochastic epidemiological models to genealogies. By combining particle filtering methods with Bayesian Markov chain Monte Carlo methods, we are able to fit a wide class of stochastic, nonlinear epidemiological models with different forms of population structure to genealogies. We demonstrate our framework using two structured epidemiological models: a model with disease progression between multiple stages of infection and a two-population model reflecting spatial structure. We apply the multi-stage model to HIV genealogies and show that the proposed method can be used to estimate the stage-specific transmission rates and prevalence of HIV. Finally, using the two-population model we explore how much information about population structure is contained in genealogies and what sample sizes are necessary to reliably infer parameters like migration rates. 相似文献

12.

From pattern to process: identifying predator–prey models from time-series data

C. Jost R. Arditi 《Population Ecology》2001,43(3):229-243

Fitting nonlinear models to time-series is a technique of increasing importance in population ecology. In this article, we apply it to assess the importance of predator dependence in the predation process by comparing two alternative models of equal complexity (one with and one without predator dependence) to predator–prey time-series. Stochasticities in such data come from both observation error and process error. We consider how these errors must be taken into account in the fitting process, and we develop eight different model selection criteria. Applying these criteria to laboratory data on simple protozoan and arthropod predator–prey systems shows that little predator dependence is present, with one interesting exception. Field data are more ambiguous (either selection depends on the particular criterion or no significant differences can be detected), and we show that both models fit reasonably well. We conclude that, within our modeling framework, predator dependence is in general insignificant in simple systems in homogeneous environments. Relatively complex systems show significant predator dependence more often than simple ones but the data are also often inconclusive. The analysis of such systems should rely on several models to detect predictions that are sensitive to predator dependence and to direct further research if necessary. Received: July 13, 2000 / Accepted: September 25, 2001 相似文献

13.

Bayesian phylogenetic analysis of combined data

Nylander JA Ronquist F Huelsenbeck JP Nieves-Aldrey JL 《Systematic biology》2004,53(1):47-67

The recent development of Bayesian phylogenetic inference using Markov chain Monte Carlo (MCMC) techniques has facilitated the exploration of parameter-rich evolutionary models. At the same time, stochastic models have become more realistic (and complex) and have been extended to new types of data, such as morphology. Based on this foundation, we developed a Bayesian MCMC approach to the analysis of combined data sets and explored its utility in inferring relationships among gall wasps based on data from morphology and four genes (nuclear and mitochondrial, ribosomal and protein coding). Examined models range in complexity from those recognizing only a morphological and a molecular partition to those having complex substitution models with independent parameters for each gene. Bayesian MCMC analysis deals efficiently with complex models: convergence occurs faster and more predictably for complex models, mixing is adequate for all parameters even under very complex models, and the parameter update cycle is virtually unaffected by model partitioning across sites. Morphology contributed only 5% of the characters in the data set but nevertheless influenced the combined-data tree, supporting the utility of morphological data in multigene analyses. We used Bayesian criteria (Bayes factors) to show that process heterogeneity across data partitions is a significant model component, although not as important as among-site rate variation. More complex evolutionary models are associated with more topological uncertainty and less conflict between morphology and molecules. Bayes factors sometimes favor simpler models over considerably more parameter-rich models, but the best model overall is also the most complex and Bayes factors do not support exclusion of apparently weak parameters from this model. Thus, Bayes factors appear to be useful for selecting among complex models, but it is still unclear whether their use strikes a reasonable balance between model complexity and error in parameter estimates. 相似文献

14.

A Generalized Logistic Model for Quantal Response Bioassay

M. A. El-Saidi E. O. George 《Biometrical journal. Biometrische Zeitschrift》1990,32(8):943-954

In bioassay, where different levels of the stimulus may represent different doses of a drug, the binary response is the death or survival of an individual receiving a specified dose. In such applications, it is common to model the probability of a positive response P at the stimulus level x by P = F(x′β), where F is a cumulative distribution function and β is a vector of unknown parameters which characterize the response function. The two most popular models used for modelling binary response bioassay involve the probit model [BLISS (1935), FINNEY (1978)], and the logistic model [BERKSON (1944), BROWN (1982)]. However, these models have some limitations. The use of the probit model involves the inverse of the standard normal distribution function, making it rather intractable. The logistic model has a simple form and a closed expression for the inverse distribution function, however, neither the logistic nor the probit can provide a good fit to response functions which are not symmetric or are symmetric but have a steeper or gentler incline in the central probability region. In this paper we introduce a more realistic model for the analysis of quantal response bioassay. The proposed model, which we refer to it as the generalized logistic model, is a family of response curves indexed by shape parameters m₁ and m₂. This family is rich enough to include the probit and logistic models as well as many others as special cases or limiting distributions. In particular, we consider the generalized logistic three parameter model where we assume that m₁ = m, m is a positive real number, and m₂ = 1. We apply this model to various sets of data, comparing the fit results to those obtained previously by other dose-response curves such as the logistic and probit, and showing that the fit can be improved by using the generalized logistic. 相似文献

15.

Reverse-Engineering Post-Transcriptional Regulation of Gap Genes in Drosophila melanogaster

Kolja Becker Eva Balsa-Canto Damjan Cicin-Sain Astrid Hoermann Hilde Janssens Julio R. Banga Johannes Jaeger 《PLoS computational biology》2013,9(10)

相似文献

16.

PyFolding: Open-Source Graphing,Simulation, and Analysis of the Biophysical Properties of Proteins

Alan R. Lowe Albert Perez-Riba Laura S. Itzhaki Ewan R.G. Main 《Biophysical journal》2018,114(3):516-521

相似文献

17.

Assessment and parameter identification of simplified models to describe the kinetics of semi-continuous biomethane production from anaerobic digestion of green and food waste

Raymond O. Owhondah Mark Walker Lin Ma Bill Nimmo Derek B. Ingham Davide Poggio Mohamed Pourkashanian 《Bioprocess and biosystems engineering》2016,39(6):977-992

Biochemical reactions occurring during anaerobic digestion have been modelled using reaction kinetic equations such as first-order, Contois and Monod which are then combined to form mechanistic models. This work considers models which include between one and three biochemical reactions to investigate if the choice of the reaction rate equation, complexity of the model structure as well as the inclusion of inhibition plays a key role in the ability of the model to describe the methane production from the semi-continuous anaerobic digestion of green waste (GW) and food waste (FW). A parameter estimation method was used to investigate the most important phenomena influencing the biogas production process. Experimental data were used to numerically estimate the model parameters and the quality of fit was quantified. Results obtained reveal that the model structure (i.e. number of reactions, inhibition) has a much stronger influence on the quality of fit compared with the choice of kinetic rate equations. In the case of GW there was only a marginal improvement when moving from a one to two reaction model, and none with inclusion of inhibition or three reactions. However, the behaviour of FW digestion was more complex and required either a two or three reaction model with inhibition functions for both ammonia and volatile fatty acids. Parameter values for the best fitting models are given for use by other authors. 相似文献

18.

Modelling Representative Life Cycle Inventories for Swiss Arable Crops (9 pp)

Thomas?Nemecek Email author Stefan?Erzinger Email author 《The International Journal of Life Cycle Assessment》2005,10(1):68-76

Goal, Scope and Background Life cycle inventories (LCIs) of agricultural products, infrastructure, inputs and processes are required to optimise food supply chains. In the past, the use of LCA was hindered by the limited availability of databases with LCIs for such agricultural inputs, processes and products in combination with LCIs of other major economic sectors. The ecoinvent database covers this need for the Swiss, and to an extent, the European context. A suitable approach had to be outlined for defining representative datasets for products from arable crops, since there was no comprehensive survey of agricultural production.Methods No single data source was available for defining representative datasets for arable crops. It was therefore decided to define model crops on the basis of a variety of sources in collaboration with experts on the crops in question. The datasets were validated by experts and by comparison with literature. Field emissions were calculated using a set of models taking into account situation-specific parameters. Data defined by this procedure are more generally usable, but their definition is also more laborious. Results and Discussion Selected results (inventories and impact assessment) are presented for infrastructure (buildings, machinery), work processes, fertilisers, pesticides, seed and arable crop products. Infrastructure has a higher share of environmental impacts than in typical industrial processes, often due to low utilisation rates. Energy use is dominated by mechanisation, the use of mineral fertilisers (particularly nitrogen) and grain drying. Eutrophication is caused mainly by nitrogen compounds. In general, field emissions are of decisive importance for many environmental impacts. Conclusion and Outlook The ecoinvent database provides representative agricultural data for the Swiss, and to an extent, the European context. It also provides the meta-information necessary for deciding whether a dataset is suitable for the purpose of a particular LCA study. To further improve the representativeness of the datasets, an environmental farm monitoring network is required. 相似文献

19.

Logistic regression for clustered data from environmental monitoring programs

《Ecological Informatics》2018

Large-scale surveys, such as national forest inventories and vegetation monitoring programs, usually have complex sampling designs that include geographical stratification and units organized in clusters. When models are developed using data from such programs, a key question is whether or not to utilize design information when analyzing the relationship between a response variable and a set of covariates. Standard statistical regression methods often fail to account for complex sampling designs, which may lead to severely biased estimators of model coefficients. Furthermore, ignoring that data are spatially correlated within clusters may underestimate the standard errors of regression coefficient estimates, with a risk for drawing wrong conclusions. We first review general approaches that account for complex sampling designs, e.g. methods using probability weighting, and stress the need to explore the effects of the sampling design when applying logistic regression models. We then use Monte Carlo simulation to compare the performance of the standard logistic regression model with two approaches to model correlated binary responses, i.e. cluster-specific and population-averaged logistic regression models. As an example, we analyze the occurrence of epiphytic hair lichens in the genus Bryoria; an indicator of forest ecosystem integrity. Based on data from the National Forest Inventory (NFI) for the period 1993–2014 we generated a data set on hair lichen occurrence on >100,000 Picea abies trees distributed throughout Sweden. The NFI data included ten covariates representing forest structure and climate variables potentially affecting lichen occurrence. Our analyses show the importance of taking complex sampling designs and correlated binary responses into account in logistic regression modeling to avoid the risk of obtaining notably biased parameter estimators and standard errors, and erroneous interpretations about factors affecting e.g. hair lichen occurrence. We recommend comparisons of unweighted and weighted logistic regression analyses as an essential step in development of models based on data from large-scale surveys. 相似文献

20.

Conformational Studies of Nucleic Acids: III. Empirical Multiple Correlation Functions for Nucleic Acid Torsion Angles

David A. Pearlman Sung-Hou Kim 《Journal of biomolecular structure & dynamics》2013,31(1):49-67

Abstract

There are seven significantly variable torsion angles in each monomer unit of a polynucleotide. Because of this, it is computationally infeasible to consider the energetics of all conformations available to a nucleic acid without the use of simplifications. In this paper, we develop functions suggested by and regression fit to crystallographic data which allow three of these torsion angles, α (03′-P-05′-C5′), δ (C5′-C4′-C3′-03′) and ε (C4′-C3′-03′-P), to be calculated as dependent variables of those remaining. Using these functions, the seven independent torsions are reduced to four, a reduction in complexity sufficient to allow an examination of the global conformational energetics of a nucleic acid for the remaining independent torsion angles. These functions are the first to quantitatively relate a dependent nucleic acid torsion angle to several different independent angles. In all three cases the data are fit reasonably well, and in one case, α, the fit is exceptionally good, lending support for the suitability of the functions in conformational searches. In addition, an examination of the most significant terms in each of the correlation functions allows insight into the physical basis for the correlations. 相似文献