共查询到20条相似文献,搜索用时 34 毫秒
1.
2.
Due to a problem of identification, how to estimate the distinct effects of age, time period and cohort has been a controversial issue in the analysis of trends in health outcomes in epidemiology. In this study, we propose a novel approach, partial least squares (PLS) analysis, to separate the effects of age, period, and cohort. Our example for illustration is taken from the Glasgow Alumni cohort. A total of 15,322 students (11,755 men and 3,567 women) received medical screening at the Glasgow University between 1948 and 1968. The aim is to investigate the secular trends in blood pressure over 1925 and 1950 while taking into account the year of examination and age at examination. We excluded students born before 1925 or aged over 25 years at examination and those with missing values in confounders from the analyses, resulting in 12,546 and 12,516 students for analysis of systolic and diastolic blood pressure, respectively. PLS analysis shows that both systolic and diastolic blood pressure increased with students' age, and students born later had on average lower blood pressure (SBP: -0.17 mmHg/per year [95% confidence intervals: -0.19 to -0.15] for men and -0.25 [-0.28 to -0.22] for women; DBP: -0.14 [-0.15 to -0.13] for men; -0.09 [-0.11 to -0.07] for women). PLS also shows a decreasing trend in blood pressure over the examination period. As identification is not a problem for PLS, it provides a flexible modelling strategy for age-period-cohort analysis. More emphasis is then required to clarify the substantive and conceptual issues surrounding the definitions and interpretations of age, period and cohort effects. 相似文献
3.
The aim of this study was to analyze the accuracy of bone pose estimation based on sub-clusters of three skin-markers characterized by triangular Cosserat point elements (TCPEs) and to evaluate the capability of four instantaneous physical parameters, which can be measured non-invasively in vivo, to identify the most accurate TCPEs. Moreover, TCPE pose estimations were compared with the estimations of two least squares minimization methods applied to the cluster of all markers, using rigid body (RBLS) and homogeneous deformation (HDLS) assumptions. Analysis was performed on previously collected in vivo treadmill gait data composed of simultaneous measurements of the gold-standard bone pose by bi-plane fluoroscopy tracking the subjects' knee prosthesis and a stereophotogrammetric system tracking skin-markers affected by soft tissue artifact. Femur orientation and position errors estimated from skin-marker clusters were computed for 18 subjects using clusters of up to 35 markers. Results based on gold-standard data revealed that instantaneous subsets of TCPEs exist which estimate the femur pose with reasonable accuracy (median root mean square error during stance/swing: 1.4/2.8 deg for orientation, 1.5/4.2 mm for position). A non-invasive and instantaneous criteria to select accurate TCPEs for pose estimation (4.8/7.3 deg, 5.8/12.3 mm), was compared with RBLS (4.3/6.6 deg, 6.9/16.6 mm) and HDLS (4.6/7.6 deg, 6.7/12.5 mm). Accounting for homogeneous deformation, using HDLS or selected TCPEs, yielded more accurate position estimations than RBLS method, which, conversely, yielded more accurate orientation estimations. Further investigation is required to devise effective criteria for cluster selection that could represent a significant improvement in bone pose estimation accuracy. 相似文献
4.
In the linear model with right-censored responses and many potential explanatory variables, regression parameter estimates may be unstable or, when the covariates outnumber the uncensored observations, not estimable. We propose an iterative algorithm for partial least squares, based on the Buckley-James estimating equation, to estimate the covariate effect and predict the response for a future subject with a given set of covariates. We use a leave-two-out cross-validation method for empirically selecting the number of components in the partial least-squares fit that approximately minimizes the error in estimating the covariate effect of a future observation. Simulation studies compare the methods discussed here with other dimension reduction techniques. Data from the AIDS Clinical Trials Group protocol 333 are used to motivate the methodology. 相似文献
5.
Microarray experiments generate data sets with information on the expression levels of thousands of genes in a set of biological samples. Unfortunately, such experiments often produce multiple missing expression values, normally due to various experimental problems. As many algorithms for gene expression analysis require a complete data matrix as input, the missing values have to be estimated in order to analyze the available data. Alternatively, genes and arrays can be removed until no missing values remain. However, for genes or arrays with only a small number of missing values, it is desirable to impute those values. For the subsequent analysis to be as informative as possible, it is essential that the estimates for the missing gene expression values are accurate. A small amount of badly estimated missing values in the data might be enough for clustering methods, such as hierachical clustering or K-means clustering, to produce misleading results. Thus, accurate methods for missing value estimation are needed. We present novel methods for estimation of missing values in microarray data sets that are based on the least squares principle, and that utilize correlations between both genes and arrays. For this set of methods, we use the common reference name LSimpute. We compare the estimation accuracy of our methods with the widely used KNNimpute on three complete data matrices from public data sets by randomly knocking out data (labeling as missing). From these tests, we conclude that our LSimpute methods produce estimates that consistently are more accurate than those obtained using KNNimpute. Additionally, we examine a more classic approach to missing value estimation based on expectation maximization (EM). We refer to our EM implementations as EMimpute, and the estimate errors using the EMimpute methods are compared with those our novel methods produce. The results indicate that on average, the estimates from our best performing LSimpute method are at least as accurate as those from the best EMimpute algorithm. 相似文献
6.
Many biomedical studies have identified important imaging biomarkers that are associated with both repeated clinical measures and a survival outcome. The functional joint model (FJM) framework, proposed by Li and Luo in 2017, investigates the association between repeated clinical measures and survival data, while adjusting for both high-dimensional images and low-dimensional covariates based on the functional principal component analysis (FPCA). In this paper, we propose a novel algorithm for the estimation of FJM based on the functional partial least squares (FPLS). Our numerical studies demonstrate that, compared to FPCA, the proposed FPLS algorithm can yield more accurate and robust estimation and prediction performance in many important scenarios. We apply the proposed FPLS algorithm to a neuroimaging study. Data used in preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database. 相似文献
7.
C G Turner 《American journal of physical anthropology》1985,68(2):263-267
Expression count is the term used for a new method of calculating the frequency of a morphological dental trait in a population sample. This method incorporates into a single value the total trait variation expressed in the sample. It has been designed to increase the information content of small samples of dental remains common with archeological recovery. 相似文献
8.
Zhou Y Zheng C Cao H Li X Sha J 《Biochemical and biophysical research communications》2012,420(1):205-209
In this work, we propose a signal selection procedure for determination of hemoglobin (Hb) concentration in whole blood using near infrared (NIR) transmission spectral signals. A dataset of 190 whole blood NIR transmission spectra with reference Hb concentrations was used to evaluate the method. Spectral signals were selected based on the squared correlation coefficient (R(2)) between the signal and the Hb concentration. An improved uninformative variable elimination (UVE) procedure was performed to remove redundant signals from the primary selected signal set. A partial least squares (PLS) regression model was built with the final selected signals and the corresponding Hb concentrations. The results indicate that the proposed method is effective at increasing the predictive power of the NIR-PLS spectral model for determining Hb concentration in whole blood samples. 相似文献
9.
Donald A 《BMJ (Clinical research ed.)》1999,319(7220):1298-1299
10.
As an important transmembrane protein family in eukaryon, G-protein coupled receptors (GPCRs) play a significant role in cellular signal transduction and are important targets for drug design. However, it is very difficult to resolve their tertiary structure by X-ray crystallography. In this study, we have developed a Delaunay model, which constructs a series of simplexes with latent variables to classify the families of GPCRs and projects unknown sequences to principle component space (PC-space) to predict their topology. Computational results show that, for the classification of GPCRs, the method achieves the accuracy of 91.0 and 87.6% for Class A, more than 80% for the other three classes in differentiating GPCRs from non-GPCRs and 70% for discriminating between four major classes of GPCR, respectively. When recognizing the structure of GPCRs, all the N-terminals of sequences can be determined correctly. The maximum accuracy of predicting transmembrane segments is achieved in the 7th transmembrane segment of Rhodopsin, which is 99.4%, and the average error is 2.1 amino acids, which is the lowest in all of the segments prediction. This method could provide structural information of a novel GPCR as a tool for experiments and other algorithms of structure prediction of GPCRs. Academic users should send their request for the MATLAB program for classifying GPCRs and predicting the topology of them at liml@scu.edu.cn . 相似文献
11.
12.
J J Lee 《Biometrics》1991,47(4):1573-1580
In the calibration problem, the need to construct a confidence interval to estimate the unknown chi 0 arises when the null hypothesis of zero slope is rejected. Otherwise, the resulting confidence interval will be infinite to reflect the fact that the slope of the regression line may be zero. Under the condition of rejecting the hypothesis of zero slope, we study the properties of the conditional coverage rate of the calibration confidence interval. The conditional coverage rate (P1) is a function of the slope, distance between chi 0 and the mean of the trailing sample means, the sum of squares of chi, and n. When the true slope is close to 0 and chi 0 is away from means, P1 can go down to 0. On the other hand, as the power of testing zero slope reaches 1, with or without chi 0 close to means, P1 will tend to the desired nominal coverage rate. In summary, one should choose a reasonably small alpha in testing zero slope to avoid constructing a confidence interval for chi 0 when the true slope is 0. In addition, it is desirable to have high power in testing zero slope so that the resulting confidence interval will maintain the desired coverage rate when using the conditional approach in the calibration problem. 相似文献
13.
The problem with "the Paleoptera Problem:" sense and sensitivity 总被引:3,自引:0,他引:3
T.Heath Ogden Michael F Whiting 《Cladistics : the international journal of the Willi Hennig Society》2003,19(5):432-442
While the monophyly of winged insects (Pterygota) is well supported, phylogenetic relationships among the most basal extant pterygote lineages are problematic. Ephemeroptera (mayflies) and Odonata (dragonflies) represent the two most basal extant lineages of winged insects, and determining their relationship with regard to Neoptera (remaining winged insects) is a critical step toward understanding insect diversification. A recent molecular analysis concluded that Paleoptera (Odonata Ephemeroptera) is monophyletic. However, we demonstrate that this result is supported only under a narrow range of alignment parameters. We have further tested the monophyly of Paleoptera using additional sequence data from 18SrDNA, 28S rDNA, and Histone 3 for a broader selection of taxa and a wider range of analytical methodologies. Our results suggest that the current suite of molecular data ambiguously resolve the three basal winged insect lineages and do not provide independent confirmation of Odonata + Neoptera as supported via morphological data. 相似文献
14.
Ronald A. Heiner 《Behavioural processes》1984,9(4):339-344
Herrnstein (1979) recently claimed that the matching law could be derived from an ordinary differential equation. He failed, however, to analyze the dynamic properties of his proposed equation. I show that it implies behavior cannot stabilize in accordance with the matching law. Consequently, Herrnstein's equation cannot explain why matching behavior has been widely observed. 相似文献
15.
16.
Jesualdo A. Fuentes-G. Paul David Polly Emília P. Martins 《Evolution; international journal of organic evolution》2020,74(2):311-325
Phylogenetic comparative methods use tree topology, branch lengths, and models of phenotypic change to take into account nonindependence in statistical analysis. However, these methods normally assume that trees and models are known without error. Approaches relying on evolutionary regimes also assume specific distributions of character states across a tree, which often result from ancestral state reconstructions that are subject to uncertainty. Several methods have been proposed to deal with some of these sources of uncertainty, but approaches accounting for all of them are less common. Here, we show how Bayesian statistics facilitates this task while relaxing the homogeneous rate assumption of the well-known phylogenetic generalized least squares (PGLS) framework. This Bayesian formulation allows uncertainty about phylogeny, evolutionary regimes, or other statistical parameters to be taken into account for studies as simple as testing for coevolution in two traits or as complex as testing whether bursts of phenotypic change are associated with evolutionary shifts in intertrait correlations. A mixture of validation approaches indicates that the approach has good inferential properties and predictive performance. We provide suggestions for implementation and show its usefulness by exploring the coevolution of ankle posture and forefoot proportions in Carnivora. 相似文献
17.
Assessing calibration uncertainty in molecular dating: the assignment of fossils to alternative calibration points 总被引:4,自引:0,他引:4
Although recent methodological advances have allowed the incorporation of rate variation in molecular dating analyses, the calibration procedure, performed mainly through fossils, remains resistant to improvements. One source of uncertainty pertains to the assignment of fossils to specific nodes in a phylogeny, especially when alternative possibilities exist that can be equally justified on morphological grounds. Here we expand on a recently developed fossil cross-validation method to evaluate whether alternative nodal assignments of multiple fossils produce calibration sets that differ in their internal consistency. We use an enlarged Crypteroniaceae-centered phylogeny of Myrtales, six fossils, and 72 combinations of calibration points, termed calibration sets, to identify (i) the fossil assignments that produce the most internally consistent calibration sets and (ii) the mean ages, derived from these calibration sets, for the split of the Southeast Asian Crypteroniaceae from their West Gondwanan sister clade (node X). We found that a correlation exists between s values, devised to measure the consistency among the calibration points of a calibration set (Near and Sanderson, 2004), and nodal distances among calibration points. By ranking all sets according to the percent deviation of s from the regression line with nodal distance, we identified the sets with the highest level of corrected calibration-set consistency. These sets generated lower standard deviations associated with the ages of node X than sets characterized by lower corrected consistency. The three calibration sets with the highest corrected consistencies produced mean age estimates for node X of 79.70, 79.14, and 78.15 My. These timeframes are most compatible with the hypothesis that the Crypteroniaceae stem lineage dispersed from Africa to the Deccan plate as it drifted northward during the Late Cretaceous. 相似文献
18.
MOTIVATION: The immunogenicity of peptides depends on their ability to bind to MHC molecules. MHC binding affinity prediction methods can save significant amounts of experimental work. The class II MHC binding site is open at both ends, making epitope prediction difficult because of the multiple binding ability of long peptides. RESULTS: An iterative self-consistent partial least squares (PLS)-based additive method was applied to a set of 66 peptides no longer than 16 amino acids, binding to DRB1*0401. A regression equation containing the quantitative contributions of the amino acids at each of the nine positions was generated. Its predictability was tested using two external test sets which gave r(pred) = 0.593 and r(pred) = 0.655, respectively. Furthermore, it was benchmarked using 25 known T-cell epitopes restricted by DRB1*0401 and we compared our results with four other online predictive methods. The additive method showed the best result finding 24 of the 25 T-cell epitopes. AVAILABILITY: Peptides used in the study are available from http://www.jenner.ac.uk/JenPep. The PLS method is available commercially in the SYBYL molecular modelling software package. The final model for affinity prediction of peptides binding to DRB1*0401 molecule is available at http://www.jenner.ac.uk/MHCPred. Models developed for DRB1*0101 and DRB1*0701 also are available in MHCPred. 相似文献
19.
Beyond binding: using phage display to select for structure, folding and enzymatic activity in proteins. 总被引:6,自引:0,他引:6
Phage display of a wide range of polypeptides has been increasingly used to identify novel molecules with useful binding properties for research, medical and industrial applications. Recent developments include methods for the selection of stabilized variants of a protein, the selection of regulatable enzymes and promising strategies for the selection and evolution of protein catalysts. 相似文献
20.
Type II restriction endonucleases (REs) are widely used tools in molecular biology, biotechnology and diagnostics. Efforts to generate new specificities by structure-guided design and random mutagenesis have been unsuccessful so far. We have developed a new procedure called the methylation activity-based selection (MABS) for generating REs with a new specificity. MABS uses a unique property of bifunctional type II REs to methylate DNA targets they recognize. The procedure includes three steps: (1) conversion of a bifunctional RE into a monofunctional DNA-modifying enzyme by cleavage center disruption; (2) mutagenesis and selection of mutants with altered DNA modification specificity based on their ability to protect predetermined DNA targets; (3) reconstitution of the cleavage center's wild-type structure. The efficiency of the MABS technique was demonstrated by altering the sequence specificity of the bifunctional RE Eco57I from 5'-CTGAAG to 5'-CTGRAG, and thus generating the mutant restriction endonuclease (and DNA methyltransferase) of a specificity not known before. This study provides evidence that MABS is a promising technique for generation of REs with new specificities. 相似文献