共查询到20条相似文献,搜索用时 15 毫秒
1.
In this work, we fit pattern-mixture models to data sets with responses that are potentially missing not at random (MNAR, Little and Rubin, 1987). In estimating the regression parameters that are identifiable, we use the pseudo maximum likelihood method based on exponential families. This procedure provides consistent estimators when the mean structure is correctly specified for each pattern, with further information on the variance structure giving an efficient estimator. The proposed method can be used to handle a variety of continuous and discrete outcomes. A test built on this approach is also developed for model simplification in order to improve efficiency. Simulations are carried out to compare the proposed estimation procedure with other methods. In combination with sensitivity analysis, our approach can be used to fit parsimonious semi-parametric pattern-mixture models to outcomes that are potentially MNAR. We apply the proposed method to an epidemiologic cohort study to examine cognition decline among elderly. 相似文献
2.
Robust estimation of multivariate covariance components 总被引:1,自引:0,他引:1
In many settings, such as interlaboratory testing, small area estimation in sample surveys, and heritability studies, investigators are interested in estimating covariance components for multivariate measurements. However, the presence of outliers can seriously distort estimates obtained using standard procedures such as maximum likelihood. We propose a procedure based on M-estimation for robustly estimating multivariate covariance components in the presence of outliers; the procedure applies to balanced and unbalanced data. We present an algorithm for computing the robust estimates and examine the performance of the estimator through a simulation study. The estimator is used to find covariance components and identify outliers in a study of variability of egg length and breadth measurements of American coots. 相似文献
3.
Microsatellite analysis of genetic variation in black bear populations 总被引:16,自引:0,他引:16
Measuring levels of genetic variation is an important aspect of conservation genetics The informativeness of such measurements is related to the variability of the genetic markers used; a particular concern in species, such as bears, which are characterized by low levels of genetic variation resulting from low population densities and small effective population sizes We describe the development of microsatellite analysis in bears and its use in assessing interpopulation differences in genetic variation in black bears from three Canadian National Parks These markers are highly variable and allowed identification of dramatic differences in both distribution and amount of variation between populations Low levels of variation were observed in a population from the Island of Newfoundland The significance of interpopulation differences in variability was tested using a likelihood ratio test of estimates of θ= 4 Ne u. 相似文献
4.
5.
Aitkin M 《Biometrics》1999,55(1):117-128
This paper describes an EM algorithm for nonparametric maximum likelihood (ML) estimation in generalized linear models with variance component structure. The algorithm provides an alternative analysis to approximate MQL and PQL analyses (McGilchrist and Aisbett, 1991, Biometrical Journal 33, 131-141; Breslow and Clayton, 1993; Journal of the American Statistical Association 88, 9-25; McGilchrist, 1994, Journal of the Royal Statistical Society, Series B 56, 61-69; Goldstein, 1995, Multilevel Statistical Models) and to GEE analyses (Liang and Zeger, 1986, Biometrika 73, 13-22). The algorithm, first given by Hinde and Wood (1987, in Longitudinal Data Analysis, 110-126), is a generalization of that for random effect models for overdispersion in generalized linear models, described in Aitkin (1996, Statistics and Computing 6, 251-262). The algorithm is initially derived as a form of Gaussian quadrature assuming a normal mixing distribution, but with only slight variation it can be used for a completely unknown mixing distribution, giving a straightforward method for the fully nonparametric ML estimation of this distribution. This is of value because the ML estimates of the GLM parameters can be sensitive to the specification of a parametric form for the mixing distribution. The nonparametric analysis can be extended straightforwardly to general random parameter models, with full NPML estimation of the joint distribution of the random parameters. This can produce substantial computational saving compared with full numerical integration over a specified parametric distribution for the random parameters. A simple method is described for obtaining correct standard errors for parameter estimates when using the EM algorithm. Several examples are discussed involving simple variance component and longitudinal models, and small-area estimation. 相似文献
6.
Roff D 《Evolution; international journal of organic evolution》2002,56(6):1286-1291
There is considerable interest in comparing genetic variance-covariances matrices (G matrix). However, present methods are difficult to implement and cannot readily be extended to incorporate effects of other variables such as habitat, sex, or location. In this paper I present a method based on MANOVA that can be done using only standard statistical packages (coding for the method using SPLUS is available from the author). The crux of the approach is to use the jackknife method to estimate the pseudovalues of the estimates; these estimates can then be used as datapoints in a MANOVA. I illustrate the method using two published datasets: (1) variation in G matrices resulting from differences in rearing condition, species, and sex in the crickets Gryllus firmus and G. pennsylvanicus; and (2) variation in G matrices associated with habitat and history in the amphipod Gammarus minus. 相似文献
7.
Huggins R 《Biometrics》2000,56(2):537-545
In the study of longitudinal twin and family data, interest is often in the covariance structure of the data and the decomposition of this covariance structure into genetic and environmental components rather than in estimating the mean function. Various parametric models for covariance structures have been proposed but, e.g., in studies of children where growth spurts occur at various ages, it is difficult to a priori determine an appropriate parametric model for the covariance structure. In particular, there is a general lack of the visualization procedures, such as lowess, that are invaluable in the initial stages of constructing a parametric model for a mean function. Here we use kernel smoothing to modify a cross-sectional approach based on the sample covariance matrices to obtain smoothed estimates of the genetic and environmental variances and correlations for longitudinal twin data. The methods are proposed to be exploratory as an aid to parametric modeling rather than inferential, although approximate asymptotic standard errors are derived in the Appendix. 相似文献
8.
Coevolution has long been thought to drive the exaggeration of traits, promote major evolutionary transitions such as the evolution of sexual reproduction and influence epidemiological dynamics. Despite coevolution’s long suspected importance, we have yet to develop a quantitative understanding of its strength and prevalence because we lack generally applicable statistical methods that yield numerical estimates for coevolution’s strength and significance in the wild. Here, we develop a novel method that derives maximum likelihood estimates for the strength of direct pairwise coevolution by coupling a well‐established coevolutionary model to spatially structured phenotypic data. Applying our method to two well‐studied interactions reveals evidence for coevolution in both systems. Broad application of this approach has the potential to further resolve long‐standing evolutionary debates such as the role species interactions play in the evolution of sexual reproduction and the organisation of ecological communities. 相似文献
9.
Most statistical solutions to the problem of statistical inferencewith missing data involve integration or expectation. This canbe done in many ways: directly or indirectly, analytically ornumerically, deterministically or stochastically. Missing-dataproblems can be formulated in terms of latent random variables,so that hierarchical likelihood methods of Lee & Nelder(1996) can be applied to missing-value problems to provide onesolution to the problem of integration of the likelihood. Theresulting methods effectively use a Laplace approximation tothe marginal likelihood with an additional adjustment to themeasures of precision to accommodate the estimation of the fixedeffects parameters. We first consider missing at random caseswhere problems are simpler to handle because the integrationdoes not need to involve the missing-value mechanism and thenconsider missing not at random cases. We also study tobit regressionand refit the missing not at random selection model to the antidepressanttrial data analyzed in Diggle & Kenward (1994). 相似文献
10.
11.
D. Scott Taylor Michael T. Fisher Bruce J. Turner 《Environmental Biology of Fishes》2001,61(4):455-459
Rivulus marmoratus is a self-fertilizing hermaphroditic fish found in the tropical Atlantic as populations of homozygous clones, with the exception of a single site in Belize where male fish are abundant and heterozygosity is the norm. The presence of male fish apparently leads to outcrossing and heterozygosity, but males have been found in limited numbers in other populations which are homozygous. DNA fingerprinting now reveals that the Belize population has remained heterozygous, with a high proportion of males (20–25%), for several years. In addition, two newly discovered populations with a lower percentage of males (1–2%) are reported from the Bahamas and Honduras. One of these populations (Bahamas) consists of homozygous clones, while the other (Honduras) displays a limited proportion of heterozygosity. The Honduras population is only the second outcrossing population known in this species, and the limited heterozygosity seen here may reflect the lower proportion of males. 相似文献
12.
Geert Molenberghs Bart Michiels Michael G. Kenward 《Biometrical journal. Biometrische Zeitschrift》1998,40(5):557-572
In this paper we develop pseudo-likelihood methods for the estimation of parameters in a model that is specified in terms of both selection modelling and pattern-mixture modelling quantities. Two cases are considered: (1) the model is specified directly from a joint model for the measurement and dropout processes; (2) conditional models for the measurement process given dropout and vice versa are specified directly. In the latter case, compatibility constraints to ensure the existence of a joint density are derived. The method is applied to data from a psychiatric study, where a bivariate therapeutic outcome is supplemented with covariate information. 相似文献
13.
We explore the estimation of uncertainty in evolutionary parameters using a recently devised approach for resampling entire additive genetic variance–covariance matrices ( G ). Large‐sample theory shows that maximum‐likelihood estimates (including restricted maximum likelihood, REML) asymptotically have a multivariate normal distribution, with covariance matrix derived from the inverse of the information matrix, and mean equal to the estimated G . This suggests that sampling estimates of G from this distribution can be used to assess the variability of estimates of G , and of functions of G . We refer to this as the REML‐MVN method. This has been implemented in the mixed‐model program WOMBAT. Estimates of sampling variances from REML‐MVN were compared to those from the parametric bootstrap and from a Bayesian Markov chain Monte Carlo (MCMC) approach (implemented in the R package MCMCglmm). We apply each approach to evolvability statistics previously estimated for a large, 20‐dimensional data set for Drosophila wings. REML‐MVN and MCMC sampling variances are close to those estimated with the parametric bootstrap. Both slightly underestimate the error in the best‐estimated aspects of the G matrix. REML analysis supports the previous conclusion that the G matrix for this population is full rank. REML‐MVN is computationally very efficient, making it an attractive alternative to both data resampling and MCMC approaches to assessing confidence in parameters of evolutionary interest. 相似文献
14.
R. F. Caro M. Grossman R. L. Fernando 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》1985,69(5-6):523-530
Summary Effects of data imbalance on bias, sampling variance and mean square error of heritability estimated with variance components were examined using a random two-way nested classification. Four designs, ranging from zero imbalance (balanced data) to low, medium and high imbalance, were considered for each of four combinations of heritability (h2=0.2 and 0.4) and sample size (N=120 and 600). Observations were simulated for each design by drawing independent pseudo-random deviates from normal distributions with zero means, and variances determined by heritability. There were 100 replicates of each simulation; the same design matrix was used in all replications. Variance components were estimated by analysis of variance (Henderson's Method 1) and by maximum likelihood (ML). For the design and model used in this study, bias in heritability based on Method 1 and ML estimates of variance components was negligible. Effect of imbalance on variance of heritability was smaller for ML than for Method 1 estimation, and was smaller for heritability based on estimates of sire-plus-dam variance components than for heritability based on estimates of sire or dam variance components. Mean square error for heritability based on estimates of sire-plus-dam variance components appears to be less sensitive to data imbalance than heritability based on estimates of sire or dam variance components, especially when using Method 1 estimation. Estimation of heritability from sire-plus-dam components was insensitive to differences in data imbalance, especially for the larger sample size.Supported by grants from the Illinois Agricultural Experiment Station and the University of Illinois Research Board. Charles Smith, H. W. Norton and D. Gianola contributed valuable suggestions 相似文献
15.
The maximum likelihood approach to jointly model the survival time and its longitudinal covariates has been successful to model both processes in longitudinal studies. Random effects in the longitudinal process are often used to model the survival times through a proportional hazards model, and this invokes an EM algorithm to search for the maximum likelihood estimates (MLEs). Several intriguing issues are examined here, including the robustness of the MLEs against departure from the normal random effects assumption, and difficulties with the profile likelihood approach to provide reliable estimates for the standard error of the MLEs. We provide insights into the robustness property and suggest to overcome the difficulty of reliable estimates for the standard errors by using bootstrap procedures. Numerical studies and data analysis illustrate our points. 相似文献
16.
The Daily Hormone Study, a substudy of the Study of Women's Health Across the Nation (SWAN) consisting of more than 600 pre- and perimenopausal women, includes a scalar measure of total hip bone mineral density (BMD) together with repeated measures of creatinine-adjusted follicle stimulating hormone (FSH) assayed from daily urine samples collected over one menstrual cycle. It is of scientific interest to investigate the effect of the FSH time profile during a menstrual cycle on total hip BMD, adjusting for age and body mass index. The statistical analysis is challenged by several features of the data: (1) the covariate FSH is measured longitudinally and its effect on the scalar outcome BMD may be complex; (2) due to varying menstrual cycle lengths, subjects have unbalanced longitudinal measures of FSH; and (3) the longitudinal measures of FSH are subject to considerable among- and within-subject variations and measurement errors. We propose a measurement error partial functional linear model, where repeated measures of FSH are modeled using a functional mixed effects model and the effect of the FSH time profile on BMD is modeled using a partial functional linear model by treating the unobserved true subject-specific FSH time profile as a functional covariate. We develop a two-stage nonparametric regression calibration method using period smoothing splines. Using the connection between smoothing splines and mixed models, we show that a key feature of our approach is that estimation at both stages can be conveniently cast into a unified mixed model framework. A simple testing procedure for constant functional covariate effect is also proposed. The proposed methods are evaluated using simulation studies and applied to the SWAN data. 相似文献
17.
DNA fingerprinting in clonal organisms 总被引:2,自引:0,他引:2
J. F. Y. BROOKFIELD 《Molecular ecology》1992,1(1):21-26
The use of DNA fingerprinting to identify members of the same clone in completely or partially asexual organisms requires that the individuals within a clone share a recent common ancestor. By considering the expected distributions of band–sharing values in asexual and sexual organisms, it is shown that DNA fingerprinting may be effective in distinguishing members of the same clone, provided that the frequency of sexual reproduction is considerably greater than the minisatellite mutation rate. 相似文献
18.
Inference regarding the inclusion or exclusion of random effects in linear mixed models is challenging because the variance components are located on the boundary of their parameter space under the usual null hypothesis. As a result, the asymptotic null distribution of the Wald, score, and likelihood ratio tests will not have the typical χ(2) distribution. Although it has been proved that the correct asymptotic distribution is a mixture of χ(2) distributions, the appropriate mixture distribution is rather cumbersome and nonintuitive when the null and alternative hypotheses differ by more than one random effect. As alternatives, we present two permutation tests, one that is based on the best linear unbiased predictors and one that is based on the restricted likelihood ratio test statistic. Both methods involve weighted residuals, with the weights determined by the among- and within-subject variance components. The null permutation distributions of our statistics are computed by permuting the residuals both within and among subjects and are valid both asymptotically and in small samples. We examine the size and power of our tests via simulation under a variety of settings and apply our test to a published data set of chronic myelogenous leukemia patients. 相似文献
19.
20.